Dataflow product in Google Cloud is mandatory for advanced data processing pipelines for machine learning solutions.
This is a summary of Google Cloud Platform (GCP) products relevant for Machine Learning Engineer role.
Comparing the major machine learning platforms AWS SageMaker, Azure Machine Learning, Google Vertex AI and Databricks.
A tutorial for parallel computation with Spark and Python. The example has been ran on AWS cloud computing platform.
AWS Glue service works especially well for big data batch processing. Read the full post from data.solita.fi.
Read about Solita team's solution in a hackathon organized by Hiab. The task was to take advantage of data to maximize machine uptime.
It's easy to spot these hype terms like data science, big data in LinkedIn or exhibition posters. I summarized the definitions.