EN FI SV
Google Colab, Databricks Community Edition, Visual Studio Code and Dcoker are some options to create a free data science workspace.

Free data science workspaces

I have written multiple blog posts about machine learning (ML) engineering and machine learning platforms. Those systems are usually target to productionize ML solutions, are somewhat big investments and focus on managing the whole ML lifecycle.

Comparing the major machine learning platforms AWS SageMaker, Azure Machine Learning, Google Vertex AI and Databricks.

Comparison of machine learning platforms in major clouds

This blog post compares machine learning platforms from major cloud providers Azure, AWS and Google Cloud. Also Databricks platform has been included.

What is a machine learning platform? Introducing different components such as workbench, MLOps tools and cloud computation.

What is a machine learning platform?

Machine learning is going towards the direction where data scientist does the creative work and ML platform takes care of unpleasant process management.

Machine learning in predictive maintenance. The two-part blog series provides insights for cost savings and an example script in Python.

Machine learning in predictive maintenance

Predictive maintenance aims to repair the equipment before the failure actually happens. Scheduled maintenances minimize the production downtime especially in industrial companies.

In my opinion the big difference is that a data scientist focuses more on business problems while data engineer solves technical problems.

Difference between data scientist and data engineer roles

Working the past few years in both data science and data engineering projects, I have gained pretty good understanding to answer that question.

Experiences from DataCamp online training. Structured data science courses are easy to organize for yourself or a team.

DataCamp - Learn data science online

DataCamp is an online learning platform for data science. The data science course catalog contains wide selection of Python, R, SQL and Excel videos and assignments.

Clustering time series data with SQL - Nice 3D visualization using simple logic. Python notebook example in GitHub with industrial data.

Clustering data using SQL - An example with industrial IoT data

Clustering time series data with SQL. The purpose of this experiment was to prove that doing data science doesn’t always require fancy tools.

I wrote to Solita's blog about text analytics with the headline "Finnish stemming and lemmatization in python". The post has code examples.

Finnish stemming and lemmatization in python

I wrote to Solita’s Data blog about text analytics with the headline Finnish stemming and lemmatization in python. Read the writing here .

Experiences from funding application classification by text analytics

Experiences from funding application classification by text analytics

I wrote to Solita’s data blog about a text analytics project. The goal was to automate manual classification of funding applications.

I give an example about machine learning use case in a format that should be understandable also for less technical people.

Combining machine learning and business - Practical example

You can find the article from Solita’s data related blog site data.solita.fi . Finally I managed to publish my blog post with the topic A Machine Learning Example For Business.

You can make the living by sports betting. The blog is not sponsored as I'm sharing my own experiences. Read the tutorial.

Sports betting tutorial - Can you make the living?

It is actually possible to make your living by doing sports betting. This blog is not sponsored - these are my own experiences.