Running Flask frontend and backend in Kubernetes
Kubernetes has been everywhere lately. Especially in the context of MLOps to manage the plethora of different tasks such as training, serving and registering the models.
An undocumented product_id parameter in Pipedrive API to attach products to deals
I found a “bug” in the Pipedrive API documentation while exploring a customer case together with Pipedrive partner SaaShop .
Free data science workspaces
I have written multiple blog posts about machine learning (ML) engineering and machine learning platforms. Those systems are usually target to productionize ML solutions, are somewhat big investments and focus on managing the whole ML lifecycle.
Comparison of machine learning platforms in major clouds
This blog post compares machine learning platforms from major cloud providers Azure, AWS and Google Cloud. Also Databricks platform has been included.
Machine learning in predictive maintenance
Predictive maintenance aims to repair the equipment before the failure actually happens. Scheduled maintenances minimize the production downtime especially in industrial companies.
DataCamp - Learn data science online
DataCamp is an online learning platform for data science. The data science course catalog contains wide selection of Python, R, SQL and Excel videos and assignments.
PySpark execution logic and code optimization
On last fall I wrote about the PySpark framework at my previous employer’s blog. As the name indicates, the topic is extremely technical.
Clustering data using SQL - An example with industrial IoT data
Clustering time series data with SQL. The purpose of this experiment was to prove that doing data science doesn’t always require fancy tools.
Spark + Python tutorial for data developers
Go to Spark + Python tutorial in AWS Glue in Solita’s data blog. Spark and parallel computing A shop cashier can only serve a limited amount of customers at a given time.
Finnish stemming and lemmatization in python
I wrote to Solita’s Data blog about text analytics with the headline Finnish stemming and lemmatization in python. Read the writing here .
Should you start investing if a depression is coming? - Data analysis
This is a summary from the original Finnish blog article. Data analysis result: Invest only the money you don’t need at the moment The purpose of the stock market analysis was to answer this question: Is it good idea time to start regular investing if a depression is coming?
Csv headers to list using Python
A datawarehouse project required documentation for incoming CSV-files. The intent was to list all header fields of tens of CSV files grouped by the file name.
Django tutorial - For data oriented web developers
Django is a web framework for Python programming language which in practise means well designed folder structure and pre-made class modules for most common functionalities in web service.