Skip to content

Mikael Ahonen

Inspiration | Attitude | Stories

  • Blog
    • Data Science
    • Software
    • Business
    • Self improvement
    • Free time
    • Society
  • Mikael
  • Business
    • Microsoft Excel
    • Pipedrive CRM
  • English
    • Suomi
    • English
    • Svenska

Mikael Ahonen

Inspiration | Attitude | Stories

  • Blog
    • Data Science
    • Software
    • Business
    • Self improvement
    • Free time
    • Society
  • Mikael
  • Business
    • Microsoft Excel
    • Pipedrive CRM
  • English
    • Suomi
    • English
    • Svenska

Mikael Ahonen

Inspiration | Attitude | Stories

Category: Data Science

Solving problems with statistics, programming and business understanding. Machine learning is a sub topic of data science.

See another category to find posts about about business software and mobile apps.

28.05.202207.06.2022Data Science

Running Flask frontend and backend in Kubernetes

Kubernetes have been everywhere lately. Especially in the context of MLOps. I gave it a try by creating web app with Python Flask.

12.04.202212.04.2022Data Science

An undocumented product_id parameter in Pipedrive API to attach products to deals

I found an undocumented product_id parameter in Pipedrive API to attach products to deals. The issue is reported to Pipedrive dev team.

Alternatives for a free data science workspace.
25.12.202129.12.2021Data Science, Machine learning platforms in cloud

Free data science workspaces

Google Colab, Databricks Community Edition, Visual Studio Code and Dcoker are some options to create a free data science workspace.

27.11.202127.05.2022Data Science, Machine learning platforms in cloud

Comparison of machine learning platforms in major clouds

Comparing the major machine learning platforms AWS SageMaker, Azure Machine Learning, Google Vertex AI and Databricks.

Example of a simple machine learning process.
21.11.202128.05.2022Data Science, Machine learning platforms in cloud

What is a machine learning platform?

What is a machine learning platform? Introducing different components such as workbench, MLOps tools and cloud computation.

Koneoppimisen hyödyntäminen ennakoivassa huollossa. Kaksiosaisessa sarjassa asiaa kustannusäästöistä ja esimerkki Python-kielellä.
29.10.202027.10.2020Data Science

Machine learning in predictive maintenance

Machine learning in predictive maintenance. The two-part blog series provides insights for cost savings and an example script in Python.

How to fool a web service about your actual location? In an experiment I pretended being in Ireland while traveling in Sweden.
28.10.202021.11.2021Data Science

Faking your geographical location to a web service – A hobby project

How to fool a web service about your actual location? In an experiment I pretended being in Ireland while traveling in Sweden.

Difference between Data Science and Data Engineering 1. Data platform built by a Data Engineer 2. Predictions and recommendations developed by a Data Scientist
17.08.202025.12.2021Data Science

Difference between data scientist and data engineer roles

In my opinion the big difference is that a data scientist focuses more on business problems while data engineer solves technical problems.

Review about DataCamp data science courses. Read the review about the online training and get familiar with pricing and service offering.
13.08.202010.02.2022Data Science

DataCamp – Learn data science online

Experiences from DataCamp online training. Structured data science courses are easy to organize for yourself or a team.

The article goes through the PySpark execution logic and provides guidelines to optimize the speed and performance.
09.02.202009.02.2020Data Science

PySpark execution logic and code optimization

The article goes through the PySpark execution logic and provides guidelines to optimize the speed and performance.

Clustering time series data with SQL. The purpose is to prove that doing data science doesn't always require fancy tools. Examples in this repository might be helpful, if you must use SQL instead of proper data science tools such as python. Thus this repository is not a comprehensive guide for time series data clustering. Even though clustering is often connected to machine learning, this showcase relies only on logical decision making. I have focused on IoT related data in the field of predictive maintenance.
10.11.201910.11.2019Data Science

Clustering data using SQL – An example with industrial IoT data

Clustering time series data with SQL – Nice 3D visualization using simple logic. Python notebook example in GitHub with industrial data.

Spark and Python tutorial for data developers. The example has been ran on AWS cloud computing platform.
07.10.201904.10.2019Data Science

Spark + Python tutorial for data developers

A tutorial for parallel computation with Spark and Python. The example has been ran on AWS cloud computing platform.

AWS Glue service works especially well for big data batch processing.
01.09.201901.09.2019Data Science

Introduction to AWS Glue for big data ETL

AWS Glue service works especially well for big data batch processing. Read the full post from data.solita.fi.

Excel Power Map-ominaisuus on tarkoitettu paikkatietojen visualisointiin. Katso turvapaikanhakijadatasta tehty demo-video.
15.07.201904.11.2019Data Science

Excel Power Map – Spatial data visualization as a time series

Excel Power Map is designed to visualize spatial data. Watch the demo video about visualizing annual asylum seeker data.

Finnish language stemming and lemmatization in python.
23.06.201913.08.2020Data Science

Finnish stemming and lemmatization in python

I wrote to Solita’s blog about text analytics with the headline “Finnish stemming and lemmatization in python”. The post has code examples.

Experiences from funding application classification by text analytics
27.01.201908.02.2022Data Science

Experiences from funding application classification by text analytics

Experiences from funding application classification by text analytics

Example: How to combine machine learning and business?
15.05.201808.02.2022Data Science

Combining machine learning and business – Practical example

I give an example about machine learning use case in a format that should be understandable also for less technical people.

Read about Solita team's solution in a hackathon organized by Hiab. The task was to take advantage of data to maximize machine uptime.
01.11.201708.02.2022Data Science

Maximizing uptime in Hiab hackathon

Read about Solita team’s solution in a hackathon organized by Hiab. The task was to take advantage of data to maximize machine uptime.

Sarakkeiden muuttaminen riveiksi Excel PowerQuery työkalulla.
01.10.201708.02.2022Data Science

Unpivot columns to rows with Excel PowerQuery

Unpivoting columns to rows with Excel PowerQuery. Watch 30 seconds video how to do it without any formulas.

Python code to automatically list header fields of multiple CSV files. The original use case was related to data warehouse documentation.
01.09.201708.02.2022Data Science

Csv headers to list using Python

Python code to automatically list header fields of multiple CSV files. The original use case was related to data warehouse documentation.

Parsing first name, last name and company from email in Excel - Download Excel template.
22.02.201708.02.2022Data Science

Parsing first name, last name and company from email in Excel – Download Excel template

Parsing first name, last name and company from email in Excel Do you have a list of emails that you want to split by first […]

Sports betting tutorial - Can you make living with it?
26.10.201608.02.2022Data Science

Sports betting tutorial – Can you make the living?

You can make the living by sports betting. The blog is not sponsored as I’m sharing my own experiences. Read the tutorial.

Visualization of earthquake data
25.10.201622.11.2021Data Science

Visualization and clustering of eartquake dataset

The built-in dataset quakes in RStudio had 1000 records of earthquakes nearby Fiji. The first year of observations is 1964 but the last year remains […]

Virusongelma - tilastopähkinä. Virus problem - a statistical puzzle.
22.10.201608.02.2022Data Science

Virus problem – A statistical puzzle

The problem: A virus is spreading across the world – it kills without treatment. Your task is to solve a statistical puzzle.

Introducing Excel Power tools. Read the review.
22.10.201608.02.2022Data Science

Introduction to Excel Power tools

Excel Power BI lisäosien perheeseen kuuluvat Power Query, Power Pivot, Power Map ja Power View nopeassa esittelyssä.

Data science, business intelligence and big data. The definitions in vocabulary.
26.08.201608.02.2022Data Science

Data science and business intelligence – Definitions

It’s easy to spot these hype terms like data science, big data in LinkedIn or exhibition posters. I summarized the definitions.

Logo for Python programming language. Python is a general purpose programming language that is used nowadays for cloud platform development, web services and data analytics.
01.07.201608.02.2022Data Science

Django tutorial – For data oriented web developers

Python based Django web framework offers a great platform to create a data oriented web application for any size of needs.

Search from site

LINKS

  • Home
  • Blog
  • About Mikael
  • Business
    • Microsoft Excel
    • Pipedrive CRM
  • Contact
  • Subscribe

BLOG SERIES

  • Around the world trip
  • Free Excel course
  • Machine learning platforms in cloud
Mikael Ahonen | mikaelahonen.com
Click OK to accept cookies OK
Privacy statement
Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Non-necessary
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
SAVE & ACCEPT