In my opinion the big difference is that a data scientist focuses more on business problems while data engineer solves technical problems.
Experiences from DataCamp online training. Structured data science courses are easy to organize for yourself or a team.
Databox enables elegant reports in a SaaS interface to be shared and published both internally and outside the organization.
The article goes through the PySpark execution logic and provides guidelines to optimize the speed and performance.
Pipedrive Essential and Advanced are by far the most popular plans among my customers. A comparison of different subscriptions.
Clustering time series data with SQL – Nice 3D visualization using simple logic. Python notebook example in GitHub with industrial data.
A tutorial for parallel computation with Spark and Python. The example has been ran on AWS cloud computing platform.
AWS Glue service works especially well for big data batch processing. Read the full post from data.solita.fi.
Excel Power Map is designed to visualize spatial data. Watch the demo video about visualizing annual asylum seeker data.
I wrote to Solita’s blog about text analytics with the headline “Finnish stemming and lemmatization in python”. The post has code examples.
Experiences from Pipedrive CRM implementation. Pipedrive is simple for the sales people, but the migration from an old CRM requires planning.
Simple instructions to sign up and get started with Pipedrive CRM. Redeem the extended trial period for Pipedrive.
Experiences from funding application classification by text analytics
You can find the article from Solita’s data related blog site data.solita.fi. Finally I managed to publish my blog post with the topic A Machine […]
Read about Solita team’s solution in a hackathon organized by Hiab. The task was to take advantage of data to maximize machine uptime.
Unpivoting columns to rows with Excel PowerQuery. Watch 30 seconds video how to do it without any formulas.
Python code to automatically list header fields of multiple CSV files. The original use case was related to data warehouse documentation.
Parsing first name, last name and company from email in Excel Do you have a list of emails that you want to split by first […]
Personally, I use a note app to store ideas, observations, travel tickets and debts that should be quickly accessible.
Toggl is a mobile app for work hour recording and management. Toggl is not great because of the uniqueness but simplicity.
The free Excel course. Compact video lectures with exercise materials for training. The lectures have an optimal order for learning.
Workbook, worksheet, saving, opening and options in Excel.
Change font color, autofit column width, merge cells, use bold text and set borders.
A brief introduction how to compose your first Excel formula.
Copy and paste only values, formulas or formattings in Excel.
Learn different number formats as well as Excel’s date and time system.
How to do absolute and relative references and name ranges.
Sort, filter and summarize easily with Excel table objects.
Excel has a set of powerful text manipulation formulas. In this lecture you will learn to apply SUBSTITUTE(), MID(), FIND() and LEN().
Yes. It’s me in the picture. It was Halloween. Creating and Android app has been in my task list a good while but now I […]
It is actually possible to make your living by doing sports betting. This blog is not sponsored – these are my own experiences. Betting – […]
The built-in dataset quakes in RStudio had 1000 records of earthquakes nearby Fiji. The first year of observations is 1964 but the last year remains […]
Splitwise solves a traditional problem that exists in every single group of friends – how to split costs easily. Need to split costs? In Splitwise you […]
After testing multiple todo list I dare to announce Wunderlist as the simplest one with just right amount of features. You can categorize tasks to folders and […]
This imaginary problem does not rely on any real situation. A virus is spreading across the world – it kills without treatment. A medicine does exist […]
Next I will introduxe briefly how Power tools in Microsoft Excel Power BI family are related to each other. That way you quickly realize whether […]
It’s easy to spot these hype terms like data science, big data in LinkedIn or exhibition posters. I summarized the definitions of most frequently used […]
Django is a web framework for Python programming language which in practise means well designed folder structure and pre-made class modules for most common functionalities in […]