The article goes through the PySpark execution logic and provides guidelines to optimize the speed and performance.
A tutorial for parallel computation with Spark and Python. The example has been ran on AWS cloud computing platform.
AWS Glue service works especially well for big data batch processing. Read the full post from data.solita.fi.
Excel Power Map is designed to visualize spatial data. Watch the demo video about visualizing annual asylum seeker data.
I wrote to Solita’s blog about text analytics with the headline “Finnish stemming and lemmatization in python”. The post has code examples.
Blogging about professional topics is an excellent way to increase visibility for yourself and your company. Read the tips and experiences!
Experiences from funding application classification by text analytics
I give an example about machine learning use case in a format that should be understandable also for less technical people.
Read about Solita team’s solution in a hackathon organized by Hiab. The task was to take advantage of data to maximize machine uptime.