Breaking news! My business has moved to datatori.com

Spark + Python tutorial for data developers

A tutorial for parallel computation with Spark and Python. The example has been ran on AWS cloud computing platform.

Introduction to AWS Glue for big data ETL

AWS Glue service works especially well for big data batch processing. Read the full post from data.solita.fi.