I’ve been designing data-centric systems for last 20 years and have gained huge experience in developing and performance tuning of OLTP, DW and BI applications. Last 3 year I’m focused on Data Analytics in Big Data Clusters and implementing different innovative approaches for Fast Data Processing. At the moment my responsibility is to develop competencies for Enterprise level Data Platform at Eleks company.
Topic: Apache Spark: Advanced in-memory BigData Analytics
Short Description: Apache Spark is an open-source parallel processing framework that supports in-memory processing to boost the performance of big-data analytic applications. We will cover approaches of processing Big Data on Spark cluster for real-time analytic, machine learning and iterative BI and also discuss the pros and cons of using popular open source ML servers.