Using Monoids for Large Scale Business Stats by Ashwanth Kumar
As Big data processing tends to move towards stream processing, there is a need to monitor the behaviour of our systems through collection of business metrics in real time. Owing to the the very nature of stream processing systems, MR Stat jobs are insufficient. Ashwanth, a Principal Engineer at Indix, will talk about Indix's learnings over the years in dealing with metrics for stream based processing systems.
Democratizing data with an internal data pipeline platform by Manoj Mahalingam
Manoj, a Principal Engineer at Indix will talk about the platform that Indix built on top of Spark to enable not just data scientists, but just about everyone with access to the data to also define datasets, create pipelines and perform operations over it while increasing the resource utilisation and more importantly, the productivity of the people.
Optimizing Spark by Matild Reema
Reema, A software engineer at Indix will give us an overview of the Spark Architecture, the best practices and what she did to improve the performance and resource utliziation of a Spark cluster.
• 10.00am to 10.45am Using Monoids for Large Scale Business Stats
• 10.45am to 11.30am Democratizing data with an Internal Data Pipeline Platform
• 11.30am to 12.15pm Optimizing Spark by Matild Reema
This is a free event. Please arrive a bit early so that you don't get delayed at the security gate.
Bring: - Government issued photo ID.
Any of these would work: Driving License, PAN card, Aadhaar, Passport.
RSVP closes at 5:00pm, the previous day to arrange the security passes.
Please upload your photo to your meetup.com profile, to prepare your security passes.