In this meetup, we will learn about monitoring in the cloud, in some of the most demanding environments when it comes to performance and resilience. Don't miss it!
Doors open at 19:00 and talks will start at 19:15 sharp, please be on time!
There will be drinks and food after the talks, as usual!
We will have a book to raffle among those who arrive before 19:15!
Talk #1: Monitoring real-time cloud infrastructure with an open-source stack
Speaker: Diogo Guerra
At Feedzai we have been making a transition to a devops mindset. During 2018 our cloud footprint grew 400% being a significant part of Feedzai's business. In this talk I will cover the effort put on monitoring and alerting of our systems and how our cloud operation changed the way we address monitoring from planning to the alerts that reach engineers phones.
I'll go into details of the transformation process and the differences between our legacy monitoring platform and the current state as well the impact on the quality of our service.
Feedzai monitoring stack is built on top of TIG (Telegraf, InfluxDB and Grafana) and covers mostly open source technologies such as Cassandra, Postgres, RabbitMQ, Hadoop (Spark and Yarn) as well as AWS native services.
Diogo Guerra is VP of Engineering at Feedzai leading the development of Feedzai's Real Time Fraud Detection system. He specializes in distributed systems, high performance, and low latency real-time platforms.
Leading teams to design and build systems that can process high volumes of data and leverage the power of Machine Learning, Diogo works on a daily basis with top financial institutions across the world to fight fraud.
Talk #2: Quantiles over Percentiles: When and How
Speaker: João Acabado
(note: this talk was voted ex aequo the best lightning talk in our meetup last December, and everyone wanted to know more about it! So we asked João to come back with an extended version)
When performance matters there should be a clear and easy way of setting service level expectations. The way you measure latency determines how can we define our objectives.
Let's brush up your statistics, Quantiles and Percentiles helps us define performance objectives and indicators and have everyone on the same page whenever our services get too slow.
João Acabado is Resilience Engineering Manager @ Sky Portugal. He's always looking for the easiest hard problem that can be solved. Has been looking to bridge the delivery of software solutions with their operation in the past couple years.
IMPORTANT NOTE: the talks will be recorded and published in the DevOps Lisbon YouTube channel. By signing up for this event you explicitly agree that your image might appear in the video recordings. Nevertheless, you can always get in touch with the organization ([masked]) to have it removed.
Claim the event and start manage its content.I am the organizer