Data Engineering SD - Data Lifecycle Management

Mar 16, 2022 · San Diego, United States of America

Come out to the Data Engineering group monthly meeting for a presentation followed by group discussions. This is a virtual meeting and all are welcome to attend.

Presentation: Data Lifecycle Management - Applying Engineering Best Practices for Data
Presenter: Itai David

Today, when working with data lakes over object storage it is difficult to test changes in isolation, stage new data pipelines/ML models in parallel to production, ensure best practices, debug issues or revert in case of a quality issue.

lakeFS is an open source project that enables managing data the same way as code. Enabling isolated development, safe data ingestion and resilient production. lakeFS provides git-like capabilities such as branches, merges and commits on top of format agnostic data repositories kept on object storage.

Itai David is a software engineer with Treeverse, the company behind lakeFS; With over 15 years of experience in developing software. He is based out of Winnipeg, Canada - so is very jealous of the San Diego weather ;)

As usual we will have group discussions on topics voted on at the meeting. Show up with a question or discussion topic or just hang out and participate in the conversation.

Contact [masked] if you are interested in presenting at a future meetup.

Agenda for the Meeting

- Presentation + Questions (5:30 - 6:15)
- Group Discussions (please participate) (6:15 - 7:00)

Zoom Link for Meeting: RSVP for Link

Event organizers

Are you organizing Data Engineering SD - Data Lifecycle Management?

Claim the event and start manage its content.

I am the organizer

based on 0 reviews

Featured Events