For this meetup we are excited to be joined by two great presenters, Sam Deng owner of Shipshape Labs and Russell Spitzer, Software Engineer and Junior Park Ranger at DataStax.
What You'll Learn At This Meetup:
Sam Deng: Google results for "RESTful Cassandra API" are less than awesome. Come hear about how I got burned so you don't have to! This talk will cover techniques for avoiding pitfalls dealing with caching, updates, pagination, lack of referential integrity, and more.
Russell Spitzer: Not since peanut butter and jelly has there been such an epic combo. Spark is the world’s foremost distributed analytics platform, delivering in-memory analytics with a speed and ease of use unheard of in Hadoop. Cassandra is the lighting fast distributed database powering such IT giants as Outbrain and Netflix. Did you know you can combine them with free open source technology? Integrate them easily with the Datastax Open Source Spark Cassandra Connector. This feature-rich integration allows Spark to fully take advantage of Cassandra as well as use Cassandra-specific Spark optimizations. Increase the efficiency of your application with the insider knowledge delivered by one of the main authors of the connector. In this session we’ll go over some of the most common use cases of the Spark Cassandra Connector and highlight how to avoid the most common pitfalls. We will walk through: Spark Cassandra Basic Features: How the Spark Cassandra Connector reads and writes data to C* How Spark Dataframes are integrated with Cassandra How to use Cassandra data locality to your advantage How Cassandra predicate pushdown works in SparkSQL Building and Tuning Spark Streaming Applications with Cassandra: Tuning standard RDD operations for maximum throughput Using the internal C* driver pool for flexibility and efficient access Understanding how receivers work and interact with Cassandra locality Use Spark to Perform Common Cassandra Maintenance: Migrate data from RDBMS sources directly into Cassandra Using Spark to migrate information between different Cassandra Clusters Bulk loading Cassandra using Spark and DataFrames Rebuilding Cassandra tables with different indexes using Spark
*A big thank you to Sony for hosting!
*Food and drinks will be served hope to see you there!
About Sam Deng: I'm a full stack of pancakes and owner of Shipshape Labs. Recent technical lead at PBS KIDS. Recovering east coast workaholic.
About Russell Spitzer: After earning his Ph.D in bioinformatics from UCSF, Russell Spitzer took his love of big data to DataStax. There he has worked on all aspects of integrating Cassandra with other Apache technologies like Spark, Hadoop and Solr. Now his main focus on the integration of Cassandra with Apache Spark via the Spark Cassandra Connector.
Claim the event and start manage its content.I am the organizer