In this tutorial, students will learn how to use Python with Apache
Hadoop to store, process, and analyze incredibly large data sets. Hadoop
has become the standard in distributed data processing, but has mostly
required Java in the past. Today, there are a numerous open source
projects that support Hadoop in Python and this tutorial will show
students how to use them.