Hadoop, Pig, and Twitter http://spkr8.com/t/4013

Description:

How does Twitter analyze its massive dataset? What tools do we use, and where do we focus our analysis? In this talk, I will discuss our transition from a MySQL-based to a Hadoop-based data infrastructure and our use of Pig (a scripting language built on top of Hadoop) to democratize big-data analysis across the company. I will present concrete examples of interesting analyses at each step.

Comments on this Talk

Have an account? Sign in or register.

Leave a Comment