Thinking in Map Reduce http://spkr8.com/t/1492

Description:

Hadoop is an Apache project that provides a framework for running applications that process vast amounts of data (hundreds of terabytes) on large clusters (thousands of computers) of commodity hardware. The Hadoop framework transparently provides applications both reliability and data motion. Hadoop implements a distributed file system and Map Reduce. This presentation presents the motivation and approach for Hadoop, an overview of the components and architecture, and an overview of some of the tools built on top of Hadoop, such as Hbase, Pig, and Hive.

Comments on this Talk

Have an account? Sign in.

Leave a Comment

Remember to keep it constructive! Identify strengths and areas for improvement, and make suggestions!

Your Rating: 2.5

I'll Rate It! I was there.

Welcome to the SpeakerRate Beta! Have feedback? Let us know over at Get Satisfaction (but be nice).