About the Talk
October 30, 2009 6:45 AM
Georgia Tech Research Institute Conference CenterGeorgia Tech Research Institute Conference Center
Map/Reduce, as provided in Apache’s Hadoop, is simple in concept, but difficult in practice. Many real world tasks require multiple map/reduce tasks and sometimes figuring out which step to perform an operation in can be a challenge. Cascading is an open source abstraction layer on top of Hadoop that provides an easy to use, but powerful API that figures out what problems you are trying to solve and builds the appropriate map, reduce and dependency logic. In this presentation I will be explaining the concepts of Cascading and showing how it is used to solve real world problems.