Monday, August 19, 2013

Mongo-Hadoop 1.1

Hadoop is a powerful, JVM-based platform for running Map/Reduce jobs on clusters of many machines, and it excels at doing analytics and processing tasks on very large data sets.

Since MongoDB excels at storing large operational data sets for applications, it makes sense to explore using these together - MongoDB for storage and querying, and Hadoop for batch processing.

The Mongo-Hadoop Adapter 
We recently released the 1.1 release of the Mongo-Hadoop Adapter. The Mongo-Hadoop adapter makes it easy to use Mongo databases, or mongoDB backup files in .bson format, as the input source or output destination for Hadoop Map/Reduce jobs. By inspecting the data and computing input splits, Hadoop can process the data in parallel so that very large datasets can be processed quickly.

Read more here

Leave a Reply

All Tech News IN © 2011 & Main Blogger .