Cloudera just announced an opensource package for bulk loading of data from from databases for use by Map Reduce applications. The application is called sqoop, written primarily by aaron kimbal.
http://www.cloudera.com/hadoop-sqoop
My small sample of experience using hadoop map tasks to read from a database or to write to a database resulted in the database falling over almost immediately due to excessive loading. On the flip side the db was mysql running on a single 8way linu…
Jason,
What is the best practice using Hadoop to migrate data from one Oracle remote database to an Oracle local database assuming wre have dblink to the remote database.
I saw examples to read data from a database. But, I feel we should be able t…