A Community for Hadoop Users
Hi All,
We are reading two txt files which contains relational data and then we generate java objects from these data.
Then we compare the objects of two files. We want to find which objects differ in both the files.
Note : We can not compare Strings (line by line) of one file to another file because one line contains reference of another line of the same file and this way it has tree of references.
Again each line may have different data structure (we…
ContinueAdded by Chaula Ganatra on January 13, 2012 at 1:37am — No Comments
Scenario:
I have one subset of database and one dataware house. I have bring this both things on HDFS. I want to analyse the result based on subset and datawarehouse. (In short, for one record in subset I have to scan each and every record in dataware house).
Question:
I want to do this task using Map-Reduce algo. I am not getting that how to take both files as a input in mapper and also how to handle both files in map phase of map-reduce. Pls suggest me some idea so…
ContinueAdded by Bhavesh Shah on January 3, 2012 at 12:35am — 2 Comments
I am currently doing project in energy efficiency and reliability in hadoop.. Can any one tell me which class in hadoop doing block spliting and block allocation and block replication.. Also whether is it possible to dynamically add nodes in cluster..if it is possible tell me the steps.. Can any one give the answer..? Plz soon..
Added by radhakrishnan_cse on December 28, 2011 at 8:30am — 2 Comments
A major brokerage firm is looking for a senior Hadoop architect/developer. You will work with a variety of financial data, trading and back-office-related. Security Master knowledge is helpful.
You will work on the infrastructure end, working on logical and physical architecture. Your main responsibility will be working with Terabytes of data and organizing it into data domains. You should have extensive experience with data implementations, data storage, data access and…
ContinueAdded by Dmitriy Goldin on June 20, 2011 at 6:55am — No Comments
Added by Muhammed Irshad on May 31, 2011 at 10:12pm — No Comments
Flurry is Hiring! Here are just a couple of the open positions!
Please go to our website www.flurry.com to apply!
About Us
Added by Amy Caruso on April 18, 2011 at 3:45pm — No Comments
It’s clear that now we are living in the era of big data. The stores of data on which modern businesses rely are already vast and increasing at an unprecedented pace. Organizations are capturing data at deeper levels of detail and keeping more history than they ever have before. Managing all of the data is thus emerging as one of the key challenges of the new decade.
The solutions to this challenge vary, but interest in them seems to be universal. The largest database vendors and…
Added by Mendeil Bailey on April 11, 2011 at 11:20am — No Comments
Added by MIT on December 29, 2010 at 8:12am — No Comments
Introduction
Hadoop is rapidly becoming the technology of choice for enterprises that need to effectively collect, store and process large amounts of structured and complex data.
The Pentaho BI project is open…
ContinueAdded by Anantha Srivathsa N on December 19, 2010 at 11:00pm — No Comments
Added by Adarsh on November 18, 2010 at 3:18pm — No Comments
Conde Group, Inc. (www.condegroup.com) a consulting and staffing firm based in southern California focuses on helping our clients to acquire and retain…
ContinueAdded by Amy Tierney on November 10, 2010 at 7:36am — No Comments
Added by Vincent Garrigoux on August 17, 2010 at 10:41pm — 1 Comment
Added by Vincent Garrigoux on August 17, 2010 at 8:47am — 1 Comment
Added by Louisa Landry on March 23, 2010 at 8:45am — No Comments
Added by Marc Sturlese on February 15, 2010 at 7:00am — 4 Comments
Added by Mark Cejas on February 13, 2010 at 10:41am — 2 Comments
Added by Mark Cejas on December 31, 2009 at 12:23pm — 1 Comment
Added by Jason Venner on November 17, 2009 at 12:57pm — No Comments
5 members
3 members
9 members
1 member
8 members
© 2012 Created by Jason Venner.