A Community for Hadoop Users
This network is a place to discuss and learn Hadoop, Solr, Katta, Map Reduce, Machine Learning and Big Data
Started by arup sarkar on Tuesday.
Started by Prabha Satya on Tuesday.
Started by John Yard Jan 19.
Hi All,
We are reading two txt files which contains relational data and then we generate java objects from these data.
Then we compare the objects of two files. We want to find which objects differ in both the files.
Note : We can not compare Strings (line by line) of one file to another file because one line contains reference of another line of the same file and this way it has tree of references.
Again each line may have different data structure (we…
ContinuePosted by Chaula Ganatra on January 13, 2012 at 1:37am
Scenario:
I have one subset of database and one dataware house. I have bring this both things on HDFS. I want to analyse the result based on subset and datawarehouse. (In short, for one record in subset I have to scan each and every record in dataware house).
Question:
I want to do this task using Map-Reduce algo. I am not getting that how to take both files as a input in mapper and also how to handle both files in map phase of map-reduce. Pls suggest me some idea so…
ContinuePosted by Bhavesh Shah on January 3, 2012 at 12:35am — 2 Comments
I am currently doing project in energy efficiency and reliability in hadoop.. Can any one tell me which class in hadoop doing block spliting and block allocation and block replication.. Also whether is it possible to dynamically add nodes in cluster..if it is possible tell me the steps.. Can any one give the answer..? Plz soon..
Posted by radhakrishnan_cse on December 28, 2011 at 8:30am — 1 Comment
A major brokerage firm is looking for a senior Hadoop architect/developer. You will work with a variety of financial data, trading and back-office-related. Security Master knowledge is helpful.
You will work on the infrastructure end, working on logical and physical architecture. Your main responsibility will be working with Terabytes of data and organizing it into data domains. You should have extensive experience with data implementations, data storage, data access and…
ContinuePosted by Dmitriy Goldin on June 20, 2011 at 6:55am
Posted by Muhammed Irshad on May 31, 2011 at 10:12pm
5 members
3 members
9 members
1 member
8 members
© 2012 Created by Jason Venner.