A Community for Hadoop Users
Hi All,
We are reading two txt files which contains relational data and then we generate java objects from these data.
Then we compare the objects of two files. We want to find which objects differ in both the files.
Note : We can not compare Strings (line by line) of one file to another file because one line contains reference of another line of the same file and this way it has tree of references.
Again each line may have different data structure (we have around 700 different data structure).
The files which we will be comparing would be of size ranging from 10 MB to 1GB.
Can you please suggest whether Hadoop MapReduce would be suitable for our requirement?
Your suggestions are highly appreciated.
Regards,
Chaula
6 members
4 members
11 members
1 member
9 members
© 2012 Created by Jason Venner.
You need to be a member of Hadoop Professionals to add comments!
Join Hadoop Professionals