Hadoop Professionals

A Community for Hadoop Users

Hadoop MapReduce to compare Relational Data Stored in text File

Hi All,

We are reading two txt files which contains relational data and then we generate java objects from these data.

Then we compare the objects of two files. We want to find which objects differ in both the files.

Note : We can not compare Strings (line by line) of one file to another file because one line contains reference of another line of the same file and this way it has tree of references.

Again each line may have different data structure (we have around 700 different data structure).

The files which we will be comparing would be of size ranging from 10 MB to 1GB.

Can you please suggest whether Hadoop MapReduce would be suitable for our requirement?

Your suggestions are highly appreciated.

Regards,

Chaula

Views: 59

Tags: Data, File, Hadoop, MapReduce, Relational, Stored, compare, in, text, to

Comment

You need to be a member of Hadoop Professionals to add comments!

Join Hadoop Professionals




Groups

© 2012   Created by Jason Venner.

Badges  |  Report an Issue  |  Terms of Service