Click here to Skip to main content
15,886,691 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
Hello Friends,

I wants to compare two columns in excel 2010 (each columns have around 8 to 10 million rows), since I am not able to load it in excel.
I converts it into .txt file and uses the following linux commands.

1. join -j 1 -t $'\t' <(sort macysBase.txt ) <(sort DwpShoes.txt ) > combined
2. join -v 1 -t $'\t' <(sort file1) <(sort file2) > unique1
3. join -v 2 -t $'\t' <(sort file1) <(sort file2) > unique2

Command 1 : To get combined data in both files
Command 2 : To get unique data in file 1
Command 3 : To get unique data in file 2

The issue here is, It takes time to get done & sometimes I am not getting desired results. can anyone suggest some shortcut do this task or a program to get this done.

Thanks in advance.
Posted
Comments
Mehdi Gholam 22-Mar-15 12:17pm    
Excel only supports 1million rows.
Zoltán Zörgő 22-Mar-15 14:41pm    
1) Why Excel
2) What do you mean exactly by "comparing"... what do you want to see as result?

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900