Click here to Skip to main content
15,890,512 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
See more:
I have 2 files (text) that I want to load into my DB (SQL) table.But my both the files have dupes in them (primary key dupes and duplicate rows).So how to eradicate or eliminate dupes from files before loading it to DB in SSIS packages.

Can some one tell me in detail what all the tasks (Controlflow task and Data flow task) I have to include in a SSIS package to achieve the above task?
Posted

1 solution

I had to do that recently inside an SSIS package. I used the sort transformation and checked the box to "remove rows with duplicate sort values".

This is a pretty good article on how to do that.
http://www.mssqltips.com/sqlservertip/3036/removing-duplicates-rows-with-ssis-sort-transformation/[^]
 
Share this answer
 
Comments
Member 10658693 10-Mar-14 14:39pm    
Thanks!! I'll try this.
Member 10658693 11-Mar-14 4:42am    
Well it worked, though I made few changes to my SSIS as my scenarion was quite different! thx

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900