Cleansing of the data has become a major part of the Data Warehousing effort. If the records are truly duplicates (exactly), then there are ways to remove them either during the ETL process directly or after loading. In addition, it can be done in a pre-process using something like the Unix uniq command, or the sed or awk utilities, or a small VB or java program.
-------------------------
The reasonable man adapts himself to the world. The unreasonable one persists in trying to adapt the world to himself. Therefore all progress depends on the unreasonable man. - George Bernard Shaw