Please start any new threads on our new site at https://forums.sqlteam.com. We've got lots of great SQL Server experts to answer whatever question you can come up with.

 All Forums
 SQL Server 2000 Forums
 SQL Server Development (2000)
 Duplicate Rows

Author  Topic 

AskSQLTeam
Ask SQLTeam Question

0 Posts

Posted - 2001-03-13 : 11:47:49
Dan writes "Hi,

The Situation: I recently set up a SQL Server 7 database to handle cusomer contact records.

The problem: The data sources were 15 seperate Excel worksheets with numerous dulpicate entries. However, there is no specific field which can be used to identify a duplicate record uniformly.

For example: Worksheet A -represents the customer name as 'ABC Machine'. Worksheet B - represents the same customer as 'ABC Machine, Inc'. These entries are obviously viewed as unique when attempting to import the data into SQL Server 7. These may represent sub accounts or different locations for the same company -or they may be true duplicates.

My question: Does SQL Server provide any utility to group or check very similar entries for manual review? Or, is this a purely manual process which must be performed pre-import?

I ended up writing a parsing routine in C to examine and compare individual fields and filter off records which were suspect - time consuming...

I'd love to know any techniques or available utilities offering an alternative.

Thanks
Dan"
   

- Advertisement -