Date: Wed, 6 Jun 2007 12:05:30 -0600
Reply-To: Eugenio Grant <eugenio.grant@ipsos-ca.com>
Sender: "SPSSX(r) Discussion" <SPSSX-L@LISTSERV.UGA.EDU>
From: Eugenio Grant <eugenio.grant@ipsos-ca.com>
Subject: Finding Duplicate Names
Content-Type: text/plain; charset="us-ascii"
Hi:
I have a BIG database (45.000) records. It has a variable called "name" that
has the name of the company. I'm trying to do 2 things with it.
1. Identify (not eliminate) duplicate names.
2. Identify similar names, meaning not the same but a similar. For example
Time Warner Inc, and Time Warner might be the same.
Any ideas.
Regards,
|