|
How do I reference the data set 'correct_domains' in the do loop, so I
can look for close Levenshtein distances (to find misspelled domains)?
data correct_domains;
input domain $200.;
infile datalines truncover;
datalines;
yahoo.com
gmail.com
hotmail.com
aol.com
comcast.net
msn.com
sbcglobal.net
verizon.net
bellsouth.net
cox.net
att.net
;;;;
run;
data check_these_domains;
input domain $200.;
infile datalines truncover;
datalines;
yahoo.cm
gmail.co
hotmial.com
aol.com
comcast.net
;;;;
run;
data checked;
set check_these_domains;
do _i_ = 1 to 11;
r = COMPLEV(???, domain);
if r in (1,2) then leave /* do something useful */;
run;
run;
Andrew
|