|
|
楼主

楼主 |
发表于 2009-3-6 05:31:46
|
只看该作者
容错合并文件
I think this is a very challenging problem I am facing and I have no idea how to deal with it
Suppose I have two csv files
A.csv
Toyota Camry,1998,blue
Honda Civic,1999,blue
Acura Inf,2000,yellow
B.csv
Toyota Inc. Camry, 2000km
Honda Corp Civic,1500km
HondaUSA Inf, 2000, 2300km
I want to generate C.csv
Toyota Camry,1998,blue ,2000km
Honda Civic,1999,blue,1500km
HondaUSA Inf,2000,yellow,2300km
The worst part of the task is that there needs to be error tolerance to deal with the variations in the company name
1.extra spaces
2.extra dots
3.phrases such as Inc, corp.
4.Create a list of manual translation tables(Acura translates to HondaUSA)
Is this mission impossible? |
|