b***t 发帖数: 42 | 1 【 以下文字转载自 JobHunting 讨论区 】
发信人: bnapt (bnapt), 信区: JobHunting
标 题: natrural language processing question!!!!
发信站: BBS 未名空间站 (Fri Mar 6 18:15:18 2009)
given 10 same length strings, each string is 128 characters. Each strings
are ordinary English text with all spaces, numbers, punctuation, and other
non-alphabetic characters removed. They were copied or transcribed from
obscure sources, and have all been changed slightly from the original (
without intentionally introducing spelling or grammat | f*********g 发帖数: 632 | 2 我没有看得特别明白。
你是否统计一下这十个字符串中的相同字母出现的频率,然后拿英语字母频率相比对?
比对近似的,就应该把这十个字符串中的字母换成对应的英语字母。这样就解决问题了
。 |
|