![]() ![]() If an opening sentence in English was 20 words long but the French opening was two sentences of about 10 words, the computer would pair the English sentence with the two French ones. It did this by pairing sentences from the same point in the proceedings that were of roughly equal lengths. Jelinek's group began by using a computer to automatically align sentences in the French and English versions of the parliamentary documents. "A couple guys drove to Canada and left with two suitcases full of tapes that contained the proceedings," says Daniel Marcu, co-founder of Language Weaver, the first start-up to use the new statistical techniques in 2002. The IBM effort began with proceedings from the Canadian parliament, which were published in English and French. (Warren Weaver, a mathematician at the Rockefeller Foundation, had first raised the idea of a statistical model for translation in a 1947 letter in which he wrote: "When I look at an article in Russian, I say: 'This is really written in English, but it has been coded in some strange symbols.' ") The computer wouldn't understand the meaning of what it was translating, but by creating a huge database of words and sentences in different languages, the computer could be programmed to find the most common sentence constructions and alignment of words, and how these were likely to correspond between languages. Translation, they basically argued, was as much a mathematical problem as a linguistic one. Jelinek and his group at IBM argued that by using statistics and probability theory, instead of language rules, a computer could do a better job of converting one language into another. But the differing rules in different languages made it difficult. By getting a computer to understand how a sentence worked grammatically in one language, the thought was, it would be possible to create a sentence meaning the same thing in another language. Up to that point, researchers working on machine translation used linguistic models. And all because of an effort begun in the 1980s to remove humans from the equation.Īs the late Frederick Jelinek, who pioneered work on speech recognition at IBM in the 1970s, is widely quoted as saying: "Every time I fire a linguist, my translation improves." (He later denied putting it so harshly.) But in the last few years, as anyone using Google Translate, Babel Fish or many other translation Web sites can tell you, things have changed dramatically. ![]() For decades, machine translation was mostly useful if you were trying to be funny. Libby's quandary will come as no surprise to anyone who has tried to use a computer to translate things. Stumped, she pressed some buttons and eventually managed to wash her clothes, in an elongated wash cycle that kept her pinned down for three hours. "It occurs nobody endlschleudern, however, intercatapults" is one result she got. The guide was written in German, which Casey cannot read, so she typed bits of it into an Internet translation tool. When she couldn't figure out how to use the washing machine, she opened up the instruction manual. Early one morning in 2007, Libby Casey was trying to do her laundry in a guesthouse in Reykjavik, Iceland. ![]()
0 Comments
Leave a Reply. |
Details
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |