Monday, February 11, 2013

Algorithm learns how to revive lost languages

Like living things, languages evolve. Words mutate, sounds shift, and new tongues arise from old.

Charting this landscape is usually done through manual research. But now a computer has been taught to reconstruct lost languages using the sounds uttered by those who speak their modern successors.

Alexandre Bouchard-C?t? at the University of British Columbia in Vancouver, Canada, and colleagues have developed a machine-learning algorithm that uses rules about how the sounds of words can vary to infer the most likely phonetic changes behind a language's divergence.

For example, in a recent change known as the Canadian Shift, many Canadians now say "aboot" instead of "about". "It happens in all words with a similar sound," says Bouchard-C?t?.

The team applied the technique to thousands of word pairings used across 637 Austronesian languages ? the family that includes Fijian, Hawaiian and Tongan.

Tracking human history

The system was able to suggest how ancestor languages might have sounded and also identify which sounds were most likely to change. When the team compared the results with work done by human specialists, they found that over 85 per cent of suggestions were within a single character of the actual words.

For example, the modern word for "wind" in Fijiian is cagi . Using this and the same word in other modern Austronesian languages, the automatic system reconstructed the ancestor word beliu and the human experts reconstructed bali.

Reconstructing ancient languages can reveal details of our ancient history. Looking at when the word for "wheel" diverges in the family tree of European languages helps us date the human settlement of different parts of the continent, for instance.

The technique could improve machine translation of phonetically similar languages, such as Portuguese and French.

Endangered languages could also be preserved if they are phonetically related to more widely spoken tongues, says Bouchard-C?t?. He is now working on an online version of the tool for linguists to use.

Journal: PNAS, 10.1073/pnas.1204678110

If you would like to reuse any content from New Scientist, either in print or online, please contact the syndication department first for permission. New Scientist does not own rights to photos, but there are a variety of licensing options available for use of articles and graphics we own the copyright to.

Have your say

Only subscribers may leave comments on this article. Please log in.

Only personal subscribers may leave comments on this article

Subscribe now to comment.

All comments should respect the New Scientist House Rules. If you think a particular comment breaks these rules then please use the "Report" link in that comment to report it to us.

If you are having a technical problem posting a comment, please contact technical support.

Source: http://feeds.newscientist.com/c/749/f/10897/s/28756ef6/l/0L0Snewscientist0N0Carticle0Cdn23160A0Ealgorithm0Elearns0Ehow0Eto0Erevive0Elost0Elanguages0Bhtml0Dcmpid0FRSS0QNSNS0Q20A120EGLOBAL0Qonline0Enews/story01.htm

B H c mitt romney mark zuckerberg mark zuckerberg maurice jones drew

No comments:

Post a Comment

Note: Only a member of this blog may post a comment.