Now, a computer programme to decode lost languages

Scientists have developed a new computer programme that can automatically translate an ancient language into a known language, a discovery they hope could help them decipher some of the scripts that are yet to be understood.

Updated on: Jul 20, 2010, 21:41:06 IST

PTI | By HT Correspondent, London

Prefer HTon Google

Share via

Copy link

Scientists have developed a new computer programme that can automatically translate an ancient language into a known language, a discovery they hope could help them decipher some of the scripts that are yet to be understood.

Developed by researchers at the Massachusetts Institute of Technology, the programme has successfully decoded the over three-thousand-year-old Ugaritic language.

Ugaritic was last used around 1200 BC in western Syria and it consists of dots on clay tablets. It was first discovered in 1920 but was not deciphered until 1932.

To evaluate the efficiency of their programme, the researchers gave reference of the Hebrew language, which is similar to Ugaritic.

The system is then able to make assumptions about the way different words are formed and whether they consist of a prefix and a suffix, for example.

Through repeated analysis, the program linked letters and words to map nearly all Ugaritic symbols to their Hebrew equivalents in a matter of hours, the Daily Mail reported.

Professor Regina Barzilay, who was leading the research, said: "Traditionally, decipherment has been viewed as a sort of scholarly detective game, and computers weren't thought to be of much use.

"Our aim is to bring to bear the full power of modern machine learning and statistics to this problem."

According to the scientists, the programme looks for commonly used symbols in the two languages and gradually refines its mapping of the alphabet until it can go no further.

The Ugaritic alphabet has 30 letters, and the system correctly mapped 29 of them to their Hebrew counterparts.

Of the words that the two languages shared the programme was able to correctly identify 60 per cent of them.

However, experts have expressed scepticism about the programme and said that it is of little use because many of the undeciphered texts have no known ancestor to map against.

The programme also assumes that the computer knows where one word begins and another ends, something, which is not always the case.

Get the latest headlines from US news and global updates from Pakistan, Nepal, UK, Bangladesh, Russia and US Iran war Live, get all the latest headlines in one place on Hindustan Times.