Bible's 'divine data' could be perfect for text algorithms

IANS New York

Last Updated : Oct 24 2018 | 5:35 PM IST

Besides being a source of spiritual guidance for many people around the globe, the Bible can also help improve computer-based text translators, researchers say.

Using data from the Bible, a team from the Dartmouth College, New Hampshire in the US, developed an algorithm trained on various versions of the sacred texts that can convert written works into different styles for different audiences.

The team saw in the Bible "a large, previously untapped dataset of aligned parallel text."

"The English-language Bible comes in many different written styles, making it the perfect source text to work with for style translation," said lead author Keith Carlson, a doctoral student at Dartmouth.

According to the study, published in the journal Royal Society Open Science, this is not the first parallel dataset created for style translation. But it is the first that uses the Bible.

Also Read

Facebook develops quicker, faster way to translate languages

Hominins walked like modern humans, climbed like apes: Study

5 Dead Sea Scrolls in Washington museum fake

Astronauts' weight linked to post-flight eye changes: Study

Facebook AI significantly improves Urdu to English translation

Beyond providing infinite inspiration, each version of the Bible contains more than 31,000 verses that the researchers used to produce over 1.5 million unique pairings of source and target verses for machine-learning training sets.

It is already thoroughly indexed by the consistent use of book, chapter and verse numbers. The predictable organisation of the text across versions eliminates the risk of alignment errors that could be caused by automatic methods of matching different versions of the same text, the researchers noted.

"The Bible is a 'divine' data set to work with to study this task," said Daniel Rockmore, Professor of computer science at Dartmouth.

The team used 34 stylistically distinct Bible versions ranging in linguistic complexity from the "King James Version" to the "Bible in basic english."

The texts were fed into two algorithms -- a statistical machine translation system called "Moses" and a neural network framework commonly used in machine translation, "Seq2Seq."

Other texts that have been used in the past, ranging from Shakespeare to Wikipedia entries, provide data sets that are either much smaller or not as well suited for the task of learning style translation.

"Humans have been performing the task of organizing Bible texts for centuries, so we didn't have to put our faith into less reliable alignment algorithms," Rockmore said.

--IANS

rt/prs

Disclaimer: No Business Standard Journalist was involved in creation of this content

More From This Section

Pope: Fidelity must apply to all relationships

Explore News

Bank holiday Christmas 2024 Market Today Latest News LIVE Mamata Machinery IPO Allotment DAM Capital Advisors IPO Allotment Pakistan China J-35 Jet Deal Unimech Aerospace IPO IPO News Business Standard at 50

Bible's 'divine data' could be perfect for text algorithms

Also Read

Facebook develops quicker, faster way to translate languages

Hominins walked like modern humans, climbed like apes: Study

5 Dead Sea Scrolls in Washington museum fake

Astronauts' weight linked to post-flight eye changes: Study

Facebook AI significantly improves Urdu to English translation

More From This Section

Pope: Fidelity must apply to all relationships

Will bring to UN message of 'common God, shared humanity': Amjad Ali Khan

SC permits bursting of only green firecrackers (Third Lead)

Four-day International Arya Maha Sammelan to be held from Oct 25

Explore News