A deep learning artificial intelligence (AI) model can predict missing words, fragments, and sentences from cuneiform tablets that are up to 4500 years old.
Clay tablets inscribed with cuneiform text in the Akkadian language are key tools for understanding the cultures that existed in Mesopotamia (roughly the area of modern-day Iraq) between 2500 BC and 100 AD. Many of these tablets, given the age, are damaged and missing key sections of the text.The computer scientist Gabriel Stanovsky from the Hebrew University of Jerusalem and colleagues from different departments collaborated to use artificial intelligence to unlock the secrets of these tables, completing the missing cuneiform text.
What is cuneiform writing
Cuneiform writing is a writing system used in ancient Mesopotamia. It is considered the oldest form of writing in the world and has been used for over 3.000 years. Cuneiform writing consists of small wedge-shaped signs that were inscribed on wet clay tablets.
Encode tables in cuneiform writing
In the past, research has already "read" old documents (letters of the renaissance: rolls of herculaneum), but never with this type of approach to the writings of the Sumerian civilization.
The team used a deep learning AI model already trained on 104 different languages. These include some Semitic languages such as Hebrew, which shares similarities with Akkadian. They then trained the algorithm by transcribing 10.000 cuneiform tablets. The AI model was able to suggest contextually accurate words and phrases to fill in the gaps. Take it as a kind of T9, but with the Mesopotamian.
How do we know that the suggestions are relevant? The researchers also tested AI on already known parts of the tablets, and completion was excellent there as well. The artificial intelligence has reconstructed the sentences in cuneiform writing with an amazing 89% accuracy, in some cases even expanding the possible interpretations of the texts.
The importance of knowing languages
“The main finding of this study,” says Stanovsky, “is that the use of other languages really helped codify Akkadian.” Indeed, without pre-training the model on those 104 different languages, the reading accuracy of the cuneiform tablets was nearly 30 percentage points lower.
It is a tool that in the next few years, I am sure, will unleash enormous potential for the deciphering of important historical documents.
References: arxiv.org/abs/2109.04513