Secrets Of 160,000 Ancient Texts Kept In The Abbey Library Of St. Gall May Soon Be Unlocked By AI

Jan Bartek – AncientPages.com – The Abbey Library of St. Gall in Switzerland is home to approximately 160,000 volumes of literary and historical manuscripts dating back to the eighth century, all of which are written by hand, on parchment, in languages rarely spoken in modern times.

To preserve these historical accounts of humanity, such texts, numbering in the millions, have been kept safely stored away in libraries and monasteries all over the world. A significant portion of these collections are available to the general public through digital imagery, but experts say there is an extraordinary amount of material that has never been read,  a treasure trove of insight into the world’s history hidden within.

Secrets Of 160,000 Ancient Texts Kept In The Abbey Library Of St. Gall May Soon Be Unlocked By AI

Abbey Library St. Gall. Credit: Stiftsbibliothek St. Gallen – Public Domain

Now, researchers at University of Notre Dame are developing an artificial neural network to read complex ancient handwriting based on human perception to improve capabilities of deep learning transcription.

“We’re dealing with historical documents written in styles that have long fallen out of fashion, going back many centuries, and in languages like Latin, which are rarely ever used anymore,” said Walter Scheirer, the Dennis O. Doughty Collegiate ᴀssociate Professor in the Department of Computer Science and Engineering at Notre Dame. “You can get beautiful pH๏τos of these materials, but what we’ve set out to do is automate transcription in a way that mimics the perception of the page through the eyes of the expert reader and provides a quick, searchable reading of the text.”

In research published in the Insтιтute of Electrical and Electronics Engineers journal Transactions on Pattern Analysis and Machine Intelligence, Scheirer outlines how his team combined traditional methods of machine learning with visual psychophysics — a method of measuring the connections between physical stimuli and mental phenomena, such as the amount of time it takes for an expert reader to recognize a specific character, gauge the quality of the handwriting or identify the use of certain abbreviations.

Scheirer’s team studied digitized Latin manuscripts that were written by scribes in the Cloister of St. Gall in the ninth century. Readers entered their manual transcriptions into a specially designed software interface. The team then measured reaction times during transcription for an understanding of which words, characters and pᴀssages were easy or difficult. Scheirer explained that including that kind of data created a network more consistent with human behavior, reduced errors and provided a more accurate, more realistic reading of the text.

“It’s a strategy not typically used in machine learning,” Scheirer said. “We’re labeling the data through these psychophysical measurements, which comes directly from psychological studies of perception — by taking behavioral measurements. We then inform the network of common difficulties in the perception of these characters and can make corrections based on those measurements.”

Using deep learning to transcribe ancient texts is something of great interest to scholars in the humanities.

“There’s a difference between just taking the pH๏τos and reading them, and having a program to provide a searchable reading,” said Hildegund Müller, ᴀssociate professor in the Department of Classics at Notre Dame. “If you consider the texts used in this study — ninth-century manuscripts — that’s an early stage of the Middle Ages. It’s a long time before the printing press. That’s a time when an enormous amount of manuscripts was produced. There is all sorts of information hidden in these manuscripts — unidentified texts that nobody has seen before.”

Secrets Of 160,000 Ancient Texts Kept In The Abbey Library Of St. Gall May Soon Be Unlocked By AI

Page 3 of the „Insтιтutio de arte grammatica“ in the manuscript St. Gallen, Stiftsbibliothek. Credit: Public Domain

Scheirer said challenges remain. His team is working on improving accuracy of transcriptions, especially in the case of damaged or incomplete documents, as well as how to account for illustrations or other aspects of a page that could be confusing to the network.

However, the team was able to adjust the program to transcribe Ethiopian texts, adapting it to a language with a completely different set of characters — a first step toward developing a program with the capability to transcribe and translate information for users.

“In the literary field, it could be really helpful. Every good literary work is surrounded by a vast amount of historical documents, but where it’s really going to be useful is in historical archival research,” said Müller. “There is a great need to advance the digital humanities. When you talk about the Middle Ages and early modern times, if you want to understand the details and consequences of historical events, you have to look through the written material, and these texts are the only thing we have.

See also: More Archaeology News

The problem may be even greater outside the Western world. Think of languages that are disappearing in cultures that are under threat. We must first of all preserve these works, make them accessible and, at some point, incorporate translations to make them a part of cultural processes that are still underway — and we are racing against time.”

Written by Jan Bartek – AncientPages.com Staff Writer

Related Posts

Andalusia Was First Inhabited By Neolithic People From The Southern Part Of The Iberian Peninsula 6,200 Years Ago

Conny Waters – AncientPages.com – The island of San Fernando, Cadiz in Andalusia, was home to the first Neolithic farmers and shepherds who decided to permanently settle there around 6,200 years ago. They practised shellfish collection and consumption all year round, with a preference for winter. Location of Campo de Hockey site in southern Iberian […]

Unknown Bronze Age Settlement Discovered Accidentally In Heimberg, Switzerland

Jan Bartek – AncientPages.com – Sometimes, when archaeologists look for one thing, they find something entirely different. This is exactly what happened in Switzerland when researchers were excavating, hoping to find an ancient Roman brick workshop, but they unearthed a previously unknown Bronze Age settlement instead. The excavation in Heimberg, on the right edge of […]

Unexplained Mystery Of The Dangerous Invisible Enemy In A French Town

Ellen Lloyd – AncientPages.com – It was an ordinary day in a small, sleepy town in France. There were no indications anything strange was about to happen. Yet, an inexplicable and extraordinary event left the unsuspecting residents completely bewildered and unsure of what was unfolding. The situation that unfolded was indeed unusual, if not bizarre. […]

Rare 2,800-Year-Old ᴀssyrian Scarab Amulet Found In Lower Galilee

Jan Bartek – AncientPages.com – Erez Avrahamov, a 45-year-old inhabitant of Peduel, made an incredible discovery while hiking in the Tabor Stream Nature Reserve located in Lower Galilee. He stumbled upon an ancient seal shaped like a scarab that dates back to the First Temple period. Credit: Israel Antiquities Authority This ancient artifact is as […]

Dinas Powys: Late ‘Antique Hillfort Phenomenon’ In Post-Roman Western Britain

Conny Waters – AncientPages.com – Dinas Powys, Glamorgan, located about 9km southwest of Cardiff, is a small inland fort of approximately 0.35ha. The hillfort was first excavated by a team of archaeologists led by Leslie Alcock from 1954 through to 1958. The site is often referenced as a prime example of elite settlements in post-Roman […]

Puzzling Vasconic Inscription On Ancient Irulegi Hand Resembles Basque Language

Jan Bartek – AncientPages.com – A few years ago, archaeologists excavating an Iron Age site known as Irulegi in northern Spain discovered a flat bronze artifact shaped like a human hand. After careful cleaning, they found it bore inscriptions of words from a Vasconic language. This language family includes Basque and several other languages that […]