AI Helps Researchers Unlock Mysteries of Vatican Archives

Researchers in Italy recently published the findings of a project called Codice Ratio, or “The Code System,” which demonstrates how deep learning is helping the Vatican transcribe a portion of their archives.
The Vatican Secret Archives are made up of some fifty-three miles of shelving, thirty-five thousand volumes of the catalog, and twelve centuries worth of documents. The records contain private letters and other records of past popes, including letters and documents from Michelangelo, Henry VIII requesting a marriage annulment, and even letters from Abraham Lincoln to the Pope.
Like many historic archives from the around the world, the Vatican has their own photographic and conservation team, however, the records are so large, that the idea of transcribing them by hand seems highly impractical.
“Our approach requires minimal training efforts, making the transcription process more scalable as the production of training sets requires a few pages and can be easily crowdsourced,” the researchers said. “Our system has been able to produce good transcriptions that can be used by paleographers as a solid basis to speedup the transcription process at a large scale.”

*Sample text from the manuscript “Liber* septimus regestorum domini Honorii pope III” (Vatican Registers).

For training and inference, the researchers used an NVIDIA GeForce GTX GPU, CUDA and the cuDNN-accelerated TensorFlow deep learning framework. The team trained their neural network on images and documents provided to them by the Vatican, to recognize entire words rather than letters.
The researchers were very successful, generating the exact transcription for 65% of the word images of their dataset.
The team says they will continue to improve their training data set, obtaining more documents from the archives to improve their system.
Read more >

AI Helps Researchers Unlock Mysteries of Vatican Archives

Related resources

Tags

About the Authors

AI Helps Researchers Unlock Mysteries of Vatican Archives

Related resources

Tags

About the Authors

Comments

Related posts

Deciphering Ancient Texts with AI

U.S. Library of Congress Processes over 16 Million Historic Newspaper Pages Using AI

Inception Spotlight: DeepZen Uses AI to Generate Speech for Audiobooks

Using AI to Encrypt Messages in Plain Sight

New Translator Provides More Human-Like Translations

Related posts

Just Released: NVIDIA Modulus v24.04

New Video Series: OpenUSD for Developers

Generative AI for Digital Humans and New AI-powered NVIDIA RTX Lighting

NVIDIA Speech and Translation AI Models Set Records for Speed and Accuracy

Breaking Barriers in Healthcare with New Models for Generative AI and Cellular Imaging