test
Search publications, data, projects and authors

Free full text available

Article

English

ID: <

10670/1.5omfgl

>

Where these data come from
A Medieval Epigraphic Corpus and its Retro-Developments (CIFM-CBMA). The Exploratory Research of the COSME2 Consortium

Abstract

International audience The digital “Burgundian Epigraphic Corpus” is the result of the collaboration between two teams, the CIFM (Corpus of Inscriptions of Medieval France) and the CBMA (Corpus of Medieval Burgundian Texts), as part of the Cosme2 (Consortium Sources Médiévales - linked to TGIR Huma-Num from CNRS - France), dedicated to the digital approaches of the historical corpora. This article stress how a complex set of documents mixing Latin, Greek, and Old French texts, accompanied by rich metadata, has been processed in order to allow new surveys by humanists. It shows how the corpus is constantly reinvested and how its exploitation, thanks to artificial intelligence, generates new data and metadata that can be reinjected into the corpus and in turn operated creating a kind of virtuous circle. Three retro-developments are briefly discussed here: 1. Semantic Web, Connectivity and Named Entities; 2. GIS and Automated Extraction of New Metadata; 3. Lemmatization and Automatic Language Detection.

Your Feedback

Please give us your feedback and help us make GoTriple better.
Fill in our satisfaction questionnaire and tell us what you like about GoTriple!