• Artificial Intelligence to the Rescue of the Spanish Golden Age: Automatic Transcription and Modernization of One Thousand Three Hundred Theatrical Prints and Manuscripts

    Author(s):
    Álvaro Cuéllar (see profile)
    Date:
    2023
    Group(s):
    Artificial Intelligence, Digital Humanists, Global DH, Spanish Golden Age Literature
    Subject(s):
    Artificial intelligence, Spanish language, Spanish literature, Neural networks (Computer science), Manuscripts, Prints, Vega, Lope de, 1562-1635
    Item Type:
    Article
    Tag(s):
    artificial intelligence, htr, lope de vega, manuscript, Neural Networks, Siglo de Oro, Stylometry, transcription
    Permanent URL:
    https://doi.org/10.17613/60kr-7q23
    Abstract:
    A high percentage of theatrical prints and manuscripts from the aurisecular period have never been transcribed in an analogical or, of course, digital format. It is therefore impossible to use these documents to carry out searches of our interest or for the valuable computer analyses (stylometry, topic modelling, sentiment analysis, etc.) that have been developed in recent years. Thanks to Artificial Intelligence (Transkribus) and HTR (Handwritten Text Recognition) techniques, I have trained three models, already public for the research community, capable of transcribing and orthographically modernizing these documents automatically with a high degree of precision: around 97% of success in prints and 91% in manuscripts. Through these models I have been able to process some 1,300 theatrical plays contained in prints and manuscripts from numerous libraries, archives, and other digitized sources. The resulting transcripts are now part of the ETSO project, of the TEXORO search engine and, in addition to being an advanced starting point for careful editing of the texts, they themselves have sufficient quality to be subjected to stylometric analysis, which is yielding authorship attributions of interest.
    Notes:
    This is a translation of the article: Cuéllar, Álvaro. (2023). «La Inteligencia Artificial al rescate del Siglo de Oro. Transcripción y modernización automática de mil trescientos impresos y manuscritos teatrales», Hipogrifo. Revista de literatura y cultura del Siglo de Oro, vol. 11, núm. 1, pp. 101-115, https://doi.org/10.13035/H.2023.11.01.08.
    Metadata:
    Published as:
    Journal article    
    Status:
    Published
    Last Updated:
    5 months ago
    License:
    Attribution

    Downloads

    Item Name: pdf cuellar_inteligenciaartificialenglish.pdf
      Download View in browser
    Activity: Downloads: 29