STSM: Consolidating a software infrastructure and workflow for a multilingual corpus and terminological resource of Neology
Name: David Lindemann
Start : 12/01/2025
End: 18/01/2025
David Lindemann, leader of task 2 in ENEOLI working group 1, used the opportunity granted by this
STSM at FCSH, Universidade NOVA de Lisboa, to work closely together with ENEOLI members at the
host institution, Rute Costa (action vice-chair), Ana Salgado (WG1 chair), and Margarida Ramos (WG1
Task 3 leader), on different topics: The consolidation of the workflow implemented for the WG1 tasks and beyond, based on a Wikibase instance, other free software modules and own scripts, on the one hand, and the integration of task 1.3 in that digital environment, on the other. In addition, the grantee gave a conference at FCSH, which was streamed online and recorded, entitled “FAIR data on Wiki-Platforms,” and all hosts attended a workshop organized by the CODA group at Universidade do Porto and Wikimedia Portugal, about lexical datasets in Wikidata and Wikibase, in order to explore possibilities for defining collaborative lexicographical workflows regarding Mirandese, a minority language spoken in the northeast of Portugal.
The outcomes of this STSM include the inclusion of polylexical candidate terms in the NeoVoc dataset
and the implementation of a NLP pipeline for further ENEOLI working languages, as successfully tested
before for French and German. This now allows to measure the use of Neology meta-terminology
(NeoVoc) in scientific articles of the field (NeoCorpus), in 12 languages. In addition, NeoCorpus full text
body contents have been made available to the ENEOLI community in plain TXT format, so that they can
be used for the extraction of additional term candidates. On the other hand, NeoVoc concepts can now be
annotated with domain descriptors and with fine-grained lexical innovation process descriptors, stemming from the scheme proposed by Sableyrolles (task 1.3). The collaboration with the CODA group and Wikimedia Portugal initiated during this STSM will certainly continue and contribute to enriching and disseminating the ENEOLI action.