STSM: Extraction and modeling of bilingual French–Chinese (FR–ZH) prototypes of phraseological neologisms (2015–2025)

Name: Lian Chen

Start : 19/01/2026
End: 30/01/2026

In January 2026, I was invited by Dr.-Ing. Besim Kabashi as a Visiting Scholar in Computational Corpus Linguistics (Prof. Stephanie Evert), at the Centre for Research on Lexicography, Valency and Collocation (CoCoLex) (Director: Prof. Stephanie Evert), Friedrich-Alexander University of Erlangen–Nuremberg (FAU), Germany. This Short-Term Scientific Mission (STSM), funded by the COST Action ENEOLI (European Network on Lexical Innovation), was carried out within the framework of the PhrasNeoLex project.

The objective of the mission was the extraction and ontological modeling of bilingual French–Chinese (2015–2025) prototypes of phraseological neologisms. The research focused on three main axes: (1) formal and semantic characterization of emerging phraseological units; (2) syntactic patterning and formal representation; and (3) conceptual alignment for ontology-based modeling within the ENEOLI framework.

The validated outputs are intended for integration into the ENEOLI Wikibase, contributing to sustainable and interoperable lexical innovation resources. Using the same methodology, I am also very interested in integrating Cantonese neologisms at a later stage (traditional Chinese characters, representing an important dialect in China).

During the stay, consolidated bilingual reference corpora were constructed, and scalable computational pipelines were implemented for differential lexicographic detection, idiomatic variant identification, and embedding-based semantic validation. The methodological exchange with IZ CoCoLex significantly strengthened the integration of corpus linguistics, lexicography, and NLP approaches.

Beyond the technical objectives, the mission fostered close scientific exchange with the CoCoLex team. I participated in methodological discussions and laboratory visits, gaining insight into ongoing projects at CoCoLex. These exchanges enabled the alignment of phraseological neology detection methods with construction-grammar-based modeling approaches developed at FAU.

The STSM reinforced long-term collaboration between FAU and the ENEOLI network. Ongoing joint initiatives include contributions to an edited volume on Multiword Expressions and Neology (Language Science Press), the organization of a EURALEX 2026 workshop, the Special Issue Lexicography in Asia and Generative AI (Lexicography: Journal of Asialex), and future activities aimed at integrating computational phraseology into broader European research infrastructures.