LChange'23 is the fourth workshop for computational approaches to historical language change with the focus on digital text corpora. Come join us for this exciting adventure!
The workshop builds upon its first iteration in 2019, and the subsequent events (2021, 2022). It will be colocated with EMNLP 2023 in Singapore, as a hybrid event. The workshop will take place on Wednesday 6 December 2023.
|KEYNOTE 1, Moderator: Andrey Kutuzov|
|9.30-10.30||Neural language models for word usage representation and analysis||Mario Giulianelli|
|SESSION 1 Chair: Pierluigi Cassotti|
|11.00-11.20||EvoSem: A database of polysemous cognate sets||Mathieu Dehouck, Alex François, Siva Kalyan, Martial Pastor, David Kletz|
|11.20-11.40||Semantic Shifts in Mental Health-Related Concepts||Naomi Baes, Nick Haslam, Ekaterina Vylomova|
|11.40-12.00||Scent and Sensibility: Perception Shifts in the Olfactory Domain||Teresa Paccosi, Stefano Menini, Elisa Leonardelli, Ilaria Barzon, Sara Tonelli|
|KEYNOTE 2, Moderator: Syrielle Montariol|
|13.30-14.30||What does semantic change have to do with Hello Kitty? Referring as the source of change||Gemma Boleda|
|SESSION 2 Chair: Bill Noble|
|14.30-14.50||Political dogwhistles and community divergence in semantic change||Max Boholm, Asad B. Sayeed|
|14.50-15.10||Automating Sound Change Prediction for Phylogenetic Inference: A Tukanoan Case Study||Kalvin Chang, Nathaniel Romney Robinson, Anna Cai, Ting Chen, Annie Zhang, David R Mortensen|
|15.10-15.30||Domain-Adapting BERT for Attributing Manuscript, Century and Region in Pre-Modern Slavic Texts||Piroska Lendvai, Anna Jouravel, Elena Renje|
|15.30-16.30||POSTER SESSION||See list below|
|SESSION 3 Chair: Andrey Kutuzov|
|16.30-16.50||Representing and Computing Uncertainty in Phonological Reconstruction||Johann-Mattis List, Nathan Hill, Robert Forkel, Frederic Blum|
|16.50-17.10||Anchors in Embedding Space: A Simple Concept Tracking Approach to Support Conceptual History Research||Jetske Adams, Martha Larson, Jaap Verheul, Michael Boyden|
|17.10-17.30||GHisBERT – Training BERT from scratch for lexical semantic investigations across historical German language stages||Christin Beck, Marisa Köllner|
Gemma Boleda (Universitat Pompeu Fabra)
Title of talk: What does semantic change have to do with Hello Kitty? Referring as the source of change
It has long been noted that lexical semantic change is rooted in specific utterances, specific reference acts: for instance, in Old English “deor” (“deer”) meant “wild animal”, and it acquired its current meaning “probably via hunting, deer being the favorite animal of the chase”. However, traditional historical linguistics lacked the tools to explore the process from reference to semantic change on a large scale. Current methods in computational linguistics, as well as the increasing availability of large-scale linguistic resources, afford precisely that. In this talk, I will present work that links reference to change by examining different phenomena (production of referring expressions, regular polysemy) at different timescales (language development, synchronic use, language evolution), using quantitative and computational methods.
Mario Giulianelli (University of Amsterdam)
Title of talk: Neural language models for word usage representation and analysis
Neural language models are powerful tools for language scientists interested in studying language users’ interpretation of words and the evolution of word usage over time. Drawing from a series of recent findings, I will argue that contemporary language models (LMs) can infer contextually appropriate interpretations for diverse usages of the same word, akin to how human readers comprehend lexical meaning. Through the integration of artificial attention mechanisms that operate across extended contextual sequences and an extensive phase of cross-situational learning using vast amounts of texts, current LMs serve as highly generalisable engines for lexical interpretation; they offer distinct access to the first-order and second-order co-occurrence statistics of word usage; and they allow for quantitative and interpretable comparisons, synchronically and diachronically. I will discuss methods to engage with LMs for obtaining word representations, including the collection and analysis of neural representations generated during the processing of word usage examples, and the direct input of natural language instructions to induce human-readable word definitions. These approaches hold significant relevance for examining shifts and variations in word usage across the temporal and spatial dimensions.
Deadlines are to be understood as 23:59 AoE.
Thanks to iguanodon.ai, we are sponsoring the registration fees for the EMNLP conference, including the yearly ACL membership fee, for several students and early-career researchers.
We therefore would like to invite interested candidates to apply by email, by October 15th 23:59 CEST, to email@example.com with the following information:
This workshop explores state-of-the-art computational methodologies, theories and digital text resources on exploring the time-varying nature of human language.
The aim of this workshop is three-fold. First, we want to provide pioneering researchers who work on computational methods, evaluation, and large-scale modelling of language change an outlet for disseminating cutting-edge research on topics concerning language change. We want to utilize this workshop as a platform for sharing state-of-the-art research progress in this fundamental domain of natural language research.
Second, in doing so we want to bring together domain experts across disciplines by connecting researchers in historical linguistics with those that develop and test computational methods for detecting semantic change and laws of semantic change; and those that need knowledge (of the occurrence and shape) of language change, for example, in digital humanities and computational social sciences where text mining is applied to diachronic corpora subject to e.g., lexical semantic change.
Third, the detection and modelling of language change using diachronic text and text mining raise fundamental theoretical and methodological challenges for future research.
Besides these goals, this workshop will also support discussion on the evaluation of computational methodologies for uncovering language change. SemEval2020 Task1 on unsupervised detection of lexical semantic change attracted three figure submission numbers and a total of 21 submitted system papers. Since then, three more tasks have been completed in Italian, Russian, and Spanish.
We invite original research papers from a wide range of topics, including but not limited to:
URL for submissions: https://openreview.net/group?id=EMNLP/2023/Workshop/LChange.
Long and short papers may consist of up to eight (8) and four (4) pages of content, respectively, plus unlimited references; final versions will be given one additional page of content so that reviewers' comments can be taken into account.
LChange’23 also welcomes papers focusing on releasing a dataset or a model; these papers fall into the short paper category. To encourage model and dataset sharing at the reviewing phase, model and dataset papers do not need to be anonymous.
Accepted papers will be presented orally or as posters and included in the workshop proceedings. Submissions are open to all, and are to be submitted anonymously. All papers will be refereed through a double-blind peer review process by at least three reviewers with final acceptance decisions made by the workshop organizers.
If you have published in the field previously, and are inrerested in helping out in the PC to review papers, send us an email.