
Help shape the future of Lexical Semantic Change Detection! Join the SemEval-2027 shared task on tracking word meanings across multiple time periods.
Lexical Semantic Change Detection (LSCD) studies how word meanings change over time. Existing benchmarks have mostly focused on comparing two time periods, but semantic change is often gradual and unfolds across decades. This task extends the evaluation setting from two time periods to ten, enabling the modeling of individual word senses and their dynamics over time.
The task is based on a novel, multilingual benchmark covering Swedish, English, Italian, Spanish, Dutch, and Russian, with manually annotated data across ten time periods and approximately 126,000 human annotations.
Given dated usages of a target word, participants must identify which senses the word has in which time periods in an unsupervised manner.
Participants are given a specific sense description and must identify which usages across time instantiate that sense. This subtask reflects common research scenarios in the humanities and social sciences, where researchers are interested in tracing one particular meaning.
The benchmark contains dated usages from historical corpora spanning approximately 1880–2023. Each language is divided into ten time periods of roughly equal length.
For each target word, usages are sampled from each time period. Each usage is manually annotated by at least three annotators.
The data and the evaluation will be hosted in Codalab. The link to Codalab will be provided soon.
Organisers: Nina Tahmasebi, Pierluigi Cassotti Felix Morger, Lucia Siciliani, Eduardo Calò, Pablo Mosteiro, Stefano De Pascale, and Mariia Fedorova