Mar

2026

How Neural Networks Detect and Interpret Wordplay: New Insights from HSE Researchers

An international team including researchers from the HSE Faculty of Computer Science has presented KoWit-24, an annotated dataset of 2,700 Russian-language Kommersant news headlines containing wordplay. The dataset enables an assessment of how artificial intelligence detects and interprets wordplay. Experiments with five large language models show that even advanced systems still make mistakes, and that interpreting wordplay is more challenging for them than detecting it. The results were presented at the RANLP conference; the paper is available on Arxiv.org, and the dataset and the code for reproducing the experiments are available on GitHub.

Wordplay refers to deliberate use of language that violates linguistic norms in order to attract attention, entertain, or amuse the reader. It is common in Russian news headlines and can take various forms. For example, the headline ‘Osobo bumazhnye persony’ plays on the phrase ‘Osobo vazhnye persony’ (Russian for ‘very important persons’). The word vazhnye (‘important’) is replaced with bumazhnye (‘paper-related’), which rhymes with the original and shifts the meaning toward the topic of paper production. Another example is ‘Kod naklikal,’ the headline of an article about open-source code. It closely resembles ‘kot naplakal,’ an idiom meaning ‘very little,’ thereby creating a humorous ambiguity.

For human readers, such wordplay in headlines is immediately apparent and requires no explanation. However, large language models such as ChatGPT or GigaChat Max are often at a loss, struggling not only to detect the wordplay but even more so to explain the joke. One reason for this difficulty is the limited humour datasets on which LLMs are trained. In most cases, humour in these datasets is represented by canned internet jokes explicitly labelled as ‘jokes,’ which is insufficient for the models to learn why something is funny. In addition, such datasets contain almost no annotation—there are no machine- or human-readable layers of description indicating whether wordplay is present, what type of technique is used, what the headline refers to, and so on.

Researchers from the HSE Faculty of Computer Science, in collaboration with colleagues from IT:U—Interdisciplinary Transformation University Austria—and independent researchers, have created KoWit-24, a dataset dedicated to wordplay. It comprises 2,700 headlines from the Russian business daily Kommersant published between January 2021 and December 2023, along with contextual information: each headline is accompanied by a short description of the news story (the lead) and a summary. For each instance of wordplay, the authors manually annotated the type of technique, identified the anchors—the words that trigger the wordplay—and, where possible, linked the original expressions to relevant Wikipedia articles.

The authors adopted linguist Alan Scott Partington’s definition of wordplay, according to which wordplay occurs when the same expression can be interpreted in at least two ways and this effect is intentional. Wordplay can arise in several ways. One case involves ambiguity inherent in a word or its sound. For example, in the headline ‘Volgu ne mogut zastavit’ tech’ bystree,’ the word Volgu (Volga) refers both to the river and to a federal highway with the same name. Another case involves a slight modification of a well-known phrase or title, in which the author alters the wording while relying on the reader to recognise the original and complete the joke. For instance, ‘Missiya sokratima’ alludes to ‘Missiya nevypolnima,’ the Russian title of the film Mission: Impossible, while the headline itself suggests that a diplomatic mission can be downsized.

The researchers also distinguished ‘nonce words’—coined for a single occasion—and oxymorons, which combine two contradictory meanings. This approach not only allowed them to collect and describe examples but also to compare the performance of different language models.

After annotation, the authors tested the dataset on five LLMs: GPT-4o, YandexGPT-4, GigaChat Lite, GigaChat Max, and Mistral NeMo. Each model was provided with a headline and the corresponding news lead and asked to perform two tasks: first, to determine whether the headline contained wordplay, and second, to interpret it by identifying the original phrase or reference. The researchers compared the effects of two types of prompts: a simple prompt asking whether the headline contained wordplay, and an extended prompt providing a definition along with examples of different wordplay types. The extended prompt improved performance on the detection task for three of the five models, while GPT-4o demonstrated the strongest performance in both detection and interpretation. For all models, interpreting the source of the joke proved significantly more difficult than simply detecting the presence of wordplay.

Pavel Braslavski

‘KoWit-24 addresses two key limitations of earlier datasets: it provides context for each headline and includes multi-level annotation. This transforms a collection of examples into a full-fledged “testbed” for AI. It now allows for an objective comparison of models—whether a model can detect wordplay, identify the anchor, and correctly recall the original phrase or reference. Such verifiable metrics not only allow for a more accurate evaluation of current systems but also support their intentional improvement through selection of prompts, training examples, and fact-checking strategies. In the future, we plan to investigate whether this dataset can be used to enhance humour generation,’ says Pavel Braslavski, Associate Professor at the HSE Faculty of Computer Science and co-author of the paper.

In addition, the dataset establishes a common and transparent standard for evaluation, as researchers use the same data and experimental scripts. This reduces variability in the results and helps develop models that better understand natural language, rather than merely following the logical structure of the text.

Date

30 March

Topics

HSE Development Programme until 2030

Keywords

research projects frontiers of science HSE as a Technological University neural networks Priority 2030 wordplay

About

Faculty of Computer Science

About persons

Pavel Braslavski

HSE Economists Use Search Queries to Forecast Birth Rates

Researchers from the HSE Faculty of Economic Sciences have shown that the accuracy of birth rate forecasts for Russia can be improved by almost 50% by incorporating the dynamics of online search queries related to pregnancy and childbirth into forecasting models. In the best-performing models, the forecasting error fell from 4.6% to 3.2%. The findings have been published in Populations and Economics.

9 July

Jul

2026

HSE Researchers Discover Who Eats Out in Russia—And Why

Around one-third of Russians (31.3%) rarely eat out or buy ready-made meals. The core group of active consumers—those who eat out or purchase prepared food almost every day or several times a week—accounts for only about 9% of the population. These are the findings of a study conducted by the HSE Institute for Social Policy. According to the researchers eating out is no longer a marker of high social status in Russia.

8 July

Jul

2026

Scientists Model How Interactions Between Societies Can Trigger Chaotic Behaviour

Scientists at HSE MIEM have proposed a mathematical model explaining how interactions between societies can influence their stability. Based on the classical theory of evolutionary games, the study reveals an unexpected effect: even a weak informational influence of one society on another can cause one society to remain stable while the other exhibits chaotic behaviour among its individual members. The study has been published in the International Journal of Bifurcation and Chaos.

7 July

Jul

2026

Ancient Craniiform Brachiopod: A Newly Discovered Species with a Unique Shell Shape and Lifestyle

Scientists from HSE University, MSU, and Tallinn University of Technology have studied a fossil species of ancient brachiopods that lived in a warm sea in what is now northern Estonia more than 445 million years ago. These ancient brachiopods developed a cup-shaped shell with a protective 'cap' that shielded them from overgrowth by other marine organisms. The study has been published in Palaeogeography, Palaeoclimatology, Palaeoecology.

6 July

Jul

2026

Scientists Develop Bacterium-Sized Microlaser

An international team of researchers, including scientists from HSE University–St Petersburg, has developed microlasers that emit deep-ultraviolet light at a wavelength of 255 nanometres. The devices operate at room temperature, and the smallest of them measures just two micrometres in diameter—roughly the size of a bacterium. These microlasers could be used in sensors, spectroscopic systems, photonic chips, and communication devices. The paper has been published in Optics & Laser Technology.

2 July

Jul

2026

HSE Develops App for Assessing Phonological Processing in Children

Researchers at the HSE Centre for Language and Brain have developed a new digital tool for assessing children's phonological processing skills—the ZARYA (Sound Analysis of the Russian Language) test battery. It is the first standardised application in Russia designed to provide a fast and reliable assessment of children's ability to distinguish speech sounds, retain them in working memory, and perform phonemic analysis. The app runs on Android tablets and smartphones and is available for download from RuStore. Details of the test validation have been published in the Journal of Speech, Language, and Hearing Research.

2 July

Jul

2026

Researchers Discover How Spelling Errors Slow Down Reading in Russian

Psycholinguists from the Centre for Language and Brain at HSE University–St Petersburg have shown that words that are frequently misspelled are processed more slowly by readers, even when presented with the correct spelling. The researchers confirmed this effect for the first time using Russian-language materials and found that response speed is most strongly linked to how confidently individuals can distinguish the correct spelling of a word from an incorrect one. The study has been published in The Mental Lexicon.

2 July

Jul

2026

Scientists Discover Why Europium 'Misbehaves'

Europium is a rare-earth metal responsible for the pure red glow in displays and other luminescent materials. For a long time, however, it refused to emit light when surrounded by certain organic molecules known as acylpyrazolone ligands. Chemists have now uncovered the reason: in europium complexes with these ligands, a 'black window' appears—a charge-transfer state in which the energy absorbed by the ligand is dissipated as heat rather than emitted as light. Understanding this mechanism opens the way to designing more efficient red-emitting materials for displays, fluorescent thermometers, and chemical sensors. The results have been published in Dalton Transactions.

1 July

Jun

2026

HSE Economists Reveal How the Wage Gap Emerges Among Vocational School Graduates

HSE researchers examined the careers of 600,000 graduates of Russian secondary vocational education programmes and found that at the start of their careers, the gender wage gap reaches 23%, doubling after three years. This disparity is largely due to male and female students choosing different occupations when enrolling in vocational schools. These were the findings made by Sergey Roshchin, Natalya Yemelina, and Ksenia Rozhkova from of the HSE Faculty of Economic Sciences. The article has been published in Educational Studies.

30 June

Jun

2026

HSE Researchers Make Aldehydes Perform Dual Function

Chemists from HSE University have discovered a way to carry out a reductive addition reaction without using an external reducing agent. Instead, the required 'resource' is supplied by the aldehyde itself, one of the reaction participants. This approach helps prevent unwanted side reactions, reduces toxicity, and simplifies the production and synthesis of organic molecules, including those used in the manufacture of medicines. The study has been published in Journal of Catalysis.

25 June