Comparing children and large language models in word sense disambiguation: Insights and challenges

Francesco Cabiddu; Mitja Nikolaus; Abdellah Fourtassi

doi:10.34842/k0rg-2n19

Options

LLMs Special Issue

Comparing children and large language models in word sense disambiguation: Insights and challenges

Authors

Francesco Cabiddu (University College London, UK)
Mitja Nikolaus (CerCo, CNRS, France)
Abdellah Fourtassi (Aix-Marseille University, France)

Abstract

Understanding how children process ambiguous words is a challenge because sense disambiguation is a complex task that depends on both bottom-up and top-down cues. Here, we seek insight into this phenomenon by investigating how such a competence might arise in large distributional learners (Transformers) that purport to acquire sense representations from language input in a largely unsupervised fashion. We investigated how sense disambiguation might be achieved using model representations derived from naturalistic child-directed speech. We tested a large pool of Transformer models, varying in their pretraining input size/nature as well as the size of their parameter space. Tested across three behavioral experiments from the developmental literature, we found that these models capture some essential properties of child sense disambiguation, although most still struggle in the more challenging tasks with contrastive cues. We discuss implications for both theories of word learning and for using Transformers to capture child language processing.

Keywords: Child Word Sense Disambiguation, Transformers, Usage-Based Learning

Downloads:
Download PDF
View PDF

Published on
2024-08-28

Peer Reviewed

License

Creative Commons Attribution-Noncommercial 4.0 International

Authors

Francesco Cabiddu (Psychology, University College London, UK)
Mitja Nikolaus (CerCo, CNRS, France)
Abdellah Fourtassi (Computer Science, Aix-Marseille University, France)

Downloads

Issue

Volume 5 • Issue 1 • 2025 • Special Issue: What Large Language Models (LLMs) can('t) tell us about child language acquisition

Identifiers

DOI: https://doi.org/10.34842/k0rg-2n19

Publication details

Pages: 35–115
Accepted on: 2024-07-23

File Checksums (MD5)

PDF: 62b7eae7d16a95342c51fe292bd51ed8

Comparing children and large language models in word sense disambiguation: Insights and challenges

Abstract

Harvard-Style Citation

Vancouver-Style Citation

APA-Style Citation

Non Specialist Summary