REAL

Evaluating an ensemble model of linguistic categorization on three variable morphological patterns in Hungarian

Rácz, Péter and Rebrus, Péter and Tóth, Szilárd (2024) Evaluating an ensemble model of linguistic categorization on three variable morphological patterns in Hungarian. PROCEEDINGS OF THE ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 46. pp. 4706-4711. ISSN 1069-7977

[img]
Preview
Text
eScholarshipUCitem914866s3.pdf - Published Version
Available under License Creative Commons Attribution.

Download (299kB) | Preview

Abstract

We implemented two instance-based learners, the K-Nearest Neighbors model and the Generalized Context Model, and a rule-based learner, the Minimal Generalization Learner, adapted for linguistic data. We fit these on three distinct, variable patterns of word variation in Hungarian: paradigmatic leveling and vowel deletion in verbs and vowel harmony in nouns. We tested their predictions using a Wug task. The best learners were combined into an ensemble model for each pattern. All three learners explain variation in the test data. The best ensemble models of inflectional variation in the data combine instance-based and rule-based learners. This result suggests that the best psychologically plausible learning model of morphological variation combines instance-based and rule-based approaches and might vary from case to case.

Item Type: Article
Uncontrolled Keywords: morphology; natural language processing; corpus studies; computational modelling
Subjects: P Language and Literature / nyelvészet és irodalom > P0 Philology. Linguistics / filológia, nyelvészet
Q Science / természettudomány > QA Mathematics / matematika > QA75 Electronic computers. Computer science / számítástechnika, számítógéptudomány
SWORD Depositor: MTMT SWORD
Depositing User: MTMT SWORD
Date Deposited: 19 Feb 2025 14:20
Last Modified: 19 Feb 2025 14:20
URI: https://real.mtak.hu/id/eprint/215759

Actions (login required)

Edit Item Edit Item