REAL

Leveraging Open Large Language Models for Multilingual Policy Topic Classification: The Babel Machine Approach

Sebők, Miklós and Máté, Ákos and Ring, Orsolya and Kovács, Viktor and Lehoczki, Richárd (2024) Leveraging Open Large Language Models for Multilingual Policy Topic Classification: The Babel Machine Approach. SOCIAL SCIENCE COMPUTER REVIEW. pp. 1-23. ISSN 0894-4393 (print); 1552-8286 (online)

[img]
Preview
Text (A tanulmány teljes szövege)
sebok-et-al-2024-leveraging-open-large-language-models-for-multilingual-policy-topic-classification-the-babel-machine.pdf
Available under License Creative Commons Attribution.

Download (1MB) | Preview
[img]
Preview
Text (Kiegészítő táblázatok)
sj-pdf-1-ssc-10.1177_08944393241259434.pdf
Available under License Creative Commons Attribution.

Download (162kB) | Preview

Abstract

The article presents an open-source and freely available natural language processing system for comparative policy studies. The CAP Babel Machine allows for the automated classification of input files based on the 21 major policy topics of the codebook of the Comparative Agendas Project (CAP). By using multilingual XLM-RoBERTa large language models, the pipeline can produce state-of-the-art level outputs for selected pairs of languages and domains (such as media or parliamentary speech). For 24 cases out of 41, the weighted macro F1 of our language-domain models surpassed 0.75 (and, for 6 language-domain pairs, 0.90). Besides macro F1, for most major topic categories, the distribution of micro F1 scores is also centered around 0.75. These results show that the CAP Babel machine is a viable alternative for human coding in terms of validity at less cost and higher reliability. The proposed research design also has significant possibilities for scaling in terms of leveraging new models, covering new languages, and adding new datasets for fine-tuning. Based on our tests on manifesto data, a different policy classification scheme, we argue that model-pipeline frameworks such as the Babel Machine can, over time, potentially replace double-blind human coding for a multitude of comparative classification problems.

Item Type: Article
Uncontrolled Keywords: quantitative text analysis, deep learning, large language models, classification policy agendas, comparative agendas project
Subjects: P Language and Literature / nyelvészet és irodalom > P0 Philology. Linguistics / filológia, nyelvészet
Q Science / természettudomány > QA Mathematics / matematika > QA75 Electronic computers. Computer science / számítástechnika, számítógéptudomány
SWORD Depositor: MTMT SWORD
Depositing User: MTMT SWORD
Date Deposited: 12 Jun 2024 07:10
Last Modified: 12 Jun 2024 07:10
URI: https://real.mtak.hu/id/eprint/197063

Actions (login required)

Edit Item Edit Item