REAL

Book Review. Grimmer, J., Roberts, M.E., & Stewart, B.M. (2022). Text as Data: A New Framework for Machine Learning and the Social Sciences. Princeton University Press.

Varga, Tamás (2024) Book Review. Grimmer, J., Roberts, M.E., & Stewart, B.M. (2022). Text as Data: A New Framework for Machine Learning and the Social Sciences. Princeton University Press. INTERSECTIONS: EAST EUROPEAN JOURNAL OF SOCIETY AND POLITICS, 10 (4). pp. 160-165. ISSN 2416-089X

[img]
Preview
Text
1410-ArticleText-5736-1-10-20250217.pdf - Published Version

Download (165kB) | Preview

Abstract

Social scientists, digital humanities scholars and industry professionals now regularly leverage large-scale document corpora. A large dataset of texts, while providing a wealth of information, is insufficient on its own to generate meaningful insights. It is essential to approach the dataset with well-defined research questions that guide the analytical process and ensure the relevance of the findings. Moreover, deriving meaningful answers requires the application of appropriate methodologies that are aligned with the research objectives. In addition to methodological rigor, scholars must critically assess the limitations of the dataset's validity. This involves evaluating the accuracy, reliability, and completeness of the data, as well as recognizing any inherent biases. The book book aims to illustrate how to treat “text as data” for social science tasks and social science problems. It adopts a six-part structure, combined with several chapters and subchapters. Each part is structured around five fundamental concepts: representation, discovery, measurement, prediction, and causal inference. By doing this, it serves as a comprehensive guide for researchers, delineating the capabilities and limitations inherent in text data methodologies.

Item Type: Article
Subjects: H Social Sciences / társadalomtudományok > H Social Sciences (General) / társadalomtudomány általában
Q Science / természettudomány > QA Mathematics / matematika > QA75 Electronic computers. Computer science / számítástechnika, számítógéptudomány
SWORD Depositor: MTMT SWORD
Depositing User: MTMT SWORD
Date Deposited: 16 Feb 2026 12:05
Last Modified: 16 Feb 2026 12:05
URI: https://real.mtak.hu/id/eprint/234166

Actions (login required)

Edit Item Edit Item