REAL

Relevance Segmentation of Long Documents

Szántó, Zsolt and Sliz-Nagy, Alex and Nagy T., István and Csuma-Kovács, Ádám and Vincze, Veronika and Farkas, Richárd (2018) Relevance Segmentation of Long Documents. In: XIV. Magyar Számítógépes Nyelvészeti Konferencia.

[img]
Preview
Text
teljesB5-415-422.pdf

Download (1MB) | Preview

Abstract

In this paper, we present our methods to identify the most salient topics for a selected domain based on topic modeling. We propose a topic relevance score and segmentation procedure which can split the document into parts referring to various topics. We also offer a solution for visualizing textual spans that are related to a given topic. In this way, it can be easily determined which are the most relevant and most irrelevant segments of a long document (like blog posts or news articles).

Item Type: Conference or Workshop Item (Paper)
Subjects: T Technology / alkalmazott, műszaki tudományok > T2 Technology (General) / műszaki tudományok általában
Depositing User: Dr Richárd Farkas
Date Deposited: 30 Sep 2018 17:49
Last Modified: 30 Sep 2018 17:49
URI: http://real.mtak.hu/id/eprint/86146

Actions (login required)

Edit Item Edit Item