FAQ

A novel text classification problem and its solution

Publication date: 25.09.2014

Technical Transactions, 2013, Automatic Control Issue 4-AC (12) 2013, pp. 7-16

https://doi.org/10.4467/2353737XCT.14.043.3951

Authors

,
Sławomir Zadrożny
Systems Research Institute, Polish Academy of Sciences, Warsaw; Warsaw School of Information Technology
All publications →
,
Janusz Kacprzyk
Katedra Automatyki i Technik Informacyjnych, Wydział Elektrotechniki i Inżynierii Komputerowej, Politechnika Krakowska; Katedra Komputerowych Systemów Automatyki, Instytut Technologii Komputerowych, Automatyki i Metrologii, Uniwersytet Narodowy „Lvivska Politechnika”
All publications →
,
Marek Gajewski
Systems Research Institute, Polish Academy of Sciences, Warsaw
All publications →
Maciej Wysocki
Warsaw School of Information Technology
All publications →

Download full text

Titles

A novel text classification problem and its solution

Abstract

A new text categorization problem is introduced. As in the classical problem, there is a set of documents and a set of categories. However, in addition to being assigned to a specific category, each document belongs to a certain sequence of documents, referred to as a case. It is assumed that all documents in the same case belong to the same category. An example may be a set of news articles. Their categories may be sport, politics, entertainment, etc. In each category there exist cases, i.e., sequences of documents describing, for example evolution of some events. The problem considered is how to classify a document to a proper category and a proper case within this category. In the paper we formalize the problem and discuss two approaches to its solution.

References


Information

Information: Technical Transactions, 2013, Automatic Control Issue 4-AC (12) 2013, pp. 7-16

Article type: Original article

Titles:

Polish:

A novel text classification problem and its solution

English:

A novel text classification problem and its solution

Authors

Systems Research Institute, Polish Academy of Sciences, Warsaw; Warsaw School of Information Technology

Katedra Automatyki i Technik Informacyjnych, Wydział Elektrotechniki i Inżynierii Komputerowej, Politechnika Krakowska; Katedra Komputerowych Systemów Automatyki, Instytut Technologii Komputerowych, Automatyki i Metrologii, Uniwersytet Narodowy „Lvivska Politechnika”

Systems Research Institute, Polish Academy of Sciences, Warsaw

Warsaw School of Information Technology

Published at: 25.09.2014

Article status: Open

Licence: None

Percentage share of authors:

Sławomir Zadrożny (Author) - 25%
Janusz Kacprzyk (Author) - 25%
Marek Gajewski (Author) - 25%
Maciej Wysocki (Author) - 25%

Article corrections:

-

Publication languages:

English

View count: 1682

Number of downloads: 1221

<p> A novel text classification problem and its solution</p>

A novel text classification problem and its solution

cytuj

pobierz pliki

RIS BIB ENDNOTE