TY - JOUR TI - A novel text classification problem and its solution AU - Zadrożny, Sławomir AU - Kacprzyk, Janusz AU - Gajewski, Marek AU - Wysocki, Maciej TI - A novel text classification problem and its solution AB - A new text categorization problem is introduced. As in the classical problem, there is a set of documents and a set of categories. However, in addition to being assigned to a specific category, each document belongs to a certain sequence of documents, referred to as a case. It is assumed that all documents in the same case belong to the same category. An example may be a set of news articles. Their categories may be sport, politics, entertainment, etc. In each category there exist cases, i.e., sequences of documents describing, for example evolution of some events. The problem considered is how to classify a document to a proper category and a proper case within this category. In the paper we formalize the problem and discuss two approaches to its solution. VL - 2013 IS - Automatyka Zeszyt 4-AC (12) 2013 PY - 2014 SN - 0011-4561 C1 - 2353-737X SP - 7 EP - 16 DO - 10.4467/2353737XCT.14.043.3951 UR - https://ejournals.eu/czasopismo/czasopismo-techniczne/artykul/a-novel-text-classification-problem-and-its-solution KW - text categorization KW - sequences of documents KW - sequence mining KW - hidden Markov models