The Use of Electronic Historical Dictionary Data in Corpus Design

Publication date: 19.04.2016

Studies in Polish Linguistics, Volume 11 (2016), Vol. 11, Issue 2, pp. 47-56



Renata Bronikowska
Polish Academy of Sciences, Warsaw, Poland
All publications →
Włodzimierz Gruszczyński
University of Social Sciences and Humanities; Polish Academy of Sciences, Chodakowska 19/31, Warszawa; Polska Akademia Nauk; Plac Defilad 1 Skrytka Pocztowa 24, 00-901 Warszawa, Poland
All publications →
Maciej Ogrodniczuk
Polish Academy of Sciences, Warsaw, Poland
All publications →
Marcin Woliński
Polish Academy of Sciences, Warsaw, Poland
All publications →

Download full text


The Use of Electronic Historical Dictionary Data in Corpus Design


The History of the 17th and 18th c. Polish Language Laboratory, Institute of Polish Language, Polish Academy of Sciences, is in the process of creating two large databases: The Electronic Dictionary of the 17th−18th c. Polish and The Electronic Corpus of the 17th and 18th c. Polish Texts (up to 1772), the latter in cooperation with the Institute of Computer Science, Polish Academy of Sciences. It is expected that combining these two sets of data will help to achieve the objectives established for both database projects. The present article shows the benefits that the Corpus creators can get from the data gathered in the dictionary, with special emphasis put on the use of grammatical information included in the dictionary entries to design tools for automatic text annotation in the Corpus.


Download references

Gruszczyński Włodzimierz (ed.) (2004–). Elektroniczny słownik języka polskiego XVII i XVIII wieku. [URL: http://sxvii.pl/; accessed December 15, 2015].

Przepiórkowski Adam, Bańko Mirosław, Górski Rafał L., Lewandowska-Tomaszczyk Barbara (eds.) (2012). Narodowy Korpus Języka Polskiego. Warsaw: Wydawnictwo Naukowe PWN. [URL: http://nkjp.pl; accessed December 15, 2015].

Siekierska Krystyna (ed.) (1999−2004). Słownik języka polskiego XVII i 1. połowy XVIII wieku. Vol. 1. Kraków: Wydawnictwo Instytutu Języka Polskiego PAN.

Saloni Zygmunt, Woliński Marcin, Wołosz Robert, Gruszczyński Włodzimierz, Skowrońska Danuta (2015). Słownik gramatyczny języka polskiego. 3rd ed. Warsaw. [URL: http://sgjp.pl; accessed December 15, 2015].

Woliński Marcin (2006). Morfeusz − a practical tool for the morphological analysis of Polish. In Intelligent Information Processing and Web Mining, Advances in Soft Computing. Mieczysław A. Kłopotek, Sławomir T. Wierzchoń, Krzysztof Trojanowski (eds.), 503−512. Berlin: Springer-Verlag.


Information: Studies in Polish Linguistics, Volume 11 (2016), Vol. 11, Issue 2, pp. 47-56

Article type: Original article



The Use of Electronic Historical Dictionary Data in Corpus Design


Polish Academy of Sciences, Warsaw, Poland

University of Social Sciences and Humanities; Polish Academy of Sciences, Chodakowska 19/31, Warszawa; Polska Akademia Nauk; Plac Defilad 1 Skrytka Pocztowa 24, 00-901 Warszawa, Poland

Polish Academy of Sciences, Warsaw, Poland

Polish Academy of Sciences, Warsaw, Poland

Published at: 19.04.2016

Article status: Open

Licence: None

Percentage share of authors:

Renata Bronikowska (Author) - 25%
Włodzimierz Gruszczyński (Author) - 25%
Maciej Ogrodniczuk (Author) - 25%
Marcin Woliński (Author) - 25%

Article corrections:


Publication languages:
