FAQ

The Use of Electronic Historical Dictionary Data in Corpus Design

Publication date: 19.04.2016

Studies in Polish Linguistics, Volume 11 (2016), Vol. 11, Issue 2, pp. 47 - 56

https://doi.org/10.4467/23005920SPL.16.003.4818

Authors

,
Renata Bronikowska
Polish Academy of Sciences, Warsaw, Poland
All publications →
,
Włodzimierz Gruszczyński
University of Social Sciences and Humanities; Polish Academy of Sciences, Chodakowska 19/31, Warszawa; Polska Akademia Nauk; Plac Defilad 1 Skrytka Pocztowa 24, 00-901 Warszawa, Poland
All publications →
,
Maciej Ogrodniczuk
Polish Academy of Sciences, Warsaw, Poland
All publications →
Marcin Woliński
Polish Academy of Sciences, Warsaw, Poland
All publications →

Titles

The Use of Electronic Historical Dictionary Data in Corpus Design

Abstract

The History of the 17th and 18th c. Polish Language Laboratory, Institute of Polish Language, Polish Academy of Sciences, is in the process of creating two large databases: The Electronic Dictionary of the 17th−18th c. Polish and The Electronic Corpus of the 17th and 18th c. Polish Texts (up to 1772), the latter in cooperation with the Institute of Computer Science, Polish Academy of Sciences. It is expected that combining these two sets of data will help to achieve the objectives established for both database projects. The present article shows the benefits that the Corpus creators can get from the data gathered in the dictionary, with special emphasis put on the use of grammatical information included in the dictionary entries to design tools for automatic text annotation in the Corpus.

References

Download references

Gruszczyński Włodzimierz (ed.) (2004–). Elektroniczny słownik języka polskiego XVII i XVIII wieku. [URL: http://sxvii.pl/; accessed December 15, 2015].

Przepiórkowski Adam, Bańko Mirosław, Górski Rafał L., Lewandowska-Tomaszczyk Barbara (eds.) (2012). Narodowy Korpus Języka Polskiego. Warsaw: Wydawnictwo Naukowe PWN. [URL: http://nkjp.pl; accessed December 15, 2015].

Siekierska Krystyna (ed.) (1999−2004). Słownik języka polskiego XVII i 1. połowy XVIII wieku. Vol. 1. Kraków: Wydawnictwo Instytutu Języka Polskiego PAN.

Saloni Zygmunt, Woliński Marcin, Wołosz Robert, Gruszczyński Włodzimierz, Skowrońska Danuta (2015). Słownik gramatyczny języka polskiego. 3rd ed. Warsaw. [URL: http://sgjp.pl; accessed December 15, 2015].

Woliński Marcin (2006). Morfeusz − a practical tool for the morphological analysis of Polish. In Intelligent Information Processing and Web Mining, Advances in Soft Computing. Mieczysław A. Kłopotek, Sławomir T. Wierzchoń, Krzysztof Trojanowski (eds.), 503−512. Berlin: Springer-Verlag.

Information

Information: Studies in Polish Linguistics, Volume 11 (2016), Vol. 11, Issue 2, pp. 47 - 56

Article type: Original article

Titles:

English:

The Use of Electronic Historical Dictionary Data in Corpus Design

Authors

Polish Academy of Sciences, Warsaw, Poland

University of Social Sciences and Humanities; Polish Academy of Sciences, Chodakowska 19/31, Warszawa; Polska Akademia Nauk; Plac Defilad 1 Skrytka Pocztowa 24, 00-901 Warszawa, Poland

Polish Academy of Sciences, Warsaw, Poland

Polish Academy of Sciences, Warsaw, Poland

Published at: 19.04.2016

Article status: Open

Licence: None

Percentage share of authors:

Renata Bronikowska (Author) - 25%
Włodzimierz Gruszczyński (Author) - 25%
Maciej Ogrodniczuk (Author) - 25%
Marcin Woliński (Author) - 25%

Article corrections:

-

Publication languages:

English