TY - JOUR TI - Success Rates in Most-frequent-word-based Authorship Attribution. A Case Study of 1000 Polish Novels from Ignacy Krasicki to Jerzy Pilch AU - Rybicki, Jan TI - Success Rates in Most-frequent-word-based Authorship Attribution. A Case Study of 1000 Polish Novels from Ignacy Krasicki to Jerzy Pilch AB - The success rate of authorship attribution by multivariate analysis of most-frequent-word frequencies is studied in a 1000-novel corpus of Polish literary works from the late 18th to the early 21st century. The results are examined for possible influences of the number of authors and/or the number of texts to be attributed. Also, the success rates achieved in this study are compared to those obtained in earlier studies for smaller corpora, too small perhaps to produce regular patterns. This study shows that text sets of this size confirm the intuitive predictions as to those influences: 1) the more authors, the less successful attribution; 2) for the same number of authors, the number of texts to be attributed does not influence success rate. VL - Volume 10 (2015) IS - Vol. 10, Issue 2 PY - 2015 SN - 1732-8160 C1 - 2300-5920 SP - 87 EP - 104 DO - 10.4467/23005920SPL.15.004.3561 UR - https://ejournals.eu/en/journal/studies-in-polish-linguistics/article/success-rates-in-most-frequent-word-based-authorship-attribution-a-case-study-of-1000-polish-novels-from-ignacy-krasicki-to-jerzy-pilch KW - multivariate analysis KW - authorship attribution KW - Polish literature KW - stylometry