FAQ

Parallelization of the Levenshtein distance algorithm

Data publikacji: 10.02.2015

Czasopismo Techniczne, 2014, Nauki Podstawowe Zeszyt 3 NP (17) 2014, s. 109 - 112

https://doi.org/10.4467/2353737XCT.14.319.3407

Autorzy

,
Artur Niewiarowski
Institute of Computer Science, Faculty of Physics, Mathematics and Computer Science of Cracow University of Technology
Wszystkie publikacje autora →
Marek Stanuszek
Institute of Computer Science, Faculty of Physics, Mathematics and Computer Science of Cracow University of Technology
Wszystkie publikacje autora →

Tytuły

Parallelization of the Levenshtein distance algorithm

Abstrakt

This paper presents a method for the parallelization of the Levenshtein distance algorithm deployed on very large strings. The proposed approach was accomplished using .NET Framework 4.0 technology with a specific implementation of threads using the System. Threading.Task namespace library. The algorithms developed in this study were tested on a high performance machine using Xamarin Mono (for Linux RedHat/Fedora OS). The computational results demonstrate a high level of efficiency of the proposed parallelization procedure.

Bibliografia

Niewiarowski A., Stanuszek M., The mechanism of identification and classification of content, Studia Informatica, Vol. 34, 2B(112), Silesian University of Technology Press, Gliwice 2013, 205-222.

Niewiarowski A., Stanuszek M., Mechanism of analysis of similarity short texts, based on the Levenshtein distance, Studia Informatica. Vol. 34, 1 (110), Silesian University of Technology Press, Gliwice 2013, 107-114.

Niewiarowski A., Term frequency optimization for the vector space model, Czasopismo Techniczne, 9-M/2012, 155-165.

Kobzdej P., Waligóra D., Wielebińska K., Paprzycki M., Parallel Application of Levenshtein Distance to Establish Similarity Between Strings, International Journal of Computer Research, Vol. 12, No. 4, 2003, 625-633.

Mono-project (www.mono-project.com/What_is_Mono).

Левенштейн В.И., Двоичные коды с исправлением выпадений, вставок и замещений символов, Доклады Академий Наук СCCP 163 (4), 1965, 845-848.

Wypych M., Stochastic Spelling Correction of Texts in Polish, Institute of Linguistics, Adam Mickiewicz University, Poznań, Poland; Speech and Language Technology. Volume 6, Poznań 2002.

Damerau F.J., A technique for computer detection and correction of spelling errors, Communications of the ACM, 7 (3), 1964, 171-176.

Runkler T.A., Bezdek J.C., Web mining with relational clustering, International Journal of Approximate Reasoning, Vol. 32, Issues 2–3, February 2003, 217-236.

Niewiarowski A., Działanie parsera ‚Part-of-Speech Tagging’ w ujęciu mechanizmu Web Content Mining, Wydawnictwo VI Ogólnopolskiej Konferencji Naukowej Nauka i Przemysł, Politechnika Krakowska im. Tadeusza Kościuszki, Kraków 2011, 93-100.

Niewiarowski A., Stanuszek M., Parallelize edit distance algorithm, Proceedings. Seventh ACC Cyfronet AGH Users’ Conference, Academic Computer Centre Cyfronet AGH, Zakopane 2014, 31-32.

Niewiarowski A., Stanuszek M., Performance and quality of method for short text similarity algorithm based on edit distance and thesaurus, Proceedings. Seventh ACC Cyfronet AGH Users’ Conference, Academic Computer Centre Cyfronet AGH, Zakopane 2014, 33-34.

Ramos J., Using tf-idf to determine word relevance in document queries, Proceedings of the First Instructional Conference on Machine Learning, 2003.

Campbell C., Johnson R., Miller A., Toub S., Parallel Programming with Microsoft. NET. Design Patterns for Decomposition and Coordination on Multicore Architectures, Microsoft Press, 2010.

Niewiarowski A., Szybko zrozumieć Visual Basic 2012, Self Publishing. Kraków 2013, 66-73.

Informacje

Informacje: Czasopismo Techniczne, 2014, Nauki Podstawowe Zeszyt 3 NP (17) 2014, s. 109 - 112

Typ artykułu: Oryginalny artykuł naukowy

Tytuły:

Polski:

Parallelization of the Levenshtein distance algorithm

Angielski:

Parallelization of the Levenshtein distance algorithm

Autorzy

Institute of Computer Science, Faculty of Physics, Mathematics and Computer Science of Cracow University of Technology

Institute of Computer Science, Faculty of Physics, Mathematics and Computer Science of Cracow University of Technology

Publikacja: 10.02.2015

Status artykułu: Otwarte __T_UNLOCK

Licencja: Żadna

Udział procentowy autorów:

Artur Niewiarowski (Autor) - 50%
Marek Stanuszek (Autor) - 50%

Korekty artykułu:

-

Języki publikacji:

Angielski

Liczba wyświetleń: 2054

Liczba pobrań: 1701

<p> Parallelization of the Levenshtein distance algorithm</p>