ejournals

Informacje

Afiliacja

Instytut Teleinformatyki, Wydział Fizyki, Matematyki i Informatyki, Politechnika Krakowska

Artykuły

Sortowanie:

Autor

Detection algorithm for content on Internet web portals

Krzysztof Ulman

Czasopismo Techniczne, Nauki Podstawowe Zeszyt 1-NP (18) 2012, 2012, s. 1 - 1

https://doi.org/10.4467/2353737XCT.14.090.1867

The paper shows steps, made during designing and implementing automatic web pages contents recognition algorithm, based on HTML structure analysis. A web page contents is the article text with its headline, without any other text like menu, advertisements, user’s comments, image captions, etc.

PDF

Czytaj więcej