FAQ

Detection algorithm for content on Internet web portals

Publication date: 2012

Technical Transactions, 2012, Fundamental Sciences Issue 1-NP (18) 2012, pp. 1-1

https://doi.org/10.4467/2353737XCT.14.090.1867

Authors

,
Krzysztof Ulman
Instytut Teleinformatyki, Wydział Fizyki, Matematyki i Informatyki, Politechnika Krakowska
All publications →
Krzysztof Rzecki
Instytut Teleinformatyki, Wydział Fizyki, Matematyki i Informatyki, Politechnika Krakowska
Contact with author
All publications →

Download full text

Titles

Detection algorithm for content on Internet web portals

Abstract

The paper shows steps, made during designing and implementing automatic web pages contents recognition algorithm, based on HTML structure analysis. A web page contents is the article text with its headline, without any other text like menu, advertisements, user’s comments, image captions, etc.

References


Information

Information: Technical Transactions, 2012, Fundamental Sciences Issue 1-NP (18) 2012, pp. 1-1

Article type: Original article

Titles:

Polish:

Detection algorithm for content on Internet web portals

English:

Detection algorithm for content on Internet web portals

Authors

Instytut Teleinformatyki, Wydział Fizyki, Matematyki i Informatyki, Politechnika Krakowska

Instytut Teleinformatyki, Wydział Fizyki, Matematyki i Informatyki, Politechnika Krakowska

Published at: 2012

Article status: Open

Licence: None

Percentage share of authors:

Krzysztof Ulman (Author) - 50%
Krzysztof Rzecki (Author) - 50%

Article corrections:

-

Publication languages:

Polish

View count: 2639

Number of downloads: 1917

<p> Detection algorithm for content on Internet web portals</p>

Detection algorithm for content on Internet web portals

cytuj

pobierz pliki

RIS BIB ENDNOTE