FAQ

Detection algorithm for content on Internet web portals

Publication date: 2012

Technical Transactions, 2012, Fundamental Sciences Issue 1-NP (18) 2012, pp. 1 - 1

https://doi.org/10.4467/2353737XCT.14.090.1867

Authors

,
Krzysztof Ulman
Instytut Teleinformatyki, Wydział Fizyki, Matematyki i Informatyki, Politechnika Krakowska
All publications →
Krzysztof Rzecki
Instytut Teleinformatyki, Wydział Fizyki, Matematyki i Informatyki, Politechnika Krakowska
All publications →

Titles

Detection algorithm for content on Internet web portals

Abstract

The paper shows steps, made during designing and implementing automatic web pages contents recognition algorithm, based on HTML structure analysis. A web page contents is the article text with its headline, without any other text like menu, advertisements, user’s comments, image captions, etc.

References


Information

Information: Technical Transactions, 2012, Fundamental Sciences Issue 1-NP (18) 2012, pp. 1 - 1

Article type: Original article

Titles:

Polish:

Detection algorithm for content on Internet web portals

English:

Detection algorithm for content on Internet web portals

Authors

Instytut Teleinformatyki, Wydział Fizyki, Matematyki i Informatyki, Politechnika Krakowska

Instytut Teleinformatyki, Wydział Fizyki, Matematyki i Informatyki, Politechnika Krakowska

Published at: 2012

Article status: Open

Licence: None

Percentage share of authors:

Krzysztof Ulman (Author) - 50%
Krzysztof Rzecki (Author) - 50%

Article corrections:

-

Publication languages:

Polish

View count: 2575

Number of downloads: 1888

<p> Detection algorithm for content on Internet web portals</p>