Gesellschaft für Informatik e.V.

Lecture Notes in Informatics


Datenbanksysteme in Business, Technologie und Web, 11. Fachtagung des GIFachbereichs “Datenbanken und Informationssysteme” (DBIS), 2.-4. März 2005 Karlsruhe. GI 2005 P-65, 48-65 (2005).

GI, Gesellschaft für Informatik, Bonn
2005


Editors

Gottfried Vossen, Frank Leymann, Peter Lockemann, Wolffried Stucky (eds.)


Copyright © GI, Gesellschaft für Informatik, Bonn

Contents

Web data extraction for business intelligence: the lixto approach

Georg Gottlob

Abstract


Knowledge about market developments and competitor activities on the market becomes more and more a critical success factor for enterprises. The World Wide Web provides public domain information which can be retrieved for example from Web sites or online shops. The extraction from semi-structured information sources is mostly done manually and is therefore very time consuming. This paper describes how public information can be extracted automatically from Web sites, transformed into structured data formats, and used for data analysis in Business Intelligence systems.


Full Text: PDF

GI, Gesellschaft für Informatik, Bonn
ISBN 3-885794-6


Last changed 24.01.2012 21:49:19