Lecture Notes in Informatics

Information Systems Technology and its Applications, International Conference ISTA'2003, June 19-21, 2003, Kharkiv, Ukraine, Proceedings. P-30, 21-33 (2003).

Mikhail Godlevsky (ed.), Stephen W. Liddle (ed.), Mayr Heinrich C. (ed.)

An integrated ontology development environment for data extraction

Stephen W. Liddle , Kimball A. Hewett and David W. Embley


Data extraction is a necessary technology to deal with the huge and growing collection of unstructured and semistructured information available on the World Wide Web. Ontology-based data extraction is a robust approach, but the construction of ontologies is a technical task requiring the services of a human expert. We present a Java-based tool for the graphical creation and testing of data extraction ontologies. This tool leverages standards such as Java and XML to provide a portable, extensible, maintainable, feature-rich environment. This tool reduces the burden on expert ontology developers and simplifies the task of ontology creation.

