Paper
27 March 2001 Schema extraction and levelization for XML data
Jong P. Yoon, Sung-Rim Kim
Author Affiliations +
Abstract
XML is a new standard for representing and exchanging information on the Internet. An XML data is a data that is tagged by XML elements. Such an XML data can be retrieved not only by a Boolean connection with keywords on the Internet. Keyword-based information retrieval does not precisely result in user requests partly because user requests cannot be properly conveyed. Either too many or too few matches are produced. It is not trivial to formulate what to retrieve for a good-sized query-result. In conventional approaches, a database schema is useful for users to formulate queries and for query processing. Likewise, this paper proposes a method of schema extraction for XML data collection. Obtaining one single schema is not sufficient to serve for the good size of information retrieval and adaptively for the various requests from Internet users. To support this, schemas are then levelized with respect to the frequency of topological data structures in a database. The topological structural information of these schemas is used to formulate queries and further to rewrite queries for relaxation and restriction. Without modification, the method proposed in this paper is used not only for multimedia XML data collections but also for general XML databases.
© (2001) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jong P. Yoon and Sung-Rim Kim "Schema extraction and levelization for XML data", Proc. SPIE 4384, Data Mining and Knowledge Discovery: Theory, Tools, and Technology III, (27 March 2001); https://doi.org/10.1117/12.421065
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Raster graphics

Databases

Multimedia

Internet

Binary data

Data modeling

Computer science

RELATED CONTENT

Negative comment recognition model based on lightGBM
Proceedings of SPIE (March 28 2023)
How efficient is BitTorrent?
Proceedings of SPIE (January 16 2006)
Data modeling for data mining
Proceedings of SPIE (March 12 2002)

Back to Top