Schema extraction and levelization for XML data

Jong P. Yoon; Sung-Rim Kim

doi:10.1117/12.421065

27 March 2001 Schema extraction and levelization for XML data

Jong P. Yoon, Sung-Rim Kim

Proceedings Volume 4384, Data Mining and Knowledge Discovery: Theory, Tools, and Technology III; (2001) https://doi.org/10.1117/12.421065
Event: Aerospace/Defense Sensing, Simulation, and Controls, 2001, Orlando, FL, United States

Abstract

XML is a new standard for representing and exchanging information on the Internet. An XML data is a data that is tagged by XML elements. Such an XML data can be retrieved not only by a Boolean connection with keywords on the Internet. Keyword-based information retrieval does not precisely result in user requests partly because user requests cannot be properly conveyed. Either too many or too few matches are produced. It is not trivial to formulate what to retrieve for a good-sized query-result. In conventional approaches, a database schema is useful for users to formulate queries and for query processing. Likewise, this paper proposes a method of schema extraction for XML data collection. Obtaining one single schema is not sufficient to serve for the good size of information retrieval and adaptively for the various requests from Internet users. To support this, schemas are then levelized with respect to the frequency of topological data structures in a database. The topological structural information of these schemas is used to formulate queries and further to rewrite queries for relaxation and restriction. Without modification, the method proposed in this paper is used not only for multimedia XML data collections but also for general XML databases.

Citation Download Citation

Jong P. Yoon and Sung-Rim Kim "Schema extraction and levelization for XML data", Proc. SPIE 4384, Data Mining and Knowledge Discovery: Theory, Tools, and Technology III, (27 March 2001); https://doi.org/10.1117/12.421065

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available