Publication Type
Conference Paper
Version
submittedVersion
Publication Date
6-2000
Abstract
XML documents are semistructured and the structure of the documents is embedded in the tags. Although XML documents can be accompanied by a DTD that defines the structure of the documents, the presence of a DTD is not mandatory. The difficulty in deriving the DTD for XML documents lies in the fact that DTDs are of different syntax as XML and that prior knowledge of the structure of the documents is required. In this paper, we introduce DTD-Miner, an automatic structure-mining tool for XML documents. Using a Web-based interface, the user will be able to submit a set of similarly structured XML documents and the system will automatically suggest a DTD. The user is also able to further refine the DTD generated to reduce the complexity by relaxing some the rules used in the system.
Discipline
Databases and Information Systems | Numerical Analysis and Scientific Computing
Publication
Second International Workshop on Advanced Issues of E-Commerce and Web-based Information Systems (WECWIS 2000)
City or Country
San Jose, IEEE Computer Society Press
Citation
HUE, Moh Chuang; LIM, Ee Peng; and NG, Wee-Keong.
DTD-Miner: A tool for mining DTDs from XML documents. (2000). Second International Workshop on Advanced Issues of E-Commerce and Web-based Information Systems (WECWIS 2000).
Available at: https://ink.library.smu.edu.sg/sis_research/990
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
http://portal.acm.org/citation.cfm?id=885174
Included in
Databases and Information Systems Commons, Numerical Analysis and Scientific Computing Commons