Publication: Measuring structural similarity of semistructured data based on information-theoretic approaches
Measuring structural similarity of semistructured data based on information-theoretic approaches
Date
Date
Date
Citations
Helmer, S., Augsten, N., & Böhlen, M. (2012). Measuring structural similarity of semistructured data based on information-theoretic approaches. The VLDB Journal, 21(5), 677–702. https://doi.org/10.1007/s00778-012-0263-0
Abstract
Abstract
Abstract
We propose and experimentally evaluate different approaches for measuring the structural similarity of semistructured documents based on information-theoretic concepts. Common to all approaches is a two-step procedure: first, we extract and linearize the structural information from documents, and then, we use similarity measures that are based on, respectively, Kolmogorov complexity and Shannon entropy to determine the distance between the documents. Compared to other approaches, we are able to achieve a linear run-time complexity and
Metrics
Downloads
Views
Additional indexing
Creators (Authors)
Volume
Volume
Volume
Number
Number
Number
Page range/Item number
Page range/Item number
Page range/Item number
Page end
Page end
Page end
Item Type
Item Type
Item Type
In collections
Dewey Decimal Classifikation
Dewey Decimal Classifikation
Dewey Decimal Classifikation
Keywords
Scope
Scope
Scope
Language
Language
Language
Publication date
Publication date
Publication date
Date available
Date available
Date available
ISSN or e-ISSN
ISSN or e-ISSN
ISSN or e-ISSN
OA Status
OA Status
OA Status
Publisher DOI
Other Identification Number
Other Identification Number
Other Identification Number
Metrics
Downloads
Views
Citations
Helmer, S., Augsten, N., & Böhlen, M. (2012). Measuring structural similarity of semistructured data based on information-theoretic approaches. The VLDB Journal, 21(5), 677–702. https://doi.org/10.1007/s00778-012-0263-0