Publication:

Measuring structural similarity of semistructured data based on information-theoretic approaches

Date

Date

Date
2012
Journal Article
Published version

Citations

Citation copied

Helmer, S., Augsten, N., & Böhlen, M. (2012). Measuring structural similarity of semistructured data based on information-theoretic approaches. The VLDB Journal, 21(5), 677–702. https://doi.org/10.1007/s00778-012-0263-0

Abstract

Abstract

Abstract

We propose and experimentally evaluate different approaches for measuring the structural similarity of semistructured documents based on information-theoretic concepts. Common to all approaches is a two-step procedure: first, we extract and linearize the structural information from documents, and then, we use similarity measures that are based on, respectively, Kolmogorov complexity and Shannon entropy to determine the distance between the documents. Compared to other approaches, we are able to achieve a linear run-time complexity and

Metrics

Downloads

1 since deposited on 2013-01-29
Acq. date: 2025-11-12

Views

157 since deposited on 2013-01-29
Acq. date: 2025-11-12

Additional indexing

Creators (Authors)

Journal/Series Title

Journal/Series Title

Journal/Series Title

Volume

Volume

Volume
21

Number

Number

Number
5

Page range/Item number

Page range/Item number

Page range/Item number
677

Page end

Page end

Page end
702

Item Type

Item Type

Item Type
Journal Article

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Keywords

Hardware and Architecture, Information Systems

Scope

Scope

Scope
Discipline-based scholarship (basic research)

Language

Language

Language
English

Publication date

Publication date

Publication date
2012

Date available

Date available

Date available
2013-01-29

Publisher

Publisher

Publisher

ISSN or e-ISSN

ISSN or e-ISSN

ISSN or e-ISSN
1066-8888

OA Status

OA Status

OA Status
Closed

Other Identification Number

Other Identification Number

Other Identification Number
merlin-id:7762

Metrics

Downloads

1 since deposited on 2013-01-29
Acq. date: 2025-11-12

Views

157 since deposited on 2013-01-29
Acq. date: 2025-11-12

Citations

Citation copied

Helmer, S., Augsten, N., & Böhlen, M. (2012). Measuring structural similarity of semistructured data based on information-theoretic approaches. The VLDB Journal, 21(5), 677–702. https://doi.org/10.1007/s00778-012-0263-0

Closed
Loading...
Thumbnail Image

Files

Files

Files
Files available to download:1

Files

Files

Files
Files available to download:1
Loading...
Thumbnail Image