< Terug naar vorige pagina

Publicatie

Satisfiability for SCULPT-Schemas for CSV-Like Data.

Boekbijdrage - Boekhoofdstuk Conferentiebijdrage

SCULPT is a simple schema language inspired by the recent working effort towards a recommendation by the World Wide Web Consortium (W3C) for tabular data and metadata on the Web. In its core, a SCULPT schema consists of a set of rules where left-hand sides select sets of regions in the tabular data and the right-hand sides describe the contents of these regions. A document (divided in cells by row- and column-delimiters) then satisfies a schema if it satisfies every rule. In this paper, we study the satisfiability problem for SCULPT schemas. As SCULPT describes grid-like structures, satisfiability obviously becomes undecidable rather quickly even for very restricted schemas. We define a schema language called L-SCULPT (Lego SCULPT) that restricts the walking power of SCULPT by selecting rectangular shaped areas and only considers tables for which selected regions do not intersect. Depending on the axes used by L-SCULPT, we show that satisfiability is PTIME-complete or undecidable. One of the tractable fragments is practically useful as it extends the structural core of the current W3C proposal for schemas over tabular data. We therefore see how the navigational power of the W3C proposal can be extended while still retaining tractable satisfiability tests.
Boek: 21st International Conference on Database Theory (ICDT 2018)
Series: Leibniz International Proceedings in Informatics (LIPIcs)
Pagina's: 14:1 - 14:19
ISBN:9783959770637
Jaar van publicatie:2018
Trefwoorden:CSV, Schema languages, Semi-structured data
Toegankelijkheid:Open