Learning deterministic regular expressions for the inference of schemas from XML data Hasselt University
Inferring an appropriate DTD or XML Schema Definition (XSD) for a given collection of XML documents essentially reduces to learning deterministic regular expressions from sets of positive example words. Unfortunately, there is no algorithm capapble of learning the complete class of deterministic regular expressions from positive examples only, as we will show. The regular expressions occurring in practical DTD's and XSD's, however, are such that ...