In the present paper, we introduce a variant of Gold-style learners that is not
required to infer precise descriptions of the languages in a class, but that must
nd descriptive patterns, i. e., optimal generalisations within a class of pattern
languages. Our rst main result characterises those indexed families of recursive
languages that can be inferred by such learners, and we demonstrate that
this characterisation shows enlightening connections to Angluin's corresponding
result for exact inference. Furthermore, this result reveals that our model
can be interpreted as an instance of a natural extension of Gold's model of
language identi cation in the limit. Using a notion of descriptiveness that is
restricted to the natural subclass of terminal-free E-pattern languages, we introduce
a generic inference strategy, and our second main result characterises
those classes of languages that can be generalised by this strategy. This characterisation
demonstrates that there are major classes of languages that can be
generalised in our model, but not be inferred by a normal Gold-style learner.
Our corresponding technical considerations lead to insights of intrinsic interest
into combinatorial and algorithmic properties of pattern languages.
History
School
Science
Department
Computer Science
Citation
FREYDENBERGER, D.D. and REIDENBACH, D., 2013. Inferring descriptive generalisations of formal languages. Journal of Computer and System Sciences, 79 (5), pp.622-639.
NOTICE: this is the author’s version of a work that was accepted for publication in Journal of Computer and System Sciences. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published at http://dx.doi.org/10.1016/j.jcss.2012.10.001