selected publications conference paper Finding optimal probabilistic generators for XML collections. . 127-139. 2012