selected publications
-
academic article
- Intensional data on the web. ACM SIGWEB Newsletter. 2015:1-12. 2015
- The ARCOMEM Architecture for Social- and Semantic-Driven Web Archiving. Future Internet. 6:688-716. 2014
- ARCOMEM Crawling Architecture. Future Internet. 6:518-541. 2014
- Discovering interesting information with advances in web technology. ACM SIGKDD Explorations Newsletter. 14:63-81. 2013
- Discovering interesting information with advances in web technology. School of Electrical Engineering & Computer Science; Science & Engineering Faculty. 2012
- Capturing continuous data and answering aggregate queries in probabilistic XML. ACM Transactions on Database Systems. 36:1-45. 2011
- PARIS. Proceedings of the VLDB Endowment. 5:157-168. 2011
- Probabilistic XML via Markov Chains. Proceedings of the VLDB Endowment. 3:770-781. 2010
- Schema mapping discovery from data instances. Journal of the ACM. 57:1-37. 2010
- On the expressiveness of probabilistic XML models. The VLDB Journal. 18:1041-1064. 2009
-
blog posting
-
chapter
- A Framework for Sampling-Based XML Data Pricing. Lecture notes in computer science. 116-138. 2016
- Provenance Circuits for Trees and Treelike Instances. Lecture notes in computer science. 56-68. 2015
- Get a Sample for a Discount. Lecture notes in computer science. 20-34. 2014
- Intelligent and Adaptive Crawling of Web Applications for Web Archiving. Lecture notes in computer science. 306-322. 2013
- On the Connections between Relational and XML Probabilistic Data Models. Lecture notes in computer science. 121-134. 2013
- Probabilistic XML: Models and Complexity. Studies in fuzziness and soft computing. 39-66. 2013
- Exploiting the Social and Semantic Web for Guided Web Archiving. Lecture notes in computer science. 426-432. 2012
- Monadic Datalog Containment. Lecture notes in computer science. 79-91. 2012
-
conference paper
- Truth Finding with Attribute Partitioning. . 27-33. 2015
- Monitoring moving objects using uncertain web data. . 565-568. 2014
- Scalable, generic, and adaptive systems for focused crawling. . 35-45. 2014
- Disseminate your research. . 1-2. 2013
- Demonstrating intelligent crawling and archiving of web applications. . 2481-2484. 2013
- Uncertain version control in open collaborative editing of tree-structured documents. . 27-36. 2013
- Demonstrating ProApproX 2.0. . 2734-2736. 2012
- Auto-completion learning for XML. . 669-672. 2012
- Optimal top-k generation of attribute combinations based on ranked lists. . 409-420. 2012
- Finding optimal probabilistic generators for XML collections. . 127-139. 2012
- ProApproX. . 1295-1298. 2011
- Efficient query evaluation over probabilistic XML with long-distance dependencies. . 32-37. 2011
- Value joins are expensive over (probabilistic) XML. . 41-48. 2011
- A probabilistic XML merging tool. . 538-541. 2011
- The hidden web, XML and the Semantic Web. . 534-537. 2011
- Aggregate queries for discrete and continuous probabilistic XML. . 50-61. 2010
- Corroborating information from disagreeing views. . 2010