Simon Razniewski


Senior Researcher
Department 5
Max Planck Institute for Informatics
Room 425
Campus E1 4 (map)
Saarbrücken
Germany
+49(0)68193255126
srazniew@mpi-inf.mpg.de (PGP)

About

Simon Razniewski is senior researcher at the Max Planck Institute for Informatics in Saarbrücken, Germany, where he heads the Knowledge Base Construction and Quality area.

He was previously assistant professor at the Free University of Bozen Bolzano¹ (2014-2017). He holds a PhD from the Free University of Bozen-Bolzano (2014), and a Diplom (MSc.) from TU Dresden (2010; not this Dresden). He spent time as visitor at the Max-Planck Institute for Informatics (2016), the University of Queensland (2015), AT&T Labs-Research (2013), the University of California, San Diego (2012), and has previous industrial experience from Globalfoundries (2010) and Siemens IT (2009). He has published 11 papers at premier² conferences in the area of data science and management (and more than 35 papers in total).

¹ 2018 world’s 9th best small university according to THE
² A* or A in the CORE 2018 ranking

News

  • 22.1. Short paper “TiFi: Taxonomy Induction for Fictional Domains” (joint work with Cuong Xuan Chu and Gerhard Weikum) has been accepted at The Web conference 2019
  • 5.12. Paper “Completeness and Soundness Guarantees for Conjunctive SPARQL Queries over RDF Data Sources with Completeness Statements” (joint work with Fariz Darari, Werner Nutt, Sebastian Rudolph) accepted for the Semantic Web Journal
  • 29.10. Talk “Identifying and Linking Knowledge Base Counting Quantifiers” at FU Bozen-Bolzano
  • 7.10. Attending ISWC
  • 10.9. 700k views of lookslikescanned.com
  • 24.8. Talk “What knowledge bases know (and what they don’t)” at the HCC lab of the FU Berlin
  • 20.7. Offering the seminar “Advanced topics in Knowledge Bases” in the winter term 2018/19. More infos and signup here
  • 3.7. SIGWEB newsletter on knowledge base recall (joint work with Gerhard Weikum)
  • 26.5. Our paper “Enriching Knowledge Bases with Counting Quantifiers” (joint work with Paramita Mirza, Fariz Darari and Gerhard Weikum) has been accepted at ISWC 2018 (PDF)
  • 23.-27.4. At the WWW conference
  • 19.4.2018 Received NVIDIA GPU grant (1x Geforce Titan Xp)
  • 14.3.2018 Appointed as senior researcher
  • 6.12.2017 Recoin, the relative completeness indicator for Wikidata, has been extended to all entities
  • 4.11.2017 At CIKM and ADMA
  • 1.11.2017 Joined the Max Planck Institute for Informatics
  • 28.10.2017 Will give a talk at WikidataCon: “Data completeness: How to know what Wikidata knows?”
  • 2.9.-16.9.2017 Away from email in Kyrgyzstan
  • 28.9.2017 Seminar “Knowledge Bases” has been announced. Register till 4.10.
  • 19.8.2017 Our paper “Doctoral Advisor or Medical Condition: Towards Entity-specific Rankings of Knowledge Base Properties” (joint work with Vevake Balaraman and Werner Nutt) has been accepted as spotlight research paper at ADMA 2017
  • 31.7.-11.8.2017 At ACL
  • 14.7.2017 Our paper “Completeness-aware Rule Learning from Knowledge Graphs” (joint work with Thomas Pellissier Tanon, Daria Stepanova, Paramita Mirza and Gerhard Weikum) has been accepted at ISWC 2017
  • 29.5.2017 At ESWC
  • 12.4.2017 Our poster “Assessing the Completeness of Entities in
  • Knowledge Bases” (with Albin Ahmeti and Axel Polleres) has been accepted at ESWC 2017
  • 11.4.2017 Visiting the University of Auckland
  • 31.4.2017 Our short paper “Cardinal Virtues: Extracting Relation Cardinalities from Text” has been accepted at ACL 2017 (with Paramita Mirza, Fariz Darari, Gerhard Weikum)
  • 22.3.2017 Our short paper “Enabling Completeness-aware Querying in SPARQL” has been accepted at the WebDB 2017 workshop (with Luis Galárraga and Katja Hose)
  • 7.3.2017 Times Higher Education ranks FUB as world’s tenth best small university
  • 17.2.2017 Visiting the LIRIS lab in Lyon
  • 10.2.2017 KRDB Lunch seminar talk on recent research
  • 11.1.2017 Visiting Telecom ParisTech University, DBWeb team
  • 15.11.2016 Our relative completeness tool for Wikidata, ReCoin, has been released
  • 12.11.2016 200k views of lookslikescanned.com
  • 18.10.2016 Our paper “Predicting Completeness in Knowledge Bases” (with Luis Galárraga, Antoine Amarilli and Fabian M. Suchanek) has been accepted at WSDM 2017
  • Visiting the Max-Planck-Institute for Informatics, 10/2016 – 1/2017
  • Our poster “Expanding Wikidata’s Parenthood Information by 178%, or How To Mine Relation Cardinality Information” (with Paramita Mirza and Werner Nutt) has been accepted at ISWC 2016
  • Our paper “Managing and Consuming Completeness Information for Wikidata Using COOL-WD” (with Radityo Eko Prasojo, Fariz Darari and Werner Nutt) has been accepted at the COLD workshop at ISWC 2016
  • Our paper “Optimizing Update Frequencies for Decaying Information” has been accepted at CIKM 2016
  • Our paper “Exploiting Hierarchies for Efficient Detection of Completeness in Stream Data” (with Shazia Sadiq and Xiaofang Zhou) has been accepted at ADC 2016
  • Mostly away from email till 22.7. in Indonesia and Georgia
  • 30.4.2016 Our proposal “The Call for Recall” has been approved by the FUB (budget 20k€)
  • 19.4.2016 Our paper “Turning The Partial-closed World Assumption Upside Down” has been accepted at AMW 2016
  • 4.4.2016 Our vision paper “But What Do We Actually Know?” (with Fabian M. Suchanek and Werner Nutt) has been accepted at the AKBC workshop at NAACL 2016
  • 1.3.2016: Our paper “Enabling Fine-grained RDF Data Completeness Assessment” (with Fariz Darari, Radityo Eko Prasojo and Werner Nutt) has been accepted at ICWE 2016
  • 28.2.2016: 100k views of lookslikescanned.com

Publications

Premier conference publications¹

    • [11] TiFi: Taxonomy Induction for Fictional Domains, Cuong Xuan Chu, Simon Razniewski and Gerhard Weikum, WWW (short paper), 2019 – Short paper acceptance rate 20%
    • [10] Enriching Knowledge Bases with Counting Quantifiers, Paramita Mirza, Simon Razniewski, Fariz Darari and Gerhard Weikum, ISWC, 2018 – Acceptance rate 23%
    • [9] Completeness-aware Rule Learning from Knowledge Graphs, Thomas Pellissier Tanon, Daria Stepanova, Simon Razniewski, Paramita Mirza and Gerhard Weikum, ISWC, 2017 – Acceptance rate 22.3%
    • [8] Cardinal Virtues: Extracting Relation Cardinalities from Text, Paramita Mirza, Simon Razniewski, Fariz Darari and Gerhard Weikum, ACL (short paper), 2017 – Short paper acceptance rate: 18.1%
    • [7] Predicting Completeness in Knowledge Bases, Luis Galárraga, Simon Razniewski, Antoine Amarilli, Fabian M. Suchanek, WSDM, 2017 – Acceptance rate 16%
    • [6] Optimizing Update Frequencies for Decaying Information, Simon Razniewski, CIKM, 2016 – Acceptance rate: 22.8%
    • [5] Identifying the Extent of Completeness of Query Answers over Partially Complete Databases, Simon Razniewski, Flip Korn, Werner Nutt, Divesh Srivastava, SIGMOD, 2015 – Acceptance rate: 25.5%
    • [4] Verification of Query Completeness over Processes, Simon Razniewski, Marco Montali, Werner Nutt, BPM, 2013 – Acceptance rate: 14.4%
    • [3] Completeness Statements about RDF Data Sources und Their Use for Query Answering, Fariz Darari, Werner Nutt, Giuseppe Pirro, Simon Razniewski, ISWC, 2013 – Acceptance rate: 21.5%
    • [2] Completeness of Queries over SQL Databases, Werner Nutt, Simon Razniewski, CIKM, 2012 – Acceptance rate: 13.4%
    • [1] Completeness of Queries over Incomplete Databases, Simon Razniewski, Werner Nutt, VLDB, 2011 – Acceptance rate: 18.1%

¹Rank A* or A in the Core 2014 ranking (http://www.core.edu.au/conference-portal)

Other publications

  • [24] Completeness and Soundness Guarantees for Conjunctive SPARQL Queries over RDF Data Sources with Completeness Statements, Fariz Darari, Werner Nutt, Simon Razniewski and Sebastian Rudolph, Semantic Web Journal (SWJ), 2018
  • [23] Knowledge Base Recall: Detecting and Resolving the Unknown Unknowns, Simon Razniewski and Gerhard Weikum, SIGWEB newsletter, 2018
  • [22] Completeness Management for RDF Data Sources, Fariz Darari, Werner Nutt, Giuseppe Pirro, Simon Razniewski, ACM Transactions on the Web (TWEB), 2018
  • [21] Completeness-aware Rule Learning from Knowledge Graphs, Thomas Pellissier Tanon, Daria Stepanova, Simon Razniewski, Paramita Mirza and Gerhard Weikum, IJCAI best sister conference track, 2018
  • [20] Comparing Index Structures for Completeness Reasoning, Fariz Darari, Werner Nutt, Simon Razniewski, IWBIS, 2018
  • [19] Recoin: Relative Completeness in Wikidata, Vevake Balaraman, Simon Razniewski and Werner Nutt, Wiki workshop at The Web Conference, 2018
  • [18] Doctoral Advisor or Medical Condition: Towards Entity-specific Rankings of Knowledge Base Properties, Simon Razniewski, Vevake Balaraman and Werner Nutt, ADMA, 2017 – Acceptance rate spotlight research papers: 16.9%
  • [17] Cool-WD: A Completeness Tool for Wikidata, Fariz Darari, Radityo Eko Prasojo, Simon Razniewski and Werner Nutt, ISWC (Demo), 2017
  • [16] Assessing the Completeness of Entities in Knowledge Bases, Albin Ahmeti, Simon Razniewski and Axel Polleres, ESWC (Poster), 2017
  • [15] Expanding Wikidata’s Parenthood Information by 178%, or How To Mine Relation Cardinalities, Paramita Mirza, Simon Razniewski and Werner Nutt, ISWC (Poster), 2016
  • [14] Exploiting Hierarchies for Efficient Detection of Completeness in Stream Data, Simon Razniewski, Shazia Sadiq, and Xiaofang Zhou, ADC, 2016
  • [13] Managing and Consuming Completeness Information for Wikidata Using COOL-WD, Radityo Eko Prasojo, Fariz Darari, Simon Razniewski, and Werner Nutt, COLD workshop at ISWC, 2016 – Acceptance rate: 46%
  • [12] But What Do We Actually Know?, Simon Razniewski, Fabian Suchanek, Werner Nutt, AKBC workshop at NAACL, 2016
  • [11] Turning The Partial-closed World Assumption Upside Down, Simon Razniewski, Ognjen Savkovic, Werner Nutt, AMW, 2016
  • [10] Enabling Fine-grained RDF Data Completeness Assessment, Fariz Darari, Simon Razniewski, Radityo Eko Prasojo, Werner Nutt, ICWE, 2016 – Acceptance rate: 21.1%
  • [9] Long-term Optimization of Update Frequencies for Decaying Information, Simon Razniewski, Werner Nutt, International Workshop on Web and Databases (WebDB) at SIGMOD, 2015 – Acceptance rate: 29%
  • [8] Gamified Children Universities: An Exploratory Study, Vincenzo Del Fatto, Gabriella Dodero, Rosella Gennari, Alessandra Melonio, Marco Montali, Simon Razniewski, Santina Torello, Xiaofeng Wang, Floriano Zini, CHI PLAY, 2014
  • [7] Adding Completeness Information to Query Answers over Spatial Data, Simon Razniewski und Werner Nutt, SIGSPATIAL, 2014 – Acceptance rate: 21%
  • [6] Databases under the Partial Closed-world Assumption: A Survey, Simon Razniewski, Werner Nutt, Grundlagen von Datenbanken (GvDB), 2014
  • [5] Bridging the Semantic Gap between RDF and SPARQL using Completeness Statements, Fariz Darari, Simon Razniewski, Werner Nutt, ISWC (poster), 2014
  • [4] Assessing the Completeness of Geographical Data, Simon Razniewski, Werner Nutt, British National Conference on Databases (BNCOD), 2013 (Short Paper) – Acceptance rate: 47.6%
  • [3] Incomplete Databases: Missing Records and Missing Values, Werner Nutt, Simon Razniewski, Gil Vegliach, Workshop on Data Quality in Data Integration Systems (DQDI), 2012
  • [2] Checking Query Completeness over Incomplete Data, Simon Razniewski, Werner Nutt, Short Paper, Workshop on Logic in Databases (LID), 2011
  • [1] Checking Query Completeness over Incomplete Data, Simon Razniewski, Werner Nutt, KRDB Research Centre Technical Report KRDB11-2, Free University of Bozen-Bolzano, 2011

Thesis

  • Query-driven Data Completeness Management, PhD Thesis, Free University of Bozen-Bolzano, 2014

Tools, Scripts, Datasets

  • Cinex: Code, experiment data and SPARQL endpoint with Wikipedia results for counting quantifier extraction
  • Property Ranking: Dataset of 350 (entity, property1, property2) pairs for humans in Wikidata, along with a preference judgment [O18]
  • ReCoin: A user script for adding relative completeness annotations to Wikidata. Developed by Vevake Balaraman and Albin Ahmeti in the context of the TaDaQua project (Video in Danish)
  • COOL-WD: A completeness tool for Wikidata. Developed by Radityo Eko Prasojo and Fariz Darari
  • MAGIC: A tool for reasoning about the completeness of relational databases, developed by Ognjen Savkovic, Paramita Mirza and Alex Tomasi
  • A dataset of about 2000 crowdsourced completeness assertions for YAGO and Wikidata.
  • Lookslikescanned: A website to make PDFs look appear like scanned

Research

Topics

My research is centered around the theme of knowledge base construction and curation. It is rooted in foundations of logics/data management, machine learning and natural language processing, and finds current application in KB recall assessment, and encyclopedic, fictional and common-sense knowledge bases. KB recall assessment is also my best-known research topic, with further details on this page.

Selected grants

  • PI of “The Call for Recall” (2016-2017), funded by the Free University of Bozen-Bolzano with 20.000 €
  • PI of “TQTK – The Quest to Know What We Know” (2016-2017), funded by the Free University of Bozen-Bolzano with 20.000 €
  • PI of “TaDaQua – Tangible Data Quality” (2016-2018), funded by the Free University of Bozen-Bolzano with 50.000 €
  • Coauthor of “MAGIC – Managing Completeness of Data” (2013-2016), funded by the province of South Tyrol with 250.000 €

Reviewing

  • 2019: PC member of WWW, AAAI, NAACL, ISWC, AKBC, LDK, Data Quality in Wikidata workshop, Quality of Open Data workshop
  • 2018: PC member of ACL, CIKM, EMNLP, DASFAA, SemDeep-4, DL4KGs, QOD, RoD. Reviewer for VLDB Journal
  • 2017: PC member of CIKM, WebDB. Reviewer for SWJ
  • 2011-2016: External reviewer for AAMAS 2016, SEBD 2015, CIKM 2014, COOPIS 2013, TbiLLC 2013, CIKM 2011, BNCOD 2011.

Teaching

Saarland University

FU Bozen-Bolzano


Theses and internships

I have some capacities for excellent students for PhD/Master/Bachelor theses, research immersion labs and HiWi positions, on topics related to NLP, knowledge bases and information extraction.

For Master and PhD studies our institute runs a research school, which offers scholarships also at the Master level.

For Master/PhD theses I expect an intermediate knowledge of information retrieval, machine learning and computational linguistics such as taught in the courses “IRDM” and “Statistical NLP” at Saarland University.

For internships, note that I generally cannot offer internships below 6 months duration.

Feel free to reach out with questions.


Other

  • I’m operating a service for making PDFs appear like scanned at http://www.lookslikescanned.com
  • I hold a glider pilot license, which I used until recently to fly over the Eastern Alps with the Aeroclub Bozen-Bolzano. My flightbook: [1]. Some favourite videos (not by me): [2], [3]. Very old one by myself: [4].
  • If not replying to email, I might occasionally be traveling in places like Kyrgyzstan, Iran, or Uganda.
  • I’ve circled the globe (crossed the international date line without return) 3 times.