Simon Razniewski is senior researcher at the Max Planck Institute for Informatics in Saarbrücken, Germany. His research interests include Data Quality and Management, Semantic Web, Knowledge Engineering and Machine Learning.
He was previously assistant professor at the Free University of Bozen Bolzano (2014-2017). He holds a PhD from the Free University of Bozen-Bolzano (2014), and a Diplom (MSc.) from TU Dresden (2010; not this Dresden). He spent time as visitor at the Max-Planck Institute for Informatics (2016), the University of Queensland (2015), AT&T Labs-Research (2013), the University of California, San Diego (2012), and has previous industrial experience from Globalfoundries (2010) and Siemens IT (2009). He has published 8 full papers at premier² conferences in the area of data science and management (and 26 papers in total).
- 14.3.2018 Appointed as senior researcher
- 5.3.2018 Job offers for student assistants on information extraction in the food domain (position 1 – 850€, position 2 – 1650€)
- 6.12.2017 Recoin, the relative completeness indicator for Wikidata, has been extended to all entities
- 4.11.2017 At CIKM and ADMA
- 1.11.2017 Joined the Max Planck Institute for Informatics
- 28.10.2017 Will give a talk at WikidataCon: “Data completeness: How to know what Wikidata knows?”
- 2.9.-16.9.2017 Away from email in Kyrgyzstan
- 28.9.2017 Seminar “Knowledge Bases” has been announced. Register till 4.10.
- 19.8.2017 Our paper “Doctoral Advisor or Medical Condition: Towards Entity-specific Rankings of Knowledge Base Properties” (joint work with Vevake Balaraman and Werner Nutt) has been accepted as spotlight research paper at ADMA 2017
- 31.7.-11.8.2017 At ACL
- 14.7.2017 Our paper “Completeness-aware Rule Learning from Knowledge Graphs” (joint work with Thomas Pellissier Tanon, Daria Stepanova, Paramita Mirza and Gerhard Weikum) has been accepted at ISWC 2017
- 29.5.2017 At ESWC
- 12.4.2017 Our poster “Assessing the Completeness of Entities in
- Knowledge Bases” (with Albin Ahmeti and Axel Polleres) has been accepted at ESWC 2017
- 11.4.2017 Visiting the University of Auckland
- 31.4.2017 Our short paper “Cardinal Virtues: Extracting Relation Cardinalities from Text” has been accepted at ACL 2017 (with Paramita Mirza, Fariz Darari, Gerhard Weikum)
- 22.3.2017 Our short paper “Enabling Completeness-aware Querying in SPARQL” has been accepted at the WebDB 2017 workshop (with Luis Galárraga and Katja Hose)
- 7.3.2017 Times Higher Education ranks FUB as world’s tenth best small university
- 17.2.2017 Visiting the LIRIS lab in Lyon
- 10.2.2017 KRDB Lunch seminar talk on recent research
- 11.1.2017 Visiting Telecom ParisTech University, DBWeb team
- 15.11.2016 Our relative completeness tool for Wikidata, ReCoin, has been released
- 12.11.2016 200k views of lookslikescanned.com
- 18.10.2016 Our paper “Predicting Completeness in Knowledge Bases” (with Luis Galárraga, Antoine Amarilli and Fabian M. Suchanek) has been accepted at WSDM 2017
- Visiting the Max-Planck-Institute for Informatics, 10/2016 – 1/2017
- Our poster “Expanding Wikidata’s Parenthood Information by 178%, or How To Mine Relation Cardinality Information” (with Paramita Mirza and Werner Nutt) has been accepted at ISWC 2016
- Our paper “Managing and Consuming Completeness Information for Wikidata Using COOL-WD” (with Radityo Eko Prasojo, Fariz Darari and Werner Nutt) has been accepted at the COLD workshop at ISWC 2016
- Our paper “Optimizing Update Frequencies for Decaying Information” has been accepted at CIKM 2016
- Our paper “Exploiting Hierarchies for Efficient Detection of Completeness in Stream Data” (with Shazia Sadiq and Xiaofang Zhou) has been accepted at ADC 2016
- Mostly away from email till 22.7. in Indonesia and Georgia
- 30.4.2016 Our proposal “The Call for Recall” has been approved by the FUB (budget 20k€)
- 19.4.2016 Our paper “Turning The Partial-closed World Assumption Upside Down” has been accepted at AMW 2016
- 4.4.2016 Our vision paper “But What Do We Actually Know?” (with Fabian M. Suchanek and Werner Nutt) has been accepted at the AKBC workshop at NAACL 2016
- 1.3.2016: Our paper “Enabling Fine-grained RDF Data Completeness Assessment” (with Fariz Darari, Radityo Eko Prasojo and Werner Nutt) has been accepted at ICWE 2016
- 28.2.2016: 100k views of lookslikescanned.com
Premier full conference publications¹
-  Completeness-aware Rule Learning from Knowledge Graphs, Thomas Pellissier Tanon, Daria Stepanova, Simon Razniewski, Paramita Mirza and Gerhard Weikum, ISWC, Vienna, Austria, 2017 – Acceptance rate 22.3%
-  Predicting Completeness in Knowledge Bases, Luis Galárraga, Simon Razniewski, Antoine Amarilli, Fabian M. Suchanek, WSDM, Cambridge, UK, 2017 – Acceptance rate 16%
-  Optimizing Update Frequencies for Decaying Information, Simon Razniewski, CIKM, Indianapolis, USA, 2016 – Acceptance rate: 22.8%
-  Identifying the Extent of Completeness of Query Answers over Partially Complete Databases, Simon Razniewski, Flip Korn, Werner Nutt, Divesh Srivastava, SIGMOD, Melbourne, Australia, 2015 – Acceptance rate: 25.5%
-  Verification of Query Completeness over Processes, Simon Razniewski, Marco Montali, Werner Nutt, International Conference on Business Process Management (BPM), Beijing, China, 2013 – Acceptance rate: 14.4%
-  Completeness Statements about RDF Data Sources und Their Use for Query Answering, Fariz Darari, Werner Nutt, Giuseppe Pirro, Simon Razniewski, Int. Semantic Web Conference (ISWC), Australia, 2013 – Acceptance rate: 21.5%
-  Completeness of Queries over SQL Databases, Werner Nutt, Simon Razniewski, Conference on Information und Knowledge Management (CIKM), Maui, USA, 2012 – Acceptance rate: 13.4%
-  Completeness of Queries over Incomplete Databases, Simon Razniewski, Werner Nutt, Int. Conference on Very Large Databases (VLDB), Seattle, USA, 2011 – Acceptance rate: 18.1%
¹Rank A* or A in the Core 2014 ranking (http://www.core.edu.au/conference-portal)
-  Doctoral Advisor or Medical Condition: Towards Entity-specific Rankings of Knowledge Base Properties, Simon Razniewski, Vevake Balaraman and Werner Nutt, ADMA, Singapore, 2017 – Acceptance rate spotlight research papers: 16.9%
-  Assessing the Completeness of Entities in Knowledge Bases, Albin Ahmeti, Simon Razniewski and Axel Polleres, ESWC (Poster), Portoroz, Slovenia, 2017
-  Cardinal Virtues: Extracting Relation Cardinalities from Text, Paramita Mirza, Simon Razniewski, Fariz Darari and Gerhard Weikum, ACL (short paper), Vancouver, Canada, 2017 – Acceptance rate: 18.1%
-  Expanding Wikidata’s Parenthood Information by 178%, or How To Mine Relation Cardinalities, Paramita Mirza, Simon Razniewski and Werner Nutt, ISWC (Poster), Kobe, Japan, 2016
-  Exploiting Hierarchies for Efficient Detection of Completeness in Stream Data, Simon Razniewski, Shazia Sadiq, and Xiaofang Zhou, ADC, Sydney, Australia, 2016
-  Managing and Consuming Completeness Information for Wikidata Using COOL-WD, Radityo Eko Prasojo, Fariz Darari, Simon Razniewski, and Werner Nutt, COLD workshop at ISWC, Kobe, Japan, 2016 – Acceptance rate: 46%
-  But What Do We Actually Know?, Simon Razniewski, Fabian Suchanek, Werner Nutt, AKBC workshop at NAACL, San Diego, USA, 2016
-  Turning The Partial-closed World Assumption Upside Down, Simon Razniewski, Ognjen Savkovic, Werner Nutt, AMW, Panama-City, Panama, 2016
-  Enabling Fine-grained RDF Data Completeness Assessment, Fariz Darari, Simon Razniewski, Radityo Eko Prasojo, Werner Nutt, ICWE, Lugano, Switzerland, 2016 – Acceptance rate: 21.1%
-  Long-term Optimization of Update Frequencies for Decaying Information, Simon Razniewski, Werner Nutt, International Workshop on Web and Databases at SIGMOD, Melbourne, Australia, 2015 – Acceptance rate: 29%
-  Gamified Children Universities: An Exploratory Study, Vincenzo Del Fatto, Gabriella Dodero, Rosella Gennari, Alessandra Melonio, Marco Montali, Simon Razniewski, Santina Torello, Xiaofeng Wang, Floriano Zini, CHI PLAY, 2014
-  Adding Completeness Information to Query Answers over Spatial Data, Simon Razniewski und Werner Nutt, SIGSPATIAL, Dallas, USA, 2014 – Acceptance rate: 21%
-  Databases under the Partial Closed-world Assumption: A Survey, Simon Razniewski, Werner Nutt, Grundlagen von Datenbanken, 2014
-  Bridging the Semantic Gap between RDF and SPARQL using Completeness Statements, Fariz Darari, Simon Razniewski, Werner Nutt, International Semantic Web Conference (ISWC), 2014
-  Assessing the Completeness of Geographical Data, Simon Razniewski, Werner Nutt, British National Conference on Databases (BNCOD), Oxford, UK, 2013 (Short Paper) – Acceptance rate: 47.6%
-  Incomplete Databases: Missing Records and Missing Values, Werner Nutt, Simon Razniewski, Gil Vegliach, Workshop on Data Quality in Data Integration Systems (DQDI), Busan, South Korea, 2012
-  Checking Query Completeness over Incomplete Data, Simon Razniewski, Werner Nutt, Short Paper, Workshop on Logic in Databases (LID), Uppsala, Sweden, 2011
-  Checking Query Completeness over Incomplete Data, Simon Razniewski, Werner Nutt, KRDB Research Centre Technical Report KRDB11-2, Faculty of Computer Science, Free University of Bozen-Bolzano, Italy, 2011
Tools, Scripts, Datasets
- Property Ranking: Dataset of 350 (entity, property1, property2) pairs for humans in Wikidata, along with a preference judgment [O18]
- ReCoin: A user script for adding relative completeness annotations to Wikidata. Developed by Albin Ahmeti in the context of the TaDaQua project (Video in Danish) [O17]
- COOL-WD: A completeness tool for Wikidata. Developed by Radityo Eko Prasojo and Fariz Darari [O10, O13]
- MAGIC: A tool for reasoning about the completeness of relational databases, implementing functionalities from [P1], and more. Developed by Ognjen Savkovic, Paramita Mirza and Alex Tomasi
- A dataset of about 2000 crowdsourced completeness assertions for YAGO and Wikidata.
- Java script to calculate optimal recrawl frequencies [P6, O9]
- GeoCompleteness: An (incomplete) prototype for showing reasoning about the completeness of geographic data [O4, O7]
- Lookslikescanned: A website to make PDFs look appear like scanned
Member of the program committee of ACL 2018, IJCAI 2018, RoD 2018, CIKM 2017 and WebDB 2017. Reviewer for SWJ (2017). External reviewer for AAMAS 2016, SEBD 2015, CIKM 2014, COOPIS 2013, CIKM 2011, BNCOD 2011.
Previous (at FUB)
The following topics are available as Bachelor/Master theses. Please get in contact.
1. Metrics for Relative Completeness
The goal of this project is the development and evaluation of metrics for quantifying the completeness of knowledge base entities. The technical work would include the utilization of property ranking techniques  and entity similarity metrics .
- : Assessing the Completeness of Entities in Knowledge Bases, Albin Ahmeti, Simon Razniewski and Axel Polleres, ESWC, Portoroz, Slovenia, 2017
- : Doctoral Advisor or Medical Condition: Towards Entity-specific Rankings of Knowledge Base Properties, Simon Razniewski, Vevake Balaraman and Werner Nutt, ADMA, 2017
- : Sematch: Semantic similarity framework for Knowledge Graphs, Ganggao Zhu, Carlos A. Iglesias, Knowledge-Based Systems, 2017
2. Multilingual Coverage of Wikipedia
Wikipedia pages in different languages about the same entity often vary widely in size and content. The goal of this project is to quantify and qualify these differences, and to visualize them via a user script.
On the technical level, the idea is to pursue two approaches: (1) Multilingual topic modelling, to discover topics covered more/less in one article or the other, and (2) Interlinking, which can be further structured based on the information available in Wikidata. The results should be turned into a Wikipedia plugin, similar to Recoin.
3. Recommender System for Gliding
Gliding is both a recreational and competitive sport. On good days, glider pilots can be in the air up to 8 hours or more, during which they can cover significant distances (800km and more). Most glider pilots upload their competitive flights to an online platform (Onlinecontest.org), where flights are daily listed and ranked using points that are based on the covered distance and the performance of the aircraft.
To achieve high points on a given day, glider pilots have to carefully choose a task (flight route) that, given their plane, skills and the weather conditions, allows them to cover the maximal distance. All of weather conditions, skills and plane can make a huge difference, as overestimating the weather conditions may lead to not completing the task, and as it is not uncommon that experienced pilots travel twice or more the distance that beginners travel.
The goal of this project is to develop a prototype of a gliding task recommendation system, which takes into account the factors mentioned above. The core component of the prototype will be the similarity function for tasks, which will then be used in a standard recommender systems framework (i.e., collaborative filtering or content-based filtering).
4. Exploiting existential information
Existential information, i.e., knowledge about numbers of facts that hold in reality (e.g., MPII has 5 departments) are recent addition to knowledge bases that classically focus on facts that link entities (e.g., D5 is a department of MPII). The goal of this work is to exploit existential information as derived by  in some part of the KB lifecycle, i.e., either information extraction, KB consolidation, or question answering.