Simon Razniewski


Senior Researcher
Department 5
Max Planck Institute for Informatics
Room 427
Campus E1 4 (map)
Saarbrücken
Germany
+49(0)68193255126
srazniew@mpi-inf.mpg.de (PGP)

About

Simon Razniewski is senior researcher at the Max Planck Institute for Informatics in Saarbrücken, Germany, where he heads the Knowledge Base Construction and Quality area.

He was previously assistant professor at the Free University of Bozen Bolzano¹ (2014-2017). He holds a PhD from the Free University of Bozen-Bolzano (2014), and a Diplom (MSc.) from TU Dresden (2010; not this Dresden). He spent time as visitor at the Max-Planck Institute for Informatics (2016), the University of Queensland (2015), AT&T Labs-Research (2013), the University of California, San Diego (2012), and has previous industrial experience from Globalfoundries (2010) and Siemens IT (2009). He has published 16 papers at premier² conferences in the area of data science and management (and more than 44 papers in total).

¹ 2018 world’s 9th best small university according to THE
² A* or A in the CORE 2018 ranking

News

  • 6/2020 Our AKBC paper “Enriching Knowledge Bases with Negative Statements” won the best poster award (=audience-choice-best-paper) at AKBC 2020
  • 6/2020 Paper accepted in JWS: “Uncovering Hidden Semantics of Set Information in Knowledge Bases” (w/ Shrestha Ghosh and Gerhard Weikum)
  • 5/2020 Papers accepted at AKBC: “Enriching Knowledge Bases with Negative Statements” (w/ Hiba Arnaout and Gerhard Weikum) and “Joint Reasoning for Multi-Faceted Commonsense Knowledge” (w/ Yohan Chalier and Gerhard Weikum)
  • 3/2020 Offering a tutorial on commonsense knowledge extraction and consolidation at KI 2020 (August)
  • 2/2020 Will be serving on the senior program committee of IJCAI, ISWC, CIKM this year
  • 2/2020 Attending WSDM
  • 1/2020 Visiting AllenAI (slides), Microsoft Research, Amazon Seattle (slides)
  • 12/2019 Visiting eXascale Infolab (Fribourg)
  • 11/2019 Attending EMNLP
  • 10/2019 Paper “ENTYFI: Entity Typing in Fictional Texts” (joint work with Cuong Xuan Chu and Gerhard Weikum) accepted at WSDM 2020
  • 9/2019 Obtained a Google Cloud Credit grant of $5000 for research on commonsense knowledge
  • 8/2019 Short paper “Coverage of Information Extraction from Sentences and Paragraphs” (joint work with Nitisha Jain, Paramita Mirza, Gerhard Weikum) accepted at EMNLP 2019
  • 8/2019 Paper “Commonsense Properties from Query Logs and Question Answering Forums” (joint work with Julien Romero et al.) accepted at CIKM 2019
  • 6/2019 Visiting L3S (Hannover)
  • 6/2019 900k views for lookslikescanned.com
  • 5/2019 Visiting Megagon Labs (Mountain View)
  • 5/2019 Attending WWW 2019 conference
  • 5/2019 Visiting Diffbot (Mountain View)

Publications

Premier conference publications¹

  • [16] Counting Query Answers over a DL-Lite KB, Julien Corman, Davide Lanti, Diego Calvanese, Simon Razniewski, IJCAI, 2020 – Acceptance rate: 13%
  • [15] ENTYFI: Entity Typing in Fictional Texts , Cuong Xuan Chu, Simon Razniewski, Gerhard Weikum, WSDM, 2020 – Acceptance rate: 15%
  • [14] Wikidata Completeness Profiling Using ProWD, Avicenna Wisesa, Fariz Darari, Adila Krisnadhi, Werner Nutt and Simon Razniewski, K-CAP, 2019
  • [13] Coverage of Information Extraction from Sentences and Paragraphs, Simon Razniewski, Nitisha Jain, Paramita Mirza, Gerhard Weikum, EMNLP, 2019 – Acceptance rate: 20%
  • [12] Commonsense Properties from Query Logs and Question Answering Forums , Julien Romero, Simon Razniewski, Koninika Pal, Jeff Z. Pan, Archit Sakhadeo, Gerhard Weikum, CIKM, 2019 – Acceptance rate: 20%
  • [11] TiFi: Taxonomy Induction for Fictional Domains, Cuong Xuan Chu, Simon Razniewski, Gerhard Weikum, WWW, 2019 – Acceptance rate: 20%
  • [10] Enriching Knowledge Bases with Counting Quantifiers, Paramita Mirza, Simon Razniewski, Fariz Darari and Gerhard Weikum, ISWC, 2018 – Acceptance rate: 23%
  • [9] Completeness-aware Rule Learning from Knowledge Graphs, Thomas Pellissier Tanon, Daria Stepanova, Simon Razniewski, Paramita Mirza and Gerhard Weikum, ISWC, 2017 – Acceptance rate: 22%
  • [8] Cardinal Virtues: Extracting Relation Cardinalities from Text, Paramita Mirza, Simon Razniewski, Fariz Darari, Gerhard Weikum, ACL, 2017 – Acceptance rate: 18%
  • [7] Predicting Completeness in Knowledge Bases, Luis Galárraga, Simon Razniewski, Antoine Amarilli, Fabian M. Suchanek, WSDM, 2017 – Acceptance rate: 16%
  • [6] Optimizing Update Frequencies for Decaying Information, Simon Razniewski, CIKM, 2016 – Acceptance rate: 23%
  • [5] Identifying the Extent of Completeness of Query Answers over Partially Complete Databases, Simon Razniewski, Flip Korn, Werner Nutt, Divesh Srivastava, SIGMOD, 2015 – Acceptance rate: 26%
  • [4] Verification of Query Completeness over Processes, Simon Razniewski, Marco Montali and Werner Nutt, BPM, 2013 – Acceptance rate: 14%
  • [3] Completeness Statements about RDF Data Sources und Their Use for Query Answering, Fariz Darari, Werner Nutt, Giuseppe Pirro, Simon Razniewski, ISWC, 2013 – Acceptance rate: 22%
  • [2] Completeness of Queries over SQL Databases, Werner Nutt and Simon Razniewski, CIKM, 2012 – Acceptance rate: 13%
  • [1] Completeness of Queries over Incomplete Databases, Simon Razniewski and Werner Nutt, VLDB, 2011 – Acceptance rate: 18%

¹Rank A* or A in the Core 2018 ranking (http://www.core.edu.au/conference-portal)

Journal publications

  • [3] Uncovering Hidden Semantics of Set Information in Knowledge Bases , Shrestha Ghosh, Simon Razniewski, Gerhard Weikum, JWS, 2020
  • [2] Completeness and Soundness Guarantees for Conjunctive SPARQL Queries over RDF Data Sources with Completeness Statements, Fariz Darari, Werner Nutt, Simon Razniewski, Sebastian Rudolph, SWJ, 2018
  • [1] Completeness Management for RDF Data Sources, Fariz Darari, Werner Nutt, Giuseppe Pirro, Simon Razniewski, TWEB, 2018

Other peer-reviewed publications

  • [27] Joint Reasoning for Multi-Faceted Commonsense Knowledge , Yohan Chalier, Simon Razniewski and Gerhard Weikum, AKBC, 2020
  • [26] Enriching Knowledge Bases with Negative Statements , Hiba Arnaout, Simon Razniewski and Gerhard Weikum, AKBC, 2020
  • [25] CounQER: A System for Discovering and Linking Count Information in Knowledge Bases , Shrestha Ghosh, Simon Razniewski, Gerhard Weikum, ESWC demo, 2020
  • [24] SANDI: A Tool for Alignment of Images within Text, William Cheng, Sreyasi Nag Chowdhury, Gerard de Melo, Simon Razniewski and Gerhard Weikum, WSDM demo, 2020
  • [23] Epitath or Breaking News? Analyzing and Predicting the Stability of Knowledge Base Properties, Ioannis Dikeoulias, Jannik Strötgen, Simon Razniewski, TempWeb@WWW, 2019
  • [22] Knowledge Base Recall: Detecting and Resolving the Unknown Unknowns, Simon Razniewski and Gerhard Weikum, SIGWEB newsletter, 2018
  • [21] Completeness-aware Rule Learning from Knowledge Graphs, Thomas Pellissier Tanon, Daria Stepanova, Simon Razniewski, Paramita Mirza and Gerhard Weikum, IJCAI best sister conference track, 2018
  • [20] Comparing Index Structures for Completeness Reasoning, Fariz Darari, Werner Nutt, Simon Razniewski, IWBIS, 2018
  • [19] Recoin: Relative Completeness in Wikidata, Vevake Balaraman, Simon Razniewski and Werner Nutt, Wiki Workshop@WWW, 2018
  • [18] Doctoral Advisor or Medical Condition: Towards Entity-specific Rankings of Knowledge Base Properties, Simon Razniewski, Vevake Balaraman, Werner Nutt, ADMA, 2017
  • [17] Enabling Completeness-aware Querying in SPARQL, Luis Galárraga, Katja Hose, Simon Razniewski, WebDB@SIGMOD, 2017
  • [16] COOL-WD: A Completeness Tool for Wikidata, Fariz Darari, Radityo Eko Prasojo, Simon Razniewski and Werner Nutt, ISWC demo, 2017
  • [15] Assessing the Completeness of Entities in Knowledge Bases, Albin Ahmeti, Simon Razniewski, Axel Polleres, ESWC poster, 2017
  • [14] Expanding Wikidata’s Parenthood Information by 178%, or How To Mine Relation Cardinalities, Paramita Mirza, Simon Razniewski and Werner Nutt, ISWC poster, 2016
  • [13] Exploiting Hierarchies for Efficient Detection of Completeness in Stream Data, Simon Razniewski, Shazia Sadiq, and Xiaofang Zhou, ADC, 2016
  • [12] Managing and Consuming Completeness Information for Wikidata Using COOL-WD, Radityo Eko Prasojo, Fariz Darari, Simon Razniewski, and Werner Nutt, COLD@ISWC, 2016
  • [11] But What Do We Actually Know?, Simon Razniewski, Fabian Suchanek and Werner Nutt, AKBC, 2016
  • [10] Turning The Partial-closed World Assumption Upside Down, Simon Razniewski, Ognjen Savkovic and Werner Nutt, AMW, 2016
  • [9] Enabling Fine-grained RDF Data Completeness Assessment, Fariz Darari, Simon Razniewski, Radityo Eko Prasojo, Werner Nutt, ICWE, 2016
  • [8] Long-term Optimization of Update Frequencies for Decaying Information, Simon Razniewski and Werner Nutt, WebDB@SIGMOD, 2015
  • [7] Gamified Children Universities: An Exploratory Study, Vincenzo Del Fatto, Gabriella Dodero, Rosella Gennari, Alessandra Melonio, Marco Montali, Simon Razniewski, Santina Torello, Xiaofeng Wang, Floriano Zini, CHI PLAY, 2014
  • [6] Adding Completeness Information to Query Answers over Spatial Data, Simon Razniewski and Werner Nutt, SIGSPATIAL, 2014
  • [5] Databases under the Partial Closed-world Assumption: A Survey, Simon Razniewski and Werner Nutt, GvDB, 2014
  • [4] Bridging the Semantic Gap between RDF and SPARQL using Completeness Statements, Fariz Darari, Simon Razniewski and Werner Nutt, ISWC poster, 2014
  • [3] Assessing the Completeness of Geographical Data, Simon Razniewski and Werner Nutt, BNCOD, 2013
  • [2] Incomplete Databases: Missing Records and Missing Values, Werner Nutt, Simon Razniewski and Gil Vegliach, DQDI@DASFAA, 2012
  • [1] Checking Query Completeness over Incomplete Data, Simon Razniewski and Werner Nutt, LID@EDBT/ICDT, 2011

Thesis

  • Query-driven Data Completeness Management, PhD Thesis, Free University of Bozen-Bolzano, 2014

Demos

  • Entyfi (2020): Entity typing in fiction
  • Sandi (2020): Story-image alignment
  • Dice (2020): Joint reasoning for multifaceted commonsense knowledge
  • CounQER (2020): Counting queries and entity valued relations
  • ReCoin (2017): A user script for adding relative completeness annotations to Wikidata. Developed by Vevake Balaraman and Albin Ahmeti in the context of the TaDaQua project (Video in Danish)
  • COOL-WD (2017): A completeness tool for Wikidata. Developed by Radityo Eko Prasojo and Fariz Darari
  • MAGIC (2012): A tool for reasoning about the completeness of relational databases, developed by Ognjen Savkovic, Paramita Mirza and Alex Tomasi
  • Lookslikescanned: A website to make PDFs look appear like scanned

Other scripts and datasets

  • Cinex: Code, experiment data and SPARQL endpoint with Wikipedia results for counting quantifier extraction
  • Property Ranking: Dataset of 350 (entity, property1, property2) pairs for humans in Wikidata, along with a preference judgment [O18]
  • A dataset of about 2000 crowdsourced completeness assertions for YAGO and Wikidata.

Research

Topics

My research is centered around the theme of knowledge base construction and curation. It is rooted in foundations of logics/data management, machine learning and natural language processing, and finds current application in KB recall assessment, and encyclopedic, fictional and common-sense knowledge bases (sample slides). KB recall assessment also unifies much of my research, with further details on this project page, and sample slides here.

Selected grants

  • Google Cloud Credit grant of $5000 for research on commonsense knowledge, 2019
  • NVIDIA hardware grant (~$1100) for research on information extraction, 2018
  • PI of “The Call for Recall” (2016-2017), funded by the Free University of Bozen-Bolzano with 20.000 €
  • PI of “TQTK – The Quest to Know What We Know” (2016-2017), funded by the Free University of Bozen-Bolzano with 20.000 €
  • PI of “TaDaQua – Tangible Data Quality” (2016-2018), funded by the Free University of Bozen-Bolzano with 50.000 €
  • Coauthor of “MAGIC – Managing Completeness of Data” (2013-2016), funded by the province of South Tyrol with 250.000 €

Reviewing

  • 2020: Senior PC member of CIKM, IJCAI, ISWC, PC member of WSDM, ACL, ESWC, QOD, Wikidata workshop
  • 2019: PC member of WWW, ACL, AAAI, NAACL, ISWC, K-CAP, AKBC, LDK, Data Quality in Wikidata workshop, Quality of Open Data workshop, Commonsense Inference workshop
  • 2018: PC member of ACL, CIKM, EMNLP, DASFAA, SemDeep-4, DL4KGs, QOD, RoD. Reviewer for VLDB Journal
  • 2017: PC member of CIKM, WebDB. Reviewer for SWJ
  • 2011-2016: External reviewer for AAMAS 2016, SEBD 2015, CIKM 2014, COOPIS 2013, TbiLLC 2013, CIKM 2011, BNCOD 2011.

Teaching

Saarland University


Theses and internships

I have limited capacities for PhD/Master/Bachelor theses, research immersion labs and HiWi positions, on topics related to NLP, knowledge bases and information extraction. Requirements vary by project, but in most cases I expect an intermediate knowledge of information retrieval, machine learning and computational linguistics such as taught in the courses “IRDM” and “Statistical NLP” at Saarland University.

Open thesis topics:

  1. Estimating text recall (theme: NLP)
  2. Extracting and organizing social commonsense knowledge (theme: information extraction)
  3. Detecting interesting absences in image collections (theme: computer vision and data management)
  4. Predicting knowledge base completeness and stability (theme: machine learning)

For external candidates, note that our institute runs a research school, which offers scholarships both for Master and PhD studies.


Other

  • I’m operating a service for making PDFs appear like scanned at http://www.lookslikescanned.com
  • I hold a glider pilot license, which I used until recently to fly over the Eastern Alps with the Aeroclub Bozen-Bolzano. My flightbook: [1]. Some favourite videos (not by me): [2], [3]. Very old one by myself: [4].
  • If not replying to email, I might occasionally be traveling in places like Kyrgyzstan, Iran, or Uganda.
  • I’ve circled the globe (crossed the international date line without return) 3 times.
  • Curious fact: The extent of Indonesia (main islands only) at 5300 km is more than the distance from Germany to China, Kenya, or the North Pole (4630, 5200, 3920 km, respectively).
  • Apologies for any typos in my emails, I didn’t write them