Gerhard Weikum
Scientific Director at the Max Planck Institute for Informatics in Saarbruecken, Germany
Gerhard Weikum is a Scientific Director at the Max Planck Institute for Informatics in Saarbruecken, Germany, and an Adjunct Professor at Saarland University. He co-authored a comprehensive textbook on transactional systems, received the VLDB Test-of-Time Award 2002 for his work on automatic database tuning, and is one of the creators of the YAGO knowledge base which was recognized by the WWW Test-of-Time Award in 2018. Weikum is an ACM Fellow and elected member of various academies. He received the ACM SIGMOD Contributions Award in 2011, a Google Focused Research Award in 2011, an ERC Synergy Grant in 2014, the ACM SIGMOD Edgar F. Codd Innovations Award in 2016, and the Konrad Zuse Medal in 2021.
Abstract: What Computers Know, and What They Should Know
Large knowledge graphs have become a key asset for search engines and other use cases. They are partly based on automatically extracting structured information from web contents and other texts, using a variety of pattern-matching and machine-learning methods. The semantically organized machine knowledge can be harnessed to better interpret text in news, social media and web tables, contributing to question answering, natural language processing (NLP) and data analytics.http://dx.doi.org/10.1561/1900000064 for a survey). Moreover, the talk identifies open challenges and new research opportunities. In particular, it discusses potential synergies of knowledge graphs and language models.
A recent trend that has revolutionized NLP is to capture knowledge latently by billions of parameters of language models, learned at scale from huge text collections in a self-supervised manner. These pre-trained models form the basis of fine-tuning machine-learning solutions for tasks that involve both input texts and broad world knowledge, such as question answering, commonsense reasoning and human-computer conversations. This talk reviews these advances and discusses lessons learned and limitations (see