Knowledge Graph Generation from Web Text

Lead Research Organisation: University of Edinburgh

Department Name: Sch of Informatics

Abstract

The overall goal of the project is Knowledge Graph creation through the automated reading of large scale web text corpora. This means extracting information from text and organizing it in one structured resource, which makes it more accessible for computational use. The main use case of such a resource is question answering. The novelty of our approach on the following factors: (1) our method will allow for construction of such a graph using source text in multiple languages and will also make it possible to query it in multiple languages. (2) Moreover, in contrast to existing Knowledge Graphs, the interface between natural language queries and our graph will be direct and unproblematic because the representation language of the graph is the same as semantic representation of natural language utterances. Traditionally knowledge representation semantic parsing languages were developed separately which means that the task of mapping natural language queries onto knowledge databases is challenging. (3) Finally, our graph will make use of automatically mined information about entailments between events in order to supplement information extracted from text with the facts that were not mentioned but that are implied.
The question the project asks is whether the above listed aspects will provide benefits to downstream applications. The most obvious of those is question answering, which we are using as our evaluation task. The project involves designing a suitable representation language, developing a Knowledge Graph generation algorithm, integrating it with entailment resources, testing on available question answering datasets and potentially developing a dataset specifically for questions requiring access and integration of information from multiple documents.

Student:

Katarzyna Szubert

Period of Study:

Aug 17 - Feb 23

Funder:

EPSRC

Project Status:

Closed

Project Category:

Studentship

Project Reference:

1931627

Research Topic:

Unclassified

Organisations

University of Edinburgh (Lead Research Organisation)

People	ORCID iD
Mark Steedman (Primary Supervisor)
Katarzyna Szubert (Student)

Publications

Author Name

Title Publication Date Published

10 25 50

Szubert I (2019) Node Embeddings for Graph Merging: Case of Knowledge Graph Construction

Studentship Projects

Project Reference	Relationship	Related To	Start	End	Student Name
EP/N509644/1			30/09/2016	29/09/2021
1931627	Studentship	EP/N509644/1	31/08/2017	28/02/2023	Katarzyna Szubert

Abstract

Organisations

People

ORCID iD

Publications

Studentship Projects