Studying the appropriateness of different formulations of a discourse relation in context

Lead Research Organisation: University of Cambridge
Department Name: Computer Science and Technology

Abstract

Abstracts are not currently available in GtR for all funded research. This is normally because the abstract was not required at the time of proposal submission, but may be because it included sensitive information such as personal details.

Publications

10 25 50
 
Description Methods: We have demonstrated the potential for creating data sets that can produce useful insights in both behavioural (psycholinguistic) and computational disciplines. We have shown that it is possible to create a data set using reformulations of sentences extracted from corpora that is controlled enough to test specific hypotheses and varied enough for supervised machine learning. We believe such mixed methodologies provide a useful basis for collaboration between data-driven computational sciences and behavioural sciences (linguistics and psychology of language).

Tools: We have developed software for text reformulation based on applying transformation rules to typed dependency representations. The software produced for text regeneration has been further developed under grant EP/J018805/1 and has now been released as the RegenT text simplification system.

Networks: This project has lead to new collaborations. Dr Katsos is working with Dr Naveed Ahmed, from the University of Islamabad, on a 9- month project on simplifying legal text for lay audiences, with emphasis on laws on women's rights in pakistan. Dr Siddharthan has demonstrated the text simplification software developed on this project to researchers working on deaf education and on ageing. Based on their feedback, various extensions to the system are planned.
Exploitation Route The key outcome is the development of a framework for lexico-syntactic text reformulation. This has been developed further under grant EP/J018805/1 and the software has now been released.
Sectors Digital/Communication/Information Technologies (including Software)

Education

 
Description The key outcome is the development of a framework for lexico-syntactic text reformulation. This has been developed further under grant EP/J018805/1 and attempts are now ongoing for developing applications of the resultant software for text simplification (RegenT) .
Sector Digital/Communication/Information Technologies (including Software)