Research Areas

The group's research interests fall into the broad areas of:

  • Information Access: Building applications to improve access to information in massive text collections, such as the web, newswires, scientific documents, particularly the biomedical literature, and clinical records. Subtopics include: information extraction, text mining and semantic annotation, question answering and summarization.

  • Language Resources and Architectures for NLP: Platforms for developing and deploying real world language processing applications, most notably GATE, the General Archicture for Text Engineering.

  • Human-Computer Dialogue Systems: Building systems to allow spoken language interaction with computers or embodied conversational agents, with applications in areas such as keyboard-free access to information, games and entertainment, companionship.

  • Detection of Reuse and Anomaly: Investigating techniques for determining when texts or portions of texts have been reused or where portions of text do not fit with surrounding text. Applications in areas such as plagiarism and authorship detection and in discovery of hidden content.

  • Foundational Topics: Developing applications with human-like capabilities for processing language requires progress in foundational topics in language processing. Areas of interest include: word sense disambiguation, semantics of time and events, approaches to language modelling.

  • Machine Translation: Building applications to translate automatically between human languages, allowing access to the vast amount of information written in foreign languages and easier communication between speakers of different languages.

Publications

A list of group publications is available from the publications page.
A list of awarded PhDs can be found here

Publications are also available from group member's pages and in many cases from project pages.