The group's research interests fall into the broad areas of:
- Information Access: Building
applications to improve access to information in massive text
collections, such as the web, newswires, scientific
documents, particularly the biomedical literature, and clinical records. Subtopics include: information extraction, text mining and semantic annotation,
question answering and summarization.
- Language Resources and Architectures for NLP:
Platforms for developing and deploying real world language processing
applications, most notably GATE,
the General Archicture for Text Engineering.
- Human-Computer Dialogue Systems:
Building systems to allow spoken language interaction with computers
or embodied conversational agents, with applications in areas such as
keyboard-free access to information, games and entertainment,
- Detection of Reuse and Anomaly:
Investigating techniques for determining when texts or portions of
texts have been reused or where portions of text do not fit with
surrounding text. Applications in areas such as plagiarism and
authorship detection and in discovery of hidden content.
- Foundational Topics: Developing
applications with human-like capabilities for processing language
requires progress in foundational topics in language processing. Areas
of interest include: word sense disambiguation, semantics of time and
events, approaches to language modelling.
- Machine Translation: Building
applications to translate automatically between human languages, allowing access to the
vast amount of information written in foreign languages and easier communication between
speakers of different languages.
A list of group publications is available from the publications page.
A list of awarded PhDs can be found here
Publications are also available from group member's pages and in many
cases from project pages.