- Papers accepted for COLING 2016:
Representation and Learning of Temporal Relations. L. Derczynski (2016). COLING
Broad Twitter Corpus: A Diverse Named Entity Recognition Resource. L. Derczynski, K. Bontcheva, I. Roberts (2016). COLING
Stance classification in Rumours as a Sequential Task Exploiting the Tree Structure of Social Media Conversations. A. Zubiaga, E. Kochkina, M. Liakata, R. Procter, M. Lukasik. (2016). COLING
Anita: An Intelligent Text Adaptation Tool. G. Paetzold, L. Specia. (2016). COLING
Understanding the Lexical Simplification Needs of non-Native Speakers of English. G. Paetzold, L. Specia. (2016). COLING
Collecting and Exploring Everyday Language for Predicting Psycholinguistic Properties of Words. G. Paetzold, L. Specia. (2016). COLING
Imitation learning for language generation from unaligned data. G. Lampouras, A. Vlachos. (2016). COLING
- Carolina Scarton, Gustavo Paetzold and Lucia Specia will give a tutorial at COLING 2016, titled Quality estimation for language output applications
- We are please to announce that Gutsavo Paetzold has passed his PhD viva, having submitted only 2 years after joining as a PhD student.
- Leon Derczynski will give a course at ESSLLI 2017 with Matteo Magnani, titled "Networks and User-generated Content"
- Book in press in Springer Studies in Computational Intelligence: Automatically ordering events and times in text - L Derczynski
- Diana Maynard has had an article on automatic sarcasm detection published in Quartz Magazine
- Diana Maynard will give tutorials on NLP and Social Media Analysis at the 1st International Deep Learning, Big Data and Big Compute Camp, Rabat, Morocco, 24-28 October 2016. https://dlwensias.wordpress.com/2016/09/05/3/
- Paper published in European Psychiatry: Novel psychoactive substances: an investigation of temporal trends in social media and electronic health records - A Kolliakou, M Ball, L Derczynski, D
Chandran, G Gkotsis, P Deluca, R Jackson, H Shetty, R Stewart
- Mark Stevenson and Adam Poulson are collaborating with ScHaRR and Human on a project to visualise emotion in social media at the
Festival of the Mind - Link to
the Guardian Article
- Paper: An IR-based Approach Utilising Query Expansion for Plagiarism Detection in MEDLINE. R. Nawab, M Stevenson and P. Clough (2016). IEEE/ACM Transactions of Computational Biology and Bioinformatics.
- Paper: The Effect of Word Sense Disambiguation Accuracy on Literature Based Discovery. J. Preiss and M. Stevenson (2016). BMC Decision Making and Medical Informatics.
- Paper: A Corpus of Potentially Contradictory Research Claims from Cardiovascular Research Abstracts. A. Alamri and M. Stevenson (2016). Journal of Biomedical Semantics, 7 (36).
- Papers accepted for EMNLP 2016: Stance Detection with Bidirectional Conditional Encoding , Isabelle Augenstein, Tim Rocktäschel, Andreas Vlachos and Kalina Bontcheva
- Leon Derczynski has won an NVIDA hardware grant for summary generation from collections of text.
- Prof. Lucia Specia has been awarded an EC H2020 funded ERC Starting Grant. The project on Multimodal Context Modelling for Machine Translation (MultiMT) will start on 1 July 2016 for 5 years.
- Papers accepted for ACL 2016: Hawkes Processes for Continuous Time Sequence Classification: an Application to Rumour Stance Classification in Twitter. Michal Lukasik, P. K. Srijith, Duy Vu, Kalina Bontcheva, Arkaitz Zubiaga, Trevor Cohn.
- Papers accepted for TSD2016: Automatic Restoration of Diacritics for Igbo Language . Ignatius Ezeani, Mark Hepple and Ikechukwu Onyenwe.
- Paper nominated for Best Paper Award at WebSci 2016: Miriam Fernandez, Harith Alani, Lara Piccolo, Christoph Meili, Diana Maynard and Meia Wippoo. Talking Climate Change via Social Media: Communication, Engagement and Behaviour, May 22-25 2016, Hannover, Germany.
- Diana Maynard taught a 3-hour practical tutorial at the AI Seminar on Social Media Content Analysis, UPC Barcelona, 9-13 May 2016.
- Leon Derczynski is co-organising a workshop on Noisy User-generated Text (WNUT) at COLING in Osaka, Japan, 10th December 2016.
- Diana Maynard will teach two 6-hour courses, "Introduction to NLP" and "Practical social media and sentiment analysis" at
the University of Essex Big Data and Analytics Summer School in September 2016. http://www.essex.ac.uk/iads/events/summer-school.aspx
- Andreas Vlachos will be speaking at the Lisbon Machine Learning Summer School about imitation learning for structured prediction.
- Andreas Vlachos will be speaking at the Knowledge Representation Workshop at the University of Liverpool on 28th June 2016.
- Paper: Noise reduction and targeted exploration in imitation learning for Abstract Meaning Representation parsing. James Goodman, Andreas Vlachos and Jason Naradowsky. ACL 2016.
- Paper: Emergent: A novel data-set for stance classification. William Ferreira and Andreas Vlachos. NAACL 2016.
- Paper: Large-scale Multitask Learning for Machine Translation Quality Estimation . Kashif Shah and Lucia Specia. NAACL 2016.
- Paper: Phrase Level Segmentation and Labelling of Machine Translation Errors. Frederic Blain, Varvara Logacheva, and Lucia Specia. In Proc. of Language Resources and Evaluation Conference (LREC), May 2016, Portoroz, Slovenia
- Paper: Challenges of Evaluating Sentiment Analysis Tools on Social Media. Diana Maynard and Kalina Bontcheva. In Proc. of Language Resources and Evaluation Conference (LREC), May 2016, Portoroz, Slovenia
- Paper: Complementarity, F-score, and NLP Evaluation. Leon Derczynski. In Proc. of Language Resources and Evaluation Conference (LREC), May 2016, Portoroz, Slovenia
- Paper: GATE-Time: Extraction of Temporal Expressions and Events Leon Derczynski, Jannik Strötgen, Diana Maynard, Mark A. Greenwood, Manuel Jung. In Proc. of Language
Resources and Evaluation Conference (LREC), May 2016, Portoroz, Slovenia
- Dr. Diana Maynard has been awarded a grant for a fully-funded 4-year PhD student project by the Grantham Centre for Sustainable Futures,
to start in October 2016, on the topic of disaster relief reporting and climate change. The Grantham Scholar will be supervised by Diana Maynard and co-supervised by Prof. Jacqueline
Harrison from the Dept of Journalism and Prof. Shaun Quegan from the Centre for Terrestrial Carbon Dynamics.
- The next annual GATE training course will be held from 6-10 June 2016.
- Mark Stevenson was awarded a grant from Defence Science and Technology Laboratory: "Hypothesis Generation and Visualisation from Data"
- Paper: A Graph-based Approach to Topic Clustering for Online News. Ahmet Aker, Emina Kurtic, Balamurali Andiyakkal Rajendran, Monica Paramita, Emma Barker, Mark Hepple and
Rob Gaizauskas. ECIR 2016.
- Paper: Automated Content Analysis: A Sentiment Analysis on Malaysian Government Social Media. Siti Salwa Hasbullah and Diana Maynard. In Proc. of ACM International
Conference on Ubiquitous Information Management and Communication (IMCOM), January 2016, Danang, Vietnam.
- The COMRADES project has started. A 3 year EC H2020 project from 1 Jan'16 - 31 Dec'18.
The University of Sheffield PI is Prof. Kalina Bontcheva
- We are pleased to announce two new NLP Professors: Kalina Bontcheva and Lucia Specia
have both been promoted to Personal Chair.
Timeline extraction using distant supervision and joint inference , Savelie Cornegruta and Andreas Vlachos
Metrics for Evaluation of Word-level Machine Translation Quality Estimation. Varvara Logacheva, Michal Lukasik and Lucia Specia.
Predicting Morphologically-Complex Unknown Words in Igbo. Ikechukwu Onyenwe and Mark Hepple
- A piece was published in the
Guardian technology blog on Tuesday 8.12.2015 on our work in the EU-funded SENSEI project.
- Tutorial given by Diana Maynard at Search Solutions 2015, British Computer Society, London, November 2015:
"Text analysis with GATE"
- Mark Stevenson is co-organising a workshop on Topic Models: Post-processing and Applications at
CIKM 2015 with Nikolaos Aletras (UCL), Jey Han Lau (King's College London) and Timothy Baldwin (University of Melbourne).
- Andrés Duque from UNED in Madrid visited the group for 3 months (October - December 2015)
- Paper: Understanding climate change tweets: an open source toolkit for social media analysis. D. Maynard and K. Bontcheva. In Proc. of EnviroInfo 2015, Copenhagen,
- Poster: Real-time Social Media Analytics through Semantic Annotation and Linked Open Data. D. Maynard, M. A. Greenwood, I. Roberts, G. Windsor, K. Bontcheva.
Proceedings of WebSci 2015, Oxford, UK
- Paper: "Generalised Brown Clustering and Roll-Up Feature Generation". Leon Derczynski, Sean Chester. AAAI 2016.
- We are pleased to announce that Dr. Andreas Vlachos has joined the group from 1 September 2015.
- Paper: Evaluating Topic Representations for Exploring Document Collections. N. Aletras, T. Baldwin, J. Lau and M. Stevenson (to appear), Journal of the Association for
Information Science and Technology
- Paper: Exploring Relation Types for Literature-based Discovery. J. Preiss, M. Stevenson and R. Gaizauskas. (to appear),
Journal of the American Medical Informatics Association.
- Paper: Why are these similar? Investigating item similarity types in a large Digital Library. A. Gonzalez-Agirre, N. Aletras, G. Rigau, M. Stevenson and E. Agirre. (to appear),
Journal of the Association for Information Science and Technology
- Paper: Cognitive Styles within an Exploratory Search System for Digital Libraries. P. Goodale, P. Clough, S. Fernando, N. Ford and M. Stevenson (2014),
Journal of Documentation, 70(6):970-996.
- Paper: Improving Distant Supervision using Inference Learning. R. Roller, E. Agirre, A. Soroa and M. Stevenson (2015). In Proceedings of the 53rd Annual Meeting of the Association
for Computational Lingusitics and the 7th International Conference on Natural Language Processing of the Asican Federation of Natural Language Processing (ACL-IJCNLP 2015),
- Paper: A Hybrid Distributional and Knowledge-based Model of Lexical Semantics. N. Aletras and M. Stevenson (2015). In Proceedings of the Fourth Joint Conference on Lexical and
Computational Semantics, pages 20--29, Denver, Colorado
- Paper: Investigating Continuous Space Language Models for Machine Translation Quality Estimation. Kashif Shah, Raymond W. M. Ng, Fethi Bougares and Lucia Specia. EMNLP, 2015 (To Appear)
- Paper: SHEF-NN: Translation Quality Estimation with Neural Networks. Kashif Shah, Varvara Logacheva, Gustavo Paetzold, Frédéric Blain, Daniel Beck, Fethi Bougares and Lucia Specia. WMT, 2015 (To Appear)
- Paper: A study on the stability and effectiveness of features in quality estimation for spoken language translation. Raymond W. M. Ng, Kashif Shah, Lucia Specia and Thomas Hain. Interspeech, 2015.
- Paper: Quality estimation for ASR K-best list rescoring in spoken language translation. Raymond W. M. Ng, Kashif Shah, Wilker Aziz, Lucia Specia and Thomas Hain. ICASSP, 2015.
- Article: A Bayesian non-linear method for feature selection in machine translation quality estimation Kashif Shah, Trevor Cohn and Lucia Specia. Journal of Machine Translation, 2015.
- The Pheme project is co-supporting Clinical TempEval again in 2016, a shared evaluation task with the NIH THYME project and Harvard Children's Hospital, which will run at SemEval.
- Special issue on "Time and Information Retrieval" in the Information Processing & Management journal was published, with Leon Derczynski as lead guest editor.
- Martin Leginus from Aalborg University, co-supervised by Leon Derczynski, won the Best Student Paper award at WEBIST with his work improving tag clouds using entity disambiguation in streams.
- Sean Chester from Aarhus University will visit and give a seminar in late September.
- Book deal signed with O'Reilly on Temporal Information Processing for Language, by Leon Derczynski working with James Pustejovsky and Marc Verhagen (both from Brandeis).
- Our entry in the W-NUT entity recognition challenge in tweets won 3rd place for untyped entity recognition.
- Paper: Extracting Relations Between Non-Standard Entities using Distant Supervision and Imitation Learning.Isabelle Augenstein, Andreas Vlachos, Diana Maynard. EMNLP 2015.
- Article: Distantly Supervised Web Relation Extraction for Knowledge Base Population. Isabelle Augenstein, Diana Maynard, Fabio Ciravegna. Semantic Web Journal.
- Tutorial with Barry Norton at ESWC Summer School 2015: "Information Extraction with Linked Data"
- Article from the group published in the journal Information Processing and Management: Leon Derczynski, Diana Maynard, Giuseppe Rizzo, Marieke van Erp,
Genevieve Gorrell, Raphaël Troncy, Johann Petrak, Kalina Bontcheva. 2015. Analysis of Named Entity Recognition and Linking for Tweets.
- Paper presented at the SemEval workshop: Steven Bethard, Leon Derczynski, Guergana Savova, James Pustejovsky, Marc Verhagen. 2015. SemEval-2015 Task 6: Clinical TempEval.
- Paper presented at the SemEval workshop: Fatih Uzdilli, Martin Jaggi, Dominic Egger, Pascal Julmy, Leon Derczynski, Mark Cieliebak. 2015.
Swiss-Chocolate: Combining Flipout Regularization and Random Forest with Artificially Built Subsystems to Boost Text-Classification for Sentiment.
- Paper from the group presented at the SemEval workshop: Hegler Tissot, Genevieve Gorrell, Angus Roberts, Leon Derczynski, Marcos Didonet del Fabro. 2015.
UFPRSheffield: Contrasting Rule-based and Support Vector Machine Approaches to Time Expression Identification in Clinical TempEval.
- Book chapter form the group to appear in The Handbook of Linguistic Annotation (edited by Nancy Ide and James Pustejovsky):
Kalina Bontcheva, Leon Derczynski, Ian Roberts. 2015. Crowdsourcing Named Entity Recognition and Entity Linking Corpora.
- Paper from the group presented at the ISA-11 workshop: Hegler Tissot, Angus Roberts, Leon Derczynski, Genevieve Gorrell, Marcos Didonet del Fabro.
2015. Analysis of Temporal Expressions Annotated in Clinical Notes.
- Paper presented at the WEBIST conference: Martin Leginus, Leon Derczynski, Peter Dolog. 2015. Enhanced Information Access to Social Streams through Word Clouds with Entity Grouping.
- Paper from the group at the W-NUT workshop: Leon Derczynski, Isabelle Augenstein, Kalina Bontcheva. 2015. USFD: Twitter NER with Drift Compensation and Linked Data.
- Diana Maynard will give a Tutorial on "Practical Sentiment Analysis" at Essex University Summer School on Big Data and Analytics, 24-28 August 2015
- Book chapter publication. Diana Maynard and Jonathon Hare. Entity-based Opinion Mining from Text and Multimedia. In "Advances
in Social Media Analysis", Mohamed Gaber, Nirmalie Wiratunga, Ayse Goker, and Mihaela Cocea (eds.) 2015, Springer.
- Diana Maynard gave a keynote speech at 5th International Conference on Web Intelligence, Mining and Semantics (WIMS), July 13-15, 2015, Cyprus. "What you Tweet is What You Get: challenges and opportunities for social media analysis in industry"
- The annual GATE training course was held in Sheffield from 8-12 June, with 21 participants.
- Diana Maynard gave a tutorial on "Text Analysis with GATE" at the Reading University Workshop on Big Social Data, 24 April 2015.
- A paper by Roland Roller and Mark Stevenson (Self-supervised Relation Extraction using UMLS) won the best paper award at
- Paper published in the Journal of Biomedical Informatics:
B. McInnes and M. Stevenson (2014) Determining the Difficulty of Word Sense Disambiguation. Journal of Biomedical Informatics, 47:83-90.
- Paper accepted for the journal Studies in the Digital Humanities:
- Paper published in the journal Information Retrieval:
- Paper accepted for the journal Science of Computer Programming
- Paper from the group published at ACL 2014:
- Paper from the group published at Digital Libraries 2014:
- Paper from the group published at EACL 2014:
- Papers from the group published at EMNLP 2014:
- Kashif Shah, Trevor Cohn and Lucia Specia. 2014. A Bayesian non-Linear Method for Feature Selection in Machine Translation Quality Estimation. Machine Translation.
- The University of Sheffield (Sheffield NLP Group) was ranked 3rd in the list of institutions that have published the most LREC papers.
- The Clinical TempEval exercise will run at SemEval 2015, a collaboration between researcher at Brandeis University, U. Alabama Birmingham and Leon Derczynski for the University of Sheffield
- Leon Derczynski will give two guest lectures at a course on Network Science and online Social Network Analysis at Uppsala Universitet in May
- Members of the group have chapters in 2 new books
- We have 2 demos accepted at EACL 2014
- We have a short paper accepted at EACL 2014
- We have a tutorial at EACL 2014
- Long paper accepted at EACL 2014
- We have a tutorial at LREC 2014
- The group have 6 papers accepted at LREC 2014
- A paper is accepted in JASIST journal
- The PHEME project has started. A 3 year EC FP7 project from 1 Jan'14 - 31 Dec'16 with 9 partners worth a total of € 4,269,938 with an EC contribution of € 2,916,000. The University of Sheffield PI is Dr Kalina Bontcheva
M. Hall, P. Goodale, P. Clough and M. Stevenson (2014) The PATHS System for Exploring Digital Cultural Heritage. Studies in the Digital Humanities.
M. Hall, S. Fernando, P. Clough, A. Soroa, E. Agirre and M. Stevenson (2014) Evaluating hierarchical organisation structures for exploring digital libraries. Information Retrieval 17(4):351-379.
M. Shahbaz, P. McMinn and M. Stevenson (2014) Automatic generation of valid and invalid test data for string validation routines using web searches and regular expressions. Science of Computer Programming.
N. Aletras and M. Stevenson (2014) Labelling Topics using Unsupervised Graph-based Methods. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL 2014), pages 631--636, Baltimore, Maryland
N. Aletras, T. Baldwin, J. Lau and M. Stevenson (2014) Representing Topics Labels for Exploring Digital Libraries. In Digital Libraries 2014 (ACM/IEEE Joint Conference on Digital Libraries (JCDL 2014) and International Conference on Theory and Practice of Digital Libraries (TPDL 2014), London, UK
N. Aletras and M. Stevenson (2014) Measuring the Similarity between Automatically Generated Topics. In Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, pages 22--27, Gothenburg, Sweden
Wilker Aziz and Lucia Specia. 2014. Exact Decoding for Phrase-Based Statistical Machine Translation. EMNLP, Doha.
Daniel Beck, Trevor Cohn and Lucia Specia. 2014. Joint Emotion Analysis via Multi-task Gaussian Processes. EMNLP, Doha.
Documenting Contemporary Society by Preserving Relevant Information from Twitter In 'Twitter and Society', edited by K. Weller, A. Bruns, J. Burgess, M. Mahrt and C. Puschmann, 2014. T. Risse, W. Peters, P. Senellart, D. Maynard
Crowdsourcing Named Entity Recognition and Entity Linking Corpora in "The Handbook of Linguistic Annotation" edited by Nancy Ide & James Pustejovsky. Kalina Bontcheva, Leon Derczynski, Ian Roberts
The GATE Crowdsourcing Plugin: Crowdsourcing Annotated Corpora Made Easy Kalina Bontcheva, Ian Roberts and Leon Derczynski
DKIE: Open Source Information Extraction for Danish Leon Derczynski, Camilla Vilhelmsen Derczynski Field, Kenneth Sejdenfaden Bøgh
Passive-Aggressive Sequence Labeling with Discriminative Post-Editing for Recognising Person Entities in Tweets Leon Derczynski and Kalina Bontcheva
Natural Language Processing for Social Media Kalina Bontcheva and Leon Derczynski
Predicting and Characterising User Impact on Twitter Vasileios Lampos, Nikolaos Aletras, Daniel Preotiuc-Pietro, Trevor Cohn
Practical Social Media Analysis: finding utility in trivia Diana Maynard and Leon Derczynski
Corpus Annotation through Crowdsourcing: Towards Best Practice Guidelines Marta Sabou, Kalina Bontcheva, Leon Derczynski, Arno Scharl
An efficient and user-friendly tool for machine translation quality estimation Kashif Shah, Marco Turchi, Lucia Specia
Who cares about sarcastic tweets? Investigating the impact of sarcasm on sentiment analysis Diana Maynard
Bilingual dictionaries for all EU languages, LREC Ahmet Aker, Monica Paramita, Marcis Pinnis, Robert Gaizauskas
Bootstrapping Term Extractors for Multiple Languages Ahmet Aker, Monica Paramita, Emma Barker, Robert Gaizauskas
Spatio-temporal grounding of claims made on the web, in Pheme Leon Derczynski, Kalina Bontcheva
Generating Descriptive Multi-Document Summaries of Geo-Located Entities Using Entity Type Models. JASIST Ahmet Aker, Robert Gaizauskas
- Leon Derczynski & Kalina Bontcheva taught a week-long course at the University of Szeged, on "NLP for Social Media"
- Leon Derczynski has been awarded his PhD.
Determining the Types of Temporal Relations in Discourse
(Award Date: December 2013)
- Long paper accepted EMNLP'13 A temporal model of text periodicities using Gaussian Processes Daniel Preotiuc-Pietro, Trevor Cohn
- Mark Stevenson is a keynote speaker at RANLP 2013
- Three full papers from the group have been accepted at RANLP 2013, to be held in the spa town of Hisarya, Bulgaria
"Twitter Part-of-Speech Tagging for All: Overcoming Sparse and Noisy Data" Derczynski, L., Ritter, A., Clarke, S. & Bontcheva, K.
"Recognising and Interpreting Named Temporal Expressions" M. Brucato, M., Derczynski, L., Llorens, H., Bontcheva, K. & Jensen, C.S.
"TwitIE: A Fully-featured Information Extraction Pipeline for Microblog Text" Bontcheva, K., Derczynski, L., Funk, A., Greenwood, M.A., Maynard, D. & Aswani, N.
- The group has had a discussion paper accepted at the International Conference on the Theory of Information Retrieval
- 2 short papers & 3 demonstrations have been accepted by the group at ACL 2013
- Two out of the three nominations for the ACM SIGWEB Ted Nelson prize at Hypertext 2013, Paris are both from Sheffield's NLP group. (link)
- 5 papers by the group accepted at ACL 2013
- 3 papers by the group accepted for NAACL 2013
- The ForgetIT: Concise Preservation by combining Managed Forgetting and Contextualized Remembering project has started. A 3 year EC FP7 project from 1 Feb'13 - 31 Jan'16. The project has 11 partners worth a total of € 9,085,190 with an EC contribution of € 6,590,000. The University of Sheffield PI is Prof. Hamish Cunningham
- The VisualSense: Tagging visual data with semantic descriptions project has started. A 3 year EPSRC project from 1 Jan'13 - 31 Dec'15. The project has 4 partners and is part of the Chist-Era EC funding programme. The University of Sheffield PI is Prof. Rob Gaizauskas
"Information Retrieval for Temporal Bounding" Derczynski, L. & Gaizauskas, R.
"Reducing Annotation Effort for Quality Estimation via Active Learning" Beck, D., Specia, L. & Cohn, T.
"Temporal Signals Help Label Temporal Relations" Derczynski, L. & Gaizauskas, R.
"QuEst - A translation quality estimation framework" Specia, L., Shah, K., Guilherme Camargo de Souza, J. & Cohn, T.
"PATHS: A System for Accessing Cultural Heritage Collections" Agirre, E., Aletras, N., Clough, P., Fernando, S., Goodale, P., Hall, M., Soroa, A. & Stevenson, M.
"AnnoMarket: An Open Cloud Platform for NLP" Bontcheva, K., Tablan, V., Roberts, I., Cunningham, H. & Dimitrov, M.
"Extracting bilingual terminologies from comparable corpora" Aker, A., Paramita, M. & Gaizauskas, R.
"An Infinite Hierarchical Bayesian Model of Phrasal Translation" Cohn, T. & Haffari, G.
"Modelling Annotator Bias with Multi-task Gaussian Processes: An Application to Machine Translation Quality Estimation" Cohn, T & Specia, L.
"Markov Translation using Non-parametric Bayesian Inference" Feng, Y. & Cohn, T.
"A user-centric model of voting intention from Social Media" Lampos, V., Preotiuc-Pietro, D. & Cohn, T.
"Representing Topics Using Images" Aletras, N. and Stevenson, M.
"Unsupervised Domain Tuning to Improve Word Sense Disambiguation" Preiss, J. and Stevenson, M.
"DALE: A Word Sense Disambiguation System for Biomedical Documents Trained using Automatically Labeled Examples (demo)" Preiss, J. and Stevenson, M.
Angus Roberts has been awarded his PhD.
Clinical Information Extraction: Lowering the Barrier
(Award Date: 18 December 2012)
- The uComp: Embedded Human Computation for Knowledge Extraction and Evaluation project has started. A 3 year EPSRC project from 15 Nov'12 - 14 Nov'15. The project has 4 partners and is part of the Chist-Era EC funding programme. The University of Sheffield PI is Dr Wim Peters
- The EXPERT: EXPloiting Empirical appRoaches to Translation project has started. A 4 year Marie Curie ITN project from 1 Oct'12 - 30 Sept'16. The project has 9 partners worth a total of € 3,935,340. The University of Sheffield PI is Dr Lucia Specia
- Rao Muhammad Adeel Nawab has been awarded his PhD.
Mono-lingual Paraphrased Text reuse and Plagiarism detection
(Award Date: 18 September 2012)
- Niraj Aswani has been awarded his PhD.
Evolving a Generail Framework for Text Alignment: Case Studies with Two Asian Languages
(Award Date: 7 August 2012)
- The QTLaunchpad: Preparation and Launch of a Large-Scale Action for Quality Translation Technology project has started. A 2 year EC CSA FP7 project from 1 Jul'12 - 30 Jun'14 with 4 partners worth a total of € 2,731,060 with an EC contribution of € 1,187,767. The University of Sheffield PI is Dr Lucia Specia
- The AnnoMarket: Cloud-based Text Annotation Marketplace project has started. A 2 year EC FP7 project from 1 Jun'12 - 31 May'14 with 4 partners worth a total of € 2,003,654 with an EC contribution of € 1,430,000. The University of Sheffield PI is Prof. Hamish Cunningham
- The Medics: Language Processing for Literature Based Discovery in Medicine project has started. A 3 year EPSRC project from 1 Jun'12 - 31 May'15 worth a total of £ 293,167. The University of Sheffield PI is Dr Mark Stevenson
- The TaaS: Terminology as a Service project has started. A 2 year EC FP7 project from 1 Jun'12 - 31 May'14 with 5 partners worth a total of € 2,298,600 with an EC contribution of € 1,820,000. The University of Sheffield PI is Prof. Rob Gaizauskas
Kumutha Swampillai has been awarded her PhD.
Information Extraction Across Sentences
(Award Date: 7 March 2012)
Angelo Dalli has been awarded his PhD.
Timeline Extraction From Hyperlinked Text Corpora
(Award Date: 10 October 2011)
Danica Damljanovic has been awarded her PhD.
Natural Language Interfaces to Conceptual Models
(Award Date: 18 August 2011)
- First Grant Scheme: Non-Parametric Models of Phrase-based Machine Translation project has been awarded from the EPSRC with a start date of 1 September 2011. An 18 month EPSRC funded project from 1 Sept'11 - 28 Feb'13 worth a total of £ 101,251. The PI is Dr. Trevor Cohn
- The GATE Cloud Exploratory: Adapting the General Architecture for Text Engineering to Cloud Computing project has started. A 6 month EPSRC funded project from 1 Feb'11 - 31 Jul'11 worth a total of £ 71,677. The PI is Prof. Hamish Cunningham
- The PATHS: Personalised Access To cultural Heritage Spaces project has started. A 3 year EC FP7 project from 1 Jan'11 - 31 Dec'13 involving 6 partners worth a total of € 3,199,299 with an EC contribution of € 2,300,000. The University of Sheffield PI's are Dr. Mark Stevenson & Dr. Paul Clough (The Information School)
- The ARCOMEM: From Collect-All Archives to Community Memories - Leveraging the Wisdom of the Crowds for Intelligent Preservation project has started. A 3 year EC FP7 project from 1 Jan'11 - 31 Dec'13 involving 12 partners worth a total of € 8,013,031 with an EC contribution of € 6,000,000. The University of Sheffield PI is Prof. Hamish Cunningham
- Ben Allison has been awarded his PhD.
An Improved Hierarchical Bayesian Model of Language for Document Classification
(Award Date: 21 October 2010)
- Dr Kalina Bontcheva has been awarded a Career Acceleration Fellowship from the EPSRC The project will start on 1 October 2010 for 5 years and is worth £ 591,755.
- Dr. Mark Stevenson has been elected as a member of the SIGLEX board from 2010 until 2012.
- We are interested in hearing from students who have an interest in studying PHD's in topics related to Natural Language Processing. Some suggestions can be found on the homepages of individual academics - Dr Trevor Cohn, Prof. Rob Gaizauskas, Dr Mark Hepple & Dr Mark Stevenson. For more information on the PHD application process please see the PHD applications information page.
- Professor Yorick Wilks has been awarded
an ACM fellowship
(Added: 17 May 2010)
Nick Webb has been awarded his PhD.
Cue-based dialogue act classification
(Award Date: 16 March 2010)
- Sanaz Jabbari has been awarded her PhD.
A Statistical Model of Lexical Context
(Award Date: 23 February 2010)
Valentin Tablan has been awarded his PhD.
Toward Portable Information Extraction
(Award Date: 25 January 2010)
- The ACCURAT: Analysis and Evaluation of Comparable Corpora for Under Resourced Areas of Machine Translation Project has started. Duration is 30 months from 1 Jan'10 - 30 Jun'12. The grant is worth € 3.53m in total and of that the EC will fund € 2.82m. USFD Principal Investigators are Prof. Rob Gaizauskas & Dr Mark Sanderson (Information Studies)
- The NLP group's Hamish Cunningham will be general chair of the first Information Retrieval Facility Conference (IRFC2010) in Vienna next May.
- Dr. Trevor Cohn has joined the NLP Group as a lecturer in Computer Science.
Johan Petrak is visiting the NLP Group. He is located in room G25.
(Visit Dates: 1 June 2009 - 30 September 2009); host: Hamish Cunningham
- Professor Yorick Wilks receives BCS Lovelace Medal.
(Added: 1 May 2009)
Antoine Belin & Simon Loitier are visiting the NLP Group. They are located in room 135 (Yorick Wilks's office).
(Visit Dates: 1 May 2009 - 31 July 2009); host: Yorick Wilks
David Guthrie has been awarded his PhD.
Unsupervised Detection of Anomalous Text
(Award Date: 3 December 2008)
Joe Polifroni has been awarded his PhD.
Enabling Browsing in Interactive Systems
(Award Date: 18 November 2008)
Christopher Brewster has been awarded his PhD.
Mind the Gap: Bridging from text to ontological Knowledge
(Award Date: 1 October 2008)
Francios Mairesse has been awarded his PhD.
Learning to Adapt in Dialogue Systems: Data-driven Models for Personality Recognition and Generation
(Award Date: 30 September 2008)
- Professor Yorick Wilks is awarded ACL Lifetime Achievement Award.
(Added: 26 June 2008)
- Professor Yorick Wilks wins Zampolli Prize.
(Added: 2 June 2008)
Hrafn Loftsson has been awarded his PhD.
Tagging and Parsing Icelandic Text
(Award Date: 5 February 2008)
Michael Conway has been awarded his PhD.
Approaches to Automatic Biographical Sentence Classification: An Empirical Study
(Award Date: 27 July 2007)
Fang Huang has been awarded her PhD.
Multi-Document Summarization with Latent Semantic Analysis
(Award Date: 19 May 2005)
Ekaterini Pastra has been awarded her PhD.
Vision \96 Language Integration: a Double-Grounding Case
(Award Date: 5 January 2005)
Alexiei Dingli has been awarded his PhD.
Annotating the Semantic Web
(Award Date: 6 December 2004)
Wim Peters has been awarded his PhD.
Detection and Characterization of Figurative Language Use WordNet
(Award Date: 29 November 2004)
Diego Uribe has been awarded his PhD.
LEEP: Learning Event Extraction Patterns
(Award Date: 18 October 2004)
Brian Mitchell has been awarded his PhD.
Prepositional Phase Attachment using Machine Learning Algorithms
(Award Date: 5 July 2004)
Tomas By has been awarded his PhD.
Tears in the Rain
(Award Date: 15 March 2002)
Andrea Setzer has been awarded her PhD.
Temporal information in newswrite articles: An annotation scheme and corpus study
(Award Date: 15 March 2002)
Kalina Bontcheva has been awarded her PhD.
Generating Adaptive Hypertext
(Award Date: 17 September 2001)
Alexandar Krotov has been awarded his PhD.
Parsing with a Compacted Treebank Grammar
(Award Date: 17 September 2001)
Harmain Harmain has been awarded his PhD.
Automated Software Engineering
(Award Date: December 2000)
ChunYu Kit has been awarded his PhD.
Unsupervised Lexical Learning as Inductive Inference
(Award Date: 15 November 2000)
Hamish Cunningham has been awarded his PhD.
Software Architecture for Language Engineering
(Award Date: 10 July 2000)
Paul Woods has been awarded his PhD.
Cognitive Schemas for Chinese Noun Classifiers: A Corpus-Based Investigation
(Award Date: 25 February 2000)
Ted Dunning has been awarded his PhD.
Finding Structure In Text Genome And Other Symbolic Sequences
(Award Date: 29 November 1999)
Mark Stevenson has been awarded his PhD.
Multiple Knowledge Sources for Word Sense Disambiguation
(Award Date: 27 September 1999)
Hammid Khosravi has been awarded his PhD.
Extracting Pragmatic Content From Email
(Award Date: 9 August 1999)
Mark Lee has been awarded his PhD.
Belief Rationality and Inference
(Award Date: 14 December 1998)
Rob Collier has been awarded his PhD.
Automatic Template Creation for Information Extraction
(Award Date: 10 August 1998)