About
My research interests are focussed on natural language processing / computational linguistics focussing on information extraction and human-in-the-loop NLP. He has been a PI and CoI on various UKRI projects, EU H2020, Innovate UK, Home Office and DSTL projects. Many of these projects are cross-disciplinary in nature, featuring consortia with a mixture of academic and commercial partners experienced in a range of domains and disciplines. His PhD looked into recommender systems and ontologies, working under supervisors Prof Dave de Roure and Prof Sir Nigel Shadbolt, and was completed in Oct 2002.
You can update this in Pure (opens in a new tab). Select ‘Edit profile’. Under the heading and then ‘Curriculum and research description’, select ‘Add profile information’. In the dropdown menu, select - ‘About’.
Write about yourself in the third person. Aim for 100 to 150 words covering the main points about who you are and what you currently do. Clear, simple language is best. You can include specialist or technical terms.
You’ll be able to add details about your research, publications, career and academic history to other sections of your staff profile.
Research
Research interests
- Natural Language Processing
- Human-in-the-loop NLP: Active Learning, Adversarial Training, Rationale-based Learning, Interactive Sense Making
- Information Extraction: Few/Zero Shot Learning, Graph-based Models, Behaviour Classification, Geoparsing/Location Extraction, Event Extraction, Argument Mining
- Domains: Law Enforcement, Defence, Mental Health, Environmental Science, Social Science
Current research
Natural Language Processing
My research interest lies in the natural language processing area of information extraction and human-in-the-loop NLP, developing novel algorithms to discover and exploit patterns in free text and metadata to extract actionable human intelligence and machine-readable knowledge. In a juxtaposition to big data approaches, my research has focussed on developing novel solutions to problems where training sets are small, sparse or fragmented in nature. This is very common in areas such as social media posts during breaking news events, emerging topics within online community forums, criminal marketplaces exhibiting deliberate obfuscation, and historical datasets where information can be inaccurately recorded, corrupted or lost over time. I am interested in investigating socio-technical NLP approaches promoting explainable and trustworthy AI, human-in-the-loop approaches and information extraction based on techniques such as few/zero-shot learning, graph-based models, sentence embeddings, domain adaption and argument mining.
Research outputs
open source software github
geoparsepy PyPI
Research projects
ProTechThem : an ESRC funded project. ProTechThem will explore sharenting (parents sharing online information about minors). Motivation for sharenting and automated detection of risk behaviours online will be explored through online ethnography, criminological analysis and Natural Language Processing (NLP) algorithms to support improvement to cybersecurity behaviours.
SafeSpacesNLP - an UKRI TAS Hub funded project. Behaviour classification NLP in a socio-technical AI setting for online harmful behaviours for children and young people. Exploring human-in-the-loop graph-based and few shot NLP models for behaviour classification of online forum posts.
GloSAT - an UK NERC platform grant. Global Surface Air Temperature (GloSAT) aims to improve understanding of climate variability and change. Objectives include information extraction and data rescue of climate change sensors data from historical texts.
Multimodal audio-textual argumentation mining of political debates : a Web Science Institute grant. Development of a multimodel dataset for training NLP models to perform argument mining of political debates.
CYShadowWatch - Aa UK DSTL funded project. Automated Multilingual Information Extraction for Online Cybercrime Sites. CYShadowWatch will explore statistical machine translation and information extraction of online Russian cybercrime forums.
LPLP - Legal and property language processing : UK Innovate UK funded project. LPLP will develop cutting-edge AI techniques to extract and analyse legal rights and obligations related to property and land, including Natural Language Processing (NLP) algorithms to extract legal rights and obligations from HM Land Registry documents.
FloraGuard project : an UK ESRC funded project. FloraGuard will examine and map from a multidisciplinary perspective the criminal market in endangered plants affecting the UK. Quantitative evidence will come from a combination of surface (web forums, social media) and dark web (TOR forums) crawling of cyber-criminal activity; natural language and machine learning used to socio-economically map this activity at a community level.
Intel-Analysis DSTL : a UK DSTL funded project. Intel-Analysis DSTL uses argumentation schemes and evidential reasoning to support teams of analysts trying to evaluate conflicting hypotheses during real-time events. Evidence is obtained in real-time from a combination of human intelligence reports and information extraction from social media via natural language processing.
REVEAL project : an EU funded FP7 project. REVEAL aims to advance the necessary technologies for making a higher level analysis of social media possible. Focussed on social media verification, including digita ltext forensics, trust and credibility analytics and decision support for journalists verifying user generated content.
You can update the information for this section in Pure (opens in a new tab).
Research groups
Any research groups you belong to will automatically appear on your profile. Speak to your line manager if these are incorrect. Please do not raise a ticket in Ask HR.
Research interests
Add up to 5 research interests. The first 3 will appear in your staff profile next to your name. The full list will appear on your research page. Keep these brief and focus on the keywords people may use when searching for your work. Use a different line for each one.
In Pure (opens in a new tab), select ‘Edit profile’. Under the heading 'Curriculum and research description', select 'Add profile information'. In the dropdown menu, select 'Research interests: use separate lines'.
Current research
Update this in Pure (opens in a new tab). Select ‘Edit profile’ and then ‘Curriculum and research description - Current research’.
Describe your current research in 100 to 200 words. Write in the third person. Include broad key terms to help people discover your work, for example, “sustainability” or “fashion textiles”.
Research projects
Research Council funded projects will automatically appear here. The active project name is taken from the finance system.
Publications
Pagination
- 1
- 2
- 3
- 4
- 5
- …
-
Next page
Next
Public outputs that list you as an author will appear here, once they’re validated by the ePrints Team. If you’re missing any outputs that you’ve added to Pure, they may be waiting for validation.
Supervision
Current PhD Students
Contact your Faculty Operating Service team to update PhD students you supervise and any you’ve previously supervised. Making this information available will help potential PhD applicants to find you.
Teaching
ECS Deputy Examinations Officer and Prizes
Module Leader COMP3225 Natural Language Processing
Teaching Team COMP3222/COMP6246 Machine Learning Technologies; COMP3208 Social Computing Techniques; COMP6214 Open Data Innovation
If you are a student interested in a PhD in NLP then have a look at my research projects and contact me to discuss ideas.
You can update your teaching description in Pure (opens in a new tab). Select ‘Edit profile’. Under the heading and then ‘Curriculum and research description’ , select ‘Add profile information’. In the dropdown menu, select – ‘Teaching Interests’. Describe your teaching interests and your current responsibilities. Aim for 200 words maximum.
Courses and modules
Contact the Curriculum and Quality Assurance (CQA) team for your faculty to update this section.
External roles and responsibilities
You can update your external roles and responsibilities in Pure (opens in a new tab). Select ‘+ Add content’ and then ‘Activity’, your ‘Personal’ tab and then ‘Activities’. Choose which activities you want to show on your public profile.
You can hide activities from your public profile. Set the visibility as 'Backend' to only show this information within Pure, or 'Confidential' to make it visible only to you.
Biography
A chance to go into more detail about your work and interests.
This section will only display on your public profile if you’ve added content.
You can update your biography section in Pure (opens in a new tab). Select your ‘Personal’ tab then ‘Edit profile’. Under the heading, and ‘Curriculum and research description’, select ‘Add profile information’. In the dropdown menu, select - ‘Biography’. Aim for no more than 400 words.
This section will only appear if you enter the information into Pure (opens in a new tab).
Prizes
You can update this section in Pure (opens in a new tab). Select ‘+Add content’ and then ‘Prize’. using the ‘Prizes’ section.
You can choose to hide prizes from your public profile. Set the visibility as ‘Backend’ to only show this information within Pure, or ‘Confidential’ to make it visible only to you.