Generative modelling in computer vision

About the project

This PhD project aims to level up the current generative modelling techniques in computer vision. You'll address challenges related to language-vision integration and the transition from specialized task-oriented approaches to adaptable generalist models. Our research is aimed at unlocking the potential of generative models that combine expertise and versatility through language-vision integration.

The landscape of generative modelling in computer vision has undergone a remarkable transformation, evolving from conventional vision-based techniques to sophisticated language-vision models. This evolution has been propelled by the fusion of natural language understanding and image generation within computer vision systems, leading to ground-breaking research and practical applications like those using DALL-E3 with ChatGPT.

This integration enables machines not only to interpret visual data but also to generate contextually rich, human-like descriptions, blurring the lines between artificial intelligence and human cognition. Furthermore, this shift has given rise to versatile generalist models, capable of handling diverse tasks, necessitating innovative solutions to seamlessly integrate language and vision.

The Vision, Learning, and Control (VLC) group at the School of Electronics and Computer Science (ECS) is opening two PhD positions. We cordially invite individuals who are passionate to the realms of computer vision and machine learning to apply for these positions. The successful candidates will be engaged into the cutting-edge research within the supportive and collaborative environment of the VLC group.

Potential supervisors

Lead supervisor

Dr Zhiwu Huang

Lecturer

Research interests

Computer Vision
Machine Learning
Generative AI

Supervisors

Professor Jonathon Hare BEng (Hons), PhD, FHEA, SMIEEE

Professor

Research interests

My main research interests lie in the area of representation learning;
The long-term goal of my research is to innovate techniques that can allow machines to learn from and understand the information conveyed by data and use that information to fulfil the information needs of humans.

Dr Xiaohao Cai

Lecturer in Computer Science

Research interests

Image/signal/data processing
Computer vision
Machine learning

Entry requirements

A UK 2:1 honours degree, or its international equivalent.

Fees and funding

For UK students, tuition fees will be paid and you'll receive a stipend (living allowance) of £18,622 tax-free per year for up to 3.5 years.

How to apply

Apply now

You need to:

choose programme type (Research), 2024/25, Faculty of Engineering and Physical Sciences
choose PhD Computer Science (Full time)
add name of the supervisor (Zhiwu Huang) in section 2

Applications should include:

research proposal
your cv (resumé)
2 reference letters
degree transcripts to date

Contact us

Faculty of engineering and physical sciences

Email: feps-pgr-apply@soton.ac.uk

Project leader

For an initial conversation, email Dr. Zhiwu Huang (Zhiwu.Huang@soton.ac.uk).

Postgraduate research project

Generative modelling in computer vision

About the project

Potential supervisors

Lead supervisor

Dr Zhiwu Huang

Research interests

Supervisors

Professor Jonathon Hare BEng (Hons), PhD, FHEA, SMIEEE

Research interests

Dr Xiaohao Cai

Research interests

Entry requirements

Fees and funding

How to apply

Contact us

Faculty of engineering and physical sciences

Project leader