Module overview
In this module, we introduce corpus linguistics as an approach to and method for analysing large-scale text data. We will develop an understanding of building and curating datasets, annotating data, and using quantitative and statistical measures for language analysis. We will explore the uses, advantages and disadvantages of applying corpus methods in research. As we move through the module, we will experience each step of the process of doing research with large-scale linguistic data, considering the choices we make in research and how we can justify them in light of our aims.
Aims and Objectives
Learning Outcomes
Subject Specific Practical Skills
Having successfully completed this module you will be able to:
- use general and specialist software for large-scale textual analysis
- design, develop, and pre-process a large-scale text corpus for analysis
- apply basic approaches to visualising research findings from linguistic data
Knowledge and Understanding
Having successfully completed this module, you will be able to demonstrate knowledge and understanding of:
- methods for designing, developing, and analysing large-scale text corpora
- quantitative and statistical measures in large-scale text analysis
- the methodological and theoretical underpinnings of corpus linguistics
Subject Specific Intellectual and Research Skills
Having successfully completed this module you will be able to:
- critically evaluate the uses, advantages, and disadvantages of using corpus linguistic methods
- apply corpus linguistic methods and techniques to primary data analysis
- justify methodological choices in data collection, curation and analysis
Transferable and Generic Skills
Having successfully completed this module you will be able to:
- communicate the development, design and findings of research in multiple modes and formats (e.g. oral; written; multimodal)
- work independently towards specific research aims
- critically evaluate methodological choices in existing research
Syllabus
Indicative content for the module will include:
- Text corpora, their forms, and functions
- Linguistic intuition vs evidence
- Statistical measures in corpus linguistics
- Using existing corpora online
- Building and curating corpora
- Annotating and pre-processing text data
- Using specialist and generic software for corpus analysis
Learning and Teaching
Teaching and learning methods
Teaching and learning activities will take the form of active workshops, where students work in groups to develop experiential understanding of all aspects of corpus linguistics research. This will variably include – inter alia – working through research problems in active discussion (e.g. what data do we need to answer research question X?), analysing data and presenting findings (e.g. how has the use of X changed over time in the British National Corpus?), and/or critiquing existing research using corpus methods (e.g. in light of the aims of this study, what would you do differently and why?).
This module places explicit emphasis on the development of practical skills for working with data, using specialist software, and justifying choices made in the research process. A collaborative ethos is also embedded in the teaching and learning experience, which is also supported in asynchronous activities via the virtual learning environment.
Type | Hours |
---|---|
Seminar | 24 |
Independent Study | 126 |
Total study time | 150 |
Resources & Reading list
Textbooks
McEnery, T. & Hardie, A. (2012). Corpus Linguistics: Method, Theory and Practice. Cambridge University Press.
O’Keeffe, A. & McCarthy, M. (2010). The (2010). The Routledge Handbook of Corpus Linguistics. Routledge.
Barth, D. & Schnell, S. (2021). Understanding Corpus Linguistics. Routledge.
Stefanowitsch, A. (2020). Corpus Linguistics: A Guide to the Methodology. Language Science Press.
Assessment
Summative
This is how we’ll formally assess what you have learned in this module.
Method | Percentage contribution |
---|---|
Project plan | 20% |
Final project | 80% |
Referral
This is how we’ll assess you if you don’t meet the criteria to pass this module.
Method | Percentage contribution |
---|---|
Assessment | 100% |
Repeat
An internal repeat is where you take all of your modules again, including any you passed. An external repeat is where you only re-take the modules you failed.
Method | Percentage contribution |
---|---|
Assessment | 100% |
Repeat Information
Repeat type: Internal & External