Lancaster University

Lancaster Summer Schools
in GIS and NLP (#LancsSS18)

Lancaster University, UK – 2nd to 5th July 2018

Statistics for Corpus Linguistics

Statistics for Corpus Linguistics is aimed at students and researchers with a background in corpus linguistics who wish to learn more about the use of statistics to explore language corpora. No prior knowledge of statistics is required.

The summer school offers a practical introduction to the statistical procedures used for the analysis language corpora. The curriculum provides an overview of the main statistical procedures used in the field of corpus linguistics together with simple examples of application of these methods. It is taught by Vaclav Brezina and Matt Timperley with contributions from other staff from Lancaster University and members of the CASS Challenge Panel.

This summer school takes place under the aegis of The ESRC Centre for Corpus Approaches to Social Science (CASS), which is the recipient of The Queen's Anniversary Prize for Higher and Further Education, and UCREL, one of the world's leading and longest-established centres for corpus-based research.

The topics include, for example:

  • Null hypothesis significance testing and effect sizes
  • Sampling methods and representativeness
  • Frequency and dispersion; descriptive and inferential statistics
  • Register variation and multi-dimensional analysis

Application: The registration is now closed due to a large number of applications.

N.B.: This Summer School event is free to attend, but registration in advance is compulsory, as places are limited.

This page was last modified on Monday 6 March 2017 at 11:22 am.