Brezina, V. (2018) Statistics in Corpus Linguistics: A Practical Guide. Cambridge: Cambridge University Press.
Do you use corpora in your research or study, but find that you struggle with statistics? This practical introduction will equip you to understand the key principles of statistical thinking and apply these concepts to your own research, without the need for prior statistical knowledge. The book gives step-by-step guidance through the process of statistical analysis and provides multiple examples of how statistical techniques can be used to analyse and visualise linguistic data. It also includes a useful selection of discussion questions and exercises which you can use to check your understanding.
Lancaster Stats Tools is a companion website to the book. It contains additional materials (video lectures, exercises, data, and slides and lesson plans) as well as easy-to-use tools for calculating statistics and producing graphs.
Concordance for the lemma GO [csv] [xlsx]
Passives in BE06 - genres [csv] [xlsx]
'The' and 'I' in BNC64 [csv] [xlsx]
'Go'/'travel' in BNC [csv] [xlsx]
'Lovely' in BNC64: Male and female speech [csv] [xlsx]
'Lovely' in BNC64: Age [csv] [xlsx]
Modals in the Brown family - frequencies [csv] [xlsx]
Modals in the Brown family - concordances [csv] [xlsx]
Modals in the Brown family - summary [csv] [xlsx]
Data visualization [xlsx]
#LancsBox can identify collocations and keywords, among other things. Unfortunately it's not available on the web, so you'll need to download it to your computer for free.
Inter-rater agreement (exercise 9) [csv] [xlsx]
Inter-rater agreement (example) [csv] [xlsx]
Guardian comments [txt]
Daily Mail comments [txt]
Watch instructional videos about statistics and why it matters in language and everyday life.
Watch lecturesDownload pptx slides on a range of statistical topics related to each of the eight chapters in the book.
View downloadsDownload lesson plans for teachers related to each of the eight chapters in the book.
View downloads