Hansard Debates through a Telescope: Two centuries of digital Parliamentary records

The Hansard Corpus (1803-2003) contains 7.6 million speeches from the UK Parliament, which are not verbatim. Its size – 1.6 billion words – means that it is particularly unwieldy to explore digitally. As a result, in 2015 it was tagged semantically using the tagset of the Historical Thesaurus of English in order to enable semantic queries and aggregation. In this talk, I will discuss what the corpus represents, the overall picture of the Parliamentary record from a semantic point of view (‘through a telescope’), and what such digital parliamentary records can tell us.

Marc Alexander is Professor of English Linguistics at the University of Glasgow and is Director of the Historical Thesaurus of English. His research primarily focuses on the study of words, meaning, and effect in English, including historical lexicology, Parliamentary discourse from 1803 to the present, and the stylistics of detective fiction. He is also a trustee of the Glasgow Educational and Marshall Trust, and a trustee and the Chair of the Board of Directors of Scottish Language Dictionaries.

 

Digital Scholarship Centre

Digital Scholarship Centre, 6th floor

Main Library 

University of Edinburgh 

Edinburgh EH8 9LJ

You might be interested in

image of head

CDCS Digital Research Prizes Award Ceremony

image of people drinking coffee

CDCS May Fika

An illustrative collage with & symbol and an old photograph

Explainable Machine Learning (XAI)

An illustrative collage with & symbol and old graphs

Getting Started with Regression in R

An illustrative collage with & symbol and a historical item

Getting Started with Bayesian Statistics

An illustrative collage with & symbol and a maths graph

Linear Mixed Effects Modelling

An illustrative collage with & symbol and an old photograph

Building Personal and Project Websites