BNC64 , which consists of approx. 1.5 million tokens, has been extracted from the demographic part of the British National Corpus (BNC). It represents the speech of 64 British English speakers – 32 women and 32 men. The speakers were selected according to the following criteria:

Each speaker contributes between 6.4 and 64 thousand tokens.
The speakers form a balanced sample in terms of gender, age and socioeconomic status.
The speakers come from various parts of the UK.

The unique characteristic of BNC 64 is the fact that it enables us to investigate both individual and social variation.

Corpus characteristics (basic)
Individual speakers

