Frequency lists
Here we provide plain text versions of the frequency lists contained in WFWSE.
These are raw unedited frequency lists produced by our software and do not
contain the many additional notes supplied in the book itself. The lists
are tab delimited plain text so can be imported into your prefered spreadsheet
format. For the main lists we provide a key to the columns. More details
on the process undertaken in the preparation of the lists can be found in the
introduction to the book.
These lists show dispersion ranging between
0 and 1 rather than 0 and 100 as in the book.
We multiplied the value by 100 and rounded to zero decimal places in the book
for reasons of space. Log likelihood values are shown here to one decimal place rather
than zero as in the book.
Please note, all frequencies are per million words.
There are some extra notes explaining
the dummy values (:, @, and %) in the
lemmatised lists.
CHAPTER 1: Frequencies in the Whole Corpus (Spoken and Written English)
CHAPTER 2: Spoken and Written English
- List 2.1: Alphabetical frequency list: speech v. writing (lemmatized):
list
key
- List 2.2: Rank frequency order: spoken English (not lemmatized):
list
key
- List 2.3: Rank frequency order: written English (not lemmatized)
list
key
- List 2.4: Distinctiveness list: contrasting speech and writing (ordered by log likelihood):
list
key
CHAPTER 3: Two Main Varieties of Spoken English Compared
- List 3.1: Alphabetical frequency list: conversational v. task-oriented speech (lemmatized):
list
key
- List 3.2: Distinctiveness list: contrasting conversational v. task-oriented speech (not lemmatized):
list
key
CHAPTER 4: Two Main Varieties of Written English Compared
- List 4.1: Alphabetical frequency list: imaginative v. informative writing (lemmatized):
list
key
- List 4.2: Distinctiveness list: imaginative v. informative writing (not lemmatized):
list
key
CHAPTER 5: Rank Frequency Lists of Words within Word Classes (Parts of Speech) in the whole corpus
- List 5.1: Frequency list of nouns (by lemma):
list
- List 5.2: Frequency list of verbs (by lemma):
list
- List 5.3: Frequency list of adjectives (by lemma):
list
- List 5.4: Frequency list of adverbs (not lemmatized):
list
- List 5.5: Frequency list of pronouns (not lemmatized):
list
- List 5.6: Frequency list of determiners:
list
- List 5.7: Frequency list of determiner/pronouns:
list
- List 5.8: Frequency list of prepositions:
list
- List 5.9: Frequency list of conjunctions:
list
- List 5.10: Frequency list of interjections and discourse particles:
list
CHAPTER 6: Frequency Lists of Grammatical Word Classes (based on the Sampler Corpus)
- List 6.1.1: Alphabetical list: the whole sampler corpus (spoken and written English):
list
- List 6.1.2: Rank frequency list: the whole sampler corpus:
list
- List 6.2.1: Alphabetical list: spoken v. written English:
list
- List 6.2.2: Rank frequency list: spoken English compared with written English:
list
- List 6.2.3: Rank frequency list: written English compared with spoken English:
list
- List 6.2.4: Distinctiveness list: spoken v. written English:
list
- List 6.3.1: Alphabetical list: conversation v. task-oriented speech:
list
- List 6.3.2: Distinctiveness list: conversation v. task-oriented speech:
list
- List 6.4.1: Alphabetical list: imaginative v. informative writing:
list
- List 6.4.2: Distinctiveness list: imaginative v. informative writing:
list