Contracted forms and associated tags in BNC2

[ Related documents: Multiwords | Introduction to the Manual | Guidelines to Wordclass Tagging | Error rates | Automatic tagging of the BNC ]

For an explanation of contracted forms, please see the Introduction to the Manual and Guidelines to Wordclass Tagging.

The list consists of two parts

Orthographic Form

Broken down into

Component tags

'd've

'd + 've

VM0 + VHI

'tis

't + is

PNP + VBZ

'twas

't + was

PNP + VBD

'twere

't + were

PNP + VBD

'twould

't + would

PNP + VM0

I'd've

I + 'd + 've

PNP + VM0 + VHI

ain't

ai + n't

UNC + XX0

aint

ai + nt

UNC + XX0

aintcha

ai + nt + cha

UNC + XX0 + PNP

an'all

an' + all / an'all

CJC + DT0 / multiword AV0

arent

are + nt

VBB + XX0

c'mon

c'm + on

VVB + AVP

cannae

can + nae

VM0 + XX0

can't

ca + n't

VM0 + XX0

cannot

can + not

VM0 + XX0

couldnt

could + nt

VM0 + XX0

d'ya

d' + ya

VDB + PNP

d'you

d' + you

VDB + PNP

didnt

did + nt

VDD + XX0

doesnt

does + nt

VDZ + XX0

dont

do + nt

VDB + XX0

dunnit

dun + n + it

VDZ + XX0 + PNP

dunno

du + n + no

VDB + XX0 + VVI

geroff

ger + off

VVB/VVI + AVP/PRP

gimme

gim + me

VVB/VVI + PNP

gonna

gon + na

VVG + TO0

gorra

gor + ra

VVN + AT0

gotta

got + ta

VVN + TO0/AT0

hadnt

had + nt

VHD + XX0

hasnt

has + nt

VHZ + XX0

havent

have + nt

VHB + XX0

he'd've

he + 'd + 've

PNP + VM0 + VHI

hes

he + s

PNP + VBZ

i'd've

i + 'd + 've

PNP + VM0 + VHI

innit

in + n + it

VBZ + XX0 + PNP

isnt

is + nt

VBZ + XX0

it'd've

it + 'd + 've

PNP + VM0 + VHI

lorra

lor + ra

NN1 + PRF

m'lud

m' + lud

DPS + NN1

ought'a

ough + t + 'a

VM0 + TO0 + VHI

oughta

ought + a

VM0 + TO0

shan't

sha + n't

VM0 + XX0

she'd've

she + 'd + 've

PNP + VM0 + VHI

shes

she + s

PNP + VBZ

shouldn't've

should + n't + 've

VM0 + XX0 + VHI

shouldnt

should + nt

VM0 + XX0

t'other

t' + other

AT0 + NN1

thats

that + s

DT0 + VBZ

theres

there + s

EX0 + VBZ

they'd've

they + 'd + 've

PNP + VM0 + VHI

theyve

they + ve

PNP + VHB

tis

t + is

PNP + VBZ

twas

t + was

PNP + VBD

twere

t + were

PNP + VBD

twould

t + would

PNP + VM0

t'other

t' + other

AT0 + NN1

wanna

wan + na

VVB/VVI + TO0/AT0

wannit

wann + it

VVB/VVI + PNP

wasnae

was + nae

VBD + XX0

wasnt

was + nt

VBD + XX0

we'd've

we + 'd + 've

PNP + VM0 + VHI

werent

were + nt

VBD + XX0

weve

we + ve

PNP + VHB

won't

wo + n't

VM0 + XX0

wotta

wott + a

DTQ + AT0

wouldnt

would + nt

VM0 + XX0

you'd've

you + 'd + 've

PNP + VM0 + VHI

 

List of trailing enclitics

These items can attach -- without intervening spaces - to a variety of wordforms, typically nouns and pronouns, as in

" <w NP0>Mary<w VM0>'d never forgive you if it was," Blanche answered .    [CDY]

I mean, if <w PNP>she<w VHD>'d told Doris, Doris would have phoned.    [KST]

The contraction n't attaches to auxiliary/modal verbs (can't play, haven't looked etc.)

Enclitic formAvailable Tags
'dVM0 / VHD
'mVBB
'sVBZ / VHZ / VDZ / POS
'llVM0
n'tXX0
'reVBB
'veVHB

[ Related documents: Multiwords | Introduction to the Manual | Guidelines to Wordclass Tagging | Error rates | Automatic tagging of the BNC ]

Date: 17 March 2000