diff --git a/data/normalization/en/ngrams b/data/normalization/en/ngrams index ef97921fd1dea545681c7df7e00439f25c36a98e..20174559c1c7a8d83891663cf8e456fe1dd1945e 100644 --- a/data/normalization/en/ngrams +++ b/data/normalization/en/ngrams @@ -227,5 +227,24 @@ gonna going to 0 ain't is n't 0 Gonna Going to 0 Wanna Want to 0 -I gotta I have to 0 +I gotta I have got to 0 gotta got to 0 +dunno do n't know 0 +oughta ought to 0 +kinda kind of 0 +kindsa kinds of 0 +lemme let me 0 +gimme give me 0 +outta out of 0 +lotsa lots of 0 +whatcha what are you 0 +shoulda should have 0 +coulda could have 0 +woulda would have 0 +'cos because 0 +'cause because 0 +hasta has to 0 +hafta have to 0 +needa need to 0 +lotta a lot of 0 +getcha get you 0