From 649ee6eea77531d0a4918f4cbfa777dfc5a825d0 Mon Sep 17 00:00:00 2001 From: Benoit Sagot <benoit.sagot@inria.fr> Date: Thu, 28 Apr 2016 14:09:35 +0000 Subject: [PATCH] git-svn-id: https://scm.gforge.inria.fr/authscm/cfourrie/svn/lingwb/MElt/trunk@5677 dc05b511-7f1d-0410-9f1c-d6f32a2df9e4 --- data/normalization/en/ngrams | 21 ++++++++++++++++++++- 1 file changed, 20 insertions(+), 1 deletion(-) diff --git a/data/normalization/en/ngrams b/data/normalization/en/ngrams index ef97921..2017455 100644 --- a/data/normalization/en/ngrams +++ b/data/normalization/en/ngrams @@ -227,5 +227,24 @@ gonna going to 0 ain't is n't 0 Gonna Going to 0 Wanna Want to 0 -I gotta I have to 0 +I gotta I have got to 0 gotta got to 0 +dunno do n't know 0 +oughta ought to 0 +kinda kind of 0 +kindsa kinds of 0 +lemme let me 0 +gimme give me 0 +outta out of 0 +lotsa lots of 0 +whatcha what are you 0 +shoulda should have 0 +coulda could have 0 +woulda would have 0 +'cos because 0 +'cause because 0 +hasta has to 0 +hafta have to 0 +needa need to 0 +lotta a lot of 0 +getcha get you 0 -- GitLab