kazu.utils.spacy_pipeline

Functions

basic_spacy_pipeline()

A basic spaCy pipeline with a sentence splitter and a customised tokenizer.

Classes

KazuCustomEnglish

KazuCustomEnglishDefaults

SpacyPipelines

Wraps spaCy pipelines into a singleton, so multiple can be accessed from different locations without additional memory overhead.

class kazu.utils.spacy_pipeline.KazuCustomEnglish[source]

Bases: English

Defaults[source]

alias of KazuCustomEnglishDefaults

default_config = {'components': {}, 'corpora': {'dev': {'@readers': 'spacy.Corpus.v1', 'augmenter': None, 'gold_preproc': False, 'limit': 0, 'max_length': 0, 'path': '${paths.dev}'}, 'train': {'@readers': 'spacy.Corpus.v1', 'augmenter': None, 'gold_preproc': False, 'limit': 0, 'max_length': 0, 'path': '${paths.train}'}}, 'initialize': {'after_init': None, 'before_init': None, 'components': {}, 'init_tok2vec': '${paths.init_tok2vec}', 'lookups': None, 'tokenizer': {}, 'vectors': '${paths.vectors}', 'vocab_data': None}, 'nlp': {'after_creation': None, 'after_pipeline_creation': None, 'batch_size': 1000, 'before_creation': None, 'disabled': [], 'lang': 'kazu_custom_en', 'pipeline': [], 'tokenizer': {'@tokenizers': 'spacy.Tokenizer.v1'}, 'vectors': {'@vectors': 'spacy.Vectors.v1'}}, 'paths': {'dev': None, 'init_tok2vec': None, 'train': None, 'vectors': None}, 'system': {'gpu_allocator': None, 'seed': 0}, 'training': {'accumulate_gradient': 1, 'annotating_components': [], 'batcher': {'@batchers': 'spacy.batch_by_words.v1', 'discard_oversize': False, 'size': {'@schedules': 'compounding.v1', 'compound': 1.001, 'start': 100, 'stop': 1000}, 'tolerance': 0.2}, 'before_to_disk': None, 'before_update': None, 'dev_corpus': 'corpora.dev', 'dropout': 0.1, 'eval_frequency': 200, 'frozen_components': [], 'gpu_allocator': '${system.gpu_allocator}', 'logger': {'@loggers': 'spacy.ConsoleLogger.v1'}, 'max_epochs': 0, 'max_steps': 20000, 'optimizer': {'@optimizers': 'Adam.v1', 'L2': 0.01, 'L2_is_weight_decay': True, 'beta1': 0.9, 'beta2': 0.999, 'eps': 1e-08, 'grad_clip': 1.0, 'learn_rate': 0.001, 'use_averages': False}, 'patience': 1600, 'score_weights': {}, 'seed': '${system.seed}', 'train_corpus': 'corpora.train'}}
lang: str | None = 'kazu_custom_en'
class kazu.utils.spacy_pipeline.KazuCustomEnglishDefaults[source]

Bases: EnglishDefaults

infixes: Sequence[str | Pattern] | None = ['\\(', '/', '\\.\\.+', '…', '[\\u00A6\\u00A9\\u00AE\\u00B0\\u0482\\u058D\\u058E\\u060E\\u060F\\u06DE\\u06E9\\u06FD\\u06FE\\u07F6\\u09FA\\u0B70\\u0BF3-\\u0BF8\\u0BFA\\u0C7F\\u0D4F\\u0D79\\u0F01-\\u0F03\\u0F13\\u0F15-\\u0F17\\u0F1A-\\u0F1F\\u0F34\\u0F36\\u0F38\\u0FBE-\\u0FC5\\u0FC7-\\u0FCC\\u0FCE\\u0FCF\\u0FD5-\\u0FD8\\u109E\\u109F\\u1390-\\u1399\\u1940\\u19DE-\\u19FF\\u1B61-\\u1B6A\\u1B74-\\u1B7C\\u2100\\u2101\\u2103-\\u2106\\u2108\\u2109\\u2114\\u2116\\u2117\\u211E-\\u2123\\u2125\\u2127\\u2129\\u212E\\u213A\\u213B\\u214A\\u214C\\u214D\\u214F\\u218A\\u218B\\u2195-\\u2199\\u219C-\\u219F\\u21A1\\u21A2\\u21A4\\u21A5\\u21A7-\\u21AD\\u21AF-\\u21CD\\u21D0\\u21D1\\u21D3\\u21D5-\\u21F3\\u2300-\\u2307\\u230C-\\u231F\\u2322-\\u2328\\u232B-\\u237B\\u237D-\\u239A\\u23B4-\\u23DB\\u23E2-\\u2426\\u2440-\\u244A\\u249C-\\u24E9\\u2500-\\u25B6\\u25B8-\\u25C0\\u25C2-\\u25F7\\u2600-\\u266E\\u2670-\\u2767\\u2794-\\u27BF\\u2800-\\u28FF\\u2B00-\\u2B2F\\u2B45\\u2B46\\u2B4D-\\u2B73\\u2B76-\\u2B95\\u2B98-\\u2BC8\\u2BCA-\\u2BFE\\u2CE5-\\u2CEA\\u2E80-\\u2E99\\u2E9B-\\u2EF3\\u2F00-\\u2FD5\\u2FF0-\\u2FFB\\u3004\\u3012\\u3013\\u3020\\u3036\\u3037\\u303E\\u303F\\u3190\\u3191\\u3196-\\u319F\\u31C0-\\u31E3\\u3200-\\u321E\\u322A-\\u3247\\u3250\\u3260-\\u327F\\u328A-\\u32B0\\u32C0-\\u32FE\\u3300-\\u33FF\\u4DC0-\\u4DFF\\uA490-\\uA4C6\\uA828-\\uA82B\\uA836\\uA837\\uA839\\uAA77-\\uAA79\\uFDFD\\uFFE4\\uFFE8\\uFFED\\uFFEE\\uFFFC\\uFFFD\\U00010137-\\U0001013F\\U00010179-\\U00010189\\U0001018C-\\U0001018E\\U00010190-\\U0001019B\\U000101A0\\U000101D0-\\U000101FC\\U00010877\\U00010878\\U00010AC8\\U0001173F\\U00016B3C-\\U00016B3F\\U00016B45\\U0001BC9C\\U0001D000-\\U0001D0F5\\U0001D100-\\U0001D126\\U0001D129-\\U0001D164\\U0001D16A-\\U0001D16C\\U0001D183\\U0001D184\\U0001D18C-\\U0001D1A9\\U0001D1AE-\\U0001D1E8\\U0001D200-\\U0001D241\\U0001D245\\U0001D300-\\U0001D356\\U0001D800-\\U0001D9FF\\U0001DA37-\\U0001DA3A\\U0001DA6D-\\U0001DA74\\U0001DA76-\\U0001DA83\\U0001DA85\\U0001DA86\\U0001ECAC\\U0001F000-\\U0001F02B\\U0001F030-\\U0001F093\\U0001F0A0-\\U0001F0AE\\U0001F0B1-\\U0001F0BF\\U0001F0C1-\\U0001F0CF\\U0001F0D1-\\U0001F0F5\\U0001F110-\\U0001F16B\\U0001F170-\\U0001F1AC\\U0001F1E6-\\U0001F202\\U0001F210-\\U0001F23B\\U0001F240-\\U0001F248\\U0001F250\\U0001F251\\U0001F260-\\U0001F265\\U0001F300-\\U0001F3FA\\U0001F400-\\U0001F6D4\\U0001F6E0-\\U0001F6EC\\U0001F6F0-\\U0001F6F9\\U0001F700-\\U0001F773\\U0001F780-\\U0001F7D8\\U0001F800-\\U0001F80B\\U0001F810-\\U0001F847\\U0001F850-\\U0001F859\\U0001F860-\\U0001F887\\U0001F890-\\U0001F8AD\\U0001F900-\\U0001F90B\\U0001F910-\\U0001F93E\\U0001F940-\\U0001F970\\U0001F973-\\U0001F976\\U0001F97A\\U0001F97C-\\U0001F9A2\\U0001F9B0-\\U0001F9B9\\U0001F9C0-\\U0001F9C2\\U0001F9D0-\\U0001F9FF\\U0001FA60-\\U0001FA6D]', '(?<=[0-9])[+\\-\\*^](?=[0-9-])', '(?<=[a-z\\uFF41-\\uFF5A\\u00DF-\\u00F6\\u00F8-\\u00FF\\u0101\\u0103\\u0105\\u0107\\u0109\\u010B\\u010D\\u010F\\u0111\\u0113\\u0115\\u0117\\u0119\\u011B\\u011D\\u011F\\u0121\\u0123\\u0125\\u0127\\u0129\\u012B\\u012D\\u012F\\u0131\\u0133\\u0135\\u0137\\u0138\\u013A\\u013C\\u013E\\u0140\\u0142\\u0144\\u0146\\u0148\\u0149\\u014B\\u014D\\u014F\\u0151\\u0153\\u0155\\u0157\\u0159\\u015B\\u015D\\u015F\\u0161\\u0163\\u0165\\u0167\\u0169\\u016B\\u016D\\u016F\\u0171\\u0173\\u0175\\u0177\\u017A\\u017C\\u017E\\u017F\\u0180\\u0183\\u0185\\u0188\\u018C\\u018D\\u0192\\u0195\\u0199-\\u019B\\u019E\\u01A1\\u01A3\\u01A5\\u01A8\\u01AA\\u01AB\\u01AD\\u01B0\\u01B4\\u01B6\\u01B9\\u01BA\\u01BD-\\u01BF\\u01C6\\u01C9\\u01CC\\u01CE\\u01D0\\u01D2\\u01D4\\u01D6\\u01D8\\u01DA\\u01DC\\u01DD\\u01DF\\u01E1\\u01E3\\u01E5\\u01E7\\u01E9\\u01EB\\u01ED\\u01EF\\u01F0\\u01F3\\u01F5\\u01F9\\u01FB\\u01FD\\u01FF\\u0201\\u0203\\u0205\\u0207\\u0209\\u020B\\u020D\\u020F\\u0211\\u0213\\u0215\\u0217\\u0219\\u021B\\u021D\\u021F\\u0221\\u0223\\u0225\\u0227\\u0229\\u022B\\u022D\\u022F\\u0231\\u0233-\\u0239\\u023C\\u023F\\u0240\\u0242\\u0247\\u0249\\u024B\\u024D\\u024F\\u2C61\\u2C65\\u2C66\\u2C68\\u2C6A\\u2C6C\\u2C71\\u2C73\\u2C74\\u2C76-\\u2C7B\\uA723\\uA725\\uA727\\uA729\\uA72B\\uA72D\\uA72F-\\uA731\\uA733\\uA735\\uA737\\uA739\\uA73B\\uA73D\\uA73F\\uA741\\uA743\\uA745\\uA747\\uA749\\uA74B\\uA74D\\uA74F\\uA751\\uA753\\uA755\\uA757\\uA759\\uA75B\\uA75D\\uA75F\\uA761\\uA763\\uA765\\uA767\\uA769\\uA76B\\uA76D\\uA76F\\uA771-\\uA778\\uA77A\\uA77C\\uA77F\\uA781\\uA783\\uA785\\uA787\\uA78C\\uA78E\\uA791\\uA793-\\uA795\\uA797\\uA799\\uA79B\\uA79D\\uA79F\\uA7A1\\uA7A3\\uA7A5\\uA7A7\\uA7A9\\uA7AF\\uA7B5\\uA7B7\\uA7B9\\uA7FA\\uAB30-\\uAB5A\\uAB60-\\uAB64\\u0250-\\u02AF\\u1D00-\\u1D25\\u1D6B-\\u1D77\\u1D79-\\u1D9A\\u1E01\\u1E03\\u1E05\\u1E07\\u1E09\\u1E0B\\u1E0D\\u1E0F\\u1E11\\u1E13\\u1E15\\u1E17\\u1E19\\u1E1B\\u1E1D\\u1E1F\\u1E21\\u1E23\\u1E25\\u1E27\\u1E29\\u1E2B\\u1E2D\\u1E2F\\u1E31\\u1E33\\u1E35\\u1E37\\u1E39\\u1E3B\\u1E3D\\u1E3F\\u1E41\\u1E43\\u1E45\\u1E47\\u1E49\\u1E4B\\u1E4D\\u1E4F\\u1E51\\u1E53\\u1E55\\u1E57\\u1E59\\u1E5B\\u1E5D\\u1E5F\\u1E61\\u1E63\\u1E65\\u1E67\\u1E69\\u1E6B\\u1E6D\\u1E6F\\u1E71\\u1E73\\u1E75\\u1E77\\u1E79\\u1E7B\\u1E7D\\u1E7F\\u1E81\\u1E83\\u1E85\\u1E87\\u1E89\\u1E8B\\u1E8D\\u1E8F\\u1E91\\u1E93\\u1E95-\\u1E9D\\u1E9F\\u1EA1\\u1EA3\\u1EA5\\u1EA7\\u1EA9\\u1EAB\\u1EAD\\u1EAF\\u1EB1\\u1EB3\\u1EB5\\u1EB7\\u1EB9\\u1EBB\\u1EBD\\u1EBF\\u1EC1\\u1EC3\\u1EC5\\u1EC7\\u1EC9\\u1ECB\\u1ECD\\u1ECF\\u1ED1\\u1ED3\\u1ED5\\u1ED7\\u1ED9\\u1EDB\\u1EDD\\u1EDF\\u1EE1\\u1EE3\\u1EE5\\u1EE7\\u1EE9\\u1EEB\\u1EED\\u1EEF\\u1EF1\\u1EF3\\u1EF5\\u1EF7\\u1EF9\\u1EFB\\u1EFD\\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\\u1200-\\u137F\\u0980-\\u09FF\\u0591-\\u05F4\\uFB1D-\\uFB4F\\u0620-\\u064A\\u066E-\\u06D5\\u06E5-\\u06FF\\u0750-\\u077F\\u08A0-\\u08BD\\uFB50-\\uFBB1\\uFBD3-\\uFD3D\\uFD50-\\uFDC7\\uFDF0-\\uFDFB\\uFE70-\\uFEFC\\U0001EE00-\\U0001EEBB\\u0D80-\\u0DFF\\u0900-\\u097F\\u0C80-\\u0CFF\\u0B80-\\u0BFF\\u0C00-\\u0C7F\\uAC00-\\uD7AF\\u1100-\\u11FF\\u3040-\\u309F\\u30A0-\\u30FFー\\u4E00-\\u62FF\\u6300-\\u77FF\\u7800-\\u8CFF\\u8D00-\\u9FFF\\u3400-\\u4DBF\\U00020000-\\U000215FF\\U00021600-\\U000230FF\\U00023100-\\U000245FF\\U00024600-\\U000260FF\\U00026100-\\U000275FF\\U00027600-\\U000290FF\\U00029100-\\U0002A6DF\\U0002A700-\\U0002B73F\\U0002B740-\\U0002B81F\\U0002B820-\\U0002CEAF\\U0002CEB0-\\U0002EBEF\\u2E80-\\u2EFF\\u2F00-\\u2FDF\\u2FF0-\\u2FFF\\u3000-\\u303F\\u31C0-\\u31EF\\u3200-\\u32FF\\u3300-\\u33FF\\uF900-\\uFAFF\\uFE30-\\uFE4F\\U0001F200-\\U0001F2FF\\U0002F800-\\U0002FA1F\\\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉〈〉⟦⟧])\\.(?=[A-Z\\uFF21-\\uFF3A\\u00C0-\\u00D6\\u00D8-\\u00DE\\u0100\\u0102\\u0104\\u0106\\u0108\\u010A\\u010C\\u010E\\u0110\\u0112\\u0114\\u0116\\u0118\\u011A\\u011C\\u011E\\u0120\\u0122\\u0124\\u0126\\u0128\\u012A\\u012C\\u012E\\u0130\\u0132\\u0134\\u0136\\u0139\\u013B\\u013D\\u013F\\u0141\\u0143\\u0145\\u0147\\u014A\\u014C\\u014E\\u0150\\u0152\\u0154\\u0156\\u0158\\u015A\\u015C\\u015E\\u0160\\u0162\\u0164\\u0166\\u0168\\u016A\\u016C\\u016E\\u0170\\u0172\\u0174\\u0176\\u0178\\u0179\\u017B\\u017D\\u0181\\u0182\\u0184\\u0186\\u0187\\u0189-\\u018B\\u018E-\\u0191\\u0193\\u0194\\u0196-\\u0198\\u019C\\u019D\\u019F\\u01A0\\u01A2\\u01A4\\u01A6\\u01A7\\u01A9\\u01AC\\u01AE\\u01AF\\u01B1-\\u01B3\\u01B5\\u01B7\\u01B8\\u01BC\\u01C4\\u01C7\\u01CA\\u01CD\\u01CF\\u01D1\\u01D3\\u01D5\\u01D7\\u01D9\\u01DB\\u01DE\\u01E0\\u01E2\\u01E4\\u01E6\\u01E8\\u01EA\\u01EC\\u01EE\\u01F1\\u01F4\\u01F6-\\u01F8\\u01FA\\u01FC\\u01FE\\u0200\\u0202\\u0204\\u0206\\u0208\\u020A\\u020C\\u020E\\u0210\\u0212\\u0214\\u0216\\u0218\\u021A\\u021C\\u021E\\u0220\\u0222\\u0224\\u0226\\u0228\\u022A\\u022C\\u022E\\u0230\\u0232\\u023A\\u023B\\u023D\\u023E\\u0241\\u0243-\\u0246\\u0248\\u024A\\u024C\\u024E\\u2C60\\u2C62-\\u2C64\\u2C67\\u2C69\\u2C6B\\u2C6D-\\u2C70\\u2C72\\u2C75\\u2C7E\\u2C7F\\uA722\\uA724\\uA726\\uA728\\uA72A\\uA72C\\uA72E\\uA732\\uA734\\uA736\\uA738\\uA73A\\uA73C\\uA73E\\uA740\\uA742\\uA744\\uA746\\uA748\\uA74A\\uA74C\\uA74E\\uA750\\uA752\\uA754\\uA756\\uA758\\uA75A\\uA75C\\uA75E\\uA760\\uA762\\uA764\\uA766\\uA768\\uA76A\\uA76C\\uA76E\\uA779\\uA77B\\uA77D\\uA77E\\uA780\\uA782\\uA784\\uA786\\uA78B\\uA78D\\uA790\\uA792\\uA796\\uA798\\uA79A\\uA79C\\uA79E\\uA7A0\\uA7A2\\uA7A4\\uA7A6\\uA7A8\\uA7AA-\\uA7AE\\uA7B0-\\uA7B4\\uA7B6\\uA7B8\\u1E00\\u1E02\\u1E04\\u1E06\\u1E08\\u1E0A\\u1E0C\\u1E0E\\u1E10\\u1E12\\u1E14\\u1E16\\u1E18\\u1E1A\\u1E1C\\u1E1E\\u1E20\\u1E22\\u1E24\\u1E26\\u1E28\\u1E2A\\u1E2C\\u1E2E\\u1E30\\u1E32\\u1E34\\u1E36\\u1E38\\u1E3A\\u1E3C\\u1E3E\\u1E40\\u1E42\\u1E44\\u1E46\\u1E48\\u1E4A\\u1E4C\\u1E4E\\u1E50\\u1E52\\u1E54\\u1E56\\u1E58\\u1E5A\\u1E5C\\u1E5E\\u1E60\\u1E62\\u1E64\\u1E66\\u1E68\\u1E6A\\u1E6C\\u1E6E\\u1E70\\u1E72\\u1E74\\u1E76\\u1E78\\u1E7A\\u1E7C\\u1E7E\\u1E80\\u1E82\\u1E84\\u1E86\\u1E88\\u1E8A\\u1E8C\\u1E8E\\u1E90\\u1E92\\u1E94\\u1E9E\\u1EA0\\u1EA2\\u1EA4\\u1EA6\\u1EA8\\u1EAA\\u1EAC\\u1EAE\\u1EB0\\u1EB2\\u1EB4\\u1EB6\\u1EB8\\u1EBA\\u1EBC\\u1EBE\\u1EC0\\u1EC2\\u1EC4\\u1EC6\\u1EC8\\u1ECA\\u1ECC\\u1ECE\\u1ED0\\u1ED2\\u1ED4\\u1ED6\\u1ED8\\u1EDA\\u1EDC\\u1EDE\\u1EE0\\u1EE2\\u1EE4\\u1EE6\\u1EE8\\u1EEA\\u1EEC\\u1EEE\\u1EF0\\u1EF2\\u1EF4\\u1EF6\\u1EF8\\u1EFA\\u1EFC\\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\\u1200-\\u137F\\u0980-\\u09FF\\u0591-\\u05F4\\uFB1D-\\uFB4F\\u0620-\\u064A\\u066E-\\u06D5\\u06E5-\\u06FF\\u0750-\\u077F\\u08A0-\\u08BD\\uFB50-\\uFBB1\\uFBD3-\\uFD3D\\uFD50-\\uFDC7\\uFDF0-\\uFDFB\\uFE70-\\uFEFC\\U0001EE00-\\U0001EEBB\\u0D80-\\u0DFF\\u0900-\\u097F\\u0C80-\\u0CFF\\u0B80-\\u0BFF\\u0C00-\\u0C7F\\uAC00-\\uD7AF\\u1100-\\u11FF\\u3040-\\u309F\\u30A0-\\u30FFー\\u4E00-\\u62FF\\u6300-\\u77FF\\u7800-\\u8CFF\\u8D00-\\u9FFF\\u3400-\\u4DBF\\U00020000-\\U000215FF\\U00021600-\\U000230FF\\U00023100-\\U000245FF\\U00024600-\\U000260FF\\U00026100-\\U000275FF\\U00027600-\\U000290FF\\U00029100-\\U0002A6DF\\U0002A700-\\U0002B73F\\U0002B740-\\U0002B81F\\U0002B820-\\U0002CEAF\\U0002CEB0-\\U0002EBEF\\u2E80-\\u2EFF\\u2F00-\\u2FDF\\u2FF0-\\u2FFF\\u3000-\\u303F\\u31C0-\\u31EF\\u3200-\\u32FF\\u3300-\\u33FF\\uF900-\\uFAFF\\uFE30-\\uFE4F\\U0001F200-\\U0001F2FF\\U0002F800-\\U0002FA1F\\\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉〈〉⟦⟧])', '(?<=[A-Za-z\\uFF21-\\uFF3A\\uFF41-\\uFF5A\\u00C0-\\u00D6\\u00D8-\\u00F6\\u00F8-\\u00FF\\u0100-\\u017F\\u0180-\\u01BF\\u01C4-\\u024F\\u2C60-\\u2C7B\\u2C7E\\u2C7F\\uA722-\\uA76F\\uA771-\\uA787\\uA78B-\\uA78E\\uA790-\\uA7B9\\uA7FA\\uAB30-\\uAB5A\\uAB60-\\uAB64\\u0250-\\u02AF\\u1D00-\\u1D25\\u1D6B-\\u1D77\\u1D79-\\u1D9A\\u1E00-\\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\\u1200-\\u137F\\u0980-\\u09FF\\u0591-\\u05F4\\uFB1D-\\uFB4F\\u0620-\\u064A\\u066E-\\u06D5\\u06E5-\\u06FF\\u0750-\\u077F\\u08A0-\\u08BD\\uFB50-\\uFBB1\\uFBD3-\\uFD3D\\uFD50-\\uFDC7\\uFDF0-\\uFDFB\\uFE70-\\uFEFC\\U0001EE00-\\U0001EEBB\\u0D80-\\u0DFF\\u0900-\\u097F\\u0C80-\\u0CFF\\u0B80-\\u0BFF\\u0C00-\\u0C7F\\uAC00-\\uD7AF\\u1100-\\u11FF\\u3040-\\u309F\\u30A0-\\u30FFー\\u4E00-\\u62FF\\u6300-\\u77FF\\u7800-\\u8CFF\\u8D00-\\u9FFF\\u3400-\\u4DBF\\U00020000-\\U000215FF\\U00021600-\\U000230FF\\U00023100-\\U000245FF\\U00024600-\\U000260FF\\U00026100-\\U000275FF\\U00027600-\\U000290FF\\U00029100-\\U0002A6DF\\U0002A700-\\U0002B73F\\U0002B740-\\U0002B81F\\U0002B820-\\U0002CEAF\\U0002CEB0-\\U0002EBEF\\u2E80-\\u2EFF\\u2F00-\\u2FDF\\u2FF0-\\u2FFF\\u3000-\\u303F\\u31C0-\\u31EF\\u3200-\\u32FF\\u3300-\\u33FF\\uF900-\\uFAFF\\uFE30-\\uFE4F\\U0001F200-\\U0001F2FF\\U0002F800-\\U0002FA1F]),(?=[A-Za-z\\uFF21-\\uFF3A\\uFF41-\\uFF5A\\u00C0-\\u00D6\\u00D8-\\u00F6\\u00F8-\\u00FF\\u0100-\\u017F\\u0180-\\u01BF\\u01C4-\\u024F\\u2C60-\\u2C7B\\u2C7E\\u2C7F\\uA722-\\uA76F\\uA771-\\uA787\\uA78B-\\uA78E\\uA790-\\uA7B9\\uA7FA\\uAB30-\\uAB5A\\uAB60-\\uAB64\\u0250-\\u02AF\\u1D00-\\u1D25\\u1D6B-\\u1D77\\u1D79-\\u1D9A\\u1E00-\\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\\u1200-\\u137F\\u0980-\\u09FF\\u0591-\\u05F4\\uFB1D-\\uFB4F\\u0620-\\u064A\\u066E-\\u06D5\\u06E5-\\u06FF\\u0750-\\u077F\\u08A0-\\u08BD\\uFB50-\\uFBB1\\uFBD3-\\uFD3D\\uFD50-\\uFDC7\\uFDF0-\\uFDFB\\uFE70-\\uFEFC\\U0001EE00-\\U0001EEBB\\u0D80-\\u0DFF\\u0900-\\u097F\\u0C80-\\u0CFF\\u0B80-\\u0BFF\\u0C00-\\u0C7F\\uAC00-\\uD7AF\\u1100-\\u11FF\\u3040-\\u309F\\u30A0-\\u30FFー\\u4E00-\\u62FF\\u6300-\\u77FF\\u7800-\\u8CFF\\u8D00-\\u9FFF\\u3400-\\u4DBF\\U00020000-\\U000215FF\\U00021600-\\U000230FF\\U00023100-\\U000245FF\\U00024600-\\U000260FF\\U00026100-\\U000275FF\\U00027600-\\U000290FF\\U00029100-\\U0002A6DF\\U0002A700-\\U0002B73F\\U0002B740-\\U0002B81F\\U0002B820-\\U0002CEAF\\U0002CEB0-\\U0002EBEF\\u2E80-\\u2EFF\\u2F00-\\u2FDF\\u2FF0-\\u2FFF\\u3000-\\u303F\\u31C0-\\u31EF\\u3200-\\u32FF\\u3300-\\u33FF\\uF900-\\uFAFF\\uFE30-\\uFE4F\\U0001F200-\\U0001F2FF\\U0002F800-\\U0002FA1F])', '(?<=[A-Za-z\\uFF21-\\uFF3A\\uFF41-\\uFF5A\\u00C0-\\u00D6\\u00D8-\\u00F6\\u00F8-\\u00FF\\u0100-\\u017F\\u0180-\\u01BF\\u01C4-\\u024F\\u2C60-\\u2C7B\\u2C7E\\u2C7F\\uA722-\\uA76F\\uA771-\\uA787\\uA78B-\\uA78E\\uA790-\\uA7B9\\uA7FA\\uAB30-\\uAB5A\\uAB60-\\uAB64\\u0250-\\u02AF\\u1D00-\\u1D25\\u1D6B-\\u1D77\\u1D79-\\u1D9A\\u1E00-\\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\\u1200-\\u137F\\u0980-\\u09FF\\u0591-\\u05F4\\uFB1D-\\uFB4F\\u0620-\\u064A\\u066E-\\u06D5\\u06E5-\\u06FF\\u0750-\\u077F\\u08A0-\\u08BD\\uFB50-\\uFBB1\\uFBD3-\\uFD3D\\uFD50-\\uFDC7\\uFDF0-\\uFDFB\\uFE70-\\uFEFC\\U0001EE00-\\U0001EEBB\\u0D80-\\u0DFF\\u0900-\\u097F\\u0C80-\\u0CFF\\u0B80-\\u0BFF\\u0C00-\\u0C7F\\uAC00-\\uD7AF\\u1100-\\u11FF\\u3040-\\u309F\\u30A0-\\u30FFー\\u4E00-\\u62FF\\u6300-\\u77FF\\u7800-\\u8CFF\\u8D00-\\u9FFF\\u3400-\\u4DBF\\U00020000-\\U000215FF\\U00021600-\\U000230FF\\U00023100-\\U000245FF\\U00024600-\\U000260FF\\U00026100-\\U000275FF\\U00027600-\\U000290FF\\U00029100-\\U0002A6DF\\U0002A700-\\U0002B73F\\U0002B740-\\U0002B81F\\U0002B820-\\U0002CEAF\\U0002CEB0-\\U0002EBEF\\u2E80-\\u2EFF\\u2F00-\\u2FDF\\u2FF0-\\u2FFF\\u3000-\\u303F\\u31C0-\\u31EF\\u3200-\\u32FF\\u3300-\\u33FF\\uF900-\\uFAFF\\uFE30-\\uFE4F\\U0001F200-\\U0001F2FF\\U0002F800-\\U0002FA1F0-9])[:<>=/](?=[A-Za-z\\uFF21-\\uFF3A\\uFF41-\\uFF5A\\u00C0-\\u00D6\\u00D8-\\u00F6\\u00F8-\\u00FF\\u0100-\\u017F\\u0180-\\u01BF\\u01C4-\\u024F\\u2C60-\\u2C7B\\u2C7E\\u2C7F\\uA722-\\uA76F\\uA771-\\uA787\\uA78B-\\uA78E\\uA790-\\uA7B9\\uA7FA\\uAB30-\\uAB5A\\uAB60-\\uAB64\\u0250-\\u02AF\\u1D00-\\u1D25\\u1D6B-\\u1D77\\u1D79-\\u1D9A\\u1E00-\\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\\u1200-\\u137F\\u0980-\\u09FF\\u0591-\\u05F4\\uFB1D-\\uFB4F\\u0620-\\u064A\\u066E-\\u06D5\\u06E5-\\u06FF\\u0750-\\u077F\\u08A0-\\u08BD\\uFB50-\\uFBB1\\uFBD3-\\uFD3D\\uFD50-\\uFDC7\\uFDF0-\\uFDFB\\uFE70-\\uFEFC\\U0001EE00-\\U0001EEBB\\u0D80-\\u0DFF\\u0900-\\u097F\\u0C80-\\u0CFF\\u0B80-\\u0BFF\\u0C00-\\u0C7F\\uAC00-\\uD7AF\\u1100-\\u11FF\\u3040-\\u309F\\u30A0-\\u30FFー\\u4E00-\\u62FF\\u6300-\\u77FF\\u7800-\\u8CFF\\u8D00-\\u9FFF\\u3400-\\u4DBF\\U00020000-\\U000215FF\\U00021600-\\U000230FF\\U00023100-\\U000245FF\\U00024600-\\U000260FF\\U00026100-\\U000275FF\\U00027600-\\U000290FF\\U00029100-\\U0002A6DF\\U0002A700-\\U0002B73F\\U0002B740-\\U0002B81F\\U0002B820-\\U0002CEAF\\U0002CEB0-\\U0002EBEF\\u2E80-\\u2EFF\\u2F00-\\u2FDF\\u2FF0-\\u2FFF\\u3000-\\u303F\\u31C0-\\u31EF\\u3200-\\u32FF\\u3300-\\u33FF\\uF900-\\uFAFF\\uFE30-\\uFE4F\\U0001F200-\\U0001F2FF\\U0002F800-\\U0002FA1F])']
suffixes: Sequence[str | Pattern] | None = ['…', '……', ',', ':', ';', '\\!', '\\?', '¿', '؟', '¡', '\\(', '\\)', '\\[', '\\]', '\\{', '\\}', '<', '>', '_', '#', '\\*', '&', '。', '?', '!', ',', '、', ';', ':', '~', '·', '।', '،', '۔', '؛', '٪', '\\.\\.+', '…', "\\'", '"', '”', '“', '`', '‘', '´', '’', '‚', ',', '„', '»', '«', '「', '」', '『', '』', '(', ')', '〔', '〕', '【', '】', '《', '》', '〈', '〉', '〈', '〉', '', '⟦', '⟧', '[\\u00A6\\u00A9\\u00AE\\u00B0\\u0482\\u058D\\u058E\\u060E\\u060F\\u06DE\\u06E9\\u06FD\\u06FE\\u07F6\\u09FA\\u0B70\\u0BF3-\\u0BF8\\u0BFA\\u0C7F\\u0D4F\\u0D79\\u0F01-\\u0F03\\u0F13\\u0F15-\\u0F17\\u0F1A-\\u0F1F\\u0F34\\u0F36\\u0F38\\u0FBE-\\u0FC5\\u0FC7-\\u0FCC\\u0FCE\\u0FCF\\u0FD5-\\u0FD8\\u109E\\u109F\\u1390-\\u1399\\u1940\\u19DE-\\u19FF\\u1B61-\\u1B6A\\u1B74-\\u1B7C\\u2100\\u2101\\u2103-\\u2106\\u2108\\u2109\\u2114\\u2116\\u2117\\u211E-\\u2123\\u2125\\u2127\\u2129\\u212E\\u213A\\u213B\\u214A\\u214C\\u214D\\u214F\\u218A\\u218B\\u2195-\\u2199\\u219C-\\u219F\\u21A1\\u21A2\\u21A4\\u21A5\\u21A7-\\u21AD\\u21AF-\\u21CD\\u21D0\\u21D1\\u21D3\\u21D5-\\u21F3\\u2300-\\u2307\\u230C-\\u231F\\u2322-\\u2328\\u232B-\\u237B\\u237D-\\u239A\\u23B4-\\u23DB\\u23E2-\\u2426\\u2440-\\u244A\\u249C-\\u24E9\\u2500-\\u25B6\\u25B8-\\u25C0\\u25C2-\\u25F7\\u2600-\\u266E\\u2670-\\u2767\\u2794-\\u27BF\\u2800-\\u28FF\\u2B00-\\u2B2F\\u2B45\\u2B46\\u2B4D-\\u2B73\\u2B76-\\u2B95\\u2B98-\\u2BC8\\u2BCA-\\u2BFE\\u2CE5-\\u2CEA\\u2E80-\\u2E99\\u2E9B-\\u2EF3\\u2F00-\\u2FD5\\u2FF0-\\u2FFB\\u3004\\u3012\\u3013\\u3020\\u3036\\u3037\\u303E\\u303F\\u3190\\u3191\\u3196-\\u319F\\u31C0-\\u31E3\\u3200-\\u321E\\u322A-\\u3247\\u3250\\u3260-\\u327F\\u328A-\\u32B0\\u32C0-\\u32FE\\u3300-\\u33FF\\u4DC0-\\u4DFF\\uA490-\\uA4C6\\uA828-\\uA82B\\uA836\\uA837\\uA839\\uAA77-\\uAA79\\uFDFD\\uFFE4\\uFFE8\\uFFED\\uFFEE\\uFFFC\\uFFFD\\U00010137-\\U0001013F\\U00010179-\\U00010189\\U0001018C-\\U0001018E\\U00010190-\\U0001019B\\U000101A0\\U000101D0-\\U000101FC\\U00010877\\U00010878\\U00010AC8\\U0001173F\\U00016B3C-\\U00016B3F\\U00016B45\\U0001BC9C\\U0001D000-\\U0001D0F5\\U0001D100-\\U0001D126\\U0001D129-\\U0001D164\\U0001D16A-\\U0001D16C\\U0001D183\\U0001D184\\U0001D18C-\\U0001D1A9\\U0001D1AE-\\U0001D1E8\\U0001D200-\\U0001D241\\U0001D245\\U0001D300-\\U0001D356\\U0001D800-\\U0001D9FF\\U0001DA37-\\U0001DA3A\\U0001DA6D-\\U0001DA74\\U0001DA76-\\U0001DA83\\U0001DA85\\U0001DA86\\U0001ECAC\\U0001F000-\\U0001F02B\\U0001F030-\\U0001F093\\U0001F0A0-\\U0001F0AE\\U0001F0B1-\\U0001F0BF\\U0001F0C1-\\U0001F0CF\\U0001F0D1-\\U0001F0F5\\U0001F110-\\U0001F16B\\U0001F170-\\U0001F1AC\\U0001F1E6-\\U0001F202\\U0001F210-\\U0001F23B\\U0001F240-\\U0001F248\\U0001F250\\U0001F251\\U0001F260-\\U0001F265\\U0001F300-\\U0001F3FA\\U0001F400-\\U0001F6D4\\U0001F6E0-\\U0001F6EC\\U0001F6F0-\\U0001F6F9\\U0001F700-\\U0001F773\\U0001F780-\\U0001F7D8\\U0001F800-\\U0001F80B\\U0001F810-\\U0001F847\\U0001F850-\\U0001F859\\U0001F860-\\U0001F887\\U0001F890-\\U0001F8AD\\U0001F900-\\U0001F90B\\U0001F910-\\U0001F93E\\U0001F940-\\U0001F970\\U0001F973-\\U0001F976\\U0001F97A\\U0001F97C-\\U0001F9A2\\U0001F9B0-\\U0001F9B9\\U0001F9C0-\\U0001F9C2\\U0001F9D0-\\U0001F9FF\\U0001FA60-\\U0001FA6D]', "'s", "'S", '’s', '’S', '—', '–', '(?<=[0-9])\\+', '(?<=°[FfCcKk])\\.', '(?<=[0-9])(?:\\$|£|€|¥|฿|US\\$|C\\$|A\\$|₽|﷼|₴|₠|₡|₢|₣|₤|₥|₦|₧|₨|₩|₪|₫|€|₭|₮|₯|₰|₱|₲|₳|₴|₵|₶|₷|₸|₹|₺|₻|₼|₽|₾|₿)', '(?<=[0-9])(?:km|km²|km³|m|m²|m³|dm|dm²|dm³|cm|cm²|cm³|mm|mm²|mm³|ha|µm|nm|yd|in|ft|kg|g|mg|µg|t|lb|oz|m/s|km/h|kmh|mph|hPa|Pa|mbar|mb|MB|kb|KB|gb|GB|tb|TB|T|G|M|K|%|км|км²|км³|м|м²|м³|дм|дм²|дм³|см|см²|см³|мм|мм²|мм³|нм|кг|г|мг|м/с|км/ч|кПа|Па|мбар|Кб|КБ|кб|Мб|МБ|мб|Гб|ГБ|гб|Тб|ТБ|тбكم|كم²|كم³|م|م²|م³|سم|سم²|سم³|مم|مم²|مم³|كم|غرام|جرام|جم|كغ|ملغ|كوب|اكواب)', '(?<=[0-9a-z\\uFF41-\\uFF5A\\u00DF-\\u00F6\\u00F8-\\u00FF\\u0101\\u0103\\u0105\\u0107\\u0109\\u010B\\u010D\\u010F\\u0111\\u0113\\u0115\\u0117\\u0119\\u011B\\u011D\\u011F\\u0121\\u0123\\u0125\\u0127\\u0129\\u012B\\u012D\\u012F\\u0131\\u0133\\u0135\\u0137\\u0138\\u013A\\u013C\\u013E\\u0140\\u0142\\u0144\\u0146\\u0148\\u0149\\u014B\\u014D\\u014F\\u0151\\u0153\\u0155\\u0157\\u0159\\u015B\\u015D\\u015F\\u0161\\u0163\\u0165\\u0167\\u0169\\u016B\\u016D\\u016F\\u0171\\u0173\\u0175\\u0177\\u017A\\u017C\\u017E\\u017F\\u0180\\u0183\\u0185\\u0188\\u018C\\u018D\\u0192\\u0195\\u0199-\\u019B\\u019E\\u01A1\\u01A3\\u01A5\\u01A8\\u01AA\\u01AB\\u01AD\\u01B0\\u01B4\\u01B6\\u01B9\\u01BA\\u01BD-\\u01BF\\u01C6\\u01C9\\u01CC\\u01CE\\u01D0\\u01D2\\u01D4\\u01D6\\u01D8\\u01DA\\u01DC\\u01DD\\u01DF\\u01E1\\u01E3\\u01E5\\u01E7\\u01E9\\u01EB\\u01ED\\u01EF\\u01F0\\u01F3\\u01F5\\u01F9\\u01FB\\u01FD\\u01FF\\u0201\\u0203\\u0205\\u0207\\u0209\\u020B\\u020D\\u020F\\u0211\\u0213\\u0215\\u0217\\u0219\\u021B\\u021D\\u021F\\u0221\\u0223\\u0225\\u0227\\u0229\\u022B\\u022D\\u022F\\u0231\\u0233-\\u0239\\u023C\\u023F\\u0240\\u0242\\u0247\\u0249\\u024B\\u024D\\u024F\\u2C61\\u2C65\\u2C66\\u2C68\\u2C6A\\u2C6C\\u2C71\\u2C73\\u2C74\\u2C76-\\u2C7B\\uA723\\uA725\\uA727\\uA729\\uA72B\\uA72D\\uA72F-\\uA731\\uA733\\uA735\\uA737\\uA739\\uA73B\\uA73D\\uA73F\\uA741\\uA743\\uA745\\uA747\\uA749\\uA74B\\uA74D\\uA74F\\uA751\\uA753\\uA755\\uA757\\uA759\\uA75B\\uA75D\\uA75F\\uA761\\uA763\\uA765\\uA767\\uA769\\uA76B\\uA76D\\uA76F\\uA771-\\uA778\\uA77A\\uA77C\\uA77F\\uA781\\uA783\\uA785\\uA787\\uA78C\\uA78E\\uA791\\uA793-\\uA795\\uA797\\uA799\\uA79B\\uA79D\\uA79F\\uA7A1\\uA7A3\\uA7A5\\uA7A7\\uA7A9\\uA7AF\\uA7B5\\uA7B7\\uA7B9\\uA7FA\\uAB30-\\uAB5A\\uAB60-\\uAB64\\u0250-\\u02AF\\u1D00-\\u1D25\\u1D6B-\\u1D77\\u1D79-\\u1D9A\\u1E01\\u1E03\\u1E05\\u1E07\\u1E09\\u1E0B\\u1E0D\\u1E0F\\u1E11\\u1E13\\u1E15\\u1E17\\u1E19\\u1E1B\\u1E1D\\u1E1F\\u1E21\\u1E23\\u1E25\\u1E27\\u1E29\\u1E2B\\u1E2D\\u1E2F\\u1E31\\u1E33\\u1E35\\u1E37\\u1E39\\u1E3B\\u1E3D\\u1E3F\\u1E41\\u1E43\\u1E45\\u1E47\\u1E49\\u1E4B\\u1E4D\\u1E4F\\u1E51\\u1E53\\u1E55\\u1E57\\u1E59\\u1E5B\\u1E5D\\u1E5F\\u1E61\\u1E63\\u1E65\\u1E67\\u1E69\\u1E6B\\u1E6D\\u1E6F\\u1E71\\u1E73\\u1E75\\u1E77\\u1E79\\u1E7B\\u1E7D\\u1E7F\\u1E81\\u1E83\\u1E85\\u1E87\\u1E89\\u1E8B\\u1E8D\\u1E8F\\u1E91\\u1E93\\u1E95-\\u1E9D\\u1E9F\\u1EA1\\u1EA3\\u1EA5\\u1EA7\\u1EA9\\u1EAB\\u1EAD\\u1EAF\\u1EB1\\u1EB3\\u1EB5\\u1EB7\\u1EB9\\u1EBB\\u1EBD\\u1EBF\\u1EC1\\u1EC3\\u1EC5\\u1EC7\\u1EC9\\u1ECB\\u1ECD\\u1ECF\\u1ED1\\u1ED3\\u1ED5\\u1ED7\\u1ED9\\u1EDB\\u1EDD\\u1EDF\\u1EE1\\u1EE3\\u1EE5\\u1EE7\\u1EE9\\u1EEB\\u1EED\\u1EEF\\u1EF1\\u1EF3\\u1EF5\\u1EF7\\u1EF9\\u1EFB\\u1EFD\\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\\u1200-\\u137F\\u0980-\\u09FF\\u0591-\\u05F4\\uFB1D-\\uFB4F\\u0620-\\u064A\\u066E-\\u06D5\\u06E5-\\u06FF\\u0750-\\u077F\\u08A0-\\u08BD\\uFB50-\\uFBB1\\uFBD3-\\uFD3D\\uFD50-\\uFDC7\\uFDF0-\\uFDFB\\uFE70-\\uFEFC\\U0001EE00-\\U0001EEBB\\u0D80-\\u0DFF\\u0900-\\u097F\\u0C80-\\u0CFF\\u0B80-\\u0BFF\\u0C00-\\u0C7F\\uAC00-\\uD7AF\\u1100-\\u11FF\\u3040-\\u309F\\u30A0-\\u30FFー\\u4E00-\\u62FF\\u6300-\\u77FF\\u7800-\\u8CFF\\u8D00-\\u9FFF\\u3400-\\u4DBF\\U00020000-\\U000215FF\\U00021600-\\U000230FF\\U00023100-\\U000245FF\\U00024600-\\U000260FF\\U00026100-\\U000275FF\\U00027600-\\U000290FF\\U00029100-\\U0002A6DF\\U0002A700-\\U0002B73F\\U0002B740-\\U0002B81F\\U0002B820-\\U0002CEAF\\U0002CEB0-\\U0002EBEF\\u2E80-\\u2EFF\\u2F00-\\u2FDF\\u2FF0-\\u2FFF\\u3000-\\u303F\\u31C0-\\u31EF\\u3200-\\u32FF\\u3300-\\u33FF\\uF900-\\uFAFF\\uFE30-\\uFE4F\\U0001F200-\\U0001F2FF\\U0002F800-\\U0002FA1F%²\\-\\+…|……|,|:|;|\\!|\\?|¿|؟|¡|\\(|\\)|\\[|\\]|\\{|\\}|<|>|_|#|\\*|&|。|?|!|,|、|;|:|~|·|।|،|۔|؛|٪(?:\\\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉〈〉⟦⟧)])\\.', '(?<=[A-Z\\uFF21-\\uFF3A\\u00C0-\\u00D6\\u00D8-\\u00DE\\u0100\\u0102\\u0104\\u0106\\u0108\\u010A\\u010C\\u010E\\u0110\\u0112\\u0114\\u0116\\u0118\\u011A\\u011C\\u011E\\u0120\\u0122\\u0124\\u0126\\u0128\\u012A\\u012C\\u012E\\u0130\\u0132\\u0134\\u0136\\u0139\\u013B\\u013D\\u013F\\u0141\\u0143\\u0145\\u0147\\u014A\\u014C\\u014E\\u0150\\u0152\\u0154\\u0156\\u0158\\u015A\\u015C\\u015E\\u0160\\u0162\\u0164\\u0166\\u0168\\u016A\\u016C\\u016E\\u0170\\u0172\\u0174\\u0176\\u0178\\u0179\\u017B\\u017D\\u0181\\u0182\\u0184\\u0186\\u0187\\u0189-\\u018B\\u018E-\\u0191\\u0193\\u0194\\u0196-\\u0198\\u019C\\u019D\\u019F\\u01A0\\u01A2\\u01A4\\u01A6\\u01A7\\u01A9\\u01AC\\u01AE\\u01AF\\u01B1-\\u01B3\\u01B5\\u01B7\\u01B8\\u01BC\\u01C4\\u01C7\\u01CA\\u01CD\\u01CF\\u01D1\\u01D3\\u01D5\\u01D7\\u01D9\\u01DB\\u01DE\\u01E0\\u01E2\\u01E4\\u01E6\\u01E8\\u01EA\\u01EC\\u01EE\\u01F1\\u01F4\\u01F6-\\u01F8\\u01FA\\u01FC\\u01FE\\u0200\\u0202\\u0204\\u0206\\u0208\\u020A\\u020C\\u020E\\u0210\\u0212\\u0214\\u0216\\u0218\\u021A\\u021C\\u021E\\u0220\\u0222\\u0224\\u0226\\u0228\\u022A\\u022C\\u022E\\u0230\\u0232\\u023A\\u023B\\u023D\\u023E\\u0241\\u0243-\\u0246\\u0248\\u024A\\u024C\\u024E\\u2C60\\u2C62-\\u2C64\\u2C67\\u2C69\\u2C6B\\u2C6D-\\u2C70\\u2C72\\u2C75\\u2C7E\\u2C7F\\uA722\\uA724\\uA726\\uA728\\uA72A\\uA72C\\uA72E\\uA732\\uA734\\uA736\\uA738\\uA73A\\uA73C\\uA73E\\uA740\\uA742\\uA744\\uA746\\uA748\\uA74A\\uA74C\\uA74E\\uA750\\uA752\\uA754\\uA756\\uA758\\uA75A\\uA75C\\uA75E\\uA760\\uA762\\uA764\\uA766\\uA768\\uA76A\\uA76C\\uA76E\\uA779\\uA77B\\uA77D\\uA77E\\uA780\\uA782\\uA784\\uA786\\uA78B\\uA78D\\uA790\\uA792\\uA796\\uA798\\uA79A\\uA79C\\uA79E\\uA7A0\\uA7A2\\uA7A4\\uA7A6\\uA7A8\\uA7AA-\\uA7AE\\uA7B0-\\uA7B4\\uA7B6\\uA7B8\\u1E00\\u1E02\\u1E04\\u1E06\\u1E08\\u1E0A\\u1E0C\\u1E0E\\u1E10\\u1E12\\u1E14\\u1E16\\u1E18\\u1E1A\\u1E1C\\u1E1E\\u1E20\\u1E22\\u1E24\\u1E26\\u1E28\\u1E2A\\u1E2C\\u1E2E\\u1E30\\u1E32\\u1E34\\u1E36\\u1E38\\u1E3A\\u1E3C\\u1E3E\\u1E40\\u1E42\\u1E44\\u1E46\\u1E48\\u1E4A\\u1E4C\\u1E4E\\u1E50\\u1E52\\u1E54\\u1E56\\u1E58\\u1E5A\\u1E5C\\u1E5E\\u1E60\\u1E62\\u1E64\\u1E66\\u1E68\\u1E6A\\u1E6C\\u1E6E\\u1E70\\u1E72\\u1E74\\u1E76\\u1E78\\u1E7A\\u1E7C\\u1E7E\\u1E80\\u1E82\\u1E84\\u1E86\\u1E88\\u1E8A\\u1E8C\\u1E8E\\u1E90\\u1E92\\u1E94\\u1E9E\\u1EA0\\u1EA2\\u1EA4\\u1EA6\\u1EA8\\u1EAA\\u1EAC\\u1EAE\\u1EB0\\u1EB2\\u1EB4\\u1EB6\\u1EB8\\u1EBA\\u1EBC\\u1EBE\\u1EC0\\u1EC2\\u1EC4\\u1EC6\\u1EC8\\u1ECA\\u1ECC\\u1ECE\\u1ED0\\u1ED2\\u1ED4\\u1ED6\\u1ED8\\u1EDA\\u1EDC\\u1EDE\\u1EE0\\u1EE2\\u1EE4\\u1EE6\\u1EE8\\u1EEA\\u1EEC\\u1EEE\\u1EF0\\u1EF2\\u1EF4\\u1EF6\\u1EF8\\u1EFA\\u1EFC\\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\\u1200-\\u137F\\u0980-\\u09FF\\u0591-\\u05F4\\uFB1D-\\uFB4F\\u0620-\\u064A\\u066E-\\u06D5\\u06E5-\\u06FF\\u0750-\\u077F\\u08A0-\\u08BD\\uFB50-\\uFBB1\\uFBD3-\\uFD3D\\uFD50-\\uFDC7\\uFDF0-\\uFDFB\\uFE70-\\uFEFC\\U0001EE00-\\U0001EEBB\\u0D80-\\u0DFF\\u0900-\\u097F\\u0C80-\\u0CFF\\u0B80-\\u0BFF\\u0C00-\\u0C7F\\uAC00-\\uD7AF\\u1100-\\u11FF\\u3040-\\u309F\\u30A0-\\u30FFー\\u4E00-\\u62FF\\u6300-\\u77FF\\u7800-\\u8CFF\\u8D00-\\u9FFF\\u3400-\\u4DBF\\U00020000-\\U000215FF\\U00021600-\\U000230FF\\U00023100-\\U000245FF\\U00024600-\\U000260FF\\U00026100-\\U000275FF\\U00027600-\\U000290FF\\U00029100-\\U0002A6DF\\U0002A700-\\U0002B73F\\U0002B740-\\U0002B81F\\U0002B820-\\U0002CEAF\\U0002CEB0-\\U0002EBEF\\u2E80-\\u2EFF\\u2F00-\\u2FDF\\u2FF0-\\u2FFF\\u3000-\\u303F\\u31C0-\\u31EF\\u3200-\\u32FF\\u3300-\\u33FF\\uF900-\\uFAFF\\uFE30-\\uFE4F\\U0001F200-\\U0001F2FF\\U0002F800-\\U0002FA1F][A-Z\\uFF21-\\uFF3A\\u00C0-\\u00D6\\u00D8-\\u00DE\\u0100\\u0102\\u0104\\u0106\\u0108\\u010A\\u010C\\u010E\\u0110\\u0112\\u0114\\u0116\\u0118\\u011A\\u011C\\u011E\\u0120\\u0122\\u0124\\u0126\\u0128\\u012A\\u012C\\u012E\\u0130\\u0132\\u0134\\u0136\\u0139\\u013B\\u013D\\u013F\\u0141\\u0143\\u0145\\u0147\\u014A\\u014C\\u014E\\u0150\\u0152\\u0154\\u0156\\u0158\\u015A\\u015C\\u015E\\u0160\\u0162\\u0164\\u0166\\u0168\\u016A\\u016C\\u016E\\u0170\\u0172\\u0174\\u0176\\u0178\\u0179\\u017B\\u017D\\u0181\\u0182\\u0184\\u0186\\u0187\\u0189-\\u018B\\u018E-\\u0191\\u0193\\u0194\\u0196-\\u0198\\u019C\\u019D\\u019F\\u01A0\\u01A2\\u01A4\\u01A6\\u01A7\\u01A9\\u01AC\\u01AE\\u01AF\\u01B1-\\u01B3\\u01B5\\u01B7\\u01B8\\u01BC\\u01C4\\u01C7\\u01CA\\u01CD\\u01CF\\u01D1\\u01D3\\u01D5\\u01D7\\u01D9\\u01DB\\u01DE\\u01E0\\u01E2\\u01E4\\u01E6\\u01E8\\u01EA\\u01EC\\u01EE\\u01F1\\u01F4\\u01F6-\\u01F8\\u01FA\\u01FC\\u01FE\\u0200\\u0202\\u0204\\u0206\\u0208\\u020A\\u020C\\u020E\\u0210\\u0212\\u0214\\u0216\\u0218\\u021A\\u021C\\u021E\\u0220\\u0222\\u0224\\u0226\\u0228\\u022A\\u022C\\u022E\\u0230\\u0232\\u023A\\u023B\\u023D\\u023E\\u0241\\u0243-\\u0246\\u0248\\u024A\\u024C\\u024E\\u2C60\\u2C62-\\u2C64\\u2C67\\u2C69\\u2C6B\\u2C6D-\\u2C70\\u2C72\\u2C75\\u2C7E\\u2C7F\\uA722\\uA724\\uA726\\uA728\\uA72A\\uA72C\\uA72E\\uA732\\uA734\\uA736\\uA738\\uA73A\\uA73C\\uA73E\\uA740\\uA742\\uA744\\uA746\\uA748\\uA74A\\uA74C\\uA74E\\uA750\\uA752\\uA754\\uA756\\uA758\\uA75A\\uA75C\\uA75E\\uA760\\uA762\\uA764\\uA766\\uA768\\uA76A\\uA76C\\uA76E\\uA779\\uA77B\\uA77D\\uA77E\\uA780\\uA782\\uA784\\uA786\\uA78B\\uA78D\\uA790\\uA792\\uA796\\uA798\\uA79A\\uA79C\\uA79E\\uA7A0\\uA7A2\\uA7A4\\uA7A6\\uA7A8\\uA7AA-\\uA7AE\\uA7B0-\\uA7B4\\uA7B6\\uA7B8\\u1E00\\u1E02\\u1E04\\u1E06\\u1E08\\u1E0A\\u1E0C\\u1E0E\\u1E10\\u1E12\\u1E14\\u1E16\\u1E18\\u1E1A\\u1E1C\\u1E1E\\u1E20\\u1E22\\u1E24\\u1E26\\u1E28\\u1E2A\\u1E2C\\u1E2E\\u1E30\\u1E32\\u1E34\\u1E36\\u1E38\\u1E3A\\u1E3C\\u1E3E\\u1E40\\u1E42\\u1E44\\u1E46\\u1E48\\u1E4A\\u1E4C\\u1E4E\\u1E50\\u1E52\\u1E54\\u1E56\\u1E58\\u1E5A\\u1E5C\\u1E5E\\u1E60\\u1E62\\u1E64\\u1E66\\u1E68\\u1E6A\\u1E6C\\u1E6E\\u1E70\\u1E72\\u1E74\\u1E76\\u1E78\\u1E7A\\u1E7C\\u1E7E\\u1E80\\u1E82\\u1E84\\u1E86\\u1E88\\u1E8A\\u1E8C\\u1E8E\\u1E90\\u1E92\\u1E94\\u1E9E\\u1EA0\\u1EA2\\u1EA4\\u1EA6\\u1EA8\\u1EAA\\u1EAC\\u1EAE\\u1EB0\\u1EB2\\u1EB4\\u1EB6\\u1EB8\\u1EBA\\u1EBC\\u1EBE\\u1EC0\\u1EC2\\u1EC4\\u1EC6\\u1EC8\\u1ECA\\u1ECC\\u1ECE\\u1ED0\\u1ED2\\u1ED4\\u1ED6\\u1ED8\\u1EDA\\u1EDC\\u1EDE\\u1EE0\\u1EE2\\u1EE4\\u1EE6\\u1EE8\\u1EEA\\u1EEC\\u1EEE\\u1EF0\\u1EF2\\u1EF4\\u1EF6\\u1EF8\\u1EFA\\u1EFC\\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\\u1200-\\u137F\\u0980-\\u09FF\\u0591-\\u05F4\\uFB1D-\\uFB4F\\u0620-\\u064A\\u066E-\\u06D5\\u06E5-\\u06FF\\u0750-\\u077F\\u08A0-\\u08BD\\uFB50-\\uFBB1\\uFBD3-\\uFD3D\\uFD50-\\uFDC7\\uFDF0-\\uFDFB\\uFE70-\\uFEFC\\U0001EE00-\\U0001EEBB\\u0D80-\\u0DFF\\u0900-\\u097F\\u0C80-\\u0CFF\\u0B80-\\u0BFF\\u0C00-\\u0C7F\\uAC00-\\uD7AF\\u1100-\\u11FF\\u3040-\\u309F\\u30A0-\\u30FFー\\u4E00-\\u62FF\\u6300-\\u77FF\\u7800-\\u8CFF\\u8D00-\\u9FFF\\u3400-\\u4DBF\\U00020000-\\U000215FF\\U00021600-\\U000230FF\\U00023100-\\U000245FF\\U00024600-\\U000260FF\\U00026100-\\U000275FF\\U00027600-\\U000290FF\\U00029100-\\U0002A6DF\\U0002A700-\\U0002B73F\\U0002B740-\\U0002B81F\\U0002B820-\\U0002CEAF\\U0002CEB0-\\U0002EBEF\\u2E80-\\u2EFF\\u2F00-\\u2FDF\\u2FF0-\\u2FFF\\u3000-\\u303F\\u31C0-\\u31EF\\u3200-\\u32FF\\u3300-\\u33FF\\uF900-\\uFAFF\\uFE30-\\uFE4F\\U0001F200-\\U0001F2FF\\U0002F800-\\U0002FA1F])\\.', '(?<=\\b[A-Z\\uFF21-\\uFF3A\\u00C0-\\u00D6\\u00D8-\\u00DE\\u0100\\u0102\\u0104\\u0106\\u0108\\u010A\\u010C\\u010E\\u0110\\u0112\\u0114\\u0116\\u0118\\u011A\\u011C\\u011E\\u0120\\u0122\\u0124\\u0126\\u0128\\u012A\\u012C\\u012E\\u0130\\u0132\\u0134\\u0136\\u0139\\u013B\\u013D\\u013F\\u0141\\u0143\\u0145\\u0147\\u014A\\u014C\\u014E\\u0150\\u0152\\u0154\\u0156\\u0158\\u015A\\u015C\\u015E\\u0160\\u0162\\u0164\\u0166\\u0168\\u016A\\u016C\\u016E\\u0170\\u0172\\u0174\\u0176\\u0178\\u0179\\u017B\\u017D\\u0181\\u0182\\u0184\\u0186\\u0187\\u0189-\\u018B\\u018E-\\u0191\\u0193\\u0194\\u0196-\\u0198\\u019C\\u019D\\u019F\\u01A0\\u01A2\\u01A4\\u01A6\\u01A7\\u01A9\\u01AC\\u01AE\\u01AF\\u01B1-\\u01B3\\u01B5\\u01B7\\u01B8\\u01BC\\u01C4\\u01C7\\u01CA\\u01CD\\u01CF\\u01D1\\u01D3\\u01D5\\u01D7\\u01D9\\u01DB\\u01DE\\u01E0\\u01E2\\u01E4\\u01E6\\u01E8\\u01EA\\u01EC\\u01EE\\u01F1\\u01F4\\u01F6-\\u01F8\\u01FA\\u01FC\\u01FE\\u0200\\u0202\\u0204\\u0206\\u0208\\u020A\\u020C\\u020E\\u0210\\u0212\\u0214\\u0216\\u0218\\u021A\\u021C\\u021E\\u0220\\u0222\\u0224\\u0226\\u0228\\u022A\\u022C\\u022E\\u0230\\u0232\\u023A\\u023B\\u023D\\u023E\\u0241\\u0243-\\u0246\\u0248\\u024A\\u024C\\u024E\\u2C60\\u2C62-\\u2C64\\u2C67\\u2C69\\u2C6B\\u2C6D-\\u2C70\\u2C72\\u2C75\\u2C7E\\u2C7F\\uA722\\uA724\\uA726\\uA728\\uA72A\\uA72C\\uA72E\\uA732\\uA734\\uA736\\uA738\\uA73A\\uA73C\\uA73E\\uA740\\uA742\\uA744\\uA746\\uA748\\uA74A\\uA74C\\uA74E\\uA750\\uA752\\uA754\\uA756\\uA758\\uA75A\\uA75C\\uA75E\\uA760\\uA762\\uA764\\uA766\\uA768\\uA76A\\uA76C\\uA76E\\uA779\\uA77B\\uA77D\\uA77E\\uA780\\uA782\\uA784\\uA786\\uA78B\\uA78D\\uA790\\uA792\\uA796\\uA798\\uA79A\\uA79C\\uA79E\\uA7A0\\uA7A2\\uA7A4\\uA7A6\\uA7A8\\uA7AA-\\uA7AE\\uA7B0-\\uA7B4\\uA7B6\\uA7B8\\u1E00\\u1E02\\u1E04\\u1E06\\u1E08\\u1E0A\\u1E0C\\u1E0E\\u1E10\\u1E12\\u1E14\\u1E16\\u1E18\\u1E1A\\u1E1C\\u1E1E\\u1E20\\u1E22\\u1E24\\u1E26\\u1E28\\u1E2A\\u1E2C\\u1E2E\\u1E30\\u1E32\\u1E34\\u1E36\\u1E38\\u1E3A\\u1E3C\\u1E3E\\u1E40\\u1E42\\u1E44\\u1E46\\u1E48\\u1E4A\\u1E4C\\u1E4E\\u1E50\\u1E52\\u1E54\\u1E56\\u1E58\\u1E5A\\u1E5C\\u1E5E\\u1E60\\u1E62\\u1E64\\u1E66\\u1E68\\u1E6A\\u1E6C\\u1E6E\\u1E70\\u1E72\\u1E74\\u1E76\\u1E78\\u1E7A\\u1E7C\\u1E7E\\u1E80\\u1E82\\u1E84\\u1E86\\u1E88\\u1E8A\\u1E8C\\u1E8E\\u1E90\\u1E92\\u1E94\\u1E9E\\u1EA0\\u1EA2\\u1EA4\\u1EA6\\u1EA8\\u1EAA\\u1EAC\\u1EAE\\u1EB0\\u1EB2\\u1EB4\\u1EB6\\u1EB8\\u1EBA\\u1EBC\\u1EBE\\u1EC0\\u1EC2\\u1EC4\\u1EC6\\u1EC8\\u1ECA\\u1ECC\\u1ECE\\u1ED0\\u1ED2\\u1ED4\\u1ED6\\u1ED8\\u1EDA\\u1EDC\\u1EDE\\u1EE0\\u1EE2\\u1EE4\\u1EE6\\u1EE8\\u1EEA\\u1EEC\\u1EEE\\u1EF0\\u1EF2\\u1EF4\\u1EF6\\u1EF8\\u1EFA\\u1EFC\\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\\u1200-\\u137F\\u0980-\\u09FF\\u0591-\\u05F4\\uFB1D-\\uFB4F\\u0620-\\u064A\\u066E-\\u06D5\\u06E5-\\u06FF\\u0750-\\u077F\\u08A0-\\u08BD\\uFB50-\\uFBB1\\uFBD3-\\uFD3D\\uFD50-\\uFDC7\\uFDF0-\\uFDFB\\uFE70-\\uFEFC\\U0001EE00-\\U0001EEBB\\u0D80-\\u0DFF\\u0900-\\u097F\\u0C80-\\u0CFF\\u0B80-\\u0BFF\\u0C00-\\u0C7F\\uAC00-\\uD7AF\\u1100-\\u11FF\\u3040-\\u309F\\u30A0-\\u30FFー\\u4E00-\\u62FF\\u6300-\\u77FF\\u7800-\\u8CFF\\u8D00-\\u9FFF\\u3400-\\u4DBF\\U00020000-\\U000215FF\\U00021600-\\U000230FF\\U00023100-\\U000245FF\\U00024600-\\U000260FF\\U00026100-\\U000275FF\\U00027600-\\U000290FF\\U00029100-\\U0002A6DF\\U0002A700-\\U0002B73F\\U0002B740-\\U0002B81F\\U0002B820-\\U0002CEAF\\U0002CEB0-\\U0002EBEF\\u2E80-\\u2EFF\\u2F00-\\u2FDF\\u2FF0-\\u2FFF\\u3000-\\u303F\\u31C0-\\u31EF\\u3200-\\u32FF\\u3300-\\u33FF\\uF900-\\uFAFF\\uFE30-\\uFE4F\\U0001F200-\\U0001F2FF\\U0002F800-\\U0002FA1F])\\.']
tokenizer_exceptions: Dict[str, List[dict]] = {'\t': [{65: '\t'}], '\n': [{65: '\n'}], ' ': [{65: ' '}], "'": [{65: "'"}], "''": [{65: "''"}], "'Cause": [{65: "'Cause", 67: 'because'}], "'Cos": [{65: "'Cos", 67: 'because'}], "'Coz": [{65: "'Coz", 67: 'because'}], "'Cuz": [{65: "'Cuz", 67: 'because'}], "'S": [{65: "'S", 67: "'s"}], "'bout": [{65: "'bout", 67: 'about'}], "'cause": [{65: "'cause", 67: 'because'}], "'cos": [{65: "'cos", 67: 'because'}], "'coz": [{65: "'coz", 67: 'because'}], "'cuz": [{65: "'cuz", 67: 'because'}], "'d": [{65: "'d"}], "'em": [{65: "'em", 67: 'them'}], "'ll": [{65: "'ll", 67: 'will'}], "'nuff": [{65: "'nuff", 67: 'enough'}], "'re": [{65: "'re", 67: 'are'}], "'s": [{65: "'s", 67: "'s"}], '(*_*)': [{65: '(*_*)'}], '(-8': [{65: '(-8'}], '(-:': [{65: '(-:'}], '(-;': [{65: '(-;'}], '(-_-)': [{65: '(-_-)'}], '(._.)': [{65: '(._.)'}], '(:': [{65: '(:'}], '(;': [{65: '(;'}], '(=': [{65: '(='}], '(>_<)': [{65: '(>_<)'}], '(^_^)': [{65: '(^_^)'}], '(o:': [{65: '(o:'}], '(¬_¬)': [{65: '(¬_¬)'}], '(ಠ_ಠ)': [{65: '(ಠ_ಠ)'}], '(╯°□°)╯︵┻━┻': [{65: '(╯°□°)╯︵┻━┻'}], ')-:': [{65: ')-:'}], '):': [{65: '):'}], '-_-': [{65: '-_-'}], '-__-': [{65: '-__-'}], '._.': [{65: '._.'}], '0.0': [{65: '0.0'}], '0.o': [{65: '0.o'}], '0_0': [{65: '0_0'}], '0_o': [{65: '0_o'}], '10a.m.': [{65: '10'}, {65: 'a.m.', 67: 'a.m.'}], '10am': [{65: '10'}, {65: 'am', 67: 'a.m.'}], '10p.m.': [{65: '10'}, {65: 'p.m.', 67: 'p.m.'}], '10pm': [{65: '10'}, {65: 'pm', 67: 'p.m.'}], '11a.m.': [{65: '11'}, {65: 'a.m.', 67: 'a.m.'}], '11am': [{65: '11'}, {65: 'am', 67: 'a.m.'}], '11p.m.': [{65: '11'}, {65: 'p.m.', 67: 'p.m.'}], '11pm': [{65: '11'}, {65: 'pm', 67: 'p.m.'}], '12a.m.': [{65: '12'}, {65: 'a.m.', 67: 'a.m.'}], '12am': [{65: '12'}, {65: 'am', 67: 'a.m.'}], '12p.m.': [{65: '12'}, {65: 'p.m.', 67: 'p.m.'}], '12pm': [{65: '12'}, {65: 'pm', 67: 'p.m.'}], '1a.m.': [{65: '1'}, {65: 'a.m.', 67: 'a.m.'}], '1am': [{65: '1'}, {65: 'am', 67: 'a.m.'}], '1p.m.': [{65: '1'}, {65: 'p.m.', 67: 'p.m.'}], '1pm': [{65: '1'}, {65: 'pm', 67: 'p.m.'}], '2a.m.': [{65: '2'}, {65: 'a.m.', 67: 'a.m.'}], '2am': [{65: '2'}, {65: 'am', 67: 'a.m.'}], '2p.m.': [{65: '2'}, {65: 'p.m.', 67: 'p.m.'}], '2pm': [{65: '2'}, {65: 'pm', 67: 'p.m.'}], '3a.m.': [{65: '3'}, {65: 'a.m.', 67: 'a.m.'}], '3am': [{65: '3'}, {65: 'am', 67: 'a.m.'}], '3p.m.': [{65: '3'}, {65: 'p.m.', 67: 'p.m.'}], '3pm': [{65: '3'}, {65: 'pm', 67: 'p.m.'}], '4a.m.': [{65: '4'}, {65: 'a.m.', 67: 'a.m.'}], '4am': [{65: '4'}, {65: 'am', 67: 'a.m.'}], '4p.m.': [{65: '4'}, {65: 'p.m.', 67: 'p.m.'}], '4pm': [{65: '4'}, {65: 'pm', 67: 'p.m.'}], '5a.m.': [{65: '5'}, {65: 'a.m.', 67: 'a.m.'}], '5am': [{65: '5'}, {65: 'am', 67: 'a.m.'}], '5p.m.': [{65: '5'}, {65: 'p.m.', 67: 'p.m.'}], '5pm': [{65: '5'}, {65: 'pm', 67: 'p.m.'}], '6a.m.': [{65: '6'}, {65: 'a.m.', 67: 'a.m.'}], '6am': [{65: '6'}, {65: 'am', 67: 'a.m.'}], '6p.m.': [{65: '6'}, {65: 'p.m.', 67: 'p.m.'}], '6pm': [{65: '6'}, {65: 'pm', 67: 'p.m.'}], '7a.m.': [{65: '7'}, {65: 'a.m.', 67: 'a.m.'}], '7am': [{65: '7'}, {65: 'am', 67: 'a.m.'}], '7p.m.': [{65: '7'}, {65: 'p.m.', 67: 'p.m.'}], '7pm': [{65: '7'}, {65: 'pm', 67: 'p.m.'}], '8)': [{65: '8)'}], '8-)': [{65: '8-)'}], '8-D': [{65: '8-D'}], '8D': [{65: '8D'}], '8a.m.': [{65: '8'}, {65: 'a.m.', 67: 'a.m.'}], '8am': [{65: '8'}, {65: 'am', 67: 'a.m.'}], '8p.m.': [{65: '8'}, {65: 'p.m.', 67: 'p.m.'}], '8pm': [{65: '8'}, {65: 'pm', 67: 'p.m.'}], '9a.m.': [{65: '9'}, {65: 'a.m.', 67: 'a.m.'}], '9am': [{65: '9'}, {65: 'am', 67: 'a.m.'}], '9p.m.': [{65: '9'}, {65: 'p.m.', 67: 'p.m.'}], '9pm': [{65: '9'}, {65: 'pm', 67: 'p.m.'}], ":'(": [{65: ":'("}], ":')": [{65: ":')"}], ":'-(": [{65: ":'-("}], ":'-)": [{65: ":'-)"}], ':(': [{65: ':('}], ':((': [{65: ':(('}], ':(((': [{65: ':((('}], ':()': [{65: ':()'}], ':)': [{65: ':)'}], ':))': [{65: ':))'}], ':)))': [{65: ':)))'}], ':*': [{65: ':*'}], ':-(': [{65: ':-('}], ':-((': [{65: ':-(('}], ':-(((': [{65: ':-((('}], ':-)': [{65: ':-)'}], ':-))': [{65: ':-))'}], ':-)))': [{65: ':-)))'}], ':-*': [{65: ':-*'}], ':-/': [{65: ':-/'}], ':-0': [{65: ':-0'}], ':-3': [{65: ':-3'}], ':->': [{65: ':->'}], ':-D': [{65: ':-D'}], ':-O': [{65: ':-O'}], ':-P': [{65: ':-P'}], ':-X': [{65: ':-X'}], ':-]': [{65: ':-]'}], ':-o': [{65: ':-o'}], ':-p': [{65: ':-p'}], ':-x': [{65: ':-x'}], ':-|': [{65: ':-|'}], ':-}': [{65: ':-}'}], ':/': [{65: ':/'}], ':0': [{65: ':0'}], ':1': [{65: ':1'}], ':3': [{65: ':3'}], ':>': [{65: ':>'}], ':D': [{65: ':D'}], ':O': [{65: ':O'}], ':P': [{65: ':P'}], ':X': [{65: ':X'}], ':]': [{65: ':]'}], ':o': [{65: ':o'}], ':o)': [{65: ':o)'}], ':p': [{65: ':p'}], ':x': [{65: ':x'}], ':|': [{65: ':|'}], ':}': [{65: ':}'}], ':’(': [{65: ':’('}], ':’)': [{65: ':’)'}], ':’-(': [{65: ':’-('}], ':’-)': [{65: ':’-)'}], ';)': [{65: ';)'}], ';-)': [{65: ';-)'}], ';-D': [{65: ';-D'}], ';D': [{65: ';D'}], ';_;': [{65: ';_;'}], '<.<': [{65: '<.<'}], '</3': [{65: '</3'}], '<3': [{65: '<3'}], '<33': [{65: '<33'}], '<333': [{65: '<333'}], '<space>': [{65: '<space>'}], '=(': [{65: '=('}], '=)': [{65: '=)'}], '=/': [{65: '=/'}], '=3': [{65: '=3'}], '=D': [{65: '=D'}], '=[': [{65: '=['}], '=]': [{65: '=]'}], '=|': [{65: '=|'}], '>.<': [{65: '>.<'}], '>.>': [{65: '>.>'}], '>:(': [{65: '>:('}], '>:o': [{65: '>:o'}], '><(((*>': [{65: '><(((*>'}], '@_@': [{65: '@_@'}], 'Adm.': [{65: 'Adm.'}], "Ain't": [{'number': 2, 65: 'Ai'}, {65: "n't", 67: 'not'}], 'Aint': [{'number': 2, 65: 'Ai'}, {65: 'nt', 67: 'not'}], 'Ain’t': [{'number': 2, 65: 'Ai'}, {65: 'n’t', 67: 'not'}], 'Ak.': [{65: 'Ak.', 67: 'Alaska'}], 'Ala.': [{65: 'Ala.', 67: 'Alabama'}], 'Apr.': [{65: 'Apr.', 67: 'April'}], "Aren't": [{'number': 2, 65: 'Are', 67: 'are'}, {65: "n't", 67: 'not'}], 'Arent': [{'number': 2, 65: 'Are', 67: 'are'}, {65: 'nt', 67: 'not'}], 'Aren’t': [{'number': 2, 65: 'Are', 67: 'are'}, {65: 'n’t', 67: 'not'}], 'Ariz.': [{65: 'Ariz.', 67: 'Arizona'}], 'Ark.': [{65: 'Ark.', 67: 'Arkansas'}], 'Aug.': [{65: 'Aug.', 67: 'August'}], 'Bros.': [{65: 'Bros.'}], "C'mon": [{65: "C'm", 67: 'come'}, {65: 'on'}], 'C++': [{65: 'C++'}], 'Calif.': [{65: 'Calif.', 67: 'California'}], "Can't": [{65: 'Ca', 67: 'can'}, {65: "n't", 67: 'not'}], "Can't've": [{65: 'Ca', 67: 'can'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'Cannot': [{65: 'Can', 67: 'can'}, {65: 'not'}], 'Cant': [{65: 'Ca', 67: 'can'}, {65: 'nt', 67: 'not'}], 'Cantve': [{65: 'Ca', 67: 'can'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'Can’t': [{65: 'Ca', 67: 'can'}, {65: 'n’t', 67: 'not'}], 'Can’t’ve': [{65: 'Ca', 67: 'can'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'Co.': [{65: 'Co.'}], 'Colo.': [{65: 'Colo.', 67: 'Colorado'}], 'Conn.': [{65: 'Conn.', 67: 'Connecticut'}], 'Corp.': [{65: 'Corp.'}], "Could've": [{65: 'Could', 67: 'could'}, {65: "'ve"}], "Couldn't": [{65: 'Could', 67: 'could'}, {65: "n't", 67: 'not'}], "Couldn't've": [{65: 'Could', 67: 'could'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'Couldnt': [{65: 'Could', 67: 'could'}, {65: 'nt', 67: 'not'}], 'Couldntve': [{65: 'Could', 67: 'could'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'Couldn’t': [{65: 'Could', 67: 'could'}, {65: 'n’t', 67: 'not'}], 'Couldn’t’ve': [{65: 'Could', 67: 'could'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'Couldve': [{65: 'Could', 67: 'could'}, {65: 've'}], 'Could’ve': [{65: 'Could', 67: 'could'}, {65: '’ve'}], 'C’mon': [{65: 'C’m', 67: 'come'}, {65: 'on'}], 'D.C.': [{65: 'D.C.'}], "Daren't": [{65: 'Dare', 67: 'dare'}, {65: "n't", 67: 'not'}], 'Darent': [{65: 'Dare', 67: 'dare'}, {65: 'nt', 67: 'not'}], 'Daren’t': [{65: 'Dare', 67: 'dare'}, {65: 'n’t', 67: 'not'}], 'Dec.': [{65: 'Dec.', 67: 'December'}], 'Del.': [{65: 'Del.', 67: 'Delaware'}], "Didn't": [{65: 'Did', 67: 'do'}, {65: "n't", 67: 'not'}], "Didn't've": [{65: 'Did', 67: 'do'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'Didnt': [{65: 'Did', 67: 'do'}, {65: 'nt', 67: 'not'}], 'Didntve': [{65: 'Did', 67: 'do'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'Didn’t': [{65: 'Did', 67: 'do'}, {65: 'n’t', 67: 'not'}], 'Didn’t’ve': [{65: 'Did', 67: 'do'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], "Doesn't": [{65: 'Does', 67: 'does'}, {65: "n't", 67: 'not'}], "Doesn't've": [{65: 'Does', 67: 'does'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'Doesnt': [{65: 'Does', 67: 'does'}, {65: 'nt', 67: 'not'}], 'Doesntve': [{65: 'Does', 67: 'does'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'Doesn’t': [{65: 'Does', 67: 'does'}, {65: 'n’t', 67: 'not'}], 'Doesn’t’ve': [{65: 'Does', 67: 'does'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'Doin': [{65: 'Doin', 67: 'doing'}], "Doin'": [{65: "Doin'", 67: 'doing'}], 'Doin’': [{65: 'Doin’', 67: 'doing'}], "Don't": [{65: 'Do', 67: 'do'}, {65: "n't", 67: 'not'}], "Don't've": [{65: 'Do', 67: 'do'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'Dont': [{65: 'Do', 67: 'do'}, {65: 'nt', 67: 'not'}], 'Dontve': [{65: 'Do', 67: 'do'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'Don’t': [{65: 'Do', 67: 'do'}, {65: 'n’t', 67: 'not'}], 'Don’t’ve': [{65: 'Do', 67: 'do'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'Dr.': [{65: 'Dr.'}], 'E.G.': [{65: 'E.G.'}], 'E.g.': [{65: 'E.g.'}], 'Feb.': [{65: 'Feb.', 67: 'February'}], 'Fla.': [{65: 'Fla.', 67: 'Florida'}], 'Ga.': [{65: 'Ga.', 67: 'Georgia'}], 'Gen.': [{65: 'Gen.'}], 'Goin': [{65: 'Goin', 67: 'going'}], "Goin'": [{65: "Goin'", 67: 'going'}], 'Goin’': [{65: 'Goin’', 67: 'going'}], 'Gonna': [{65: 'Gon', 67: 'going'}, {65: 'na', 67: 'to'}], 'Gotta': [{65: 'Got', 67: 'got'}, {65: 'ta', 67: 'to'}], 'Gov.': [{65: 'Gov.'}], "Hadn't": [{65: 'Had', 67: 'have'}, {65: "n't", 67: 'not'}], "Hadn't've": [{65: 'Had', 67: 'have'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'Hadnt': [{65: 'Had', 67: 'have'}, {65: 'nt', 67: 'not'}], 'Hadntve': [{65: 'Had', 67: 'have'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'Hadn’t': [{65: 'Had', 67: 'have'}, {65: 'n’t', 67: 'not'}], 'Hadn’t’ve': [{65: 'Had', 67: 'have'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], "Hasn't": [{65: 'Has', 67: 'has'}, {65: "n't", 67: 'not'}], 'Hasnt': [{65: 'Has', 67: 'has'}, {65: 'nt', 67: 'not'}], 'Hasn’t': [{65: 'Has', 67: 'has'}, {65: 'n’t', 67: 'not'}], "Haven't": [{65: 'Have', 67: 'have'}, {65: "n't", 67: 'not'}], 'Havent': [{65: 'Have', 67: 'have'}, {65: 'nt', 67: 'not'}], 'Haven’t': [{65: 'Have', 67: 'have'}, {65: 'n’t', 67: 'not'}], 'Havin': [{65: 'Havin', 67: 'having'}], "Havin'": [{65: "Havin'", 67: 'having'}], 'Havin’': [{65: 'Havin’', 67: 'having'}], "He'd": [{65: 'He', 67: 'he'}, {65: "'d", 67: "'d"}], "He'd've": [{65: 'He', 67: 'he'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "He'll": [{65: 'He', 67: 'he'}, {65: "'ll", 67: 'will'}], "He'll've": [{65: 'He', 67: 'he'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "He's": [{65: 'He', 67: 'he'}, {65: "'s", 67: "'s"}], 'Hed': [{65: 'He', 67: 'he'}, {65: 'd', 67: "'d"}], 'Hedve': [{65: 'He', 67: 'he'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'Hellve': [{65: 'He', 67: 'he'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'Hes': [{65: 'He', 67: 'he'}, {65: 's'}], 'He’d': [{65: 'He', 67: 'he'}, {65: '’d', 67: "'d"}], 'He’d’ve': [{65: 'He', 67: 'he'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'He’ll': [{65: 'He', 67: 'he'}, {65: '’ll', 67: 'will'}], 'He’ll’ve': [{65: 'He', 67: 'he'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'He’s': [{65: 'He', 67: 'he'}, {65: '’s', 67: "'s"}], "How'd": [{65: 'How', 67: 'how'}, {65: "'d", 67: "'d"}], "How'd've": [{65: 'How', 67: 'how'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "How'd'y": [{65: 'How', 67: 'how'}, {65: "'d"}, {65: "'y", 67: 'you'}], "How'll": [{65: 'How', 67: 'how'}, {65: "'ll", 67: 'will'}], "How'll've": [{65: 'How', 67: 'how'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "How're": [{65: 'How', 67: 'how'}, {65: "'re", 67: 'are'}], "How's": [{65: 'How', 67: 'how'}, {65: "'s", 67: "'s"}], "How've": [{65: 'How', 67: 'how'}, {65: "'ve"}], 'Howd': [{65: 'How', 67: 'how'}, {65: 'd', 67: "'d"}], 'Howdve': [{65: 'How', 67: 'how'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'Howll': [{65: 'How', 67: 'how'}, {65: 'll', 67: 'will'}], 'Howllve': [{65: 'How', 67: 'how'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'Howre': [{65: 'How', 67: 'how'}, {65: 're', 67: 'are'}], 'Hows': [{65: 'How', 67: 'how'}, {65: 's'}], 'Howve': [{65: 'How'}, {65: 've', 67: 'have'}], 'How’d': [{65: 'How', 67: 'how'}, {65: '’d', 67: "'d"}], 'How’d’ve': [{65: 'How', 67: 'how'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'How’d’y': [{65: 'How', 67: 'how'}, {65: '’d'}, {65: '’y', 67: 'you'}], 'How’ll': [{65: 'How', 67: 'how'}, {65: '’ll', 67: 'will'}], 'How’ll’ve': [{65: 'How', 67: 'how'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'How’re': [{65: 'How', 67: 'how'}, {65: '’re', 67: 'are'}], 'How’s': [{65: 'How', 67: 'how'}, {65: '’s', 67: "'s"}], 'How’ve': [{65: 'How', 67: 'how'}, {65: '’ve'}], "I'd": [{65: 'I', 67: 'i'}, {65: "'d", 67: "'d"}], "I'd've": [{65: 'I', 67: 'i'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "I'll": [{65: 'I', 67: 'i'}, {65: "'ll", 67: 'will'}], "I'll've": [{65: 'I', 67: 'i'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "I'm": [{65: 'I', 67: 'i'}, {65: "'m", 67: 'am'}], "I'ma": [{65: 'I', 67: 'i'}, {65: "'m", 67: 'am'}, {65: 'a', 67: 'gonna'}], "I've": [{65: 'I', 67: 'i'}, {65: "'ve", 67: 'have'}], 'I.E.': [{65: 'I.E.'}], 'I.e.': [{65: 'I.e.'}], 'Ia.': [{65: 'Ia.', 67: 'Iowa'}], 'Id': [{65: 'I', 67: 'i'}, {65: 'd', 67: "'d"}], 'Id.': [{65: 'Id.', 67: 'Idaho'}], 'Idve': [{65: 'I', 67: 'i'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'Ill.': [{65: 'Ill.', 67: 'Illinois'}], 'Illve': [{65: 'I', 67: 'i'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'Im': [{65: 'I', 67: 'i'}, {65: 'm'}], 'Ima': [{65: 'I', 67: 'i'}, {65: 'm', 67: 'am'}, {65: 'a', 67: 'gonna'}], 'Inc.': [{65: 'Inc.'}], 'Ind.': [{65: 'Ind.', 67: 'Indiana'}], "Isn't": [{65: 'Is', 67: 'is'}, {65: "n't", 67: 'not'}], 'Isnt': [{65: 'Is', 67: 'is'}, {65: 'nt', 67: 'not'}], 'Isn’t': [{65: 'Is', 67: 'is'}, {65: 'n’t', 67: 'not'}], "It'd": [{65: 'It', 67: 'it'}, {65: "'d", 67: "'d"}], "It'd've": [{65: 'It', 67: 'it'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "It'll": [{65: 'It', 67: 'it'}, {65: "'ll", 67: 'will'}], "It'll've": [{65: 'It', 67: 'it'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "It's": [{65: 'It', 67: 'it'}, {65: "'s", 67: "'s"}], 'Itd': [{65: 'It', 67: 'it'}, {65: 'd', 67: "'d"}], 'Itdve': [{65: 'It', 67: 'it'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'Itll': [{65: 'It', 67: 'it'}, {65: 'll', 67: 'will'}], 'Itllve': [{65: 'It', 67: 'it'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'It’d': [{65: 'It', 67: 'it'}, {65: '’d', 67: "'d"}], 'It’d’ve': [{65: 'It', 67: 'it'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'It’ll': [{65: 'It', 67: 'it'}, {65: '’ll', 67: 'will'}], 'It’ll’ve': [{65: 'It', 67: 'it'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'It’s': [{65: 'It', 67: 'it'}, {65: '’s', 67: "'s"}], 'Ive': [{65: 'I', 67: 'i'}, {65: 've', 67: 'have'}], 'I’d': [{65: 'I', 67: 'i'}, {65: '’d', 67: "'d"}], 'I’d’ve': [{65: 'I', 67: 'i'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'I’ll': [{65: 'I', 67: 'i'}, {65: '’ll', 67: 'will'}], 'I’ll’ve': [{65: 'I', 67: 'i'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'I’m': [{65: 'I', 67: 'i'}, {65: '’m', 67: 'am'}], 'I’ma': [{65: 'I', 67: 'i'}, {65: '’m', 67: 'am'}, {65: 'a', 67: 'gonna'}], 'I’ve': [{65: 'I', 67: 'i'}, {65: '’ve', 67: 'have'}], 'Jan.': [{65: 'Jan.', 67: 'January'}], 'Jr.': [{65: 'Jr.'}], 'Jul.': [{65: 'Jul.', 67: 'July'}], 'Jun.': [{65: 'Jun.', 67: 'June'}], 'Kan.': [{65: 'Kan.', 67: 'Kansas'}], 'Kans.': [{65: 'Kans.', 67: 'Kansas'}], 'Ky.': [{65: 'Ky.', 67: 'Kentucky'}], 'La.': [{65: 'La.', 67: 'Louisiana'}], "Let's": [{65: 'Let', 67: 'let'}, {65: "'s", 67: 'us'}], 'Let’s': [{65: 'Let', 67: 'let'}, {65: '’s', 67: 'us'}], 'Lovin': [{65: 'Lovin', 67: 'loving'}], "Lovin'": [{65: "Lovin'", 67: 'loving'}], 'Lovin’': [{65: 'Lovin’', 67: 'loving'}], 'Ltd.': [{65: 'Ltd.'}], "Ma'am": [{65: "Ma'am", 67: 'madam'}], 'Mar.': [{65: 'Mar.', 67: 'March'}], 'Mass.': [{65: 'Mass.', 67: 'Massachusetts'}], "Mayn't": [{65: 'May', 67: 'may'}, {65: "n't", 67: 'not'}], "Mayn't've": [{65: 'May', 67: 'may'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'Maynt': [{65: 'May', 67: 'may'}, {65: 'nt', 67: 'not'}], 'Mayntve': [{65: 'May', 67: 'may'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'Mayn’t': [{65: 'May', 67: 'may'}, {65: 'n’t', 67: 'not'}], 'Mayn’t’ve': [{65: 'May', 67: 'may'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'Ma’am': [{65: 'Ma’am', 67: 'madam'}], 'Md.': [{65: 'Md.'}], 'Messrs.': [{65: 'Messrs.'}], 'Mich.': [{65: 'Mich.', 67: 'Michigan'}], "Might've": [{65: 'Might', 67: 'might'}, {65: "'ve"}], "Mightn't": [{65: 'Might', 67: 'might'}, {65: "n't", 67: 'not'}], "Mightn't've": [{65: 'Might', 67: 'might'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'Mightnt': [{65: 'Might', 67: 'might'}, {65: 'nt', 67: 'not'}], 'Mightntve': [{65: 'Might', 67: 'might'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'Mightn’t': [{65: 'Might', 67: 'might'}, {65: 'n’t', 67: 'not'}], 'Mightn’t’ve': [{65: 'Might', 67: 'might'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'Mightve': [{65: 'Might', 67: 'might'}, {65: 've'}], 'Might’ve': [{65: 'Might', 67: 'might'}, {65: '’ve'}], 'Minn.': [{65: 'Minn.', 67: 'Minnesota'}], 'Miss.': [{65: 'Miss.', 67: 'Mississippi'}], 'Mo.': [{65: 'Mo.'}], 'Mont.': [{65: 'Mont.'}], 'Mr.': [{65: 'Mr.'}], 'Mrs.': [{65: 'Mrs.'}], 'Ms.': [{65: 'Ms.'}], 'Mt.': [{65: 'Mt.', 67: 'Mount'}], "Must've": [{65: 'Must', 67: 'must'}, {65: "'ve"}], "Mustn't": [{65: 'Must', 67: 'must'}, {65: "n't", 67: 'not'}], "Mustn't've": [{65: 'Must', 67: 'must'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'Mustnt': [{65: 'Must', 67: 'must'}, {65: 'nt', 67: 'not'}], 'Mustntve': [{65: 'Must', 67: 'must'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'Mustn’t': [{65: 'Must', 67: 'must'}, {65: 'n’t', 67: 'not'}], 'Mustn’t’ve': [{65: 'Must', 67: 'must'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'Mustve': [{65: 'Must', 67: 'must'}, {65: 've'}], 'Must’ve': [{65: 'Must', 67: 'must'}, {65: '’ve'}], 'N.C.': [{65: 'N.C.', 67: 'North Carolina'}], 'N.D.': [{65: 'N.D.', 67: 'North Dakota'}], 'N.H.': [{65: 'N.H.', 67: 'New Hampshire'}], 'N.J.': [{65: 'N.J.', 67: 'New Jersey'}], 'N.M.': [{65: 'N.M.', 67: 'New Mexico'}], 'N.Y.': [{65: 'N.Y.', 67: 'New York'}], 'Neb.': [{65: 'Neb.', 67: 'Nebraska'}], 'Nebr.': [{65: 'Nebr.', 67: 'Nebraska'}], "Needn't": [{65: 'Need', 67: 'need'}, {65: "n't", 67: 'not'}], "Needn't've": [{65: 'Need', 67: 'need'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'Neednt': [{65: 'Need', 67: 'need'}, {65: 'nt', 67: 'not'}], 'Needntve': [{65: 'Need', 67: 'need'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'Needn’t': [{65: 'Need', 67: 'need'}, {65: 'n’t', 67: 'not'}], 'Needn’t’ve': [{65: 'Need', 67: 'need'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'Nev.': [{65: 'Nev.', 67: 'Nevada'}], "Not've": [{65: 'Not', 67: 'not'}, {65: "'ve", 67: 'have'}], 'Nothin': [{65: 'Nothin', 67: 'nothing'}], "Nothin'": [{65: "Nothin'", 67: 'nothing'}], 'Nothin’': [{65: 'Nothin’', 67: 'nothing'}], 'Notve': [{65: 'Not', 67: 'not'}, {65: 've', 67: 'have'}], 'Not’ve': [{65: 'Not', 67: 'not'}, {65: '’ve', 67: 'have'}], 'Nov.': [{65: 'Nov.', 67: 'November'}], 'Nuthin': [{65: 'Nuthin', 67: 'nothing'}], "Nuthin'": [{65: "Nuthin'", 67: 'nothing'}], 'Nuthin’': [{65: 'Nuthin’', 67: 'nothing'}], "O'clock": [{65: "O'clock", 67: "o'clock"}], 'O.O': [{65: 'O.O'}], 'O.o': [{65: 'O.o'}], 'O_O': [{65: 'O_O'}], 'O_o': [{65: 'O_o'}], 'Oct.': [{65: 'Oct.', 67: 'October'}], 'Okla.': [{65: 'Okla.', 67: 'Oklahoma'}], 'Ol': [{65: 'Ol', 67: 'old'}], "Ol'": [{65: "Ol'", 67: 'old'}], 'Ol’': [{65: 'Ol’', 67: 'old'}], 'Ore.': [{65: 'Ore.', 67: 'Oregon'}], "Oughtn't": [{65: 'Ought', 67: 'ought'}, {65: "n't", 67: 'not'}], "Oughtn't've": [{65: 'Ought', 67: 'ought'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'Oughtnt': [{65: 'Ought', 67: 'ought'}, {65: 'nt', 67: 'not'}], 'Oughtntve': [{65: 'Ought', 67: 'ought'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'Oughtn’t': [{65: 'Ought', 67: 'ought'}, {65: 'n’t', 67: 'not'}], 'Oughtn’t’ve': [{65: 'Ought', 67: 'ought'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'O’clock': [{65: 'O’clock', 67: "o'clock"}], 'Pa.': [{65: 'Pa.', 67: 'Pennsylvania'}], 'Ph.D.': [{65: 'Ph.D.'}], 'Prof.': [{65: 'Prof.'}], 'Rep.': [{65: 'Rep.'}], 'Rev.': [{65: 'Rev.'}], 'S.C.': [{65: 'S.C.', 67: 'South Carolina'}], 'Sen.': [{65: 'Sen.'}], 'Sep.': [{65: 'Sep.', 67: 'September'}], 'Sept.': [{65: 'Sept.', 67: 'September'}], "Shan't": [{65: 'Sha', 67: 'shall'}, {65: "n't", 67: 'not'}], "Shan't've": [{65: 'Sha', 67: 'shall'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'Shant': [{65: 'Sha', 67: 'shall'}, {65: 'nt', 67: 'not'}], 'Shantve': [{65: 'Sha', 67: 'shall'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'Shan’t': [{65: 'Sha', 67: 'shall'}, {65: 'n’t', 67: 'not'}], 'Shan’t’ve': [{65: 'Sha', 67: 'shall'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], "She'd": [{65: 'She', 67: 'she'}, {65: "'d", 67: "'d"}], "She'd've": [{65: 'She', 67: 'she'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "She'll": [{65: 'She', 67: 'she'}, {65: "'ll", 67: 'will'}], "She'll've": [{65: 'She', 67: 'she'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "She's": [{65: 'She', 67: 'she'}, {65: "'s", 67: "'s"}], 'Shedve': [{65: 'She', 67: 'she'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'Shellve': [{65: 'She', 67: 'she'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'Shes': [{65: 'She', 67: 'she'}, {65: 's'}], 'She’d': [{65: 'She', 67: 'she'}, {65: '’d', 67: "'d"}], 'She’d’ve': [{65: 'She', 67: 'she'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'She’ll': [{65: 'She', 67: 'she'}, {65: '’ll', 67: 'will'}], 'She’ll’ve': [{65: 'She', 67: 'she'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'She’s': [{65: 'She', 67: 'she'}, {65: '’s', 67: "'s"}], "Should've": [{65: 'Should', 67: 'should'}, {65: "'ve"}], "Shouldn't": [{65: 'Should', 67: 'should'}, {65: "n't", 67: 'not'}], "Shouldn't've": [{65: 'Should', 67: 'should'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'Shouldnt': [{65: 'Should', 67: 'should'}, {65: 'nt', 67: 'not'}], 'Shouldntve': [{65: 'Should', 67: 'should'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'Shouldn’t': [{65: 'Should', 67: 'should'}, {65: 'n’t', 67: 'not'}], 'Shouldn’t’ve': [{65: 'Should', 67: 'should'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'Shouldve': [{65: 'Should', 67: 'should'}, {65: 've'}], 'Should’ve': [{65: 'Should', 67: 'should'}, {65: '’ve'}], 'Somethin': [{65: 'Somethin', 67: 'something'}], "Somethin'": [{65: "Somethin'", 67: 'something'}], 'Somethin’': [{65: 'Somethin’', 67: 'something'}], 'St.': [{65: 'St.'}], 'Tenn.': [{65: 'Tenn.', 67: 'Tennessee'}], "That'd": [{65: 'That', 67: 'that'}, {65: "'d", 67: "'d"}], "That'd've": [{65: 'That', 67: 'that'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "That'll": [{65: 'That', 67: 'that'}, {65: "'ll", 67: 'will'}], "That'll've": [{65: 'That', 67: 'that'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "That's": [{65: 'That', 67: 'that'}, {65: "'s", 67: "'s"}], 'Thatd': [{65: 'That', 67: 'that'}, {65: 'd', 67: "'d"}], 'Thatdve': [{65: 'That', 67: 'that'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'Thatll': [{65: 'That', 67: 'that'}, {65: 'll', 67: 'will'}], 'Thatllve': [{65: 'That', 67: 'that'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'Thats': [{65: 'That', 67: 'that'}, {65: 's'}], 'That’d': [{65: 'That', 67: 'that'}, {65: '’d', 67: "'d"}], 'That’d’ve': [{65: 'That', 67: 'that'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'That’ll': [{65: 'That', 67: 'that'}, {65: '’ll', 67: 'will'}], 'That’ll’ve': [{65: 'That', 67: 'that'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'That’s': [{65: 'That', 67: 'that'}, {65: '’s', 67: "'s"}], "There'd": [{65: 'There', 67: 'there'}, {65: "'d", 67: "'d"}], "There'd've": [{65: 'There', 67: 'there'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "There'll": [{65: 'There', 67: 'there'}, {65: "'ll", 67: 'will'}], "There'll've": [{65: 'There', 67: 'there'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "There're": [{65: 'There', 67: 'there'}, {65: "'re", 67: 'are'}], "There's": [{65: 'There', 67: 'there'}, {65: "'s", 67: "'s"}], "There've": [{65: 'There', 67: 'there'}, {65: "'ve"}], 'Thered': [{65: 'There', 67: 'there'}, {65: 'd', 67: "'d"}], 'Theredve': [{65: 'There', 67: 'there'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'Therell': [{65: 'There', 67: 'there'}, {65: 'll', 67: 'will'}], 'Therellve': [{65: 'There', 67: 'there'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'Therere': [{65: 'There', 67: 'there'}, {65: 're', 67: 'are'}], 'Theres': [{65: 'There', 67: 'there'}, {65: 's'}], 'Thereve': [{65: 'There'}, {65: 've', 67: 'have'}], 'There’d': [{65: 'There', 67: 'there'}, {65: '’d', 67: "'d"}], 'There’d’ve': [{65: 'There', 67: 'there'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'There’ll': [{65: 'There', 67: 'there'}, {65: '’ll', 67: 'will'}], 'There’ll’ve': [{65: 'There', 67: 'there'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'There’re': [{65: 'There', 67: 'there'}, {65: '’re', 67: 'are'}], 'There’s': [{65: 'There', 67: 'there'}, {65: '’s', 67: "'s"}], 'There’ve': [{65: 'There', 67: 'there'}, {65: '’ve'}], "These'd": [{65: 'These', 67: 'these'}, {65: "'d", 67: "'d"}], "These'd've": [{65: 'These', 67: 'these'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "These'll": [{65: 'These', 67: 'these'}, {65: "'ll", 67: 'will'}], "These'll've": [{65: 'These', 67: 'these'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "These're": [{65: 'These', 67: 'these'}, {65: "'re", 67: 'are'}], "These've": [{65: 'These', 67: 'these'}, {65: "'ve"}], 'Thesed': [{65: 'These', 67: 'these'}, {65: 'd', 67: "'d"}], 'Thesedve': [{65: 'These', 67: 'these'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'Thesell': [{65: 'These', 67: 'these'}, {65: 'll', 67: 'will'}], 'Thesellve': [{65: 'These', 67: 'these'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'Thesere': [{65: 'These', 67: 'these'}, {65: 're', 67: 'are'}], 'Theseve': [{65: 'These'}, {65: 've', 67: 'have'}], 'These’d': [{65: 'These', 67: 'these'}, {65: '’d', 67: "'d"}], 'These’d’ve': [{65: 'These', 67: 'these'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'These’ll': [{65: 'These', 67: 'these'}, {65: '’ll', 67: 'will'}], 'These’ll’ve': [{65: 'These', 67: 'these'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'These’re': [{65: 'These', 67: 'these'}, {65: '’re', 67: 'are'}], 'These’ve': [{65: 'These', 67: 'these'}, {65: '’ve'}], "They'd": [{65: 'They', 67: 'they'}, {65: "'d", 67: "'d"}], "They'd've": [{65: 'They', 67: 'they'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "They'll": [{65: 'They', 67: 'they'}, {65: "'ll", 67: 'will'}], "They'll've": [{65: 'They', 67: 'they'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "They're": [{65: 'They', 67: 'they'}, {65: "'re", 67: 'are'}], "They've": [{65: 'They', 67: 'they'}, {65: "'ve", 67: 'have'}], 'Theyd': [{65: 'They', 67: 'they'}, {65: 'd', 67: "'d"}], 'Theydve': [{65: 'They', 67: 'they'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'Theyll': [{65: 'They', 67: 'they'}, {65: 'll', 67: 'will'}], 'Theyllve': [{65: 'They', 67: 'they'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'Theyre': [{65: 'They', 67: 'they'}, {65: 're', 67: 'are'}], 'Theyve': [{65: 'They', 67: 'they'}, {65: 've', 67: 'have'}], 'They’d': [{65: 'They', 67: 'they'}, {65: '’d', 67: "'d"}], 'They’d’ve': [{65: 'They', 67: 'they'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'They’ll': [{65: 'They', 67: 'they'}, {65: '’ll', 67: 'will'}], 'They’ll’ve': [{65: 'They', 67: 'they'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'They’re': [{65: 'They', 67: 'they'}, {65: '’re', 67: 'are'}], 'They’ve': [{65: 'They', 67: 'they'}, {65: '’ve', 67: 'have'}], "This'd": [{65: 'This', 67: 'this'}, {65: "'d", 67: "'d"}], "This'd've": [{65: 'This', 67: 'this'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "This'll": [{65: 'This', 67: 'this'}, {65: "'ll", 67: 'will'}], "This'll've": [{65: 'This', 67: 'this'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "This's": [{65: 'This', 67: 'this'}, {65: "'s", 67: "'s"}], 'Thisd': [{65: 'This', 67: 'this'}, {65: 'd', 67: "'d"}], 'Thisdve': [{65: 'This', 67: 'this'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'Thisll': [{65: 'This', 67: 'this'}, {65: 'll', 67: 'will'}], 'Thisllve': [{65: 'This', 67: 'this'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'Thiss': [{65: 'This', 67: 'this'}, {65: 's'}], 'This’d': [{65: 'This', 67: 'this'}, {65: '’d', 67: "'d"}], 'This’d’ve': [{65: 'This', 67: 'this'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'This’ll': [{65: 'This', 67: 'this'}, {65: '’ll', 67: 'will'}], 'This’ll’ve': [{65: 'This', 67: 'this'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'This’s': [{65: 'This', 67: 'this'}, {65: '’s', 67: "'s"}], "Those'd": [{65: 'Those', 67: 'those'}, {65: "'d", 67: "'d"}], "Those'd've": [{65: 'Those', 67: 'those'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "Those'll": [{65: 'Those', 67: 'those'}, {65: "'ll", 67: 'will'}], "Those'll've": [{65: 'Those', 67: 'those'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "Those're": [{65: 'Those', 67: 'those'}, {65: "'re", 67: 'are'}], "Those've": [{65: 'Those', 67: 'those'}, {65: "'ve"}], 'Thosed': [{65: 'Those', 67: 'those'}, {65: 'd', 67: "'d"}], 'Thosedve': [{65: 'Those', 67: 'those'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'Thosell': [{65: 'Those', 67: 'those'}, {65: 'll', 67: 'will'}], 'Thosellve': [{65: 'Those', 67: 'those'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'Thosere': [{65: 'Those', 67: 'those'}, {65: 're', 67: 'are'}], 'Thoseve': [{65: 'Those'}, {65: 've', 67: 'have'}], 'Those’d': [{65: 'Those', 67: 'those'}, {65: '’d', 67: "'d"}], 'Those’d’ve': [{65: 'Those', 67: 'those'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'Those’ll': [{65: 'Those', 67: 'those'}, {65: '’ll', 67: 'will'}], 'Those’ll’ve': [{65: 'Those', 67: 'those'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'Those’re': [{65: 'Those', 67: 'those'}, {65: '’re', 67: 'are'}], 'Those’ve': [{65: 'Those', 67: 'those'}, {65: '’ve'}], 'V.V': [{65: 'V.V'}], 'V_V': [{65: 'V_V'}], 'Va.': [{65: 'Va.', 67: 'Virginia'}], 'Wash.': [{65: 'Wash.', 67: 'Washington'}], "Wasn't": [{65: 'Was', 67: 'was'}, {65: "n't", 67: 'not'}], 'Wasnt': [{65: 'Was', 67: 'was'}, {65: 'nt', 67: 'not'}], 'Wasn’t': [{65: 'Was', 67: 'was'}, {65: 'n’t', 67: 'not'}], "We'd": [{65: 'We', 67: 'we'}, {65: "'d", 67: "'d"}], "We'd've": [{65: 'We', 67: 'we'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "We'll": [{65: 'We', 67: 'we'}, {65: "'ll", 67: 'will'}], "We'll've": [{65: 'We', 67: 'we'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "We're": [{65: 'We', 67: 'we'}, {65: "'re", 67: 'are'}], "We've": [{65: 'We', 67: 'we'}, {65: "'ve", 67: 'have'}], 'Wed': [{65: 'We', 67: 'we'}, {65: 'd', 67: "'d"}], 'Wedve': [{65: 'We', 67: 'we'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'Wellve': [{65: 'We', 67: 'we'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], "Weren't": [{65: 'Were', 67: 'were'}, {65: "n't", 67: 'not'}], 'Werent': [{65: 'Were', 67: 'were'}, {65: 'nt', 67: 'not'}], 'Weren’t': [{65: 'Were', 67: 'were'}, {65: 'n’t', 67: 'not'}], 'Weve': [{65: 'We', 67: 'we'}, {65: 've', 67: 'have'}], 'We’d': [{65: 'We', 67: 'we'}, {65: '’d', 67: "'d"}], 'We’d’ve': [{65: 'We', 67: 'we'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'We’ll': [{65: 'We', 67: 'we'}, {65: '’ll', 67: 'will'}], 'We’ll’ve': [{65: 'We', 67: 'we'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'We’re': [{65: 'We', 67: 'we'}, {65: '’re', 67: 'are'}], 'We’ve': [{65: 'We', 67: 'we'}, {65: '’ve', 67: 'have'}], "What'd": [{65: 'What', 67: 'what'}, {65: "'d", 67: "'d"}], "What'd've": [{65: 'What', 67: 'what'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "What'll": [{65: 'What', 67: 'what'}, {65: "'ll", 67: 'will'}], "What'll've": [{65: 'What', 67: 'what'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "What're": [{65: 'What', 67: 'what'}, {65: "'re", 67: 'are'}], "What's": [{65: 'What', 67: 'what'}, {65: "'s", 67: "'s"}], "What've": [{65: 'What', 67: 'what'}, {65: "'ve"}], 'Whatd': [{65: 'What', 67: 'what'}, {65: 'd', 67: "'d"}], 'Whatdve': [{65: 'What', 67: 'what'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'Whatll': [{65: 'What', 67: 'what'}, {65: 'll', 67: 'will'}], 'Whatllve': [{65: 'What', 67: 'what'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'Whatre': [{65: 'What', 67: 'what'}, {65: 're', 67: 'are'}], 'Whats': [{65: 'What', 67: 'what'}, {65: 's'}], 'Whatve': [{65: 'What'}, {65: 've', 67: 'have'}], 'What’d': [{65: 'What', 67: 'what'}, {65: '’d', 67: "'d"}], 'What’d’ve': [{65: 'What', 67: 'what'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'What’ll': [{65: 'What', 67: 'what'}, {65: '’ll', 67: 'will'}], 'What’ll’ve': [{65: 'What', 67: 'what'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'What’re': [{65: 'What', 67: 'what'}, {65: '’re', 67: 'are'}], 'What’s': [{65: 'What', 67: 'what'}, {65: '’s', 67: "'s"}], 'What’ve': [{65: 'What', 67: 'what'}, {65: '’ve'}], "When'd": [{65: 'When', 67: 'when'}, {65: "'d", 67: "'d"}], "When'd've": [{65: 'When', 67: 'when'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "When'll": [{65: 'When', 67: 'when'}, {65: "'ll", 67: 'will'}], "When'll've": [{65: 'When', 67: 'when'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "When're": [{65: 'When', 67: 'when'}, {65: "'re", 67: 'are'}], "When's": [{65: 'When', 67: 'when'}, {65: "'s", 67: "'s"}], "When've": [{65: 'When', 67: 'when'}, {65: "'ve"}], 'Whend': [{65: 'When', 67: 'when'}, {65: 'd', 67: "'d"}], 'Whendve': [{65: 'When', 67: 'when'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'Whenll': [{65: 'When', 67: 'when'}, {65: 'll', 67: 'will'}], 'Whenllve': [{65: 'When', 67: 'when'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'Whenre': [{65: 'When', 67: 'when'}, {65: 're', 67: 'are'}], 'Whens': [{65: 'When', 67: 'when'}, {65: 's'}], 'Whenve': [{65: 'When'}, {65: 've', 67: 'have'}], 'When’d': [{65: 'When', 67: 'when'}, {65: '’d', 67: "'d"}], 'When’d’ve': [{65: 'When', 67: 'when'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'When’ll': [{65: 'When', 67: 'when'}, {65: '’ll', 67: 'will'}], 'When’ll’ve': [{65: 'When', 67: 'when'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'When’re': [{65: 'When', 67: 'when'}, {65: '’re', 67: 'are'}], 'When’s': [{65: 'When', 67: 'when'}, {65: '’s', 67: "'s"}], 'When’ve': [{65: 'When', 67: 'when'}, {65: '’ve'}], "Where'd": [{65: 'Where', 67: 'where'}, {65: "'d", 67: "'d"}], "Where'd've": [{65: 'Where', 67: 'where'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "Where'll": [{65: 'Where', 67: 'where'}, {65: "'ll", 67: 'will'}], "Where'll've": [{65: 'Where', 67: 'where'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "Where're": [{65: 'Where', 67: 'where'}, {65: "'re", 67: 'are'}], "Where's": [{65: 'Where', 67: 'where'}, {65: "'s", 67: "'s"}], "Where've": [{65: 'Where', 67: 'where'}, {65: "'ve"}], 'Whered': [{65: 'Where', 67: 'where'}, {65: 'd', 67: "'d"}], 'Wheredve': [{65: 'Where', 67: 'where'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'Wherell': [{65: 'Where', 67: 'where'}, {65: 'll', 67: 'will'}], 'Wherellve': [{65: 'Where', 67: 'where'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'Wherere': [{65: 'Where', 67: 'where'}, {65: 're', 67: 'are'}], 'Wheres': [{65: 'Where', 67: 'where'}, {65: 's'}], 'Whereve': [{65: 'Where'}, {65: 've', 67: 'have'}], 'Where’d': [{65: 'Where', 67: 'where'}, {65: '’d', 67: "'d"}], 'Where’d’ve': [{65: 'Where', 67: 'where'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'Where’ll': [{65: 'Where', 67: 'where'}, {65: '’ll', 67: 'will'}], 'Where’ll’ve': [{65: 'Where', 67: 'where'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'Where’re': [{65: 'Where', 67: 'where'}, {65: '’re', 67: 'are'}], 'Where’s': [{65: 'Where', 67: 'where'}, {65: '’s', 67: "'s"}], 'Where’ve': [{65: 'Where', 67: 'where'}, {65: '’ve'}], "Who'd": [{65: 'Who', 67: 'who'}, {65: "'d", 67: "'d"}], "Who'd've": [{65: 'Who', 67: 'who'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "Who'll": [{65: 'Who', 67: 'who'}, {65: "'ll", 67: 'will'}], "Who'll've": [{65: 'Who', 67: 'who'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "Who're": [{65: 'Who', 67: 'who'}, {65: "'re", 67: 'are'}], "Who's": [{65: 'Who', 67: 'who'}, {65: "'s", 67: "'s"}], "Who've": [{65: 'Who', 67: 'who'}, {65: "'ve"}], 'Whod': [{65: 'Who', 67: 'who'}, {65: 'd', 67: "'d"}], 'Whodve': [{65: 'Who', 67: 'who'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'Wholl': [{65: 'Who', 67: 'who'}, {65: 'll', 67: 'will'}], 'Whollve': [{65: 'Who', 67: 'who'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'Whos': [{65: 'Who', 67: 'who'}, {65: 's'}], 'Whove': [{65: 'Who'}, {65: 've', 67: 'have'}], 'Who’d': [{65: 'Who', 67: 'who'}, {65: '’d', 67: "'d"}], 'Who’d’ve': [{65: 'Who', 67: 'who'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'Who’ll': [{65: 'Who', 67: 'who'}, {65: '’ll', 67: 'will'}], 'Who’ll’ve': [{65: 'Who', 67: 'who'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'Who’re': [{65: 'Who', 67: 'who'}, {65: '’re', 67: 'are'}], 'Who’s': [{65: 'Who', 67: 'who'}, {65: '’s', 67: "'s"}], 'Who’ve': [{65: 'Who', 67: 'who'}, {65: '’ve'}], "Why'd": [{65: 'Why', 67: 'why'}, {65: "'d", 67: "'d"}], "Why'd've": [{65: 'Why', 67: 'why'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "Why'll": [{65: 'Why', 67: 'why'}, {65: "'ll", 67: 'will'}], "Why'll've": [{65: 'Why', 67: 'why'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "Why're": [{65: 'Why', 67: 'why'}, {65: "'re", 67: 'are'}], "Why's": [{65: 'Why', 67: 'why'}, {65: "'s", 67: "'s"}], "Why've": [{65: 'Why', 67: 'why'}, {65: "'ve"}], 'Whyd': [{65: 'Why', 67: 'why'}, {65: 'd', 67: "'d"}], 'Whydve': [{65: 'Why', 67: 'why'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'Whyll': [{65: 'Why', 67: 'why'}, {65: 'll', 67: 'will'}], 'Whyllve': [{65: 'Why', 67: 'why'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'Whyre': [{65: 'Why', 67: 'why'}, {65: 're', 67: 'are'}], 'Whys': [{65: 'Why', 67: 'why'}, {65: 's'}], 'Whyve': [{65: 'Why'}, {65: 've', 67: 'have'}], 'Why’d': [{65: 'Why', 67: 'why'}, {65: '’d', 67: "'d"}], 'Why’d’ve': [{65: 'Why', 67: 'why'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'Why’ll': [{65: 'Why', 67: 'why'}, {65: '’ll', 67: 'will'}], 'Why’ll’ve': [{65: 'Why', 67: 'why'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'Why’re': [{65: 'Why', 67: 'why'}, {65: '’re', 67: 'are'}], 'Why’s': [{65: 'Why', 67: 'why'}, {65: '’s', 67: "'s"}], 'Why’ve': [{65: 'Why', 67: 'why'}, {65: '’ve'}], 'Wis.': [{65: 'Wis.', 67: 'Wisconsin'}], "Won't": [{65: 'Wo', 67: 'will'}, {65: "n't", 67: 'not'}], "Won't've": [{65: 'Wo', 67: 'will'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'Wont': [{65: 'Wo', 67: 'will'}, {65: 'nt', 67: 'not'}], 'Wontve': [{65: 'Wo', 67: 'will'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'Won’t': [{65: 'Wo', 67: 'will'}, {65: 'n’t', 67: 'not'}], 'Won’t’ve': [{65: 'Wo', 67: 'will'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], "Would've": [{65: 'Would', 67: 'would'}, {65: "'ve"}], "Wouldn't": [{65: 'Would', 67: 'would'}, {65: "n't", 67: 'not'}], "Wouldn't've": [{65: 'Would', 67: 'would'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'Wouldnt': [{65: 'Would', 67: 'would'}, {65: 'nt', 67: 'not'}], 'Wouldntve': [{65: 'Would', 67: 'would'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'Wouldn’t': [{65: 'Would', 67: 'would'}, {65: 'n’t', 67: 'not'}], 'Wouldn’t’ve': [{65: 'Would', 67: 'would'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'Wouldve': [{65: 'Would', 67: 'would'}, {65: 've'}], 'Would’ve': [{65: 'Would', 67: 'would'}, {65: '’ve'}], 'XD': [{65: 'XD'}], 'XDD': [{65: 'XDD'}], "You'd": [{65: 'You', 67: 'you'}, {65: "'d", 67: "'d"}], "You'd've": [{65: 'You', 67: 'you'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "You'll": [{65: 'You', 67: 'you'}, {65: "'ll", 67: 'will'}], "You'll've": [{65: 'You', 67: 'you'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "You're": [{65: 'You', 67: 'you'}, {65: "'re", 67: 'are'}], "You've": [{65: 'You', 67: 'you'}, {65: "'ve", 67: 'have'}], 'Youd': [{65: 'You', 67: 'you'}, {65: 'd', 67: "'d"}], 'Youdve': [{65: 'You', 67: 'you'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'Youll': [{65: 'You', 67: 'you'}, {65: 'll', 67: 'will'}], 'Youllve': [{65: 'You', 67: 'you'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'Youre': [{65: 'You', 67: 'you'}, {65: 're', 67: 'are'}], 'Youve': [{65: 'You', 67: 'you'}, {65: 've', 67: 'have'}], 'You’d': [{65: 'You', 67: 'you'}, {65: '’d', 67: "'d"}], 'You’d’ve': [{65: 'You', 67: 'you'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'You’ll': [{65: 'You', 67: 'you'}, {65: '’ll', 67: 'will'}], 'You’ll’ve': [{65: 'You', 67: 'you'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'You’re': [{65: 'You', 67: 'you'}, {65: '’re', 67: 'are'}], 'You’ve': [{65: 'You', 67: 'you'}, {65: '’ve', 67: 'have'}], '[-:': [{65: '[-:'}], '[:': [{65: '[:'}], '[=': [{65: '[='}], '\\")': [{65: '\\")'}], '\\n': [{65: '\\n'}], '\\t': [{65: '\\t'}], ']=': [{65: ']='}], '^_^': [{65: '^_^'}], '^__^': [{65: '^__^'}], '^___^': [{65: '^___^'}], 'a.m.': [{65: 'a.m.'}], "ain't": [{'number': 2, 65: 'ai'}, {65: "n't", 67: 'not'}], 'aint': [{'number': 2, 65: 'ai'}, {65: 'nt', 67: 'not'}], 'ain’t': [{'number': 2, 65: 'ai'}, {65: 'n’t', 67: 'not'}], 'and/or': [{65: 'and/or', 67: 'and/or'}], "aren't": [{'number': 2, 65: 'are', 67: 'are'}, {65: "n't", 67: 'not'}], 'arent': [{'number': 2, 65: 'are', 67: 'are'}, {65: 'nt', 67: 'not'}], 'aren’t': [{'number': 2, 65: 'are', 67: 'are'}, {65: 'n’t', 67: 'not'}], "c'mon": [{65: "c'm", 67: 'come'}, {65: 'on'}], "can't": [{65: 'ca', 67: 'can'}, {65: "n't", 67: 'not'}], "can't've": [{65: 'ca', 67: 'can'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'cannot': [{65: 'can'}, {65: 'not'}], 'cant': [{65: 'ca', 67: 'can'}, {65: 'nt', 67: 'not'}], 'cantve': [{65: 'ca', 67: 'can'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'can’t': [{65: 'ca', 67: 'can'}, {65: 'n’t', 67: 'not'}], 'can’t’ve': [{65: 'ca', 67: 'can'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'co.': [{65: 'co.'}], "could've": [{65: 'could', 67: 'could'}, {65: "'ve"}], "couldn't": [{65: 'could', 67: 'could'}, {65: "n't", 67: 'not'}], "couldn't've": [{65: 'could', 67: 'could'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'couldnt': [{65: 'could', 67: 'could'}, {65: 'nt', 67: 'not'}], 'couldntve': [{65: 'could', 67: 'could'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'couldn’t': [{65: 'could', 67: 'could'}, {65: 'n’t', 67: 'not'}], 'couldn’t’ve': [{65: 'could', 67: 'could'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'couldve': [{65: 'could', 67: 'could'}, {65: 've'}], 'could’ve': [{65: 'could', 67: 'could'}, {65: '’ve'}], 'c’mon': [{65: 'c’m', 67: 'come'}, {65: 'on'}], "daren't": [{65: 'dare', 67: 'dare'}, {65: "n't", 67: 'not'}], 'darent': [{65: 'dare', 67: 'dare'}, {65: 'nt', 67: 'not'}], 'daren’t': [{65: 'dare', 67: 'dare'}, {65: 'n’t', 67: 'not'}], "didn't": [{65: 'did', 67: 'do'}, {65: "n't", 67: 'not'}], "didn't've": [{65: 'did', 67: 'do'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'didnt': [{65: 'did', 67: 'do'}, {65: 'nt', 67: 'not'}], 'didntve': [{65: 'did', 67: 'do'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'didn’t': [{65: 'did', 67: 'do'}, {65: 'n’t', 67: 'not'}], 'didn’t’ve': [{65: 'did', 67: 'do'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], "doesn't": [{65: 'does', 67: 'does'}, {65: "n't", 67: 'not'}], "doesn't've": [{65: 'does', 67: 'does'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'doesnt': [{65: 'does', 67: 'does'}, {65: 'nt', 67: 'not'}], 'doesntve': [{65: 'does', 67: 'does'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'doesn’t': [{65: 'does', 67: 'does'}, {65: 'n’t', 67: 'not'}], 'doesn’t’ve': [{65: 'does', 67: 'does'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'doin': [{65: 'doin', 67: 'doing'}], "doin'": [{65: "doin'", 67: 'doing'}], 'doin’': [{65: 'doin’', 67: 'doing'}], "don't": [{65: 'do', 67: 'do'}, {65: "n't", 67: 'not'}], "don't've": [{65: 'do', 67: 'do'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'dont': [{65: 'do', 67: 'do'}, {65: 'nt', 67: 'not'}], 'dontve': [{65: 'do', 67: 'do'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'don’t': [{65: 'do', 67: 'do'}, {65: 'n’t', 67: 'not'}], 'don’t’ve': [{65: 'do', 67: 'do'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'e.g.': [{65: 'e.g.'}], 'em': [{65: 'em', 67: 'them'}], 'goin': [{65: 'goin', 67: 'going'}], "goin'": [{65: "goin'", 67: 'going'}], 'goin’': [{65: 'goin’', 67: 'going'}], 'gonna': [{65: 'gon', 67: 'going'}, {65: 'na', 67: 'to'}], 'gotta': [{65: 'got'}, {65: 'ta', 67: 'to'}], "hadn't": [{65: 'had', 67: 'have'}, {65: "n't", 67: 'not'}], "hadn't've": [{65: 'had', 67: 'have'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'hadnt': [{65: 'had', 67: 'have'}, {65: 'nt', 67: 'not'}], 'hadntve': [{65: 'had', 67: 'have'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'hadn’t': [{65: 'had', 67: 'have'}, {65: 'n’t', 67: 'not'}], 'hadn’t’ve': [{65: 'had', 67: 'have'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], "hasn't": [{65: 'has', 67: 'has'}, {65: "n't", 67: 'not'}], 'hasnt': [{65: 'has', 67: 'has'}, {65: 'nt', 67: 'not'}], 'hasn’t': [{65: 'has', 67: 'has'}, {65: 'n’t', 67: 'not'}], "haven't": [{65: 'have', 67: 'have'}, {65: "n't", 67: 'not'}], 'havent': [{65: 'have', 67: 'have'}, {65: 'nt', 67: 'not'}], 'haven’t': [{65: 'have', 67: 'have'}, {65: 'n’t', 67: 'not'}], 'havin': [{65: 'havin', 67: 'having'}], "havin'": [{65: "havin'", 67: 'having'}], 'havin’': [{65: 'havin’', 67: 'having'}], "he'd": [{65: 'he', 67: 'he'}, {65: "'d", 67: "'d"}], "he'd've": [{65: 'he', 67: 'he'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "he'll": [{65: 'he', 67: 'he'}, {65: "'ll", 67: 'will'}], "he'll've": [{65: 'he', 67: 'he'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "he's": [{65: 'he', 67: 'he'}, {65: "'s", 67: "'s"}], 'hed': [{65: 'he', 67: 'he'}, {65: 'd', 67: "'d"}], 'hedve': [{65: 'he', 67: 'he'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'hellve': [{65: 'he', 67: 'he'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'hes': [{65: 'he', 67: 'he'}, {65: 's'}], 'he’d': [{65: 'he', 67: 'he'}, {65: '’d', 67: "'d"}], 'he’d’ve': [{65: 'he', 67: 'he'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'he’ll': [{65: 'he', 67: 'he'}, {65: '’ll', 67: 'will'}], 'he’ll’ve': [{65: 'he', 67: 'he'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'he’s': [{65: 'he', 67: 'he'}, {65: '’s', 67: "'s"}], "how'd": [{65: 'how', 67: 'how'}, {65: "'d", 67: "'d"}], "how'd've": [{65: 'how', 67: 'how'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "how'd'y": [{65: 'how'}, {65: "'d"}, {65: "'y", 67: 'you'}], "how'll": [{65: 'how', 67: 'how'}, {65: "'ll", 67: 'will'}], "how'll've": [{65: 'how', 67: 'how'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "how're": [{65: 'how', 67: 'how'}, {65: "'re", 67: 'are'}], "how's": [{65: 'how', 67: 'how'}, {65: "'s", 67: "'s"}], "how've": [{65: 'how', 67: 'how'}, {65: "'ve"}], 'howd': [{65: 'how', 67: 'how'}, {65: 'd', 67: "'d"}], 'howdve': [{65: 'how', 67: 'how'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'howll': [{65: 'how', 67: 'how'}, {65: 'll', 67: 'will'}], 'howllve': [{65: 'how', 67: 'how'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'howre': [{65: 'how', 67: 'how'}, {65: 're', 67: 'are'}], 'hows': [{65: 'how', 67: 'how'}, {65: 's'}], 'howve': [{65: 'how'}, {65: 've', 67: 'have'}], 'how’d': [{65: 'how', 67: 'how'}, {65: '’d', 67: "'d"}], 'how’d’ve': [{65: 'how', 67: 'how'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'how’d’y': [{65: 'how'}, {65: '’d'}, {65: '’y', 67: 'you'}], 'how’ll': [{65: 'how', 67: 'how'}, {65: '’ll', 67: 'will'}], 'how’ll’ve': [{65: 'how', 67: 'how'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'how’re': [{65: 'how', 67: 'how'}, {65: '’re', 67: 'are'}], 'how’s': [{65: 'how', 67: 'how'}, {65: '’s', 67: "'s"}], 'how’ve': [{65: 'how', 67: 'how'}, {65: '’ve'}], "i'd": [{65: 'i', 67: 'i'}, {65: "'d", 67: "'d"}], "i'd've": [{65: 'i', 67: 'i'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "i'll": [{65: 'i', 67: 'i'}, {65: "'ll", 67: 'will'}], "i'll've": [{65: 'i', 67: 'i'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "i'm": [{65: 'i', 67: 'i'}, {65: "'m", 67: 'am'}], "i'ma": [{65: 'i', 67: 'i'}, {65: "'m", 67: 'am'}, {65: 'a', 67: 'gonna'}], "i've": [{65: 'i', 67: 'i'}, {65: "'ve", 67: 'have'}], 'i.e.': [{65: 'i.e.'}], 'id': [{65: 'i', 67: 'i'}, {65: 'd', 67: "'d"}], 'idve': [{65: 'i', 67: 'i'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'illve': [{65: 'i', 67: 'i'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'im': [{65: 'i', 67: 'i'}, {65: 'm'}], 'ima': [{65: 'i', 67: 'i'}, {65: 'm', 67: 'am'}, {65: 'a', 67: 'gonna'}], "isn't": [{65: 'is', 67: 'is'}, {65: "n't", 67: 'not'}], 'isnt': [{65: 'is', 67: 'is'}, {65: 'nt', 67: 'not'}], 'isn’t': [{65: 'is', 67: 'is'}, {65: 'n’t', 67: 'not'}], "it'd": [{65: 'it', 67: 'it'}, {65: "'d", 67: "'d"}], "it'd've": [{65: 'it', 67: 'it'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "it'll": [{65: 'it', 67: 'it'}, {65: "'ll", 67: 'will'}], "it'll've": [{65: 'it', 67: 'it'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "it's": [{65: 'it', 67: 'it'}, {65: "'s", 67: "'s"}], 'itd': [{65: 'it', 67: 'it'}, {65: 'd', 67: "'d"}], 'itdve': [{65: 'it', 67: 'it'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'itll': [{65: 'it', 67: 'it'}, {65: 'll', 67: 'will'}], 'itllve': [{65: 'it', 67: 'it'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'it’d': [{65: 'it', 67: 'it'}, {65: '’d', 67: "'d"}], 'it’d’ve': [{65: 'it', 67: 'it'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'it’ll': [{65: 'it', 67: 'it'}, {65: '’ll', 67: 'will'}], 'it’ll’ve': [{65: 'it', 67: 'it'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'it’s': [{65: 'it', 67: 'it'}, {65: '’s', 67: "'s"}], 'ive': [{65: 'i', 67: 'i'}, {65: 've', 67: 'have'}], 'i’d': [{65: 'i', 67: 'i'}, {65: '’d', 67: "'d"}], 'i’d’ve': [{65: 'i', 67: 'i'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'i’ll': [{65: 'i', 67: 'i'}, {65: '’ll', 67: 'will'}], 'i’ll’ve': [{65: 'i', 67: 'i'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'i’m': [{65: 'i', 67: 'i'}, {65: '’m', 67: 'am'}], 'i’ma': [{65: 'i', 67: 'i'}, {65: '’m', 67: 'am'}, {65: 'a', 67: 'gonna'}], 'i’ve': [{65: 'i', 67: 'i'}, {65: '’ve', 67: 'have'}], "let's": [{65: 'let'}, {65: "'s", 67: 'us'}], 'let’s': [{65: 'let'}, {65: '’s', 67: 'us'}], 'll': [{65: 'll', 67: 'will'}], 'lovin': [{65: 'lovin', 67: 'loving'}], "lovin'": [{65: "lovin'", 67: 'loving'}], 'lovin’': [{65: 'lovin’', 67: 'loving'}], "ma'am": [{65: "ma'am", 67: 'madam'}], "mayn't": [{65: 'may', 67: 'may'}, {65: "n't", 67: 'not'}], "mayn't've": [{65: 'may', 67: 'may'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'maynt': [{65: 'may', 67: 'may'}, {65: 'nt', 67: 'not'}], 'mayntve': [{65: 'may', 67: 'may'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'mayn’t': [{65: 'may', 67: 'may'}, {65: 'n’t', 67: 'not'}], 'mayn’t’ve': [{65: 'may', 67: 'may'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'ma’am': [{65: 'ma’am', 67: 'madam'}], "might've": [{65: 'might', 67: 'might'}, {65: "'ve"}], "mightn't": [{65: 'might', 67: 'might'}, {65: "n't", 67: 'not'}], "mightn't've": [{65: 'might', 67: 'might'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'mightnt': [{65: 'might', 67: 'might'}, {65: 'nt', 67: 'not'}], 'mightntve': [{65: 'might', 67: 'might'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'mightn’t': [{65: 'might', 67: 'might'}, {65: 'n’t', 67: 'not'}], 'mightn’t’ve': [{65: 'might', 67: 'might'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'mightve': [{65: 'might', 67: 'might'}, {65: 've'}], 'might’ve': [{65: 'might', 67: 'might'}, {65: '’ve'}], "must've": [{65: 'must', 67: 'must'}, {65: "'ve"}], "mustn't": [{65: 'must', 67: 'must'}, {65: "n't", 67: 'not'}], "mustn't've": [{65: 'must', 67: 'must'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'mustnt': [{65: 'must', 67: 'must'}, {65: 'nt', 67: 'not'}], 'mustntve': [{65: 'must', 67: 'must'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'mustn’t': [{65: 'must', 67: 'must'}, {65: 'n’t', 67: 'not'}], 'mustn’t’ve': [{65: 'must', 67: 'must'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'mustve': [{65: 'must', 67: 'must'}, {65: 've'}], 'must’ve': [{65: 'must', 67: 'must'}, {65: '’ve'}], "needn't": [{65: 'need', 67: 'need'}, {65: "n't", 67: 'not'}], "needn't've": [{65: 'need', 67: 'need'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'neednt': [{65: 'need', 67: 'need'}, {65: 'nt', 67: 'not'}], 'needntve': [{65: 'need', 67: 'need'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'needn’t': [{65: 'need', 67: 'need'}, {65: 'n’t', 67: 'not'}], 'needn’t’ve': [{65: 'need', 67: 'need'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], "not've": [{65: 'not'}, {65: "'ve", 67: 'have'}], 'nothin': [{65: 'nothin', 67: 'nothing'}], "nothin'": [{65: "nothin'", 67: 'nothing'}], 'nothin’': [{65: 'nothin’', 67: 'nothing'}], 'notve': [{65: 'not'}, {65: 've', 67: 'have'}], 'not’ve': [{65: 'not'}, {65: '’ve', 67: 'have'}], 'nuff': [{65: 'nuff', 67: 'enough'}], 'nuthin': [{65: 'nuthin', 67: 'nothing'}], "nuthin'": [{65: "nuthin'", 67: 'nothing'}], 'nuthin’': [{65: 'nuthin’', 67: 'nothing'}], "o'clock": [{65: "o'clock", 67: "o'clock"}], 'o.0': [{65: 'o.0'}], 'o.O': [{65: 'o.O'}], 'o.o': [{65: 'o.o'}], 'o_0': [{65: 'o_0'}], 'o_O': [{65: 'o_O'}], 'o_o': [{65: 'o_o'}], 'ol': [{65: 'ol', 67: 'old'}], "ol'": [{65: "ol'", 67: 'old'}], 'ol’': [{65: 'ol’', 67: 'old'}], "oughtn't": [{65: 'ought', 67: 'ought'}, {65: "n't", 67: 'not'}], "oughtn't've": [{65: 'ought', 67: 'ought'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'oughtnt': [{65: 'ought', 67: 'ought'}, {65: 'nt', 67: 'not'}], 'oughtntve': [{65: 'ought', 67: 'ought'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'oughtn’t': [{65: 'ought', 67: 'ought'}, {65: 'n’t', 67: 'not'}], 'oughtn’t’ve': [{65: 'ought', 67: 'ought'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'o’clock': [{65: 'o’clock', 67: "o'clock"}], 'p.m.': [{65: 'p.m.'}], "shan't": [{65: 'sha', 67: 'shall'}, {65: "n't", 67: 'not'}], "shan't've": [{65: 'sha', 67: 'shall'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'shant': [{65: 'sha', 67: 'shall'}, {65: 'nt', 67: 'not'}], 'shantve': [{65: 'sha', 67: 'shall'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'shan’t': [{65: 'sha', 67: 'shall'}, {65: 'n’t', 67: 'not'}], 'shan’t’ve': [{65: 'sha', 67: 'shall'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], "she'd": [{65: 'she', 67: 'she'}, {65: "'d", 67: "'d"}], "she'd've": [{65: 'she', 67: 'she'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "she'll": [{65: 'she', 67: 'she'}, {65: "'ll", 67: 'will'}], "she'll've": [{65: 'she', 67: 'she'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "she's": [{65: 'she', 67: 'she'}, {65: "'s", 67: "'s"}], 'shedve': [{65: 'she', 67: 'she'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'shellve': [{65: 'she', 67: 'she'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'shes': [{65: 'she', 67: 'she'}, {65: 's'}], 'she’d': [{65: 'she', 67: 'she'}, {65: '’d', 67: "'d"}], 'she’d’ve': [{65: 'she', 67: 'she'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'she’ll': [{65: 'she', 67: 'she'}, {65: '’ll', 67: 'will'}], 'she’ll’ve': [{65: 'she', 67: 'she'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'she’s': [{65: 'she', 67: 'she'}, {65: '’s', 67: "'s"}], "should've": [{65: 'should', 67: 'should'}, {65: "'ve"}], "shouldn't": [{65: 'should', 67: 'should'}, {65: "n't", 67: 'not'}], "shouldn't've": [{65: 'should', 67: 'should'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'shouldnt': [{65: 'should', 67: 'should'}, {65: 'nt', 67: 'not'}], 'shouldntve': [{65: 'should', 67: 'should'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'shouldn’t': [{65: 'should', 67: 'should'}, {65: 'n’t', 67: 'not'}], 'shouldn’t’ve': [{65: 'should', 67: 'should'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'shouldve': [{65: 'should', 67: 'should'}, {65: 've'}], 'should’ve': [{65: 'should', 67: 'should'}, {65: '’ve'}], 'somethin': [{65: 'somethin', 67: 'something'}], "somethin'": [{65: "somethin'", 67: 'something'}], 'somethin’': [{65: 'somethin’', 67: 'something'}], "that'd": [{65: 'that', 67: 'that'}, {65: "'d", 67: "'d"}], "that'd've": [{65: 'that', 67: 'that'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "that'll": [{65: 'that', 67: 'that'}, {65: "'ll", 67: 'will'}], "that'll've": [{65: 'that', 67: 'that'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "that's": [{65: 'that', 67: 'that'}, {65: "'s", 67: "'s"}], 'thatd': [{65: 'that', 67: 'that'}, {65: 'd', 67: "'d"}], 'thatdve': [{65: 'that', 67: 'that'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'thatll': [{65: 'that', 67: 'that'}, {65: 'll', 67: 'will'}], 'thatllve': [{65: 'that', 67: 'that'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'thats': [{65: 'that', 67: 'that'}, {65: 's'}], 'that’d': [{65: 'that', 67: 'that'}, {65: '’d', 67: "'d"}], 'that’d’ve': [{65: 'that', 67: 'that'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'that’ll': [{65: 'that', 67: 'that'}, {65: '’ll', 67: 'will'}], 'that’ll’ve': [{65: 'that', 67: 'that'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'that’s': [{65: 'that', 67: 'that'}, {65: '’s', 67: "'s"}], "there'd": [{65: 'there', 67: 'there'}, {65: "'d", 67: "'d"}], "there'd've": [{65: 'there', 67: 'there'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "there'll": [{65: 'there', 67: 'there'}, {65: "'ll", 67: 'will'}], "there'll've": [{65: 'there', 67: 'there'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "there're": [{65: 'there', 67: 'there'}, {65: "'re", 67: 'are'}], "there's": [{65: 'there', 67: 'there'}, {65: "'s", 67: "'s"}], "there've": [{65: 'there', 67: 'there'}, {65: "'ve"}], 'thered': [{65: 'there', 67: 'there'}, {65: 'd', 67: "'d"}], 'theredve': [{65: 'there', 67: 'there'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'therell': [{65: 'there', 67: 'there'}, {65: 'll', 67: 'will'}], 'therellve': [{65: 'there', 67: 'there'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'therere': [{65: 'there', 67: 'there'}, {65: 're', 67: 'are'}], 'theres': [{65: 'there', 67: 'there'}, {65: 's'}], 'thereve': [{65: 'there'}, {65: 've', 67: 'have'}], 'there’d': [{65: 'there', 67: 'there'}, {65: '’d', 67: "'d"}], 'there’d’ve': [{65: 'there', 67: 'there'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'there’ll': [{65: 'there', 67: 'there'}, {65: '’ll', 67: 'will'}], 'there’ll’ve': [{65: 'there', 67: 'there'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'there’re': [{65: 'there', 67: 'there'}, {65: '’re', 67: 'are'}], 'there’s': [{65: 'there', 67: 'there'}, {65: '’s', 67: "'s"}], 'there’ve': [{65: 'there', 67: 'there'}, {65: '’ve'}], "these'd": [{65: 'these', 67: 'these'}, {65: "'d", 67: "'d"}], "these'd've": [{65: 'these', 67: 'these'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "these'll": [{65: 'these', 67: 'these'}, {65: "'ll", 67: 'will'}], "these'll've": [{65: 'these', 67: 'these'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "these're": [{65: 'these', 67: 'these'}, {65: "'re", 67: 'are'}], "these've": [{65: 'these', 67: 'these'}, {65: "'ve"}], 'thesed': [{65: 'these', 67: 'these'}, {65: 'd', 67: "'d"}], 'thesedve': [{65: 'these', 67: 'these'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'thesell': [{65: 'these', 67: 'these'}, {65: 'll', 67: 'will'}], 'thesellve': [{65: 'these', 67: 'these'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'thesere': [{65: 'these', 67: 'these'}, {65: 're', 67: 'are'}], 'theseve': [{65: 'these'}, {65: 've', 67: 'have'}], 'these’d': [{65: 'these', 67: 'these'}, {65: '’d', 67: "'d"}], 'these’d’ve': [{65: 'these', 67: 'these'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'these’ll': [{65: 'these', 67: 'these'}, {65: '’ll', 67: 'will'}], 'these’ll’ve': [{65: 'these', 67: 'these'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'these’re': [{65: 'these', 67: 'these'}, {65: '’re', 67: 'are'}], 'these’ve': [{65: 'these', 67: 'these'}, {65: '’ve'}], "they'd": [{65: 'they', 67: 'they'}, {65: "'d", 67: "'d"}], "they'd've": [{65: 'they', 67: 'they'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "they'll": [{65: 'they', 67: 'they'}, {65: "'ll", 67: 'will'}], "they'll've": [{65: 'they', 67: 'they'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "they're": [{65: 'they', 67: 'they'}, {65: "'re", 67: 'are'}], "they've": [{65: 'they', 67: 'they'}, {65: "'ve", 67: 'have'}], 'theyd': [{65: 'they', 67: 'they'}, {65: 'd', 67: "'d"}], 'theydve': [{65: 'they', 67: 'they'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'theyll': [{65: 'they', 67: 'they'}, {65: 'll', 67: 'will'}], 'theyllve': [{65: 'they', 67: 'they'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'theyre': [{65: 'they', 67: 'they'}, {65: 're', 67: 'are'}], 'theyve': [{65: 'they', 67: 'they'}, {65: 've', 67: 'have'}], 'they’d': [{65: 'they', 67: 'they'}, {65: '’d', 67: "'d"}], 'they’d’ve': [{65: 'they', 67: 'they'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'they’ll': [{65: 'they', 67: 'they'}, {65: '’ll', 67: 'will'}], 'they’ll’ve': [{65: 'they', 67: 'they'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'they’re': [{65: 'they', 67: 'they'}, {65: '’re', 67: 'are'}], 'they’ve': [{65: 'they', 67: 'they'}, {65: '’ve', 67: 'have'}], "this'd": [{65: 'this', 67: 'this'}, {65: "'d", 67: "'d"}], "this'd've": [{65: 'this', 67: 'this'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "this'll": [{65: 'this', 67: 'this'}, {65: "'ll", 67: 'will'}], "this'll've": [{65: 'this', 67: 'this'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "this's": [{65: 'this', 67: 'this'}, {65: "'s", 67: "'s"}], 'thisd': [{65: 'this', 67: 'this'}, {65: 'd', 67: "'d"}], 'thisdve': [{65: 'this', 67: 'this'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'thisll': [{65: 'this', 67: 'this'}, {65: 'll', 67: 'will'}], 'thisllve': [{65: 'this', 67: 'this'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'thiss': [{65: 'this', 67: 'this'}, {65: 's'}], 'this’d': [{65: 'this', 67: 'this'}, {65: '’d', 67: "'d"}], 'this’d’ve': [{65: 'this', 67: 'this'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'this’ll': [{65: 'this', 67: 'this'}, {65: '’ll', 67: 'will'}], 'this’ll’ve': [{65: 'this', 67: 'this'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'this’s': [{65: 'this', 67: 'this'}, {65: '’s', 67: "'s"}], "those'd": [{65: 'those', 67: 'those'}, {65: "'d", 67: "'d"}], "those'd've": [{65: 'those', 67: 'those'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "those'll": [{65: 'those', 67: 'those'}, {65: "'ll", 67: 'will'}], "those'll've": [{65: 'those', 67: 'those'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "those're": [{65: 'those', 67: 'those'}, {65: "'re", 67: 'are'}], "those've": [{65: 'those', 67: 'those'}, {65: "'ve"}], 'thosed': [{65: 'those', 67: 'those'}, {65: 'd', 67: "'d"}], 'thosedve': [{65: 'those', 67: 'those'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'thosell': [{65: 'those', 67: 'those'}, {65: 'll', 67: 'will'}], 'thosellve': [{65: 'those', 67: 'those'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'thosere': [{65: 'those', 67: 'those'}, {65: 're', 67: 'are'}], 'thoseve': [{65: 'those'}, {65: 've', 67: 'have'}], 'those’d': [{65: 'those', 67: 'those'}, {65: '’d', 67: "'d"}], 'those’d’ve': [{65: 'those', 67: 'those'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'those’ll': [{65: 'those', 67: 'those'}, {65: '’ll', 67: 'will'}], 'those’ll’ve': [{65: 'those', 67: 'those'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'those’re': [{65: 'those', 67: 'those'}, {65: '’re', 67: 'are'}], 'those’ve': [{65: 'those', 67: 'those'}, {65: '’ve'}], 'v.s.': [{65: 'v.s.'}], 'v.v': [{65: 'v.v'}], 'v_v': [{65: 'v_v'}], 'vs.': [{65: 'vs.'}], 'w/o': [{65: 'w/o', 67: 'without'}], "wasn't": [{65: 'was', 67: 'was'}, {65: "n't", 67: 'not'}], 'wasnt': [{65: 'was', 67: 'was'}, {65: 'nt', 67: 'not'}], 'wasn’t': [{65: 'was', 67: 'was'}, {65: 'n’t', 67: 'not'}], "we'd": [{65: 'we', 67: 'we'}, {65: "'d", 67: "'d"}], "we'd've": [{65: 'we', 67: 'we'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "we'll": [{65: 'we', 67: 'we'}, {65: "'ll", 67: 'will'}], "we'll've": [{65: 'we', 67: 'we'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "we're": [{65: 'we', 67: 'we'}, {65: "'re", 67: 'are'}], "we've": [{65: 'we', 67: 'we'}, {65: "'ve", 67: 'have'}], 'wed': [{65: 'we', 67: 'we'}, {65: 'd', 67: "'d"}], 'wedve': [{65: 'we', 67: 'we'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'wellve': [{65: 'we', 67: 'we'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], "weren't": [{65: 'were', 67: 'were'}, {65: "n't", 67: 'not'}], 'werent': [{65: 'were', 67: 'were'}, {65: 'nt', 67: 'not'}], 'weren’t': [{65: 'were', 67: 'were'}, {65: 'n’t', 67: 'not'}], 'weve': [{65: 'we', 67: 'we'}, {65: 've', 67: 'have'}], 'we’d': [{65: 'we', 67: 'we'}, {65: '’d', 67: "'d"}], 'we’d’ve': [{65: 'we', 67: 'we'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'we’ll': [{65: 'we', 67: 'we'}, {65: '’ll', 67: 'will'}], 'we’ll’ve': [{65: 'we', 67: 'we'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'we’re': [{65: 'we', 67: 'we'}, {65: '’re', 67: 'are'}], 'we’ve': [{65: 'we', 67: 'we'}, {65: '’ve', 67: 'have'}], "what'd": [{65: 'what', 67: 'what'}, {65: "'d", 67: "'d"}], "what'd've": [{65: 'what', 67: 'what'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "what'll": [{65: 'what', 67: 'what'}, {65: "'ll", 67: 'will'}], "what'll've": [{65: 'what', 67: 'what'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "what're": [{65: 'what', 67: 'what'}, {65: "'re", 67: 'are'}], "what's": [{65: 'what', 67: 'what'}, {65: "'s", 67: "'s"}], "what've": [{65: 'what', 67: 'what'}, {65: "'ve"}], 'whatd': [{65: 'what', 67: 'what'}, {65: 'd', 67: "'d"}], 'whatdve': [{65: 'what', 67: 'what'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'whatll': [{65: 'what', 67: 'what'}, {65: 'll', 67: 'will'}], 'whatllve': [{65: 'what', 67: 'what'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'whatre': [{65: 'what', 67: 'what'}, {65: 're', 67: 'are'}], 'whats': [{65: 'what', 67: 'what'}, {65: 's'}], 'whatve': [{65: 'what'}, {65: 've', 67: 'have'}], 'what’d': [{65: 'what', 67: 'what'}, {65: '’d', 67: "'d"}], 'what’d’ve': [{65: 'what', 67: 'what'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'what’ll': [{65: 'what', 67: 'what'}, {65: '’ll', 67: 'will'}], 'what’ll’ve': [{65: 'what', 67: 'what'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'what’re': [{65: 'what', 67: 'what'}, {65: '’re', 67: 'are'}], 'what’s': [{65: 'what', 67: 'what'}, {65: '’s', 67: "'s"}], 'what’ve': [{65: 'what', 67: 'what'}, {65: '’ve'}], "when'd": [{65: 'when', 67: 'when'}, {65: "'d", 67: "'d"}], "when'd've": [{65: 'when', 67: 'when'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "when'll": [{65: 'when', 67: 'when'}, {65: "'ll", 67: 'will'}], "when'll've": [{65: 'when', 67: 'when'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "when're": [{65: 'when', 67: 'when'}, {65: "'re", 67: 'are'}], "when's": [{65: 'when', 67: 'when'}, {65: "'s", 67: "'s"}], "when've": [{65: 'when', 67: 'when'}, {65: "'ve"}], 'whend': [{65: 'when', 67: 'when'}, {65: 'd', 67: "'d"}], 'whendve': [{65: 'when', 67: 'when'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'whenll': [{65: 'when', 67: 'when'}, {65: 'll', 67: 'will'}], 'whenllve': [{65: 'when', 67: 'when'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'whenre': [{65: 'when', 67: 'when'}, {65: 're', 67: 'are'}], 'whens': [{65: 'when', 67: 'when'}, {65: 's'}], 'whenve': [{65: 'when'}, {65: 've', 67: 'have'}], 'when’d': [{65: 'when', 67: 'when'}, {65: '’d', 67: "'d"}], 'when’d’ve': [{65: 'when', 67: 'when'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'when’ll': [{65: 'when', 67: 'when'}, {65: '’ll', 67: 'will'}], 'when’ll’ve': [{65: 'when', 67: 'when'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'when’re': [{65: 'when', 67: 'when'}, {65: '’re', 67: 'are'}], 'when’s': [{65: 'when', 67: 'when'}, {65: '’s', 67: "'s"}], 'when’ve': [{65: 'when', 67: 'when'}, {65: '’ve'}], "where'd": [{65: 'where', 67: 'where'}, {65: "'d", 67: "'d"}], "where'd've": [{65: 'where', 67: 'where'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "where'll": [{65: 'where', 67: 'where'}, {65: "'ll", 67: 'will'}], "where'll've": [{65: 'where', 67: 'where'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "where're": [{65: 'where', 67: 'where'}, {65: "'re", 67: 'are'}], "where's": [{65: 'where', 67: 'where'}, {65: "'s", 67: "'s"}], "where've": [{65: 'where', 67: 'where'}, {65: "'ve"}], 'whered': [{65: 'where', 67: 'where'}, {65: 'd', 67: "'d"}], 'wheredve': [{65: 'where', 67: 'where'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'wherell': [{65: 'where', 67: 'where'}, {65: 'll', 67: 'will'}], 'wherellve': [{65: 'where', 67: 'where'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'wherere': [{65: 'where', 67: 'where'}, {65: 're', 67: 'are'}], 'wheres': [{65: 'where', 67: 'where'}, {65: 's'}], 'whereve': [{65: 'where'}, {65: 've', 67: 'have'}], 'where’d': [{65: 'where', 67: 'where'}, {65: '’d', 67: "'d"}], 'where’d’ve': [{65: 'where', 67: 'where'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'where’ll': [{65: 'where', 67: 'where'}, {65: '’ll', 67: 'will'}], 'where’ll’ve': [{65: 'where', 67: 'where'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'where’re': [{65: 'where', 67: 'where'}, {65: '’re', 67: 'are'}], 'where’s': [{65: 'where', 67: 'where'}, {65: '’s', 67: "'s"}], 'where’ve': [{65: 'where', 67: 'where'}, {65: '’ve'}], "who'd": [{65: 'who', 67: 'who'}, {65: "'d", 67: "'d"}], "who'd've": [{65: 'who', 67: 'who'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "who'll": [{65: 'who', 67: 'who'}, {65: "'ll", 67: 'will'}], "who'll've": [{65: 'who', 67: 'who'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "who're": [{65: 'who', 67: 'who'}, {65: "'re", 67: 'are'}], "who's": [{65: 'who', 67: 'who'}, {65: "'s", 67: "'s"}], "who've": [{65: 'who', 67: 'who'}, {65: "'ve"}], 'whod': [{65: 'who', 67: 'who'}, {65: 'd', 67: "'d"}], 'whodve': [{65: 'who', 67: 'who'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'wholl': [{65: 'who', 67: 'who'}, {65: 'll', 67: 'will'}], 'whollve': [{65: 'who', 67: 'who'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'whos': [{65: 'who', 67: 'who'}, {65: 's'}], 'whove': [{65: 'who'}, {65: 've', 67: 'have'}], 'who’d': [{65: 'who', 67: 'who'}, {65: '’d', 67: "'d"}], 'who’d’ve': [{65: 'who', 67: 'who'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'who’ll': [{65: 'who', 67: 'who'}, {65: '’ll', 67: 'will'}], 'who’ll’ve': [{65: 'who', 67: 'who'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'who’re': [{65: 'who', 67: 'who'}, {65: '’re', 67: 'are'}], 'who’s': [{65: 'who', 67: 'who'}, {65: '’s', 67: "'s"}], 'who’ve': [{65: 'who', 67: 'who'}, {65: '’ve'}], "why'd": [{65: 'why', 67: 'why'}, {65: "'d", 67: "'d"}], "why'd've": [{65: 'why', 67: 'why'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "why'll": [{65: 'why', 67: 'why'}, {65: "'ll", 67: 'will'}], "why'll've": [{65: 'why', 67: 'why'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "why're": [{65: 'why', 67: 'why'}, {65: "'re", 67: 'are'}], "why's": [{65: 'why', 67: 'why'}, {65: "'s", 67: "'s"}], "why've": [{65: 'why', 67: 'why'}, {65: "'ve"}], 'whyd': [{65: 'why', 67: 'why'}, {65: 'd', 67: "'d"}], 'whydve': [{65: 'why', 67: 'why'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'whyll': [{65: 'why', 67: 'why'}, {65: 'll', 67: 'will'}], 'whyllve': [{65: 'why', 67: 'why'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'whyre': [{65: 'why', 67: 'why'}, {65: 're', 67: 'are'}], 'whys': [{65: 'why', 67: 'why'}, {65: 's'}], 'whyve': [{65: 'why'}, {65: 've', 67: 'have'}], 'why’d': [{65: 'why', 67: 'why'}, {65: '’d', 67: "'d"}], 'why’d’ve': [{65: 'why', 67: 'why'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'why’ll': [{65: 'why', 67: 'why'}, {65: '’ll', 67: 'will'}], 'why’ll’ve': [{65: 'why', 67: 'why'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'why’re': [{65: 'why', 67: 'why'}, {65: '’re', 67: 'are'}], 'why’s': [{65: 'why', 67: 'why'}, {65: '’s', 67: "'s"}], 'why’ve': [{65: 'why', 67: 'why'}, {65: '’ve'}], "won't": [{65: 'wo', 67: 'will'}, {65: "n't", 67: 'not'}], "won't've": [{65: 'wo', 67: 'will'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'wont': [{65: 'wo', 67: 'will'}, {65: 'nt', 67: 'not'}], 'wontve': [{65: 'wo', 67: 'will'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'won’t': [{65: 'wo', 67: 'will'}, {65: 'n’t', 67: 'not'}], 'won’t’ve': [{65: 'wo', 67: 'will'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], "would've": [{65: 'would', 67: 'would'}, {65: "'ve"}], "wouldn't": [{65: 'would', 67: 'would'}, {65: "n't", 67: 'not'}], "wouldn't've": [{65: 'would', 67: 'would'}, {65: "n't", 67: 'not'}, {65: "'ve", 67: 'have'}], 'wouldnt': [{65: 'would', 67: 'would'}, {65: 'nt', 67: 'not'}], 'wouldntve': [{65: 'would', 67: 'would'}, {65: 'nt', 67: 'not'}, {65: 've', 67: 'have'}], 'wouldn’t': [{65: 'would', 67: 'would'}, {65: 'n’t', 67: 'not'}], 'wouldn’t’ve': [{65: 'would', 67: 'would'}, {65: 'n’t', 67: 'not'}, {65: '’ve', 67: 'have'}], 'wouldve': [{65: 'would', 67: 'would'}, {65: 've'}], 'would’ve': [{65: 'would', 67: 'would'}, {65: '’ve'}], 'xD': [{65: 'xD'}], 'xDD': [{65: 'xDD'}], "y'all": [{65: "y'", 67: 'you'}, {65: 'all'}], 'yall': [{65: 'y', 67: 'you'}, {65: 'all'}], "you'd": [{65: 'you', 67: 'you'}, {65: "'d", 67: "'d"}], "you'd've": [{65: 'you', 67: 'you'}, {65: "'d", 67: 'would'}, {65: "'ve", 67: 'have'}], "you'll": [{65: 'you', 67: 'you'}, {65: "'ll", 67: 'will'}], "you'll've": [{65: 'you', 67: 'you'}, {65: "'ll", 67: 'will'}, {65: "'ve", 67: 'have'}], "you're": [{65: 'you', 67: 'you'}, {65: "'re", 67: 'are'}], "you've": [{65: 'you', 67: 'you'}, {65: "'ve", 67: 'have'}], 'youd': [{65: 'you', 67: 'you'}, {65: 'd', 67: "'d"}], 'youdve': [{65: 'you', 67: 'you'}, {65: 'd', 67: 'would'}, {65: 've', 67: 'have'}], 'youll': [{65: 'you', 67: 'you'}, {65: 'll', 67: 'will'}], 'youllve': [{65: 'you', 67: 'you'}, {65: 'll', 67: 'will'}, {65: 've', 67: 'have'}], 'youre': [{65: 'you', 67: 'you'}, {65: 're', 67: 'are'}], 'youve': [{65: 'you', 67: 'you'}, {65: 've', 67: 'have'}], 'you’d': [{65: 'you', 67: 'you'}, {65: '’d', 67: "'d"}], 'you’d’ve': [{65: 'you', 67: 'you'}, {65: '’d', 67: 'would'}, {65: '’ve', 67: 'have'}], 'you’ll': [{65: 'you', 67: 'you'}, {65: '’ll', 67: 'will'}], 'you’ll’ve': [{65: 'you', 67: 'you'}, {65: '’ll', 67: 'will'}, {65: '’ve', 67: 'have'}], 'you’re': [{65: 'you', 67: 'you'}, {65: '’re', 67: 'are'}], 'you’ve': [{65: 'you', 67: 'you'}, {65: '’ve', 67: 'have'}], 'y’all': [{65: 'y’', 67: 'you'}, {65: 'all'}], '\xa0': [{65: '\xa0', 67: '  '}], '¯\\(ツ)/¯': [{65: '¯\\(ツ)/¯'}], '°C.': [{65: '°'}, {65: 'C'}, {65: '.'}], '°F.': [{65: '°'}, {65: 'F'}, {65: '.'}], '°K.': [{65: '°'}, {65: 'K'}, {65: '.'}], '°c.': [{65: '°'}, {65: 'c'}, {65: '.'}], '°f.': [{65: '°'}, {65: 'f'}, {65: '.'}], '°k.': [{65: '°'}, {65: 'k'}, {65: '.'}], 'ä.': [{65: 'ä.'}], 'ö.': [{65: 'ö.'}], 'ü.': [{65: 'ü.'}], 'ಠ_ಠ': [{65: 'ಠ_ಠ'}], 'ಠ︵ಠ': [{65: 'ಠ︵ಠ'}], '—': [{65: '—'}], '‘S': [{65: '‘S', 67: "'s"}], '‘s': [{65: '‘s', 67: "'s"}], '’': [{65: '’'}], '’Cause': [{65: '’Cause', 67: 'because'}], '’Cos': [{65: '’Cos', 67: 'because'}], '’Coz': [{65: '’Coz', 67: 'because'}], '’Cuz': [{65: '’Cuz', 67: 'because'}], '’S': [{65: '’S', 67: "'s"}], '’bout': [{65: '’bout', 67: 'about'}], '’cause': [{65: '’cause', 67: 'because'}], '’cos': [{65: '’cos', 67: 'because'}], '’coz': [{65: '’coz', 67: 'because'}], '’cuz': [{65: '’cuz', 67: 'because'}], '’d': [{65: '’d'}], '’em': [{65: '’em', 67: 'them'}], '’ll': [{65: '’ll', 67: 'will'}], '’nuff': [{65: '’nuff', 67: 'enough'}], '’re': [{65: '’re', 67: 'are'}], '’s': [{65: '’s', 67: "'s"}], '’’': [{65: '’’'}]}
class kazu.utils.spacy_pipeline.SpacyPipelines[source]

Bases: object

Wraps spaCy pipelines into a singleton, so multiple can be accessed from different locations without additional memory overhead.

In addition, due to a known memory issue, we reload each pipeline after a certain number of calls, namely reload_at.

Note

In order for the Garbage Collector to remove old spaCy Vocab objects, users should ensure that spacy.Doc objects are de-referenced as soon as possible (i.e. that you don’t keep the results of process_single() or process_batch() around for a long time).

__init__()[source]
Return type:

None

static add_from_func(name, func)[source]

Add a spaCy model from a callable.

Parameters:
Return type:

None

static add_from_path(name, path)[source]

Add a spaCy model from a path.

Convenience function to call add_from_func() with a wrapped version of spacy.load with the relevant path argument.

Parameters:
Return type:

None

static add_reload_callback_func(name, func)[source]

Add a callback when a model is reloaded.

If using spaCy components outside the context of a Language, these will also need to be reloaded when the underlying model is reloaded. This can be done by providing a zero argument, None return type callable. If you need to modify a field of an object with a callback, we recommend the use of functools.partial() to build the callback function.

Parameters:
Returns:

Return type:

None

static get_model(name)[source]

Get the underlying Language for a given model key.

Parameters:

name (str)

Return type:

Language

process_batch(texts: Iterable[tuple[str | Doc, Any]], model_name: str, as_tuples: Literal[True], **kwargs: Any) Iterable[tuple[Doc, Any]][source]
process_batch(texts: Iterable[str | Doc], model_name: str, as_tuples: Literal[False] = False, **kwargs: Any) Iterable[Doc]

Process an iterable of spacy.Doc or strings with a given spaCy model.

Parameters:
  • texts – either an iterable of ‘texts’ (spacy.Docs or strs) if as_tuples=False (the default), or an iterable of length 2 tuples (text, context) where context is an arbitrary python object that is provided back in the output alongside its matching text.

  • model_name – spaCy model to process texts with

  • as_tuples – If set to True (the default is False), texts are paired with a ‘context’ inside a tuple both in input and output. See the description of the texts argument and the output, and read the type information.

  • kwargs – passed to Language.pipe.

Returns:

Iterable of spacy.Docs if as_tuples=False (the default). If as_tuples=True, the result is an Iterable of (doc, context) tuples.

process_single(text, model_name, **kwargs)[source]

Process a single Doc or text with a given spaCy model.

Parameters:
  • text (str | Doc)

  • model_name (str)

  • kwargs (Any) –

    passed to the call method of Language

Returns:

Return type:

Doc

reload_model(model_name)[source]

Reload a model, clearing the spaCy vocab.

Parameters:

model_name (str)

Return type:

None

reload_at

The interval (number of calls) at which spaCy models are reloaded.

Normally set within the kazu model pack config using the environment variable KAZU_SPACY_RELOAD_INTERVAL, but if this isn’t set (either using the config or as a normal environment variable), this defaults to 1000.

Note

As this class is a singleton, modifying this will change the reload value for all spaCy pipelines (i.e. globally).

Type:

int

kazu.utils.spacy_pipeline.basic_spacy_pipeline()[source]

A basic spaCy pipeline with a sentence splitter and a customised tokenizer.

Return type:

Language