lookout.style.typos.research.create_typos

Module Contents

lookout.style.typos.research.create_typos.letters
lookout.style.typos.research.create_typos.rand_insert(string:str)

Add random letter inside a string

lookout.style.typos.research.create_typos.rand_delete(string:str)

Delete random symbol from a string

lookout.style.typos.research.create_typos.rand_substitution(string)

Substitute random symbol with a letter inside a string

lookout.style.typos.research.create_typos.rand_swap(string)

Swap two random consequent symbols

lookout.style.typos.research.create_typos.rand_typo(string)

Make random typo in a string

lookout.style.typos.research.create_typos.corrupt(data_file, typo_probability, add_typo_probability, out_file)

Augment some of identifiers from dataframe with TYPO_PROBABILITY, consequent typos in the same word happen with ADD_TYPO_PROBABILITY each

lookout.style.typos.research.create_typos.corrupt_splits(data_file, typo_probability, add_typo_probability, out_file, repeats:int=1)
lookout.style.typos.research.create_typos.train_test_split(data_file, test_portion)

Randomly split data on train and test

lookout.style.typos.research.create_typos.create_typos(args)