PerSpellData: An Exhaustive Parallel Spell Dataset For Persian
NSURL 2021, Trento, Italy, 2021
A comprehensive parallel dataset developed for the task of spell-checking in Persian. This dataset covers both non-word and real-word errors which can be used to develop an encoder-decoder model to detect and correct errors.