PerSpellData: An Exhaustive Parallel Spell Dataset For Persian

NSURL 2021, Trento, Italy, 2021

A comprehensive parallel dataset developed for the task of spell-checking in Persian. This dataset covers both non-word and real-word errors which can be used to develop an encoder-decoder model to detect and correct errors.