Optimus is the missing framework for cleaning and pre-processing data in a distributed fashion with pyspark.