Spotless provides a web service for cleansing data. It's designed to be easy to incorporate into your applications to ensure that you always have clean data to work from. There are many different rules to validate whether data is correct and a number of different options for how invalid data is handled.
Spotless works using rules, plans, and jobs. Files are cleansed in jobs which apply a series of rules to a file according to a plan specified by the user.
In order to run Spotless you need to create some rules (or choose from some of the publicly available rules), make a cleansing profile that describes the file to be uploaded, and then submit one or more jobs to be processed.
Spotless is typically used to cleanse multiple records in a file. You can process any kind of text file for processing. The default is CSV and if you want to process a TSV file you should set the csv_delimiter parameter on the plan to \t
Typical use cases are: