Data Cleaning

As well as identifying data quality issues, Spotless can clean dirty data and replacing invalid fields with corrected data.

Supported rules are:

  • Fixing dates and times to a specific format
  • Correcting numbers to a given number of decimal places
  • Setting unique foreign key references that have been truncated or corrupted
  • Adding missing data using our machine learning lookalike model
  • Updating common spelling mistakes with a lookup rule

    Blog posts about Data Cleaning

    Dec. 4, 2017, 9:33 a.m.
    A sign taken from a ski slope in the Alps illustrates the new and improved Spotless version 16 release. We are delighted to announce the release of the new improved version 16 of Spotless Data's solution to all your rogue data issues through a data cleaning API easily accessible through your web browser. There are four new areas where we have improved our machine learning filters whic...
    Read More
    Nov. 14, 2017, 11:40 a.m.
    Miniature cleaners cleaning a laptop symbolises the importance of Data Cleaning. The need for data cleaning has never been so important in a world dominated by big data, which are of no use at all unless they have been suitably cleaned, and by Artificial Intelligence, which is about as smart as a wet blanket if it is working with data which have not undergone a thorough data cleaning process...
    Read More
    Dec. 18, 2017, 7:15 a.m.
    When doctors and healthcare IT specialists have clean, quality data to work with then the data can help save lives and cure illnesses. Modern technologies that involve large quantities of data are increasingly used in medicine and healthcare to help us all live longer and healthier lives, while helping cure intractable and horrible illnesses like cancer and Parkinson's. However, getting ...
    Read More
    Dec. 11, 2017, 7:27 a.m.
    By using big data in manufacturing processes, such as a tablet monitoring what is happening in a factory, manufacturers ensure greater efficiency and success. Manufacturing has always needed large quantities of data to produce the complicated goods typical of a modern industrial society while using data for the time management of workers and general efficiency within a factory stretches back...
    Read More
    Dec. 1, 2017, 7:17 a.m.
    Data quality is now achievable for the financial sector at minimum effort with Spotless Data's data cleaning solution The financial sector has always needed to ingest and analyse large quantities of both structured and unstructured data to make the often complex decisions required within highly competitive markets where millions can be made or lost depending on the decisions taken. The c...
    Read More
    Sept. 29, 2017, 5:16 a.m.
    Spotless Data's Machine Learning Filters creating Artificial Intelligence for Data Quality. What inspired us to set up Spotless Data as a new start-up, in November 2015, using Machine Learning filters for data cleaning to reach the holy grail of Data Quality you can trust, and which can do not merely what they were designed to do but new things as well, has been seeing time and again wit...
    Read More
    Sept. 15, 2017, 4:27 a.m.
    Spotless is a web-based API that filters data coming into your systems so rogue data can never get into your data platforms. What is Rogue Data? Also known as dirty data, rogue data are essentially any corrupted, mismatched or inaccurate data which, if they get into your data platforms, will affect either your final product and thus are used and/or viewed by your customers, or will affect...
    Read More
    Sept. 15, 2017, 6:45 a.m.
    Our Machine Learning Filters eliminating rogue data thus ensuring you have spotless data quality. A simple definition of filters is that they filter out information or data which are not wanted. A more complex definition is that filters take one list of information/data, made of one or several columns, and convert it into another modified list. It does this by examining the content of the li...
    Read More
    Aug. 18, 2017, 7:59 a.m.
    Recruiting a data science team to manage big data is no easy task but easier when you already use Spotless Data's Machine Learning Filters. Managing your big data is no longer a task that only tech companies specialising in Artificial Intelligence require. Until recently big data managers only really existed in big IT companies with billions of dollars of resources while simple data mana...
    Read More
    Aug. 4, 2017, 9:04 a.m.
    The best way to trust your data again is to ensure they are quality data by using Spotless Machine Learning Filters. At the heart of Spotless Data Quality API solution lie our newly developed Machine Learning filters for data cleaning, filtering out the dirty data containing mismatches, duplications, and other corruptions. Many of these dirty data issues are predictable and are caused by a m...
    Read More