Spotless Data's machine learning filters creating Artificial Intelligence for Data Quality.
What inspired us to set up Spotless Data as a new start-up, in November 2015, using machine learning filters for data cleaning to reach the holy grail of data quality you can trust, and which can do not merely what they were designed to do but new things as well, has been seeing time and again with the clients we were working with in our other business enterprises, of their need for a data quality they simply did not have. Instead, they were plagued with problems of rogue data.
We ended up having to help these clients, themselves mostly large corporations with a great deal of big data, including wasted dark data, which they were failing to use to maximise their business advantage. We enabled them to set the processes in motion so that they could clean their data as the first step towards being able to use them in a whole range of areas within their organisations. From displaying information on their websites through creating their products through designing their latest data-driven Marketing and advertising campaigns and through internal business reporting, about the state of their business and monitoring such issues as how the most recent sales campaign was going and the financial health of their companies.
We realised it would be much more cost-effective, not merely for our clients but for businesses everywhere, to use a service such as Spotless Data to help them with their data integration rather than to have to engage in the time-consuming and expensive process of setting up their data cleaning from scratch. We also realised that by using artificial intelligence, in the form of machine learning filters, we could set up an easy-to-use python API that our customers could work with simply by accessing a web-browser from any device where they could also access their data. These filters would learn from their own experience and thus become better at what they were doing over time, becoming highly competent in dealing with a wide range of rogue data and data validation issues. We could, therefore, offer a service that was needed and useful to both our established and potential clients.
Not only is our API easy-to-use but we realised that the other brilliant advantage it has over other data quality solutions that involve downloading a software app and having it installed on one's PC or other device is that those solutions have no real way of dealing with failures in the data cleaning process. At Spotless we absolutely do not believe that the failure to clean a particular batch of data satisfactorily means that our solution isn't fit for purpose or that we should pretend such failures don't occur. To the contrary, we consider these so-called failures to be a high priority, and a real opportunity for our Machine Learning filters to learn something new with the help of our thoroughly experienced data science team.
What we do is to quarantine the problematic data and get in touch with you via the email address you provide so that between us we can discuss the issue and quickly come to some resolution. We are all too aware that businesses don't need their newly acquired data cleaned tomorrow but today, right now in real-time. Apart from with the very largest files, the entire process of filtering out dirty and having spotlessly clean data takes roughly 5 minutes, as is illustrated in these two youtube videos, data cleaning an EPG file and data cleaning a genre column within an EPG file.
What we particularly like about the API solution that we have created is that you submit your data by uploading it to the my filters page (you need to be logged-in to see this which you can do via your email, Facebook, Google or GitHub accounts) where we analyse the file and give you a report on its contents. This takes about a minute. We then give you the power and responsibility of setting the variables which will clean the file. After all, you are the person who knows your data and what you want from it. This is easy to do using our drop-down boxes and simple instructions. As initially, it may take a little time to get used to using our solution and to get exactly the results that you want we are offering each new customer 500Mb of free data cleaning to test our solution. This ensures that when you go live with it on your platforms, you will be getting the results that you want.
Here is our introduction to using our API. We guarantee your data are secure and not accessible to any 3rd parties during the time they are in our care, a responsibility we take very seriously.
Here is a quick link to our FAQ. You can also check out our range of subscription packages and pricing. If you would like to contact us you can speak to one of our team by pressing on the white square icon with a smile within a blue circle, which you can find in the bottom right-hand corner of any of the web pages on our site.
If your data quality is an issue or you know that you have known sources of dirty data but your files are just too big, and the problems too numerous to be able to fix manually please do log in and try now