Making Data Spotless

blog

Ensuring Transparency in Machine Learning
Aug. 11, 2017, 7:13 a.m.
The recent spat between Elon Musk and Mark Zuckerberg brought the alleged dangers of Artificial Intelligence (AI) and Machine Learning (ML) to the world's attention. Musk evoked old fears of AI, explored previously by the science-ficti...


Filters
July 24, 2017, 9:46 a.m.
At Spotless we have a highly experienced data science team who are outstanding in terms of their knowledge of data quality and the problems which dirty data cause and as a result have developed over the last couple of years a complex system of records, jobs and plans as well as six rule types in order to ensure ...


How to Trust Your Data Again
Aug. 4, 2017, 6:23 a.m.
At the heart of Spotless Data Quality API solution lie our newly developed Machine Learning data cleansing filters, filtering out the dirty data containing mismatches, duplications and other corruptions. Ma...


Managing your Data Lake
July 26, 2017, 5 a.m.
It is widely recognised that the simplest way to ensure data quality is to minimise the variety of sources which your data come from. As Machine Learning Berkeley professor Michael Jordan states: "data variet...


On Validating Numbers
Aug. 15, 2017, 12:08 p.m.
Validating numbers is always going to be important in cleansing data to ensure you have data quality you can trust in. Therefore the Spotless Data Science team have been working to ensure you can clean any list of numbers which may have errors in it. This type ...


Exploring artificial intelligence
Aug. 3, 2017, 8:38 a.m.
Many commentators believe the great majority of businesses will adopt some form of artificial intelligence (AI)  in the coming years as this once rare technology goes mainstream. Estimates forecast that 62% of enterprises will use AI by 2018, at...


Big Data Analytics Part 2
July 7, 2017, 5:36 a.m.
Continuing the blog on Big Data and Analytics. Part 1 can be found here. Different Varieties of Data While data comes from different sources and has varying file formats, an issue which is generally fixed by cleansing the data and placing them all into a single


Exploring Data Warehouses and Data Quality
Aug. 7, 2017, 2:46 p.m.
A data warehouse is a repository or storage area where all the data in one's company is kept in a single place. This includes data from different sources as well as both current and historical data, perhaps from a legacy platform. It can include data from the company itself, which if the company is a large one might well be spread over many departments, all of which may be using differen...


Big Data Analytics Part 1
June 23, 2017, 11:39 a.m.
With the explosion in data that the world is seeing big data have become a normal part of modern life and are required for SMEs and large companies to thrive and fully engage with their customers; though this can only happen by converting said big data into useful analysis....


Spotless Version 10 now live
June 7, 2017, 3:22 p.m.
We are delighted to announce the release of version 10 of our unique browseable Data Quality API solution to ensure you have data you can trust by cleansing your dirty data of corruptions and inaccuracies and leaving them spotlessly scrubbed up. Version 10 release...