Rogue Data

Blog posts about Rogue Data

Nov. 14, 2017, 11:40 a.m.
Miniature cleaners cleaning a laptop symbolises the importance of data cleaning. The need for data cleaning has never been so important in a world dominated by big data, which are of no use at all unless they have been suitably cleaned, and by artificial intelligence, which is about as smart as a wet blanket if it is working with data which have not undergone a thorough data cleaning process...
Read More
Nov. 2, 2017, 6:10 a.m.
The first step towards achieving data integrity for all the different data entering your repository is using Spotless Data's machine learning filters, to ensure trustworthy data quality every time. Data Integrity means have data quality throug data validation so that you can trust in your data throughout their life cycle, starting from their ingestion into a data repository. At Spotless ...
Read More
Sept. 29, 2017, 5:16 a.m.
Spotless Data's machine learning filters creating Artificial Intelligence for Data Quality. What inspired us to set up Spotless Data as a new start-up, in November 2015, using machine learning filters for data cleaning to reach the holy grail of data quality you can trust, and which can do not merely what they were designed to do but new things as well, has been seeing time and again wit...
Read More
Sept. 15, 2017, 4:27 a.m.
Spotless is a web-based API that filters data coming into your systems so rogue data can never get into your data platforms. What is rogue data? Also known as dirty data, rogue data are essentially any corrupted, mismatched or inaccurate data which, if they get into your data platforms, will affect either your final product and thus are used and/or viewed by your customers, or will affect...
Read More
Sept. 15, 2017, 6:45 a.m.
Our machine learning filters eliminating rogue data thus ensuring you have spotless data quality. A simple definition of filters is that they filter out information or data which are not wanted. A more complex definition is that filters take one list of information/data, made of one or several columns, and convert it into another modified list. It does this by examining the content of the li...
Read More
Dec. 22, 2016, 7:54 a.m.
A hype cycle perfectly visually illustrates Amara's law. Amara's Law is a computer saying which states: We tend to overestimate the effect of a technology in the short run and underestimate the effect in the long run. While this law, coined by the then president of the US-based Institute for the Future, Roy Amara, has been associated with both cyber attacks and nanotechnology, ...
Read More
May 12, 2017, 6:51 a.m.
Dirty or rogue data will always have a profound effect on any organisation. Spotless Data have recognised five fundamental different types of cleansing of dirty data which can be done by using our unique web-based data quality API solution. 1. Regex Regex are regular expressions, which define search patterns and identify particular strings found within data sets. For instance, if you k...
Read More
Nov. 15, 2016, 9:50 a.m.
User-generated-content is one area where data quality has never been so important which is why using Spotless Data to ensure data integrity makes so much sense. Corrupted data At Spotless Data we estimate that 5% of overall data held by companies is corrupted and lacking in data integrity, though a recent report estimated that manually entered data could contain an error rate of anywhere ...
Read More
Dec. 5, 2016, 6:52 a.m.
High quality data can transform Tableau data visualisation. Tableau, the data visualization software, is a way of illustrates data that is particularly useful when dealing with data which changes over time. It has a great mapping functionality, with a number of geographic identifiers built into their software, such as country, region and sometimes postcode. However, when using Tableau in ord...
Read More
Oct. 3, 2016, 1:41 p.m.
Getting the data just right has never been easier thanks to Spotless Data Validation Solution to Rogue Data. We’ve just been talking to a client who’s had bad data prevent their business from reporting any of their key KPIs for the last three months. All they knew is they’d implemented a change in one of their systems and suddenly all the data they were reporting on was cle...
Read More