Data validation has never been so easy thanks to Spotless Data's Machine Learning filters solution.
- Review the structure - A high-level check that a URL is well structured and clean. This can be useful when collecting manually entered URLs from contact databases or similar but does not actually validate that the website is correct
- Check it exists - a check that a URL is not only well structured and clean but also that it exists and that the contents can be retrieved. This can be useful in checking a set of links in a data file or CMS to ensure that there are no dead links on the site
- Verify the contents of the URL - the most in-depth test, this rule checks that a specific URL exists and that it contains specific test. This is useful in checking that URLs are up to date and still contain the content that they are expected to contain, for example, links to news articles or TV shows that may expire
If your data quality is an issue or you know that you have known sources of dirty data but your files are just too big, and the problems too numerous to be able to fix manually please do log in and try now