Data testing and monitoring for SQL accessible data
Soda SQL
Data testing, monitoring, and profiling for SQL-accessible data.
Soda SQL is an open-source command-line tool. It utilizes user-defined input to prepare SQL queries that run tests on tables in a data warehouse to find invalid, missing, or unexpected data. When tests fail, they surface “bad” data that you can fix to ensure that downstream analysts are using “good” data to make decisions.
Test your data
If your organization uses data to make decisions, you should always be testing your data.
- When data comes into a system, you should test it.
- When data is transformed or otherwise manipulated to fit into an app or other database, you should test it.
- When data is about to be exported, you should test it.
- Test to make sure data is