Skip to contents

Run checks

Main entry point and individual check runners

run_dq_check()
Run a full data quality check pipeline
run_qc_checks()
Run all generic quality checks on a dataset
run_comparison_checks()
Run all version comparison checks between two dataset snapshots
run_custom_checks()
Run organisation-specific custom checks

Configuration & ingestion

Load config and read dataset files

load_config()
Load and merge dataset configuration
detect_files()
Detect current and previous dataset files
read_dataset()
Read a dataset file into a DuckDB table

Results

Construct and interpret result objects

dq_result()
Construct a data quality result object
overall_status()
Compute the worst status across a list of dq_result objects
infer_col_type()
Infer the logical type of a character column
resolve_col_type()
Resolve the effective type of a column, respecting config overrides
check_missing_rate()
QC-01: Check missing rate per column

Snapshot database & drift analysis

Query snapshot history and compare snapshots over time

read_recent_snapshots()
Read recent snapshot history from the DuckDB database
list_snapshots()
List snapshots available in the database
compare_snapshots()
Compare two snapshots from the DuckDB database

Helpers

Utility functions

col_threshold()
Look up the effective threshold for a check, with per-column fallback