Documentation Index
Fetch the complete documentation index at: https://openmetadata-feat-feat-2mbfixtestexui.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Column-Level Test Definitions
Column-level tests validate properties of specific columns. For table-level tests and importing instructions, see the Test Definitions Reference.Column-Level Tests
ColumnValuesToBeNotNull
Validates that a column contains no null or missing values. Parameters:column(str, required): Name of the column to validate
- Ensure required fields are populated
- Validate data completeness
- Enforce NOT NULL constraints
ColumnValuesToBeUnique
Validates that all values in a column are unique with no duplicates. Parameters:column(str, required): Name of the column to validate
- Validate primary keys
- Ensure unique identifiers
- Detect duplicate records
ColumnValuesToBeInSet
Validates that all values in a column belong to a specified set. Parameters:column(str, required): Name of the column to validateallowed_values(list[str], required): List of acceptable values
- Validate enum values
- Enforce categorical constraints
- Validate lookup values
ColumnValuesToBeNotInSet
Validates that column values do not contain any forbidden values. Parameters:column(str, required): Name of the column to validateforbidden_values(list[str], required): List of values that must not appear
- Detect test data in production
- Blacklist invalid values
- Filter out placeholder values
ColumnValuesToMatchRegex
Validates that column values match a specified regular expression pattern. Parameters:column(str, required): Name of the column to validateregex(str, required): Regular expression pattern
- Validate data format consistency
- Ensure pattern compliance
- Detect malformed data
ColumnValuesToNotMatchRegex
Validates that column values do not match a forbidden regular expression pattern. Parameters:column(str, required): Name of the column to validateregex(str, required): Regular expression pattern that values must NOT match
- Detect test data patterns
- Prevent specific formats
- Identify security risks
ColumnValuesToBeBetween
Validates that all values in a column fall within a specified numeric range. Parameters:column(str, required): Name of the column to validatemin_value(float, optional): Minimum acceptable valuemax_value(float, optional): Maximum acceptable value
- Validate numeric constraints
- Detect outliers
- Ensure value ranges
ColumnValueMaxToBeBetween
Validates that the maximum value in a column falls within a specified range. Parameters:column(str, required): Name of the column to validatemin_value(float, optional): Minimum acceptable maximum valuemax_value(float, optional): Maximum acceptable maximum value
- Monitor data ranges
- Detect upper outliers
- Validate maximum constraints
ColumnValueMinToBeBetween
Validates that the minimum value in a column falls within a specified range. Parameters:column(str, required): Name of the column to validatemin_value(float, optional): Minimum acceptable minimum valuemax_value(float, optional): Maximum acceptable minimum value
- Monitor lower bounds
- Detect lower outliers
- Validate minimum constraints
ColumnValueMeanToBeBetween
Validates that the mean (average) value falls within a specified range. Parameters:column(str, required): Name of the column to validatemin_value(float, optional): Minimum acceptable mean valuemax_value(float, optional): Maximum acceptable mean value
- Statistical validation
- Detect data drift
- Monitor averages
ColumnValueMedianToBeBetween
Validates that the median value falls within a specified range. Parameters:column(str, required): Name of the column to validatemin_value(float, optional): Minimum acceptable median valuemax_value(float, optional): Maximum acceptable median value
- Robust central tendency checks
- Detect skewed distributions
- Monitor typical values
ColumnValueStdDevToBeBetween
Validates that the standard deviation falls within a specified range. Parameters:column(str, required): Name of the column to validatemin_value(float, optional): Minimum acceptable standard deviationmax_value(float, optional): Maximum acceptable standard deviation
- Detect unexpected variability
- Monitor data consistency
- Validate distribution stability
ColumnValuesSumToBeBetween
Validates that the sum of all values falls within a specified range. Parameters:column(str, required): Name of the column to validatemin_value(float, optional): Minimum acceptable summax_value(float, optional): Maximum acceptable sum
- Validate totals
- Monitor aggregates
- Detect unexpected volumes
ColumnValuesMissingCount
Validates the count of missing or null values. Parameters:column(str, required): Name of the column to validatemissing_count_value(int, optional): Expected number of missing valuesmissing_value_match(list[str], optional): Additional strings to treat as missing
- Monitor data completeness
- Track missing data patterns
- Validate optional fields
ColumnValueLengthsToBeBetween
Validates that string lengths fall within a specified range. Parameters:column(str, required): Name of the column to validatemin_length(int, optional): Minimum acceptable string lengthmax_length(int, optional): Maximum acceptable string length
- Validate string constraints
- Prevent truncation
- Ensure format compliance
ColumnValuesToBeAtExpectedLocation
Validates that a specific value appears at an expected row position. Parameters:column(str, required): Name of the column to validateexpected_value(str, required): The exact value expectedrow_index(int, optional): Zero-based row position (default: 0)
- Validate sorted data
- Check ordered results
- Verify specific positions
Customizing Tests
All tests support customization through fluent methods:Next Steps
- Learn how to use these tests with TestRunner
- Apply tests to DataFrame Validation
- Explore Advanced Usage patterns