40 Standard Data Profiles
  • 20 Mar 2024
  • 3 Minutes to read
  • PDF

40 Standard Data Profiles

  • PDF

Article Summary

Overview

Validatar ships with 40 data profiles already configured. Each one can be modified at both the management level and definition level within a database engine.

  1.  Record Count
    • Table profile
    • Returns the record count of the table.
    • Result Format: Numeric Value
  2. Total Data MB
    • Table profile
    • Returns the total size of the table in megabytes.
    • Result Format: Numeric Value
  3. Distinct Count
    • Column profile
    • Returns the distinct number of records in the column.
    • Result Format: Numeric Value
  4. Distinct Percent
    • Column profile
    • Prerequisites: Distinct Count, Record Count
    • Returns the distinct count divided by the total number of records.
    • Result Format: Percent Value
  5. Most Common Value
    • Column profile
    • Returns the most frequently used value.
    • Result Format: String
  6. Most Common Count
    • Column profile
    • Returns the most frequently used number.
    • Result Format: Numeric Value
  7. Minimum (Numeric)
    • Column profile
    • Returns the minimum value of a numeric column.
    • Result Format: Numeric Value
  8. Maximum (Numeric)
    • Column profile
    • Returns the maximum value of a numeric column.
    • Result Format: Numeric Value
  9. Mean (Numeric)
    • Column profile
    • Returns the average value of a numeric column.
    • Result Format: Numeric Value
  10. Median (Numeric)
    • Column profile
    • Returns the median value of a numeric column.
    • Result Format: Numeric Value
  11. Lower Quartile
    • Column profile
    • Returns the middle value between the lowest value and the median of the column.
    • Result Format: Numeric Value
  12. Upper Quartile
    • Column profile
    • Returns the middle value between the median and the highest value of the column.
    • Result Format: Numeric Value
  13. Minimum (String)
    • Column profile
    • Returns the minimum value of a string column.
    • Result Format: String Value
  14. Maximum (String)
    • Column profile
    • Returns the maximum value of a string column.
    • Result Format: String Value
  15. Standard Deviation
    • Column profile
    • Returns the measure of variance in a numeric column.
    • Result Format: Numeric Value
  16. Max Length
    • Column profile
    • Returns the maximum length of a value in a string column.
    • Result Format: String Value
  17. Min Length
    • Column profile
    • Returns the minimum length of a value in a string column.
    • Result Format: String Value
  18. Mean Length
    • Column profile
    • Returns the average length of a value in a string column.
    • Result Format: String Value
  19. Length Distribution
    • Column profile
    • Returns a list of lengths and their count of occurrences in a string column.
    • Result Format: Numeric Array
  20. Null Count
    • Column profile
    • Returns the count of null values in a column.
    • Result Format: Numeric Value
  21. Null Percent
    • Column profile
    • Returns the count of null values divided by the total record count in a column.
    • Result Format: Percent Value
  22. Blank Count
    • Column profile
    • Returns the count of blank values in a column.
    • Result Format: Numeric Value
  23. Blank Percent
    • Column profile
    • Returns the count of blank values divided by the total record count in a column.
    • Result Format: Percent Value
  24. Numeric Count
    • Column profile
    • Returns the count of numeric values in a string column.
    • Result Format: Numeric Value
  25. Numeric Percent
    • Column profile
    • Returns the count of numeric values divided by the total record count in a string column.
    • Result Format: Percent Value
  26. Zero Count
    • Column profile
    • Returns the count of values that equal 0 in a numeric column.
    • Result Format: Numeric Value
  27. Zero Percent
    • Column profile
    • Returns the count of values that equal 0 divided by the total record count in a numeric column.
    • Result Format: Percent Value
  28. Negative Count
    • Column profile
    • Returns the count of values less than 0 in a numeric column.
    • Result Format: Numeric Value
  29. Negative Percent
    • Column profile
    • Returns the count of values less than 0 divided by the total record count in a numeric column.
    • Result Format: Percent Value
  30. Minimum (Date)
    • Column profile
    • Returns the earliest date in a column.
    • Result Format: Date/Time Value
  31. Maximum (Date)
    • Column profile
    • Returns the most recent date in a column.
    • Result Format: Date/Time Value
  32. Longest Value
    • Column profile
    • Returns the value with the longest length in a string column.
    • Result Format: String Value
  33. Shortest Value
    • Column profile
    • Returns the value with the shortest length in a string column.
    • Result Format: String Value
  34. Distribution (String)
    • Column profile
    • Returns a list of values and their count of occurrences in a string column.
    • Result Format: String Array
  35. Distribution (Numeric)
    • Column profile
    • Returns a list of values and their count of occurrences in a numeric column.
    • Result Format: Numeric Array
  36. Top 10 Values
    • Column profile
    • Returns the 10 most common values in a column.
    • Result Format: String Array
  37. Bottom 10 Values
    • Column profile
    • Returns the 10 least common values in a column.
    • Result Format: String Array
  38. Binned (Numeric)
    • Column profile
    • Returns the grouping of values into bins for a numeric column.
    • Result Format: Numeric Array
  39. Year Distribution
    • Column profile
    • Returns a list of years and their count of occurrences in a Date/Time column.
    • Result Format: String Array
  40. Year Month Distribution
    • Column profile
    • Returns a list of years and months and their count of occurrences in a Date/Time column.
    • Result Format: String Array

Was this article helpful?

What's Next