modellerUpdated 2026-05-15

Source Statistics

What this covers

The Source Statistics panel shows table and column profiling data collected from model sources. Tessallite uses these statistics to explain data shape, identify low-cardinality dimensions, and feed optimiser decisions for aggregates and query routing.

This page is for the Source Statistics drawer, not predictive aggregates. Predictive aggregates are recommendations; source statistics are the measured facts those recommendations can use.

What the panel shows

Area	Meaning
Source selector	Chooses the model source whose table statistics are displayed.
Recompute	Runs the profiler again for the selected source.
Table rows and bytes	Approximate size signals used for modelling and optimisation decisions.
Column null rate	Share of profiled rows where the column is null.
Distinct values	Cardinality estimate for filtering, grouping, and aggregate grain choices.
Refresh cadence	How often Tessallite should consider statistics stale for that table.

Recompute options

Use the default sample for ordinary modelling guidance.
Use a custom sample when the table is large and the default is too small to represent skewed data.
Use a full scan only when accuracy matters more than profiling cost.

How to read the output

High distinct counts are useful for identifiers but usually poor aggregate grains. Low distinct counts are often good dimension candidates, especially for status, channel, geography, product family, and calendar attributes.

A high null percentage does not automatically make a column unusable. It means analysts need to understand whether null is meaningful, missing, or caused by incomplete source modelling.

Source Statistics

What this covers

What the panel shows

Recompute options

How to read the output

Related