An experiment is a recurring run of one or more components (predictors, surprise finding, column stats, custom analyses) against a dataset or view. Experiments are how recurring analysis happens in Summand — instead of remembering to re-run a notebook, you wire up an experiment and it runs forever.Documentation Index
Fetch the complete documentation index at: https://docs.summand.com/llms.txt
Use this file to discover all available pages before exploring further.
Anatomy
| Field | Purpose |
|---|---|
experimentId | Stable identifier. |
name / description | Human-readable. |
datasetId | The source. Required. |
viewId | Optional. If set, the experiment runs against the view rather than the raw dataset. |
components[] | List of component IDs to run. At least one. |
inputs | A dict keyed by ComponentInput.name — typed values supplied to each component. |
schedulePreset / cronExpression | When to run. |
enabled | Pause without deleting. |
runCount, lastRunAt, recentRuns[] | Run history (last 3 surfaced inline; full history under Run history). |
Schedule options
Six presets cover the common cases:- Daily at 3 AM (
cron(0 3 * * ? *)) - Daily at 6 AM
- Every 6 hours
- Every 12 hours
- Weekly Monday 3 AM
- Weekly Sunday 3 AM
Components
A component is a unit of analysis with declared inputs and outputs. The component catalog defines what’s available; today it includes:- Predictors — fit an EBM, score the source, write feature importance and shape functions.
- Surprise finding — flag rows that disagree with predictions; rank by confidence.
- Column stats — per-column distributions, missingness, cardinality, correlations.
- Semantic layer pieces — UMAP, feature metadata, and similar internal artifacts.
Sources: dataset vs. view
You can target either a dataset or a view:- Dataset — components run on the curated Parquet for that dataset. The simplest case.
- View — components run on the SQL view’s output. Use this when you need to filter, join, or aggregate first, or when you want experiments to share a common definition (define the cohort once as a view, point multiple experiments at it).
Outputs
Each run writes:- A new semantic-layer version for the source dataset.
- Component outputs to
summand-task-outputskeyed by run ID. - A row in Run history with status, duration, and any error.
Run history
The experiment detail page shows the most recent runs with status (SUCCEEDED / FAILED / IN_PROGRESS), duration, and (on failure) the error message. Older runs paginate further back.
Failed runs are kept — Summand doesn’t auto-retry. You can:
- Manually re-run from the experiment detail page (a one-shot trigger; doesn’t change the schedule).
- Edit and re-save to fix the configuration; the next scheduled tick picks up the change.
- Pause if the underlying source is broken upstream and you don’t want to spam failures.