Spec Syntax

A spec file is plain Markdown. This document builds up the authoring surface from simple to complex: headings, shell blocks, doctest blocks, variables, check tables, hooks, and frontmatter.

What Makes a File a Spec

A file is a spec because of two things, not its extension:

It is reachable. specdown run starts at the configured entry and follows Markdown links to .md files; specdown trace scans every .md file under the project. Only .md targets are followed — .markdown and extension-less links are skipped.
It contains executable blocks — run:<target> blocks, check tables, or inline expect:/check: — which become its test cases. A reachable file with none simply contributes zero cases.

The recommended extension is .md. .spec.md is a legacy convention that behaves identically; the extension itself carries no meaning. A plain .md spec with an executable block runs just like a .spec.md one:

Scaffold a plain-.md project with one executable block.

rm -rf what-is-a-spec && mkdir -p what-is-a-spec/specs
printf '# Index\n\n- [Feature](feature.md)\n' > what-is-a-spec/specs/index.md
BT=$(printf '\140\140\140')
printf '%s\n' '# Feature' '' "$BT"'run:shell' '$ echo hi' 'hi' "$BT" > what-is-a-spec/specs/feature.md
printf '{"entry":"specs/index.md","adapters":[]}' > what-is-a-spec/specdown.json

run:shell

$ cd what-is-a-spec && specdown run 2>&1 | grep '^PASS' | sed 's/ in [0-9]*ms//'

PASS 2 spec(s), 1 case(s)

run:shell

Headings and Prose

Heading hierarchy (#, ##, ###, ...) is converted into a test suite hierarchy. Prose paragraphs are preserved in the HTML report but are not execution targets.

Shell Blocks

run:shell fenced blocks are executable — specdown runs them and checks for zero exit.

The parser must recognize run as the executable block kind.

info	kind	target
run:shell	run	shell

Other prefixes (e.g. verify:, test:) are not recognized and produce plain, non-executable code blocks. A spec containing only unrecognized blocks has zero cases. Unrecognized prefixes emit a warning to stderr so typos like runn:shell are caught early.

To suppress warnings for specific prefixes, add ignorePrefixes to specdown.json. Plain info strings without a colon (e.g. json, go, python) never warn.

Any executable block can be marked with !fail to indicate that failure is the expected outcome. When the adapter reports failure as expected, the case is rendered identically to a regular failure — red styling, failure stats, and red dot markers in the ToC all apply. The only difference is the exit code: a spec run exits 0 when expected failures are the only failures present. If the adapter unexpectedly succeeds, the case is a real failure. !fail blocks do not support variable capture (-> $name).

false

exit status 1

run:shell

Summary Lines

If the first line of a run: block is a comment, specdown extracts it as the block's summary line.

Consecutive comment lines at the start of a block are joined with a space into a single summary. The summary ends at the first non-comment line or blank line.

In the HTML report, blocks with a summary are rendered collapsed: only the summary text and pass/fail indicator are visible. A > marker on the right side lets readers expand the block to see the full code. Failed blocks auto-expand so failures are never hidden.

This makes specs readable for non-technical stakeholders without removing any detail for developers.

The comment prefixes recognized are # , // , and -- . Only the text after the prefix becomes the summary; the prefix itself and leading/trailing whitespace are stripped.

Blocks with doctest content ($ lines) never get summaries — they use a different rendering model with command/output pairs.

Here is an example: the following block's first line is a comment, so the report will render it collapsed with the summary "Demonstrate summary line" and a pass/fail indicator.

Demonstrate summary line

test 1 -eq 1

run:shell

Multiple comment lines are joined into one summary:

First part of the summary and the second part

test 1 -eq 1

run:shell

A block without a leading comment renders normally (not collapsed):

test 1 -eq 1

run:shell

Doctest Blocks

A run:<target> block whose content starts with $ lines is auto-detected as doctest-style. Lines starting with $ are commands; subsequent lines until the next $ or end of block are the expected output.

Commands are executed sequentially. On the first output mismatch, the block fails with expected and actual values for diffing. Commands without expected output lines are executed but only checked for exit status.

$ echo hello

hello

$ echo one two three

one two three

run:shell

A doctest-style block with no expected output still verifies the command succeeds.

$ true

run:shell

Multi-line expected output is matched exactly.

$ printf 'line1\nline2\nline3'

line1 line2 line3

run:shell

Arithmetic and pipelines work as expected.

$ echo $((2 + 3))

$ seq 3 | tr '\n' '+' | sed 's/+$//'

1+2+3

run:shell

Commands that produce no output show only the prompt line.

$ mkdir -p doctest-nooutput

$ touch doctest-nooutput/file.txt

$ test -f doctest-nooutput/file.txt

run:shell

Wildcard Matching

A line containing exactly ... in the expected output matches zero or more lines in the actual output. To match a literal ... line, escape it as \.... This is useful when output contains timestamps, PIDs, temporary paths, or other values that change between runs.

A wildcard in the middle skips variable lines:

$ echo hello && date && echo goodbye

hello

... (1 line)

Tue Jun 9 02:54:28 UTC 2026

goodbye

run:shell

Multiple wildcards can appear in a single expected block:

$ printf 'a\nb\nc\nd\ne'

... (1 line)

run:shell

Escaped wildcard matches literal ...:

$ echo '...'

...

run:shell

When no ... line is present, matching is exact (backward compatible).

$ echo hello

hello

world (expected)

run:shell

A doctest block with !fail intentionally shows the wrong expected output. The !fail modifier makes the mismatch count as a pass. On failure, passed steps render in green and the failing step renders in red with the expected value below.

$ echo hello

hello

goodbye (expected)

run:shell

When a case fails, remaining cases continue. Bindings from failed cases are discarded.

Variable Capture

A block can capture its output into one or more variables with -> $varName. Multiple captures use comma-separated names: -> $var1, $var2. Each output line is bound to the corresponding capture name in order.

info	kind	target
run:shell -> $id	run	shell

Variables captured this way are referenced in subsequent blocks and tables using ${variableName}.

When the adapter returns structured (non-string) output, the value is stored as-is and fields are accessible via dot-path syntax: ${result.field}. Nested access works to arbitrary depth: ${result.outer.inner}. Accessing a missing key or indexing into a non-object value is a compile-time error.

Scoping rules

Variables from parent sections are readable in child sections
Sibling sections at the same depth can share variables (in document order, only previously captured values)
An unresolved variable is a compile-time error

The heading tree enforces a safety property: variables never leak upward from child to parent sections.

module varscope

sig Section {
  parent: lone Section
}

sig Var {
  definedIn: one Section
}

-- heading hierarchy is a tree
fact tree {
  no s: Section | s in s.^parent
}

-- a variable is visible in its defining section and all descendants
pred visible[v: Var, s: Section] {
  v.definedIn in (s + s.^parent)
}

-- variables from child sections are never visible in ancestors
assert noUpwardLeak {
  all v: Var, s: Section |
    v.definedIn in s.^(~parent) implies not visible[v, s]
}

check noUpwardLeak for 6

pred sanityCheck {}
run sanityCheck {} for 6

alloy:model(varscope)

Variables captured in a parent section are available in child sections.

printf 'from-parent'

parentVar=from-parent

run:shell

Child section

test "from-parent" = "from-parent"

$parentVar=from-parent

run:shell

When output has fewer lines than capture names, excess captures receive empty string.

Variable escaping

To output a literal ${...}, escape it with a backslash: \${literal}.

printf 'ok'

$parentVar=from-parent

escapeTest=ok

run:shell

test "ok" = "ok"

$escapeTest=ok, $parentVar=from-parent

run:shell

Raw blocks

When a code block contains many ${...} patterns that are shell variables (not specdown bindings), escaping each one is tedious. The !raw modifier skips variable interpolation for the entire block:

```run:shell !raw
for f in *.md; do
  count=$(wc -l < "${f}")
  echo "${f}: ${count} lines"
done
```


Without `!raw`, every `${f}` and `${count}` would need to be escaped as
`\${f}` and `\${count}`. The `!raw` modifier can be combined with
captures (`-> $var`) and `!fail`.

## Check Tables

A Markdown table becomes executable when preceded by a check directive.

The directive is a blockquote of the form `> check:name`.
The first row is the header. Each subsequent row is an independent test case.
Check names are defined by the adapter, not the core.

### Cell escaping

Table cells support escape sequences that are processed by the core
before sending to the adapter.

| Sequence | Meaning |
|----------|---------|
| `\n` | newline |
| `\|` | literal pipe |
| `\\` | literal backslash |

Adapters always receive unescaped values.
The HTML report also unescapes cells, rendering `\n` as visible line breaks.

| input | expected |
| --- | --- |
| hello | hello |
| line1\nline2 | line1\nline2 |
| a\\\|b | a\\\|b |

### Check parameters

Checks can accept parameters via `(key=value)` syntax.
Parameters are passed to the adapter as `checkParams` in the `assert` message.
Multiple parameters are comma-separated: `> check:name(key1=val1, key2=val2)`.

Parameters let one check definition handle many scenarios.
Instead of registering separate checks for each endpoint or mode,
register a single generic check and pass the differences as parameters:

```markdown
> check:api(endpoint=/api/users, mode=object)
| field | expected |
| name  | alice    |

> check:api(endpoint=/api/orders, mode=array, count=2)
| index | status  |
| 0     | SUCCESS |
| 1     | PENDING |
```

The adapter reads checkParams.endpoint and checkParams.mode to decide how to fetch and validate, eliminating per-endpoint check code.

When a table column name matches a directive parameter name, the table cell value takes precedence over the directive value. This allows directive parameters to act as defaults that individual rows can override.

Parameterized check call

A check directive with parameters but no following table creates a single assertion case. The adapter receives an assert message with the check name, checkParams populated, and empty columns/cells.

This is useful for inline assertions that don't warrant a full table:

> check:check-user(field=plan, expected=STANDARD)

A check directive without parameters and without a table is a compile-time error.

Inline Elements

Prose text can contain inline executable elements embedded in backtick code spans. These are evaluated during the spec run and rendered with pass/fail status in the HTML report.

Prose variable rendering

Variables captured by earlier blocks can appear in prose text as ${name}. In the HTML report, resolved variables are displayed with their actual values highlighted in green. Referencing an undefined variable in prose is a compile-time error, just like in executable blocks.

The greeting is ${greeting} and it was captured successfully.

Inline expect

A backtick code span of the form `expect: EXPR == VALUE` creates an inline equality assertion. Both sides support ${variable} substitution. It counts as a test case and appears green (pass) or red (fail) in the HTML report.

The count is `expect: ${count} == 3` items.

For example, one plus one is 2.

When the actual value does not match the expected value, the inline assertion fails and the report shows both the actual value and the expected value.

Adding !fail at the end marks the assertion as an expected failure. The inline value renders identically to a regular failure — red background, red dot marker, and failure stats all apply. The only difference is that expected failures do not cause a non-zero exit code.

This deliberately wrong assertion is an expected failure: hellogoodbye.

Inline check call

A backtick code span of the form `check:name(key=value)` creates an inline check assertion. It sends an assert message to the adapter with the check name and checkParams populated, with empty columns/cells.

The file `check:file-check(path=/tmp/data.txt, exists=yes)` was created.

When the adapter returns an actual value in its passed response, the inline check displays the actual value as the main content with the check name shown as a small ruby annotation above it.

For example, a + b is 3jq.

Multiple inline elements can appear in the same paragraph.

Other Block Prefixes

The target in run:<target> names an adapter. The built-in shell adapter works with no configuration; custom adapters are registered in specdown.json.

Prefix	Meaning
`run:<target>`	Executable block dispatched to an adapter
`alloy:model(<name>)`	Alloy model definition (see Alloy)
`alloy:ref(<model>#<assertion>)`	Alloy model reference (see Alloy)
`mermaid`	Diagram block rendered by Mermaid (see Report)

Diagram Blocks

A fenced code block with mermaid as the info string is a diagram block. Diagram blocks are non-executable — they have no test case and are not dispatched to any adapter. In the HTML report, the content is rendered as an interactive diagram via the Mermaid library.

The info string match is case-sensitive: mermaid is recognized, but Mermaid or MERMAID falls through to plain code rendering.

```mermaid
graph LR
    A[Parse] --> B[Plan]
    B --> C[Execute]
    C --> D[Report]
```


## Setup and Teardown Hooks

Hooks run adapter commands at section boundaries.
A hook directive must be followed by an executable code block.

| Directive | Meaning |
|-----------|---------|
| `> setup` | Run once before the first case in the heading subtree |
| `> teardown` | Run once after the last case in the heading subtree |
| `> setup:each` | Run before the first case of each immediate child section |
| `> teardown:each` | Run after the last case of each immediate child section |

Hooks are not counted as test cases. Their results do not appear in the
case list, but a hook failure marks the document as failed.

### Hook variable visibility

Hooks can read variables captured by blocks in parent sections.
When a hook executes, it receives all bindings visible at the
hook's heading path — the same scoping rules that apply to
regular blocks apply to hooks.

A setup or teardown directive followed by an executable code block must parse successfully.

```run:shell
# Verify spec with setup:each hook parses successfully
printf '# Hook Test\n\n## Group\n\n> setup:each\n```run:shell\necho init\n```\n\n### Scenario A\n\nSome prose.\n' > hook-good.spec.md
printf '# T\n\n- [Hook](hook-good.spec.md)\n' > index.md
cat <<'CFG' > hook-good-cfg.json
{"entry":"index.md","adapters":[{"name":"s","command":["true"],"blocks":["run:shell"]}]}
CFG
specdown run -config hook-good-cfg.json -dry-run 2>&1
```

Frontmatter

An optional YAML frontmatter can be placed at the top of a spec file.

Key	Description
`timeout`	Per-case execution time limit in milliseconds. Overrides `defaultTimeoutMsec` from config. `0` disables the time limit
`type`	Document type for traceability (e.g. `spec`, `goal`, `feature`)
`workdir`	Working directory for all shell blocks, relative to the spec file's location. Created automatically if it does not exist

If frontmatter is absent, the global defaultTimeoutMsec from specdown.json applies (default: 30 seconds).

A spec with a timeout must still pass when the adapter responds quickly.

Verify spec with YAML frontmatter timeout parses successfully

cat <<'SPEC' > timeout.spec.md
---
timeout: 5000
---

# Timeout Test

## Quick

A simple command that completes well within the timeout.
SPEC
printf '# T\n\n- [Timeout](timeout.spec.md)\n' > index.md
cat <<'CFG' > timeout-cfg.json
{"entry":"index.md","adapters":[]}
CFG
specdown run -config timeout-cfg.json -dry-run 2>&1

run:shell

A spec with a workdir frontmatter runs all shell blocks inside that directory.

Verify spec with workdir frontmatter executes in the correct directory

bt=$(printf '\x60\x60\x60')
pw=$(printf '\x24PWD')
printf '%s\n' '---' 'workdir: mywork' '---' '' '# Workdir Test' '' '## Cwd' '' "$bt""run:shell" "basename \"$pw\"" "$bt" '' '> mywork' > workdir.spec.md
printf '# W\n\n- [Workdir](workdir.spec.md)\n' > wd-index.spec.md
printf '{"entry":"wd-index.spec.md","adapters":[]}\n' > workdir-cfg.json
specdown run -config workdir-cfg.json 2>&1

run:shell