Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.uselayerup.com/llms.txt

Use this file to discover all available pages before exploring further.

21 — Versioning, release gates & failure modes.

Layerup releases are configuration promotions across a fixed environment ladder, gated by evals, drift, and error budgets. Every plane has a documented failure-mode matrix; failure is a first-class state of the substrate.

21.1 Version domains

DomainVersioningPin location
Ontologyrelease line + branches (§6)agent / tool / mapping
Mappingsemver per mappingtenant config
Toolsemver per signatureagent definition
Promptsemver hashagent / tool
Retrieval corpussnapshot idtool / agent
Modelid + lineage (§12)routing policy
Agentsemver per definitiontenant config
Policypolicy version (§16.6)tenant

21.2 Promotion ladder

Fig. 21.1 — Promotion ladder. Gates run automatically at each step; manual sign-off is required at UAT. Rollback is one operator action.

21.3 Release gate matrix

GateInputsPass conditionBlock action
Schemaartefact diff vs trunksemver matches change class (§6.1)auto-block
Evalcandidate vs baselinebench ≥ pack.min, regression ≤ pack.maxauto-block
Replayrecent runs sampleverdict-delta within noise bandauto-block · operator override possible with note
Adversarialprobe packno regression on probesauto-block
Drifttrailing windowno ≥ 2σ warnoperator alert · block at 3σ
Error budgetSLO burn-rateno fast-burn breachauto-block
Tenant sign-offUAT outcometenant-approvedauto-block
Security reviewmarking change · scope changesecurity primary signsauto-block

21.4 Canary algorithm

  1. Route a small percentage of eligible traffic to the candidate (default 1%).
  2. Observe per-step quality, latency, error budget, and drift sigma in 15-minute windows.
  3. Ramp to 5% → 25% → 100% only if all watched metrics stay within bounds.
  4. Any 2σ deviation pauses the ramp; any 3σ deviation rolls back.
  5. Rollback is content-hash promotion of the prior version; takes effect within one minute.

21.5 Rollback strategy

  • Every promoted artefact records its predecessor’s content hash; rollback is a one-step re-promotion of the predecessor.
  • In-flight runs at rollback continue against their pinned versions; no run is mid-air-upgraded.
  • Rollbacks are typed AuditEvents (config.rollback) on the tenant chain.

21.6 Failure-mode matrix

The platform classifies failures into 16 typed modes. Each mode has a detection rule, a containment path, a fallback, and an audit signature. This list is canonical; tenant-specific modes inherit from these classes.
#ModeDetectionContainmentFallbackAudit
1Hallucinationverifier rule pack · evidence missing for asserted propertyverifier blocks Decision · raises Exceptionhandoff to reviewer queueverify.block.hallucination
2Schema mismatchJSON-Schema validation fails at ingest or tool boundaryroute to quarantine queuetyped Exception · operator triagedata.ingest.schema_drift
3Stale dataSoR version observed < current at commit timerefuse commit · re-resolvere-stage with new pre-imageaction.stale_target
4Source-of-record downtimeadapter error rate window · circuit breakeropen breaker · pause adapterflat-file path or queue retryintegration.source_down
5Ambiguous evidenceconflicting EvidenceSpans on same Propertyverifier emits Exceptionhandoff with rationaleevidence.ambiguous
6Conflicting evidencecross-document contradiction detectedverifier blocks · Exceptionhandoff · request operator clarificationevidence.conflict
7Low-confidence extractionextractor confidence < agent thresholdverifier classifies as warn or blockhandoff or alternate extractorextract.low_confidence
8Model driftdrift sigma breach (§13.4)demote to shadow / demotedroute via fallback modelmodel.drift.breach
9Privilege escalation attemptscope check fails on dispatchPDP denies · principal flaggedraise security alert · lock if patternpolicy.deny.privilege_escalation
10Unauthorised tool invocationmarking or purpose mismatchPDP deniesoperator alert · audit onlypolicy.deny.purpose
11Duplicate effectidempotency-key collision with different payloadrefuse second commit · raise Exceptionoperator review · explicit supersession onlyaction.duplicate
12Adapter writeback failureadapter returns error after exhausted retriesaction moves to failedcompensation or replay after fixaction.adapter_error
13RAG poisoningretrieved spans inconsistent with ontology types · adversarial probe hitverifier blocks · corpus quarantinedfallback corpus · re-indexretrieval.poisoned
14Prompt injectioninjection-detector classifier on input · canary tokensrefuse to dispatch reasoning stephandoff · reviewer triagereasoning.injection
15Authority breachauthority predicate fails at PDP or commitrefuse stage / refuse commithandoff to higher authorityauthority.breach
16Audit chain breachchain verifier finds hash mismatchseal chain at breach pointstart new chain · operator + tenant security incidentaudit.tamper_detected

21.7 Incident response posture

  • Severity bands (S0–S3) per failure mode with documented response times.
  • Tenant security primary notified on S0 / S1 and on any audit-chain breach within 15 minutes.
  • Postmortems include contributory drift / eval signals and any release that crossed the gate.
  • Postmortems become typed AuditEvents (incident.summary) on the tenant chain.

21.8 Change freeze

During declared change freezes (tenant-defined windows or platform-declared post-incident windows), promotion is auto-blocked across all domains except security-critical patches. Freeze status is visible on the tenant config dashboard.