cohesix

Cohesix is an open-source high-assurance control-plane operating system built on the formally verified seL4 microkernel, designed to keep the trusted computing base intentionally small while enabling deterministic orchestration of edge GPU systems and auditable MLOps. Cohesix is "infrastructure for AGI".

View the Project on GitHub lukeb-aidev/cohesix

Cohesix Benchmarks

0.9.0-beta Benchmark Verdict (As-Built)

Hive-Gateway Worker Capacity (Milestone 25b)

Executive Summary

Benchmark Questions

  1. Are hard worker limits above 1000 truly removed in the real VM/TCP/auth path?
  2. What is the new validated hard capacity limit?
  3. At what worker count does aggressive mixed activity show first degradation?

Test Validity Controls

All reported runs used real end-to-end execution:

  1. QEMU boot success.
  2. TCP reachability and authenticated console preflight success.
  3. Gateway readiness checks (/v1/meta/*, LS /).
  4. Authenticated REST traffic through hive-gateway.

No --mock mode was used for reported results.

Runtime/Config Changes Under Test

Environment

Run Matrix

| Run ID | Purpose | Key Params | Artifact Prefix | | — | — | — | — | | RAMP-1K | Baseline (prior validated cap) | workers=8..1000, intensity=4, duration=8m | docs/bench/m25b_1k_rerun_20260213T233420Z | | FIXED-1K | Baseline hard-cap proof | workers=1000, intensity=1, duration=1m | docs/bench/m25b_1k_rerun_fixed1000_20260213T234240Z | | RAMP-1P5K | New-cap degradation mapping | workers=8..1500, intensity=4, duration=8m | docs/bench/m25b_1p5k_ramp_20260214T020554Z | | FIXED-1P5K | New hard-cap proof | workers=1500, intensity=1, duration=1m | docs/bench/m25b_1p5k_fixed1500_v2_20260214T020432Z | | FIXED-1200-I4 | Aggressive sustained check | workers=1200, intensity=4, duration=2m | docs/bench/m25b_1p5k_fixed1200_i4_20260214T021516Z |

Results

| Metric | RAMP-1K | FIXED-1K | RAMP-1P5K | FIXED-1P5K | FIXED-1200-I4 | | — | — | — | — | — | — | | worker_cap | null | null | null | null | null | | Max workers observed | 938 | 1000 | 1407 | 1500 | 1200 | | Overall ops | 61,293 | 4,222 | 72,223 | 7,356 | 27,548 | | Overall errors | 5 (0.0082%) | 1 (0.0237%) | 166 (0.2298%) | 5 (0.0680%) | 113 (0.4102%) | | Overall p95 latency | 0.0061s | 0.0045s | 0.1012s | 0.0061s | 0.1684s | | First step err_rate >= 1% | none | none | 1034 workers (2.9995%) | none | 1200 workers (1.6412%) |

Degradation Analysis

Capacity Interpretation

Graphs

RAMP-1P5K Worker/Error Graph

FIXED-1P5K Worker/Error Graph

FIXED-1200-I4 Worker/Error Graph

Evidence Index

Repro Commands

# Clean stale benchmark processes first
pkill -f "rest_perf_harness.py|qemu-system-aarch64|hive-gateway --bind" || true

# Ramp to new cap (aggressive profile)
python3 scripts/rest_perf_harness.py \
  --mode simulate \
  --qemu-run /tmp/cohesix-qemu-local-smp.sh \
  --gateway-bin target/debug/hive-gateway \
  --auth-token bootstrap \
  --request-auth-token stage4-rest-token \
  --workers-min 8 --workers-max 1500 \
  --intensity-min 4 --intensity-max 4 \
  --duration-mins 8 --base-rps 0.1 --max-inflight 64 \
  --summary-max-error-lines 2000 \
  --qemu-log logs/bench/m25b_1p5k_ramp.qemu.log \
  --gateway-log logs/bench/m25b_1p5k_ramp.gateway.log \
  --log-prefix m25b_1p5k_ramp

# Fixed hard-cap validation
python3 scripts/rest_perf_harness.py \
  --mode simulate \
  --qemu-run /tmp/cohesix-qemu-local-smp.sh \
  --gateway-bin target/debug/hive-gateway \
  --auth-token bootstrap \
  --request-auth-token stage4-rest-token \
  --workers-min 1500 --workers-max 1500 \
  --intensity-min 1 --intensity-max 1 \
  --duration-mins 1 --base-rps 0.1 --max-inflight 64 \
  --summary-max-error-lines 2000 \
  --qemu-log logs/bench/m25b_1p5k_fixed1500_v2.qemu.log \
  --gateway-log logs/bench/m25b_1p5k_fixed1500_v2.gateway.log \
  --log-prefix m25b_1p5k_fixed1500_v2

# Aggressive fixed-load check
python3 scripts/rest_perf_harness.py \
  --mode simulate \
  --qemu-run /tmp/cohesix-qemu-local-smp.sh \
  --gateway-bin target/debug/hive-gateway \
  --auth-token bootstrap \
  --request-auth-token stage4-rest-token \
  --workers-min 1200 --workers-max 1200 \
  --intensity-min 4 --intensity-max 4 \
  --duration-mins 2 --base-rps 0.1 --max-inflight 64 \
  --summary-max-error-lines 2000 \
  --qemu-log logs/bench/m25b_1p5k_fixed1200_i4.qemu.log \
  --gateway-log logs/bench/m25b_1p5k_fixed1200_i4.gateway.log \
  --log-prefix m25b_1p5k_fixed1200_i4

Large-Telemetry Reliability Gate (Milestone 25f, 0.9.0-beta)

Required Methodology (from docs/TEST_PLAN.md section 6b)

Local 0.9.0-beta Results (latest run per scenario)

| Scenario | Summary Artifact | Ops | Errors | Error Rate | p95 Latency | Error Budget | | — | — | — | — | — | — | — | | telemetry-1mb | logs/rest_bench_20260217T223323Z.summary.json | 7906 | 0 | 0.0000% | 0.0277s | PASS | | telemetry-10mb | logs/rest_bench_20260217T223635Z.summary.json | 7911 | 0 | 0.0000% | 0.0278s | PASS | | telemetry-100mb | logs/rest_bench_20260217T223949Z.summary.json | 2898 | 0 | 0.0000% | 0.0314s | PASS | | telemetry-1gb | logs/rest_bench_20260217T224303Z.summary.json | 487 | 0 | 0.0000% | 0.0317s | PASS |

Each artifact above records:

Repro Commands (mandatory matrix)

python3 scripts/rest_perf_harness.py simulate \
  --rest-url http://127.0.0.1:8080 \
  --no-retries --fast-ramp --scenario telemetry-1mb --error-budget-rate 0.01

python3 scripts/rest_perf_harness.py simulate \
  --rest-url http://127.0.0.1:8080 \
  --no-retries --fast-ramp --scenario telemetry-10mb --error-budget-rate 0.01

python3 scripts/rest_perf_harness.py simulate \
  --rest-url http://127.0.0.1:8080 \
  --no-retries --fast-ramp --scenario telemetry-100mb --error-budget-rate 0.01

python3 scripts/rest_perf_harness.py simulate \
  --rest-url http://127.0.0.1:8080 \
  --no-retries --fast-ramp --scenario telemetry-1gb --error-budget-rate 0.01

Evidence Index (25f)

Release-note corroboration: