Ingero Fleet v0.10 live dashboard, A100 and GH200 overlaid. MAD spikes on the top-right panel mark the straggler injections.
eBPF, GPU Debugging, GPU Observability, MLOps

26 Seconds to Find a Straggler: Fleet v0.10 End-to-End on A100 and GH200

Ingero Fleet v0.10 FOSS shipped this week. We ran it end-to-end on two three-node Lambda Cloud clusters, one Ampere, one Grace Hopper, injected a single straggler on each, and measured detection latency: 26 seconds on A100, ~30 seconds on arm64. Same code, same manifests, one wrinkle on GH200.