Pitwall Sprint — Trustable AI Racing Coach¶

Prove that an on-device AI system can be trusted at 130 mph.

This is the engineering documentation for the Trustable AI racing coach sprint (April–May 2026). The system coaches drivers in real time at Sonoma Raceway using a three-tier, fully on-device architecture — all inference runs on the Pixel 10 with no cloud dependency during driving.

Current Status (May 2, 2026)¶

Component	Status	Key Metric
VBO parser + data pipeline	✅ Done	183 files parsed, 535K frames, 14.9 hours
Track auto-builder	✅ Done	3 tracks: Sonoma (11 corners), Track 2 (9), Track 8 (11)
Data analysis + documentation	✅ Done	52 hot lap sessions profiled, 6 data docs written
LSTM sequence predictor (v3)	✅ Done	Speed: 3.3 km/h MAE, Brake: 2.7 bar MAE (unseen track, 1s horizon)
Sonic model v2 (LSTM-driven)	✅ Done	Delta-based coaching cues tested on Sonoma replay
Python HTTP bridge	✅ Done	`src/pitwall/__main__.py` :8765 — 56 endpoints, DuckDB-backed, CAN ingest
Rally-style coach engine	✅ Done	`RuleCoach` + `LitertCoach` (Gemma 4 E2B via LiteRT-LM), Bentley pedagogy, T-Rod voice
ADK multi-agent paddock backend	✅ Done	23 agents, 21 tools, `PitwallOrchestrator`, DuckDB tracing (ADR-019–021, phase-2 expansion 2026-05-28)
CAN pipeline	✅ Done	`python-can` + `cantools` DBC decoding, USB-CAN adapters (CANable Pro / Macchina M2)
Gold Standard (AJ) reference lap	✅ Done	Per-corner grading, time-loss decomposition, A-F scorecard
Test suite	✅ Done	358 tests passing, 51-assertion smoke test on 8273-frame VBO
Vue PWA (pitwall-web)	Design-only	38 screens specced in `docs/vue/`, implementation not started
Gemma 4 E2B sideload	Pending	`gemma-4-E2B-it.litertlm` to Pixel 10
Field test at Sonoma	May 23	—

What We're Building¶

A single Pixel 10 device replaces the laptop-in-footwell setup from V1. It acts as edge compute (Tensor G5 NPU), audio output, and CAN bus reader — coaching the driver through Pixel Earbuds while running the full analytics pipeline on-device via Termux.

graph TB
    subgraph Car Hardware
        RL[Racelogic Mini<br/>10Hz GPS + IMU]
        CAN[USB-CAN Adapter<br/>CANable Pro / Macchina M2<br/>OBD-II: brake, throttle, RPM]
    end

    subgraph "Pixel 10 (Termux)"
        subgraph "Hot Path (<50 ms)"
            SONIC[sonic_model + RuleCoach<br/>canonical phrase library<br/>no LLM, reflexive cues]
        end
        subgraph "Warm Path (2–4 s)"
            LITERT[LitertCoach<br/>brief / debrief<br/>POST → LocalLLM]
        end
        subgraph "Paddock Path (2–15 s)"
            ADK[18 ADK Agents<br/>Gemma 4 E4B via LocalLLM<br/>briefings, debriefs, Q&A]
        end
        BRIDGE[Flask Bridge :8765<br/>56 endpoints]
        DUCK[(DuckDB<br/>telemetry · laps · coaching_notes<br/>agent_traces · conversations)]
        ARB[CoachArbiter<br/>P3 safety → P2 technique → P1 strategy<br/>3 s cooldown · 5 s stale expiry]
    end

    subgraph Driver
        EARBUDS[Pixel Earbuds<br/>audio-first coaching]
    end

    RL --> BRIDGE
    CAN --> BRIDGE
    BRIDGE --> DUCK
    BRIDGE --> SONIC & LITERT & ADK
    SONIC --> ARB
    LITERT --> ARB
    ARB --> EARBUDS

Key Dates¶

Date	Milestone
April 8	Technical kickoff
April 29	Architecture review
May 23	Field test at Sonoma Raceway
May 30	Sprint wrap + documentation

Three-Tier Coaching Architecture¶

Tier	Engine	Latency	When	What
🔴 Hot	`sonic_model` + `RuleCoach`	<50 ms	Every frame (10 Hz)	Reflexive tone cues, threshold alerts, canonical pace notes
🟡 Warm	`LitertCoach` brief / debrief over LocalLLM	2–4 s brief / 8–15 s debrief	Off-track only	Pre-session brief, post-session debrief, Bentley pedagogy, T-Rod voice
🟢 Paddock	18 ADK agents (Gemma 4 E4B via LocalLLM)	2–15 s	Off-track only	Pre-briefs, post-session debriefs, multi-turn Q&A

Every LLM call — warm and paddock — goes to LocalLLM over local HTTP (ADR-024 for paddock, ADR-025 for warm; both superseding ADR-022). LocalLLM is a sibling Apache-2.0 Android APK (github.com/mlnomadpy/localllm) that hosts LiteRT-LM and exposes an OpenAI-compatible HTTP server on 127.0.0.1:8099/v1. As of 2026-05-28 it's the sole transport on both tiers — google-adk and litellm are base dependencies of apps/edge-daemon, the in-process litert_lm.Engine warm path was retired, and the PITWALL_ADK_BACKEND / dual-transport selectors are gone. The same endpoint also covers dev workstations running Ollama / LM Studio / llama.cpp / vLLM — just point PITWALL_ADK_OPENAI_URL at them. The paddock tier uses LiteLlm (litellm); the warm path uses stdlib urllib.request. Both speak OpenAI chat.completions against the same URL. No hosted API is ever dialled.

What's Different from Pitwall Open Source¶

Pitwall (Open Source)	Sprint (Google Edition)
Commodity hardware ($40–230)	Pro hardware: Racelogic Mini + USB-CAN + Pixel 10
Hot path: hardcoded rules engine	Hot path: canonical phrase library + sonic model (<50 ms)
Cold path: Gemini API via SSE	Warm path: Gemma 4 E2B via LocalLLM (HTTP → `127.0.0.1:8099`, 2–4 s brief)
No paddock intelligence	Paddock: 18 ADK agents (Gemma 4 E4B, briefings, debriefs, Q&A)
Generic coaching rules	Ross Bentley Pedagogical Vector Retrieval (structured curriculum)
Driver's personal best as baseline	Gold Standard: AJ's pro lap + T-Rod's human coaching audio
Single-user, single-tier	3 pods: Beginner / Intermediate / Pro with tuned personas
Laptop + phone + tablet + dongle	Single device: Pixel 10 (compute + CAN + audio)

What we keep from Pitwall (better than the original V1 prototype):

Confidence-annotated telemetry frame (ADR-001)
Message arbiter with priority + conflict resolution (ADR-002)
Sensor fusion engine for Racelogic + CAN (ADR-006)
Event-sourced driver profile across sessions (ADR-007)
Rule / pedagogical vector regression testing (ADR-008)
Graceful degradation protocol (ADR-009)

What we built and validated (new):

LSTM v3 sequence predictor — 3.3 km/h speed MAE at 1 second on an unseen track (272K params, 1.1 MB)
Auto track builder — GPS curvature → corner definitions. 3 tracks generated, 31 total corners with brake zones
Sonic model v2 — LSTM delta drives continuous audio cues. 78% of frames have active cues, 22% silence
Corner grader + time-loss decomposition — A-F grades per corner, weighted by Sonoma lap-time leverage
23-agent ADK paddock backend — briefings, debriefs, Q&A with DuckDB-backed tools and agent tracing (TPMS / handling balance / engine health / traction / input quality / safety domains added 2026-05-28)
CAN pipeline — USB-CAN → python-can → DBC decoding → DuckDB, with 6 adapter types supported
358-test suite — including 51-assertion end-to-end smoke test on real Sonoma VBO data

Quick Links¶

Internal Architecture — As-built backend topology with Mermaid diagrams
API Reference — 56 endpoint reference for the Flask bridge
ADK Agent Architecture — 23-agent paddock backend topology
ADK Implementation Plan — As-built status, all phases complete
Coaching Engine — Hot/warm path coaching, Bentley pedagogy
Feedback System — 3-layer coaching: sonic cues + corner grading + session review
Pedagogy — Ross Bentley curriculum with real Sonoma corner profiles
Sonoma Track Intelligence — Per-corner technique, landmarks, common mistakes
Architecture — Original sprint-design diagram (historical — see Internal Architecture for current)
ADRs — 21 architecture decisions