Why it matters
Healthcare's strict PHI constraints force local inference, auditable skill design, and open-standard integrations. Architects here see how the same agent runtime adapts when cloud API calls are restricted — a blueprint for regulated-industry deployments.
The tokenmaxxing angle
Tula runs local MedGemma inference rather than cloud APIs to avoid routing PHI to external providers — a concrete case study in routing decisions driven by compliance, not just cost. A nuanced dimension of AI FinOps often overlooked elsewhere.
From the organizers
Tula uses a self-hosted OpenClaw deployment that pulls clinical records from Epic via SMART-on-FHIR, stores them as FHIR R4 locally, and runs MedGemma for on-device inference; source code is Apache 2.0 on GitHub.