Why it matters
The path from local experiment to scaled deployment is exactly where infra choices start driving real inference and compute costs.
The tokenmaxxing angle
Scaling agents onto Kubernetes orchestration raises the FinOps questions tokenmaxxing tracks: concurrency limits, retry storms, and per-pod inference cost at scale.
From the organizers
Session held at CG Infinity, Inc. in Plano, TX, framed around the specific arc from personal-machine experimentation to Kubernetes-orchestrated deployment.