Why it matters
Bedrock Mantle raises the throughput ceiling to 100M TPM and 10K RPM with async inference and stateful conversations — a direct infrastructure upgrade for teams hitting Bedrock's legacy limits.
The tokenmaxxing angle
Mantle's quota jump to 100M TPM and async inference mode changes the cost-per-request math for high-volume workloads. The migration path (base URL swap only) makes it a low-friction route to better throughput economics.
From the organizers
Speaker Luis Salcido (Sr. Technical Account Manager, AWS) covers Mantle's 100M TPM / 10K RPM quota tiers and demonstrates migration using the OpenAI SDK at WeWork One Glenwood, Raleigh.