Events / Raleigh

AWS Bedrock Mantle — Distributed Inference Engine in Amazon Bedrock

Lunchtime technical deep-dive in Raleigh on AWS Bedrock Mantle, Amazon's next-gen distributed inference engine with OpenAI-compatible endpoints, async processing, and enhanced quota tiers.

Wed, Jun 17, 4:00 PMWeWork One Glenwood · Raleigh · NC

Why it matters

Bedrock Mantle raises the throughput ceiling to 100M TPM and 10K RPM with async inference and stateful conversations — a direct infrastructure upgrade for teams hitting Bedrock's legacy limits.

The tokenmaxxing angle

Mantle's quota jump to 100M TPM and async inference mode changes the cost-per-request math for high-volume workloads. The migration path (base URL swap only) makes it a low-friction route to better throughput economics.

From the organizers

Speaker Luis Salcido (Sr. Technical Account Manager, AWS) covers Mantle's 100M TPM / 10K RPM quota tiers and demonstrates migration using the OpenAI SDK at WeWork One Glenwood, Raleigh.