Discussion about this post

User's avatar
Mikey B's avatar

Policy "Tuna" and Chocolate Milk is tomorrows lunch. Only high performance thank you

Expand full comment
Kaitlin S's avatar

New to the infra side, so genuine question: if cognition/agent design enforces stable prompts, strict output schemas, and routing to small models for easy tasks, does that meaningfully boost the same wins you’re getting from batching/graphs/quant (cache hits, shorter contexts, fewer retries)? Curious where you’ve seen that complement your Tier-1/2 work.

Expand full comment
1 more comment...

No posts

Ready for more?