MLOps
Cost Spikes from Quiet Prompt Changes: Monitoring LLM Features in Production
After deploying an LLM feature, monitoring for cost spikes from subtle prompt changes is critical. Learn how to detect and mitigate these risks in production systems.
Thoughts on AI engineering, backend, frontend, and building modern software.
MLOps
After deploying an LLM feature, monitoring for cost spikes from subtle prompt changes is critical. Learn how to detect and mitigate these risks in production systems.
LLMs
Structured outputs ensure reliable AI behavior in production. Learn how schemas, validation, and retries create predictable model behavior without relying on clever prompts.
Software Architecture
Learn real-world software architecture lessons from a senior engineer building AI systems. Explore tradeoffs, failure modes, and design decisions that shape production-ready tech stacks.
MLOps
Master MLOps challenges with real-world strategies and engineering judgment. Learn from my experiences building production AI systems.
LLMs
Building production systems with large language models requires balancing speed, cost, and reliability. Learn concrete strategies for deployment, optimization, and avoiding common pitfalls.
RAG & Retrieval
Real-world challenges and design decisions in implementing Retrieval-Augmented Generation for production AI systems
Embeddings & Vector Search
Building robust AI systems requires more than models—here’s how to design, optimize, and debug embeddings and vector search in real-world scenarios.
Data Engineering
Master real-world data engineering challenges with actionable insights from building scalable AI infrastructure. Learn how to avoid common pitfalls and build robust data pipelines.
AI Engineering
Learn real-world tradeoffs, failure modes, and design decisions from a senior engineer building AI systems at scale.