Backend Engineering
Why RAG Systems Feel Dumb: The Hidden Cost of Poor API Boundaries
RAG systems often feel slow, inconsistent, or unreliable. This post explains how poor API boundaries around model calls create these issues and how to fix them.
Thoughts on AI engineering, backend, frontend, and building modern software.
Backend Engineering
RAG systems often feel slow, inconsistent, or unreliable. This post explains how poor API boundaries around model calls create these issues and how to fix them.
AI Engineering
How to turn experimental prompts into reliable AI services with structured outputs and fallback strategies
AI Agents
How tool-call permissions and approval boundaries shape reliable AI agent systems using embeddings as the new database index
Software Architecture
How to architect AI systems with modular design, internal APIs, and clear boundaries for reliable, maintainable production deployments
Security
Scoped API keys limit access to specific AI features, reducing risk from unauthorized usage and misconfigurations in production systems.
Prompt Engineering
How structured prompt contracts prevent ambiguity in AI systems, ensuring reliable and predictable outputs in production environments
MLOps
A senior engineer's guide to tracking subtle prompt tweaks that inflate cloud costs and break production systems without raising alarms
Data Engineering
Data freshness is critical for AI systems. Learn how to avoid stale inputs and prevent hallucinations in production pipelines.
Cloud & Infrastructure
AI applications often hide infrastructure costs in deployment boundaries. Learn how to manage GPU, storage, and networking costs with practical strategies for production systems.
Backend Engineering
How to design robust backend APIs for AI models with clear boundaries, timeouts, and error handling.