Learn the difference between vertical and horizontal scaling, when each approach makes sense, and why stateless services are essential for scalable system design.
April 13, 2026
March 27, 2026
You Need Storage. But Not Every Storage Is the Same.
February 20, 2026
February 18, 2026
A platform for processing events reliably.
February 12, 2026
Privia
February 9, 2026
Document Intelligence API with FastAPI-based backend project using AWS
January 29, 2026
Most RAG chatbots don’t fail because of bad models — they fail because the system was built in the wrong order. This post explains the correct build sequence that actually survives production
December 30, 2025
Most Retrieval-Augmented Generation (RAG) systems look impressive in demos — and then quietly fail in production. This post explains why, and what actually breaks after the first real users arrive.
FastAPI (or Django) + Next.js, Sanity for CMS, Groq/Llama-3 or OpenAI for LLMs, and a retrieval layer that I can benchmark, observe, and replace.
July 25, 2025
Why I moved my blog under my name, what I’ll write about (AI engineering, RAG, agents, shipping products, and content), and how I’ll keep it honest and useful.