Writing and mumblings¶

I write about a mix of consulting, open source, personal work, and applying llms. I won't email you more than twice a month, not every post I write is worth sharing but I'll do my best to share the most interesting stuff including my own writing, thoughts, and experiences.

Subscribe to my Newsletter Follow me on X

For posts about RAG (Retrieval-Augmented Generation) or LLMs (Large Language Models), check out the category labels in the sidebar. Here are some of my best posts on these topics:

Personal Stories¶

Advice for Young People: Tips and insights for those starting their journey
Losing My Hands: My experience with a career-changing injury

RAG and LLM Insights¶

Future of RAG: What's next for RAG?
RAG: More Than Just Embeddings: Understanding the full scope of RAG systems
RAG Complexity Levels: Breaking down the layers of RAG implementation
Improving Your RAG System: Steps to enhance RAG performance
Common RAG Mistakes: What not to do when building a RAG system
Easy Wins for RAG: Simple ways to boost your RAG system
RAG Feedback Loop: Creating a self-improving RAG system
RAG Search Metrics: How to measure RAG search quality

Consulting and Tech Advice¶

Tools for Consulting: Essential tech for consultants
Solo Consulting Guide: Tips for independent consultants
Consulting 101: Key lessons from my consulting experience
Building AI App MVPs: How to launch a basic AI application
Common Engineering Errors: Mistakes to avoid in software development

Talks and Interviews¶

Pydantic Keynote: Why Pydantic is crucial for Python developers
Weaviate Podcast: Discussion on vector databases
AI Development Podcast: Insights on building with AI
Dagshub Interview: Exploring data science tools
Talking Heads Podcast: Thoughts on AI and tech trends

2025/06/19
5 min read

RAG for Coding Agents Lightning Series

I find this to be a pretty interesting topic because I personally believe that coding agents are probably executing at the frontier of agentic ray systems.

The world of autonomous coding agents is rapidly evolving, with fundamental disagreements emerging about the best approaches to building reliable, high-performance systems. This Lightning Series brings together the minds behind some of the most successful coding agents—from SWE-Bench champions to billion-dollar products—to debate the core architectural decisions shaping the future of AI-powered development.

Quick Links

If you just want to sign up, you're going to have to visit every single tab, open these links, and sign up to each one.

RAG in the Age of Agents: SWE-Bench as a Case Study from Colin Flaherty of Augment Code
Lessons on Retrieval for Autonomous Coding Agents from Nik Pash of Cline
Why Devin Does Not Use Multi-Agents from Walden Yan of Cognition AI

2025/06/12
5 min read

Lovable, Monetization, and the Vibe Coder Economy

| These are all just notes from a 30-minute conversation I had with somebody. A fun little exercise, as you will see.

When people ask me what a hot take is, here's mine: more agent tools and AI tools should be pricing on outcomes and trying hard to figure out what that means. This aligns with my broader thoughts on pricing AI tools as headcount alternatives.

The question hit me personally as a small investor in Lovable and a consultant focused on value-based pricing: Why am I not building my consulting business, my courses, my job board on Lovable instead of spreading them across Stripe, Maven, Circle, Kit, and Podia, It's because I could only possibly pay $100/month, and for that, they could not possibly offer me the features I need to.

2025/06/11
12 min read

RAG Anti-Patterns with Skylar Payne

I hosted a Lightning Lesson with Skylar Payne, an experienced AI practitioner who's worked at companies like Google and LinkedIn over the past decade. Skylar shared valuable insights on common RAG (Retrieval-Augmented Generation) anti-patterns he's observed across multiple client engagements, providing practical advice for improving AI systems through better data handling, retrieval, and evaluation practices.

2025/06/09
9 min read

How to invest in AI w/ MCPS and Data Analytics

tl;dr: You should build a system that lets you discover value before you commit resources.

!! Key Takeaways

Before asking what to build, start with a simple chatbot to discover what users are interested in. There's no need to reach for a complex agent or workflow before we see real user demand.

Leverage tools like Kura to understand broad user behavior patterns. The sooner we start collecting real user data, the better.

This week, I had conversations with several VPs of AI at large enterprises, and I kept hearing the same story: XX teams experimenting with AI, a CEO demanding results for the next board meeting, sales conference, quarterly review, and no clear path from pilot to production.

These conversations happen because I built improvingrag.com—a resource that helps teams build better RAG systems, which has lead me into many conversations from researchers, engineers, and executives. But the questions aren't about RAG techniques. They're about strategy: "How do we go from experiments to production?" "How do we know what to invest in?" "How do we show ROI?"

2025/05/29
11 min read

Systematically Improving RAG with Raindrop and Oleve

I hosted a lightning lesson featuring Ben from Raindrop and Sid from Oleve to discuss AI monitoring, production testing, and data analysis frameworks. This session explored how to effectively identify issues in AI systems, implement structured monitoring, and develop frameworks for improving AI products based on real user data.

2025/05/20
4 min read

Pricing AI Agents, Headcount, and the Economic Reality

Today I spoke to an executive about SaaS products, and they told me something that shifted my perspective entirely: AI agents need to be compared to budgets that companies draw from headcount, not tooling.

This is one of those insights that seems obvious in retrospect, but completely changes how you think about positioning AI tools in today's market—especially in this era of widespread tech layoffs and economic uncertainty.

2025/05/19
8 min read

There Are Only 6 RAG Evals

The world of RAG evaluation feels needlessly complex. Everyone's building frameworks, creating metrics, and generating dashboards that make you feel like you need a PhD just to know if your system is working.

2025/05/09
5 min read

Free Lightning Lessons to Advance Your RAG Implementations

I'll be hosting industry experts to share practical techniques for enhancing your Retrieval Augmented Generation (RAG) systems.

2025/04/04
in Consulting
10 min read

Creating Content That Converts: My Guide for AI Consultants

This is some of the notes I've taken for learnindieconsulting.com

Why I Prioritize Content (And Why You Should Too)

Let me share something I wish I'd understood sooner: consistent content creation isn't just a marketing tactic—it's the foundation of a thriving consulting business.

When I started my consulting journey, I was stuck in the time-for-money trap. I'd jump on Zoom calls with prospects, explain the same concepts repeatedly, and wonder why scaling was so difficult. Then I had a realization that changed everything: what if I could have these conversations at scale?

Now I extract blog post ideas from every client call. Every Friday, I review about 17 potential topics from the week's conversations. I test them with social posts, see which ones get traction (some get 700 views, others 200,000), and develop the winners into comprehensive content.

Here's why this approach has transformed my business:

2025/03/18
6 min read

Version Control for the Vibe Coder (Part 1)

Imagine this: you open Cursor, ask it to build a feature in YOLO-mode, and let it rip. You flip back to Slack, reply to a few messages, check your emails, and return...

It's still running.

What the hell is going on? .sh files appear, there's a fresh Makefile, and a mysterious .gitignore. Anxiety creeps in. Should you interrupt it? Could you accidentally trash something critical?

Relax—you're not alone. This anxiety is common, especially among developers newer to powerful agents like Cursor's. Fortunately, Git is here to save the day.