Available NowNo Starch Press

Stop building
toy AI demos.

The Developer's Guide to AI is the field manual for engineering production-grade LLM applications, RAG pipelines, and autonomous agents.

Fig. 001

Read Chapter 1 & 2 Free

FormatsPrint, E-Book, Kindle

Length320 Pages, 5 Parts

FocusEngineering, Not Theory

Fig. 001

Free Evaluation

Read Chapters 1 & 2.
Judge for yourself.

28 pages. 15 minute read. Actual code samples.

✓ Understanding LLMs internally
✓ The companion code repo

From the Manuscript

Real decisions.
Real code.

Four excerpts, pulled directly from the book. A maxim, two production code samples, and a decision framework. This is the voice of the book - unedited.

01The MaximCh. 8 · RAG02Vector DatabaseCh. 8 · Code03Tool AgentsCh. 14 · Code04Trade-offsCh. 6 · Prompts

Chapter 8 · Designing a RAG System · p. 190

"Better data beats
bigger prompts."

Context

Every retrieval trick in this chapter - better chunking, smarter retrieval, re-ranking, citations - depends on having clean, current, and trustworthy sources. Treat data quality like a first-class feature of your pipeline, not an afterthought.

"When answers are wrong, observability is how you debug the pipeline - not the prompt."

Chapter 8 · Vector Databases · p. 180

A production vector database requires robust data ingestion. Here we build a document store that handles chunking, deterministic IDs, and metadata preservation, ensuring we can trace every retrieved snippet back to its source.

# From: part3/rag/store.py
class MultiDocumentVectorStore:
    ❶ def __init__(self, documents: List[Document]):
        embedding_function = OllamaEmbeddingFunction(
            url="http://localhost:11434",
            model_name="mxbai-embed-large"
        )
        client = chromadb.Client()
        self.collection: Collection = client.create_collection(
            name="examples_readme",
            embedding_function=embedding_function,
        )

        ❷ for doc_idx, document in enumerate(documents):
            splitter = RecursiveCharacterTextSplitter.from_language(
                language="markdown", chunk_size=1500)
            document_chunks = splitter.split_text(document.content)

            for chunk_idx, document_chunk in enumerate(document_chunks):
                self.collection.add(
                    documents=document_chunk,
                    ❸ ids=f"doc_{doc_idx + 1}_{chunk_idx + 1}",
                    ❹ metadatas={"source_url": document.source_url}
                )

    def query(self, question: str):
        results = self.collection.query(query_texts=[question], n_results=3)
        document_chunk_results = results.get('documents')[0]
        ❺ document_chunk_metadatas = results.get('metadatas')[0]
        documents: List[Document] = []

        ❻ for result_idx, document_chunk in enumerate(document_chunk_results):
            documents.append(Document(
                ❼ source_url=document_chunk_metadatas[result_idx].get('source_url'),
                content=document_chunk
            ))

        return documents

❶Initializes a local Ollama embedding function and ChromaDB collection.
❷Loops through documents and chunks them using a Markdown-aware text splitter.
❸Assigns a deterministic composite ID based on the document and chunk indices.
❹Attaches metadata to track the original source URL for each chunk.
❺Retrieves the metadata array from the vector search results.
❻Loops over the query results to reconstruct the objects.
❼Preserves the original source URL metadata into the retrieved objects.

Chapter 14 · Extending Agents with Tools · p. 258

Here's how the book teaches an agent to do things, not just talk about them. The @tool decorator registers a Python function - and the docstring is what the LLM reads to decide when and how to call it.

# From: part5/agents/tools.py
from langchain_core.tools import tool
from langgraph.prebuilt import create_react_agent

❶ @tool
❷ def save_file(data: str) -> None:
    ❸ """
    Save the data to a file.

    Args:
     data: The data to save to the file.
    """
    ❹ with open("investments.txt", "w") as f:
        f.write(data)

❺ tools = [save_file]
agent = create_react_agent(model, tools)

❶The @tool decorator registers the function with the agent runtime.
❷Type hints matter - the agent uses them to validate inputs.
❸The docstring is the instruction manual. The LLM reads it.
❹A real filesystem operation - the agent can now persist state.
❺The tool is passed to the agent. It can now decide when to call it.

Chapter 6 · Prompt Engineering in Code · p. 80

Every AI codebase has to pick an abstraction level - raw SDK, lightweight wrapper, or full framework. The book's trade-off curve:

↑Higher Abstraction

Buys speed of delivery and shared conventions.

LangChain · LlamaIndex · Haystack · CrewAI

↓Lower Abstraction

Buys performance, debuggability, and long-term freedom.

OpenAI SDK · Anthropic SDK · Ollama client

"Choose the level that removes your biggest current bottleneck - and be ready to slide up or down the stack as those bottlenecks change."

These are four excerpts out of 320 pages.

Get Chapters 1 & 2 in your inbox

What You'll Build

Five parts.
One journey from prompts to agents.

This isn't a collection of essays. It's a sequenced build. Each part extends the last, so by the time you get to agents, you've already written the LLM calls, the prompts, the retrieval, and the fine-tuning underneath them.

Part · 3 chapters

Getting Started with AI

Ship an LLM-powered app before you finish the part. Ollama running locally, a Node or Python server hitting it, streaming responses to the UI. No OpenAI key required. Build on your own hardware first.

Covers

·Local LLMs with Ollama
·Streaming with Express
·Python + FastAPI essentials

Part · 3 chapters

Prompt Engineering

Prompt techniques that survive production, not parlor tricks. How to structure prompts inside real codebases, handle failure modes, and write prompts that behave the same way on the hundredth call as on the first.

Covers

·Fundamentals
·Advanced techniques
·Prompts in code

III

Part · 2 chapters

Vector Databases & RAG

Retrieval-Augmented Generation from scratch. Vector databases without the hype, chunking strategies, retrieval design, and how to tell when RAG is actually the right answer vs. a context-window trick.

Covers

·Vector DBs in practice
·Designing a RAG system

Part · 3 chapters

Adapting Models

Fine-tuning when it's worth it, and skipping it when it's not. Data preparation, the real cost curves, and the decisions senior engineers make about prompt engineering vs. fine-tuning vs. a custom model.

Covers

·Why (and when)
·Data preparation
·Fine-tuning in practice

Part · 3 chapters

Building Agentic Systems

From workflow to autonomous agent. How agents actually work, how to build one, and how to extend them with real tools that do real work, not another 'agent writes poetry' demo.

Covers

·Workflows to agents
·Building an agent
·Extending with tools

See the full table of contents

Table of Contents

Every chapter, every page.

Tap any part to expand.

IGetting Started with AI3 chapters

1Understanding Large Language Models3
2Building Your First LLM-Powered Application15
3Python Essentials for LLMs and APIs29

IIPrompt Engineering3 chapters

4Fundamentals of Prompt Engineering43
5Prompt Engineering Techniques61
6Prompt Engineering in Code77

IIIVector Databases and RAG2 chapters

7Vector Databases in Practice131
8Designing a Retrieval-Augmented Generation System161

IVAdapting Models to Real-World Tasks3 chapters

9Why and When to Customize a Model197
10Preparing Data for Fine-Tuning207
11Fine-Tuning Models in Practice219

VBuilding Agentic Systems3 chapters

12From Workflows to Autonomous Agents247
13Building an Autonomous Agent253
14Extending Agents with Tools259

Praise

What senior engineers are saying.

Going through "The Developer’s Guide to AI", with fellow developers in our Commit Your Code Discord, leading chapter discussions over several months, helped me turn scattered AI concepts into something clear and practical. It broke down topics like RAG, prompt engineering, and agents in a way that actually made sense to apply. The experience went beyond reading, it gave me real confidence in using AI in development. This is a book that truly meets you where you are. If you’re looking to move from learning AI to actually building with it, this is a great place to start.

David Kea, Jr.

Enterprise Technology Business Specialist

“This book demystified AI under the hood for me. The explanations are clear, the companion repo is hands-on and keeps improving, and the coverage of RAG, tokens, prompt engineering, and fine-tuning gave me real foundations to build on. Even though the AI landscape changes almost every day, these fundamentals don't.”

Sara Baqla

AI Native Engineer

“Even as a developer who thought I understood LLMs well, this book delivered new, practical insights. It bridges theory and real-world application, showing how to use LLMs at the application layer without a data science background. A high-value, must-read for any developer building with AI”

Preet Katari

Cloud Solutions Architect

Audience

This book is for, and not for, specific people.

✓For You If

01You've played with ChatGPT and Claude. Now you want to build with them.
02You're comfortable reading TypeScript or Python. You've shipped real code.
03You're tired of AI demos that don't survive contact with production.
04You want to understand when to use RAG, fine-tuning, prompts, or agents, and why.
05You'd rather build five real systems than read five theoretical papers.

×Probably Not For You If

01You've never written a line of code. Start somewhere else, then come back.
02You want a cookbook of copy-paste prompts. This teaches systems thinking, not incantations.
03You want ML theory, transformer math, or training models from scratch.
04You're looking for speculation about AGI. This book is about shipping working software.

The Authors

Three engineers.
Over fifty years combined in the field.

Danny Thompson

From frying chicken at a gas station to Fortune 500 and beyond.

Accomplished software developer with a proven track record of success in the tech industry. Former Director, Senior Developer, Community Leader, and now Author. One of the most recognized community voices in growing your technical skills in the field of software engineering. Host of the globally top-ranked The Programming Podcast. Organizer of the Commit Your Code Conference for Charity and the Dallas Software Developers meetup. Keynote speaker at international conferences.

dthompsondev.com

Notable

·The Programming Podcast
·Commit Your Code organizer
·Delivered 200+ technical talks

Jacob Orshalick

Software architect, independent consultant, and long-time Java EE voice.

Over two decades in software development. Former PhD research assistant, now 16+ years as an independent consultant. Has built solutions for startups, Fortune 500s, and everything in between. Co-author of Seam Framework: Experience the Evolution of Java EE. Award-winning open source contributor. Speaks regularly at conferences.

focus.dev

Notable

·Prior book: Seam Framework
·Conference speaker
·20+ years building software

Jerry M. Reghunadh

Senior director, systems architect, and relentless curiosity-driven engineer.

Over two decades spanning QA automation, product engineering, and architectural leadership. Senior director at a global product organization, where he architects complex solutions and ships products at scale. International conference speaker. Firm believer that writing quality code on bare-bones hardware is an art form. Avid Formula 1 fan.

jerrymannel.me

Notable

·Senior director
·International speaker
·20+ years engineering

Questions

Everything you'd want to know
before you buy.

01What programming languages does the book use?

Primarily JavaScript/TypeScript (Node + Express) and Python (FastAPI). If you can read either one, you'll be fine. Most concepts are language-agnostic, and the book shows the same idea in both where it matters.

02Do I need prior AI or ML experience?

No. The book assumes you're a working developer who's used ChatGPT or Claude and written real software. It does not assume you've studied transformer math or trained models. Chapter 1 covers the mental model of LLMs from first principles; you ramp up from there.

03Which AI providers and models does the book cover?

The book teaches patterns that work across providers. You'll run local models via Ollama, hit hosted APIs (OpenAI-compatible patterns that also work with Anthropic, Google, and open-source providers), and see where provider-specific decisions actually matter.

04Does it cover RAG (Retrieval-Augmented Generation)?

Yes, Part III is entirely about RAG. Two chapters: one on vector databases in practice, one on designing a full RAG system. You build a working system, not just read about the concept.

05Does it cover agents and agentic systems?

Yes, Part V is three chapters on agents. From the workflow-to-autonomous-agent progression, through building your first agent, to extending agents with tools that do real work.

06Is the source code available?

Yes. All code from the book is available in a companion repository. You get free access when you buy the book. Beta-test signups also receive repo access.

07Paperback or ebook: which should I get?

If you want to annotate and reference it on a shelf, paperback. If you want to copy-paste code samples and read on multiple devices, ebook. Many readers buy both, and No Starch sometimes bundles them.

08Won't this be outdated quickly given how fast AI moves?

The book is deliberately built around durable patterns like prompt engineering fundamentals, RAG architecture, fine-tuning trade-offs, and agent design, not model-of-the-week tutorials. The specific providers and models will shift; the engineering principles stay.

09Is this suitable for senior or staff engineers?

Yes, and we've heard this specifically from early readers. If you're senior and new to AI, this is the fastest way to get production-ready context. If you're already shipping AI features, the RAG, fine-tuning, and agent chapters fill in the gaps between tutorials and real systems.

10How does this compare to an online course or YouTube tutorial?

Courses move fast and get out of date. This is a reference: something you open to Chapter 8 when you need to design a retrieval system, or Chapter 12 when you're deciding between a workflow and an agent. It's the book you keep on the desk, not the one you binge once.

11Who are the authors?

Jacob Orshalick is a software architect with 20+ years of experience and co-author of Seam Framework. Jerry M. Reghunadh is a senior director with two decades in engineering and international speaking. Danny Thompson is a senior developer advocate, host of The Programming Podcast, and organizer of the Commit Your Code conference. Technical review by Nikhil Kapoor (16+ years in AI/ML).

12Where can I buy the book?

Directly from No Starch Press (best price, supports the publisher and authors), Amazon (widest availability), Barnes & Noble, or most technical booksellers. Ebook formats are available through No Starch and most major ebook retailers.

Get the Book

Available now.
Pick your preferred retailer.

Paperback, ebook, and Kindle formats. Pick your retailer.

No Starch Press Quality Guarantee

Recommended

Prime & Kindle

Amazon

From $59.99

Paperback and Kindle formats, widest availability

Buy on Amazon

Direct from publisher

No Starch Press

From $59.99

Paperback + ebook bundle, best for authors

Buy Direct

Online & in-store

Barnes & Noble

From $59.99

Paperback and Nook ebook

Buy at B&N

Published by No Starch Press · San Francisco · 2026

The Developer's Guide to AI is the field manual for engineering production-grade LLM applications, RAG pipelines, and autonomous agents.

Read Chapters 1 & 2. Judge for yourself.

Real decisions.Real code.

Five parts.One journey from prompts to agents.

Getting Started with AI

Prompt Engineering

Vector Databases & RAG

Adapting Models

Building Agentic Systems

Every chapter, every page.

What senior engineers are saying.

This book is for, and not for, specific people.

Three engineers.Over fifty years combined in the field.

Danny Thompson

Jacob Orshalick

Jerry M. Reghunadh

Everything you'd want to knowbefore you buy.

Available now.Pick your preferred retailer.

Amazon

No Starch Press

Barnes & Noble

Get AI Engineeringinsights delivered.

Read Chapters 1 & 2.
Judge for yourself.

Real decisions.
Real code.

Five parts.
One journey from prompts to agents.

Three engineers.
Over fifty years combined in the field.

Everything you'd want to know
before you buy.

Available now.
Pick your preferred retailer.

Get AI Engineering
insights delivered.