What is The Developer's Guide to AI?
A hands-on technical book for software engineers teaching how to build real AI systems, published by No Starch Press.
Who are the authors?
Jacob Orshalick, Jerry M. Reghunadh, and Danny Thompson.
What core technologies are covered?
Large Language Models (LLMs), Prompt Engineering, Vector Databases, Retrieval-Augmented Generation (RAG), Fine-Tuning, and Autonomous Agents.
Who is the target audience?
Working developers proficient in JavaScript, TypeScript, or Python who want to move past API wrappers and build scalable AI architecture.
Available NowNo Starch Press

Stop building
toy AI demos.

The Developer's Guide to AI is the field manual for engineering production-grade LLM applications, RAG pipelines, and autonomous agents.

The Developer's Guide to AI Book Cover
Fig. 001
Read Chapter 1 & 2 Free
FormatsPrint, E-Book, Kindle
Length320 Pages, 5 Parts
FocusEngineering, Not Theory

Free Evaluation

Read Chapters 1 & 2.
Judge for yourself.

28 pages. 15 minute read. Actual code samples.

  • Understanding LLMs internally
  • The companion code repo

Zero spam. One-click unsubscribe.

Trusted by production engineers
Such a great and fun book! No prior knowledge needed. Readers will develop the skills and understanding to be AI thought-leaders on their teams.
Erik Weibust, Sr Engineering Manager
I was skeptical reading the book but it helped out significantly when it came to discussions with my manager and team! Highly recommend.
Isaac Arcoss Huicochea, Sr Business Analyst
I finally understand the buzzwords people always talk about! The narrative flow of the book is amazing and makes complex topics easy to digest.
Walker Laury, Sr IT Engineer

From the Manuscript

Real decisions.
Real code.

Four excerpts, pulled directly from the book. A maxim, two production code samples, and a decision framework. This is the voice of the book - unedited.

Chapter 8 · Designing a RAG System · p. 190
"Better data beats
bigger prompts."
Context

Every retrieval trick in this chapter - better chunking, smarter retrieval, re-ranking, citations - depends on having clean, current, and trustworthy sources. Treat data quality like a first-class feature of your pipeline, not an afterthought.

"When answers are wrong, observability is how you debug the pipeline - not the prompt."

Chapter 8 · Vector Databases · p. 180

A production vector database requires robust data ingestion. Here we build a document store that handles chunking, deterministic IDs, and metadata preservation, ensuring we can trace every retrieved snippet back to its source.

# From: part3/rag/store.py
class MultiDocumentVectorStore:
     def __init__(self, documents: List[Document]):
        embedding_function = OllamaEmbeddingFunction(
            url="http://localhost:11434",
            model_name="mxbai-embed-large"
        )
        client = chromadb.Client()
        self.collection: Collection = client.create_collection(
            name="examples_readme",
            embedding_function=embedding_function,
        )

         for doc_idx, document in enumerate(documents):
            splitter = RecursiveCharacterTextSplitter.from_language(
                language="markdown", chunk_size=1500)
            document_chunks = splitter.split_text(document.content)

            for chunk_idx, document_chunk in enumerate(document_chunks):
                self.collection.add(
                    documents=document_chunk,
                     ids=f"doc_{doc_idx + 1}_{chunk_idx + 1}",
                     metadatas={"source_url": document.source_url}
                )

    def query(self, question: str):
        results = self.collection.query(query_texts=[question], n_results=3)
        document_chunk_results = results.get('documents')[0]
         document_chunk_metadatas = results.get('metadatas')[0]
        documents: List[Document] = []

         for result_idx, document_chunk in enumerate(document_chunk_results):
            documents.append(Document(
                 source_url=document_chunk_metadatas[result_idx].get('source_url'),
                content=document_chunk
            ))

        return documents
  • Initializes a local Ollama embedding function and ChromaDB collection.
  • Loops through documents and chunks them using a Markdown-aware text splitter.
  • Assigns a deterministic composite ID based on the document and chunk indices.
  • Attaches metadata to track the original source URL for each chunk.
  • Retrieves the metadata array from the vector search results.
  • Loops over the query results to reconstruct the objects.
  • Preserves the original source URL metadata into the retrieved objects.
Chapter 14 · Extending Agents with Tools · p. 258

Here's how the book teaches an agent to do things, not just talk about them. The @tool decorator registers a Python function - and the docstring is what the LLM reads to decide when and how to call it.

# From: part5/agents/tools.py
from langchain_core.tools import tool
from langgraph.prebuilt import create_react_agent

 @tool
 def save_file(data: str) -> None:
     """
    Save the data to a file.

    Args:
     data: The data to save to the file.
    """
     with open("investments.txt", "w") as f:
        f.write(data)

 tools = [save_file]
agent = create_react_agent(model, tools)
  • The @tool decorator registers the function with the agent runtime.
  • Type hints matter - the agent uses them to validate inputs.
  • The docstring is the instruction manual. The LLM reads it.
  • A real filesystem operation - the agent can now persist state.
  • The tool is passed to the agent. It can now decide when to call it.
Chapter 6 · Prompt Engineering in Code · p. 80

Every AI codebase has to pick an abstraction level - raw SDK, lightweight wrapper, or full framework. The book's trade-off curve:

Higher Abstraction

Buys speed of delivery and shared conventions.

LangChain · LlamaIndex · Haystack · CrewAI

Lower Abstraction

Buys performance, debuggability, and long-term freedom.

OpenAI SDK · Anthropic SDK · Ollama client

"Choose the level that removes your biggest current bottleneck - and be ready to slide up or down the stack as those bottlenecks change."

These are four excerpts out of 320 pages.

Get Chapters 1 & 2 in your inbox

What You'll Build

Five parts.
One journey from prompts to agents.

This isn't a collection of essays. It's a sequenced build. Each part extends the last, so by the time you get to agents, you've already written the LLM calls, the prompts, the retrieval, and the fine-tuning underneath them.

I
Part · 3 chapters

Getting Started with AI

Ship an LLM-powered app before you finish the part. Ollama running locally, a Node or Python server hitting it, streaming responses to the UI. No OpenAI key required. Build on your own hardware first.

Covers
  • ·Local LLMs with Ollama
  • ·Streaming with Express
  • ·Python + FastAPI essentials
II
Part · 3 chapters

Prompt Engineering

Prompt techniques that survive production, not parlor tricks. How to structure prompts inside real codebases, handle failure modes, and write prompts that behave the same way on the hundredth call as on the first.

Covers
  • ·Fundamentals
  • ·Advanced techniques
  • ·Prompts in code
III
Part · 2 chapters

Vector Databases & RAG

Retrieval-Augmented Generation from scratch. Vector databases without the hype, chunking strategies, retrieval design, and how to tell when RAG is actually the right answer vs. a context-window trick.

Covers
  • ·Vector DBs in practice
  • ·Designing a RAG system
IV
Part · 3 chapters

Adapting Models

Fine-tuning when it's worth it, and skipping it when it's not. Data preparation, the real cost curves, and the decisions senior engineers make about prompt engineering vs. fine-tuning vs. a custom model.

Covers
  • ·Why (and when)
  • ·Data preparation
  • ·Fine-tuning in practice
V
Part · 3 chapters

Building Agentic Systems

From workflow to autonomous agent. How agents actually work, how to build one, and how to extend them with real tools that do real work, not another 'agent writes poetry' demo.

Covers
  • ·Workflows to agents
  • ·Building an agent
  • ·Extending with tools

Table of Contents

Every chapter, every page.

Tap any part to expand.

IGetting Started with AI3 chapters
  • 1Understanding Large Language Models3
  • 2Building Your First LLM-Powered Application15
  • 3Python Essentials for LLMs and APIs29
IIPrompt Engineering3 chapters
  • 4Fundamentals of Prompt Engineering43
  • 5Prompt Engineering Techniques61
  • 6Prompt Engineering in Code77
IIIVector Databases and RAG2 chapters
  • 7Vector Databases in Practice131
  • 8Designing a Retrieval-Augmented Generation System161
IVAdapting Models to Real-World Tasks3 chapters
  • 9Why and When to Customize a Model197
  • 10Preparing Data for Fine-Tuning207
  • 11Fine-Tuning Models in Practice219
VBuilding Agentic Systems3 chapters
  • 12From Workflows to Autonomous Agents247
  • 13Building an Autonomous Agent253
  • 14Extending Agents with Tools259

Praise

What senior engineers are saying.

Going through "The Developer’s Guide to AI", with fellow developers in our Commit Your Code Discord, leading chapter discussions over several months, helped me turn scattered AI concepts into something clear and practical. It broke down topics like RAG, prompt engineering, and agents in a way that actually made sense to apply. The experience went beyond reading, it gave me real confidence in using AI in development. This is a book that truly meets you where you are. If you’re looking to move from learning AI to actually building with it, this is a great place to start.
David Kea, Jr.
Enterprise Technology Business Specialist
This book demystified AI under the hood for me. The explanations are clear, the companion repo is hands-on and keeps improving, and the coverage of RAG, tokens, prompt engineering, and fine-tuning gave me real foundations to build on. Even though the AI landscape changes almost every day, these fundamentals don't.
Sara Baqla
AI Native Engineer
Even as a developer who thought I understood LLMs well, this book delivered new, practical insights. It bridges theory and real-world application, showing how to use LLMs at the application layer without a data science background. A high-value, must-read for any developer building with AI
Preet Katari
Cloud Solutions Architect

Audience

This book is for, and not for, specific people.

For You If
  • 01You've played with ChatGPT and Claude. Now you want to build with them.
  • 02You're comfortable reading TypeScript or Python. You've shipped real code.
  • 03You're tired of AI demos that don't survive contact with production.
  • 04You want to understand when to use RAG, fine-tuning, prompts, or agents, and why.
  • 05You'd rather build five real systems than read five theoretical papers.
×Probably Not For You If
  • 01You've never written a line of code. Start somewhere else, then come back.
  • 02You want a cookbook of copy-paste prompts. This teaches systems thinking, not incantations.
  • 03You want ML theory, transformer math, or training models from scratch.
  • 04You're looking for speculation about AGI. This book is about shipping working software.

The Authors

Three engineers.
Over fifty years combined in the field.

Portrait of Danny Thompson, co-author of The Developer's Guide to AI book

Danny Thompson

From frying chicken at a gas station to Fortune 500 and beyond.

Accomplished software developer with a proven track record of success in the tech industry. Former Director, Senior Developer, Community Leader, and now Author. One of the most recognized community voices in growing your technical skills in the field of software engineering. Host of the globally top-ranked The Programming Podcast. Organizer of the Commit Your Code Conference for Charity and the Dallas Software Developers meetup. Keynote speaker at international conferences.

Notable
  • ·The Programming Podcast
  • ·Commit Your Code organizer
  • ·Delivered 200+ technical talks
Portrait of Jacob Orshalick, co-author of The Developer's Guide to AI book

Jacob Orshalick

Software architect, independent consultant, and long-time Java EE voice.

Over two decades in software development. Former PhD research assistant, now 16+ years as an independent consultant. Has built solutions for startups, Fortune 500s, and everything in between. Co-author of Seam Framework: Experience the Evolution of Java EE. Award-winning open source contributor. Speaks regularly at conferences.

Notable
  • ·Prior book: Seam Framework
  • ·Conference speaker
  • ·20+ years building software
Portrait of Jerry M. Reghunadh, co-author of The Developer's Guide to AI book

Jerry M. Reghunadh

Senior director, systems architect, and relentless curiosity-driven engineer.

Over two decades spanning QA automation, product engineering, and architectural leadership. Senior director at a global product organization, where he architects complex solutions and ships products at scale. International conference speaker. Firm believer that writing quality code on bare-bones hardware is an art form. Avid Formula 1 fan.

Notable
  • ·Senior director
  • ·International speaker
  • ·20+ years engineering

Questions

Everything you'd want to know
before you buy.

01What programming languages does the book use?
Primarily JavaScript/TypeScript (Node + Express) and Python (FastAPI). If you can read either one, you'll be fine. Most concepts are language-agnostic, and the book shows the same idea in both where it matters.
02Do I need prior AI or ML experience?
No. The book assumes you're a working developer who's used ChatGPT or Claude and written real software. It does not assume you've studied transformer math or trained models. Chapter 1 covers the mental model of LLMs from first principles; you ramp up from there.
03Which AI providers and models does the book cover?
The book teaches patterns that work across providers. You'll run local models via Ollama, hit hosted APIs (OpenAI-compatible patterns that also work with Anthropic, Google, and open-source providers), and see where provider-specific decisions actually matter.
04Does it cover RAG (Retrieval-Augmented Generation)?
Yes, Part III is entirely about RAG. Two chapters: one on vector databases in practice, one on designing a full RAG system. You build a working system, not just read about the concept.
05Does it cover agents and agentic systems?
Yes, Part V is three chapters on agents. From the workflow-to-autonomous-agent progression, through building your first agent, to extending agents with tools that do real work.
06Is the source code available?
Yes. All code from the book is available in a companion repository. You get free access when you buy the book. Beta-test signups also receive repo access.
07Paperback or ebook: which should I get?
If you want to annotate and reference it on a shelf, paperback. If you want to copy-paste code samples and read on multiple devices, ebook. Many readers buy both, and No Starch sometimes bundles them.
08Won't this be outdated quickly given how fast AI moves?
The book is deliberately built around durable patterns like prompt engineering fundamentals, RAG architecture, fine-tuning trade-offs, and agent design, not model-of-the-week tutorials. The specific providers and models will shift; the engineering principles stay.
09Is this suitable for senior or staff engineers?
Yes, and we've heard this specifically from early readers. If you're senior and new to AI, this is the fastest way to get production-ready context. If you're already shipping AI features, the RAG, fine-tuning, and agent chapters fill in the gaps between tutorials and real systems.
10How does this compare to an online course or YouTube tutorial?
Courses move fast and get out of date. This is a reference: something you open to Chapter 8 when you need to design a retrieval system, or Chapter 12 when you're deciding between a workflow and an agent. It's the book you keep on the desk, not the one you binge once.
11Who are the authors?
Jacob Orshalick is a software architect with 20+ years of experience and co-author of Seam Framework. Jerry M. Reghunadh is a senior director with two decades in engineering and international speaking. Danny Thompson is a senior developer advocate, host of The Programming Podcast, and organizer of the Commit Your Code conference. Technical review by Nikhil Kapoor (16+ years in AI/ML).
12Where can I buy the book?
Directly from No Starch Press (best price, supports the publisher and authors), Amazon (widest availability), Barnes & Noble, or most technical booksellers. Ebook formats are available through No Starch and most major ebook retailers.

Get the Book

Available now.
Pick your preferred retailer.

Paperback, ebook, and Kindle formats. Pick your retailer.

No Starch Press Quality Guarantee
Direct from publisher

No Starch Press

From $59.99

Paperback + ebook bundle, best for authors

Buy Direct
Online & in-store

Barnes & Noble

From $59.99

Paperback and Nook ebook

Buy at B&N
Published by No Starch Press · San Francisco · 2026

Stay in the loop

Get AI Engineering
insights delivered.

New chapters, code drops, and field notes - straight to your inbox.

Zero spam. One-click unsubscribe.