How Large Language Models Work

Edward Raff, Drew Farris, Stella Biderman

1st edition

Paperback (22 Oct 2025)

$53.31

Pre-order

Includes delivery to the United States

Publisher's Synopsis

Learn how large language models like GPT and Gemini work under the hood in plain English.How Large Language Models Work translates years of expert research on Large Language Models into a readable, focused introduction to working with these amazing systems. It explains clearly how LLMs function, introduces the optimization techniques to fine-tune them, and shows how to create pipelines and processes to ensure your AI applications are efficient and error-free.In How Large Language Models Work you will learn how to: Test and evaluate LLMs Use human feedback, supervised fine-tuning, and Retrieval augmented generation (RAG) Reducing the risk of bad outputs, high-stakes errors, and automation bias Human-computer interaction systems Combine LLMs with traditional ML How Large Language Models Work is written by some of the best machine learning researchers at Booz Allen Hamilton, including researcher Stella Biderman, Director of AI/ML Research Drew Farris, and Director of Emerging AI Edward Raff. In clear and simple terms, these experts lay out the foundational concepts of LLMs, the technology's opportunities and limitations, and best practices for incorporating AI into your organization.