
Building Machine Learning Systems with a Feature Store: Batch, Real-Time, and LLM Systems
Author(s): Jim Dowling (Author)
- Publisher: O'Reilly Media
- Publication Date: December 16, 2025
- Edition: 1st
- Language: English
- Print length: 506 pages
- ISBN-10: 1098165233
- ISBN-13: 9781098165239
Book Description
Get up to speed on a new unified approach to building machine learning (ML) systems with a feature store. Using this practical book, data scientists and ML engineers will learn in detail how to develop and operate batch, real-time, and agentic ML systems.
Author Jim Dowling introduces fundamental principles and practices for developing, testing, and operating ML and AI systems at scale. You’ll see how any AI system can be decomposed into independent feature, training, and inference pipelines connected by a shared data layer. Through example ML systems, you’ll tackle the hardest part of ML systems–the data, learning how to transform data into features and embeddings, and how to design a data model for AI.
-
Develop batch ML systems at any scale
-
Develop real-time ML systems by shifting left or shifting right feature computation
-
Develop agentic ML systems that use LLMs, tools, and retrieval-augmented generation
-
Understand and apply MLOps principles when developing and operating ML systems
Editorial Reviews
Review
“This book shows how modern feature engineering is really done. It bridges the gap between research and production. A must-read for anyone serious about building efficient, real-world ML systems”
– Ritchie Vink, Creator of Polars, CEO & Founder Polars Inc
“Jim does a great job explaining the crucial systems aspects to ML and gives a lot of practical tips on how to navigate production ML deployments”.
– Hannes Mühleisen, Co-Creator of DuckDB, CEO of DuckDB Labs.
“Building machine learning systems in production has historically involved a lot of black magic and undocumented learnings. Jim Dowling is doing a great service to ML practitioners by sharing the best practices and putting together a clear step-by-step guide.”
– Erik Bernhardsson, Inventor of Luigi and Modal. Founder and CEO at Modal.
“The truly hard part of ML is building the scalable, reliable data systems that power them. Jim is one of the few people who can explain system level challenges with exceptional clarity. This book is the definitive, practical guide for bridging the gap from research to real world production grade systems.”
– Willem Pienaar, Inventor of Feast Feature Store
“Jim’s the closest thing we have to a world-class expert. Read this book if you want a detailed, practical, re-usable manual on how to get a good-quality running system – as an SRE, I especially appreciate his attention to observability and debugging. The detailed case studies are crunchy icing on a filling cake.”
– Niall Murphy, O’Reilly Author, SRE legend
“A must-read for AI/ML practitioners looking to match use cases to the right ML platforms and tools. “
– Lalith Suresh, Co-Creator of Feldera.
“Nobody has captured before the essentials of building AI apps using modern data streaming systems like Flink. Jim’s book shows the way!”.
– Paris Carbone, Apache Flink SIGMOD Winner
“I witnessed the rise of feature stores at Uber, where ML-powered products operated on batch and real-time data. Jim Dowling helped define the category, and this book gives every engineer a practical playbook for shipping production-grade ML systems that matter.”
– Vinoth Chandar, Creator of Apache Hudi
Wow! eBook


