
Building Generative AI Services with FastAPI:A Practical Approach to Developing Context-Rich Generative AI Applications
by: Alireza Parandeh (Author)
Publisher: O'Reilly Media
Edition: 1st
Publication Date: 2025-05-20
Language: English
Print Length: 528 pages
ISBN-10: 1098160304
ISBN-13: 9781098160302
Book Description
Ready to build production-grade applications with generative AI? This practical guide takes you through designing and deploying AI services using the FastAPI web framework. Learn how to integrate models that process text, images, audio, and video while seamlessly interacting with databases, filesystems, websites, and APIs. Whether you're a web developer, data scientist, or DevOps engineer, this book equips you with the tools to build scalable, real-time AI applications.Author Alireza Parandeh provides clear explanations and hands-on examples covering authentication, concurrency, caching, and retrieval-augmented generation (RAG) with vector databases. You'll also explore best practices for testing AI outputs, optimizing performance, and securing microservices. With containerized deployment using Docker, you'll be ready to launch AI-powered applications confidently in the cloud.Build generative AI services that interact with databases, filesystems, websites, and APIsManage concurrency in AI workloads and handle long-running tasksStream AI-generated outputs in real time via WebSocket and server-sent eventsSecure services with authentication, content filtering, throttling, and rate limitingOptimize AI performance with caching, batch processing, and fine-tuning techniquesVisit the Book's Website.
Editorial Reviews
Ready to build production-grade applications with generative AI? This practical guide takes you through designing and deploying AI services using the FastAPI web framework. Learn how to integrate models that process text, images, audio, and video while seamlessly interacting with databases, filesystems, websites, and APIs. Whether you're a web developer, data scientist, or DevOps engineer, this book equips you with the tools to build scalable, real-time AI applications.Author Alireza Parandeh provides clear explanations and hands-on examples covering authentication, concurrency, caching, and retrieval-augmented generation (RAG) with vector databases. You'll also explore best practices for testing AI outputs, optimizing performance, and securing microservices. With containerized deployment using Docker, you'll be ready to launch AI-powered applications confidently in the cloud.Build generative AI services that interact with databases, filesystems, websites, and APIsManage concurrency in AI workloads and handle long-running tasksStream AI-generated outputs in real time via WebSocket and server-sent eventsSecure services with authentication, content filtering, throttling, and rate limitingOptimize AI performance with caching, batch processing, and fine-tuning techniquesVisit the Book's Website.
Wow! eBook

