(0)

Kirjuta arvustus

-40%

Optimizing LLM Performance Framework-Agnostic Techniques for Speed, Scalability, and Cost-Efficient Inference Across PyTorch, ONNX, VLLM, and More

Peter E Poisson

(0)

Kirjuta arvustus

Keel inglise keel

Kaane Pehme kaanega

Avaldatud 2025-07-26

17,18 € 28,63 €

-40% koodiga BOOKS

Pehme kaanega 28,63 € Kõvakaaneline

Lõppenud

30-päevane tagastamisõigus

Are you struggling to scale your large language models (LLMs) without breaking the bank or sacrificing latency? This book offers a clear roadmap to optimize inference, reduce costs, and scale seamlessly across platforms like PyTorch, ONNX, vLLM, and more.Optimizing LLM Performance is your hands-on guide to boosting the efficiency of large language models in production environments. Whether you're building c ... Täielik kirjeldus

Võib-olla meeldib sulle ka

-40%

TOP

If Anyone Builds It, Everyone Dies: Why Superhuman AI Would Kill Us All

Eliezer Yudkowsky, Nate Soares

13,84 € 23,06 €

-40% koodiga BOOKS

Meie tarnija laos

-40%

TOP

The God Test

Robert Wright

13,84 € 23,06 €

-40% koodiga BOOKS

Meie tarnija laos

-40%

TOP

Gödel, Escher, Bach: An Eternal Golden Braid

Douglas R. Hofstadter

19,25 € 32,08 €

-40% koodiga BOOKS

Meie tarnija laos

-40%

TOP

AI Engineering: Building Applications with Foundation Models

Chip Huyen

67,11 € 111,85 €

-40% koodiga BOOKS

Meie tarnija laos

-40%

TOP

Empire of AI: Dreams and Nightmares in Sam Altman's OpenAI

Karen Hao

15,25 € 25,41 €

-40% koodiga BOOKS

Meie tarnija laos

-40%

TOP

The Rust Programming Language

Steve Klabnik, Carol Nichols, Chris Krycho

40,07 € 66,79 €

-40% koodiga BOOKS

Meie tarnija laos

-40%

TOP

The Art of Game Design: A Book of Lenses

Jesse Schell

72,16 € 120,27 €

-40% koodiga BOOKS

Meie tarnija laos

-40%

TOP

How To Think About AI: A Guide For The Perplexed

Richard Susskind

11,22 € 18,70 €

-40% koodiga BOOKS

Meie tarnija laos

-40%

TOP

The Pragmatic Programmer: journey to mastery, 20th Anniversary Edition, 2/e: your journey to mastery, 20th Anniversary Edition

Andrew Hunt, David Thomas

40,58 € 67,63 €

-40% koodiga BOOKS

Meie tarnija laos

-40%

TOP

HBR Guide to Generative AI for Managers

Elisa Farri, Gabriele Rosani

28,40 € 47,33 €

-40% koodiga BOOKS

Meie tarnija laos

-40%

TOP

Hackers. 25th Anniversary Edition: Heroes of the Computer Revolution

Steven Levy

25,17 € 41,95 €

-40% koodiga BOOKS

Meie tarnija laos

-40%

TOP

The Web Application Hacker's Handbook: Discovering and Exploiting Security Flaws

Marcus Pinto, Dafydd Stuttard

51,07 € 85,12 €

-40% koodiga BOOKS

Meie tarnija laos

-40%

TOP

How Linux Works: What Every Superuser Should Know

Brian Ward

33,34 € 55,57 €

-40% koodiga BOOKS

Meie tarnija laos

-40%

TOP

Fundamentals of Software Architecture: A Modern Engineering Approach

Mark Richards, Neal Ford

67,11 € 111,85 €

-40% koodiga BOOKS

Meie tarnija laos

-40%

TOP

Speak Data: Artists, Scientists, Thinkers, and Dreamers on How We Live Our Lives in Numbers

Giorgia Lupi, Phillip Cox

26,06 € 43,43 €

-40% koodiga BOOKS

Meie tarnija laos

-40%

TOP

Linux Basics for Hackers, 2nd Edition: Getting Started with Networking, Scripting, and Security in Kali

Occupytheweb

26,95 € 44,92 €

-40% koodiga BOOKS

Meie tarnija laos

-40%

TOP

Deep Learning: Foundations and Concepts

Christopher M. Bishop, Hugh Bishop

79,19 € 131,98 €

-40% koodiga BOOKS

Meie tarnija laos

-40%

TOP

Mathematics for Machine Learning

Cheng Soon Ong, A. Aldo Faisal, Marc Peter Deisenroth

51,49 € 85,82 €

-40% koodiga BOOKS

Meie tarnija laos

-40%

TOP

The Art of Death Stranding

26,60 € 44,33 €

-40% koodiga BOOKS

Meie tarnija laos

-40%

TOP

Artificial Intelligence: A Modern Approach, Global Edition

Peter Norvig, Stuart Russell

83,73 € 139,55 €

-40% koodiga BOOKS

Meie tarnija laos

Kirjeldus

Optimizing LLM Performance is your hands-on guide to boosting the efficiency of large language models in production environments. Whether you're building chatbots, document summarizers, or enterprise AI tools, this book teaches proven methods to accelerate inference while maintaining accuracy. It dives deep into hardware-aware optimizations, quantization, model pruning, compiler acceleration, and memory-efficient runtime strategies without locking you into any single framework.

Written with clarity and real-world use in mind, the book features practical case studies, side-by-side performance comparisons, and up-to-date techniques from the cutting edge of AI deployment. If you're building, serving, or scaling LLMs in 2025, this is the performance engineering guide you've been waiting for.

Key Features:
- Framework-agnostic optimization techniques using PyTorch, ONNX Runtime, vLLM, llama.cpp, and more
- Deep dive into quantization (INT8/4-bit), distillation, pruning, and KV caching
- Hands-on examples with FastAPI, Hugging Face Transformers, and serverless deployment
- Covers performance profiling, streaming, batching, and cost-efficient scaling
- Future-proof insights on compiler-aware models, LoRA 2.0, and edge inference

Ready to build LLM systems that are faster, cheaper, and more scalable?
Grab your copy of Optimizing LLM Performance today and deploy smarter.

Lisateave

Autor	Peter E Poisson
Kirjastaja	Amazon Digital Services LLC - Kdp
Väljalaskeaasta	2025
Kaanetüüp	Pehme kaanega
EAN	9798294338459

Kirjuta oma arvustus

Te vaatate: Optimizing LLM Performance Framework-Agnostic Techniques for Speed, Scalability, and Cost-Efficient Inference Across PyTorch, ONNX, VLLM, and More

Teie hinnang:

Goodreads'i arvustused

17,18 € 28,63 €

Optimizing LLM Performance Framework-Agnostic Techniques for Speed, Scalability, and Cost-Efficient Inference Across PyTorch, ONNX, VLLM, and More

Võib-olla meeldib sulle ka

If Anyone Builds It, Everyone Dies: Why Superhuman AI Would Kill Us All

The God Test

Gödel, Escher, Bach: An Eternal Golden Braid

AI Engineering: Building Applications with Foundation Models

Empire of AI: Dreams and Nightmares in Sam Altman's OpenAI

The Rust Programming Language

The Art of Game Design: A Book of Lenses

How To Think About AI: A Guide For The Perplexed

The Pragmatic Programmer: journey to mastery, 20th Anniversary Edition, 2/e: your journey to mastery, 20th Anniversary Edition

HBR Guide to Generative AI for Managers

Hackers. 25th Anniversary Edition: Heroes of the Computer Revolution

The Web Application Hacker's Handbook: Discovering and Exploiting Security Flaws

How Linux Works: What Every Superuser Should Know

Fundamentals of Software Architecture: A Modern Engineering Approach

Speak Data: Artists, Scientists, Thinkers, and Dreamers on How We Live Our Lives in Numbers

Linux Basics for Hackers, 2nd Edition: Getting Started with Networking, Scripting, and Security in Kali

Deep Learning: Foundations and Concepts

Mathematics for Machine Learning

The Art of Death Stranding

Artificial Intelligence: A Modern Approach, Global Edition

Kirjeldus

Lisateave

Goodreads'i arvustused

Olibro

Abi

Informatsioon

Kontaktid

Optimizing LLM Performance Framework-Agnostic Techniques for Speed, Scalability, and Cost-Efficient Inference Across PyTorch, ONNX, VLLM, and More - Peter E Poisson

Optimizing LLM Performance Framework-Agnostic Techniques for Speed, Scalability, and Cost-Efficient Inference Across PyTorch, ONNX, VLLM, and More

Võib-olla meeldib sulle ka

Kirjeldus

Lisateave

Goodreads'i arvustused