Artificial Intelligence: AI Engineer's Cheatsheet: Silicon Edition (Ultra-large scale LLM training and inference)

★★★★★ 4.5 72 reviews

$39.00
Price when purchased online
Free shipping Free 30-day returns

Sold and shipped by jobs.innov.ma
We aim to show you accurate product information. Manufacturers, suppliers and others provide what you see here.
$39.00
Price when purchased online
Free shipping Free 30-day returns

How do you want your item?
You get 30 days free! Choose a plan at checkout.
Shipping
Arrives May 13
Free
Pickup
Check nearby
Delivery
Not available

Sold and shipped by jobs.innov.ma
Free 30-day returns Details

Product details

Management number 220491367 Release Date 2026/05/03 List Price $15.60 Model Number 220491367
Category

Ultimate guide to understand and design Large-scale LLM training and inference system for AI gigafactory era."Artificial Intelligence: AI Engineer’s Cheatsheet - Silicon Edition" is a comprehensive technical companion for mastering AI systems engineering and large-scale LLM (Large Language Model) optimizations for the era of AI gigafactories.This book bridges the gap between high-level machine learning theory and low-level silicon-aware implementation - empowering engineers to design, train and deploy state-of-the-art (SOTA) models efficiently on modern hardware architectures.Unlike traditional ML/AI textbooks, this edition distills the knowledge required to build and optimize the same scale of systems powering organizations like OpenAI, Anthropic, DeepSeek and Google DeepMind. If you can understand the concepts in this book, you will not only understand how today’s AI systems work - you will be ready to work at leading AI labs.What You will Learn?This book systematically covers the essential layers of modern AI engineering:AI System Design (MLSys & Serving):End-to-end design concepts of AI serving systems such as vLLM, SGLang, and TensorRT-LLM. Explore scheduling, batching and optimizations like Continuous Batching - the same strategies enabling OpenAI to serve hundreds of millions of user queries weekly.Core LLM Architecture and Operations:A deep dive into Transformer-based architectures, including attention and their optimized variants such as FlashAttention, FlexAttention and memory-efficient decoding pipelines.Quantization Engineering:Understand BF16, FP8 (E4M3/E5M2) and quantization-aware training techniques for compute and memory optimization across GPUs, TPUs and custom accelerators.AI Hardware Architecture:A silicon-level exploration of GPUs and CPU backends (x86, ARM). Learn how hardware characteristics such as memory hierarchy and interconnect bandwidth impact model performance.Software-Hardware co-design:And much more.By the end of this book, you will be able to:Engineer and optimize end-to-end AI systems for large-scale training and inference.Evaluate trade-offs between model accuracy, latency, throughput and cost.Design quantization and parallelization strategies suitable for real deployments.Perform back-of-the-envelope calculations for compute, bandwidth, and memory requirements.Engage meaningfully in technical discussions on AI architecture, geopolitics and the emerging compute economy.Who This Book Is For?Students and developers preparing for machine learning, deep learning, and GenAI interviews.Engineers and researchers seeking to solidify their understanding of large-scale AI system design.Professionals transitioning into AI infrastructure, compiler or hardware optimization roles.Independent learners aiming to conduct research or replicate open-source SOTA systems.AI will not replace you - but engineers who understand AI systems from algorithm to silicon will.Start with this book, and redefine your position in the AI era.Book: Artificial Intelligence: AI Engineer's Cheatsheet: Silicon editionAuthor: Seymour PapermasterUpdated: 1 November 2025 (v1.15)Pages: 206Table of contents:Global AI race...Decode-Maximal BatchingContinuous BatchingP/D disaggregationCollective Communication primitivesMemory components in LLMTensor Parallelism [TP]...INT8 QuantizationGPU Workload Parallelizationand much more Read more

ISBN13 979-8267517096
Language English
Publisher Independently published
Dimensions 6 x 0.48 x 9 inches
Book 1 of 1 Ultra-large scale LLM training and inference
Item Weight 13.3 ounces
Print length 209 pages
Publication date September 28, 2025

Correction of product information

If you notice any omissions or errors in the product information on this page, please use the correction request form below.

Correction Request Form

Customer ratings & reviews

4.5 out of 5
★★★★★
72 ratings | 30 reviews
How item rating is calculated
View all reviews
5 stars
83% (60)
4 stars
4% (3)
3 stars
2% (1)
2 stars
1% (1)
1 star
10% (7)
Sort by

There are currently no written reviews for this product.