LLM Inference Engineering: Quantization, KV-Cache Optimization, and High-Throughput Serving: A Production Engineer's Guide to INT4/INT8 ... Speculative Decoding, Cost Optimization

Priser från
100,42

Utvalda

JäMFöR ALLA WEBBUTIKER (2)

Beskrivning

Amazon LLM Inference Engineering: Quantization, KV-Cache Optimization, and High-Throughput Serving: A Production Engineer's Guide to INT4/INT8 Quantization, ... Speculative Decoding, and Cost Optimization

Jämför webbutiker (2)

Shop
Pris
100,42 kr
100,42 kr
Beskrivning (1)

LLM Inference Engineering: Quantization, KV-Cache Optimization, and High-Throughput Serving: A Production Engineer's Guide to INT4/INT8 Quantization, ... Speculative Decoding, and Cost Optimization


Produktspecifikationer

Märke Independently Published
EAN
  • 9798180985187

Priser uppdaterades senast:

Utvalt Val
100,42 kr
Til butik