The AI compute stack, in crawlable form
This explainer maps the layers behind modern AI systems: power, fabs, chips, server systems, CUDA and kernels, frameworks, model weights, inference serving, and applications.
- Hardware: power, semiconductor process, packaging, HBM, GPUs, accelerators, and networking.
- Software: kernels, CUDA, PyTorch, JAX, TensorRT-LLM, vLLM, SGLang, and serving runtimes.
- Bottlenecks: memory bandwidth, KV cache, packaging capacity, cluster networking, and power interconnection.