dstack is a streamlined alternative to Kubernetes and Slurm, specifically designed for AI. It simplifies container orchestration for AI workloads both in the cloud and on-prem, speeding up the development, training, and deployment of AI models. dstack is easy to use with any cloud providers as well as on-prem servers. dstack supports NVIDIA GPU, AMD GPU, and Google Cloud TPU out of the box. With its latest release, it's now possible to use NVIDIA NIM with dstack to deploy LLMs to any cloud or on-prem—no Kubernetes required.