Last changed
Contact our team to test out this image for free. Please also indicate any other images you would like to evaluate.
vLLM is a high-throughput and memory-efficient inference engine for Large Language Models (LLMs). This FIPS-validated variant provides OpenSSL FIPS 140-3 compliance for secure, production LLM deployments.
Chainguard Containers are regularly-updated, secure-by-default container images.
For those with access, this container image is available on cgr.dev:
Be sure to replace the ORGANIZATION placeholder with the name used for your organization's private repository within the Chainguard Registry.
This image ships with a validated redistribution of the OpenSSL FIPS provider module. For more information on FIPS support in Chainguard Images, consult the guide on FIPS-enabled Chainguard Images on Chainguard Academy.
Chainguard's vLLM FIPS image is comparable to the vllm/vllm-openai image, with several key differences:
The following packages have been modified or removed compared to upstream:
EnvTensorAllocator not being enabled during startup. This is an optional TVM optimization and does not affect vLLM functionalitypip install lmcache at runtime. See vLLM LMCache Examples for configurationFor Expert Parallel deployment with Mixture-of-Experts (MoE) models like DeepSeek-V2/V3, the EP kernels are not pre-installed in this image. Unlike upstream which ships pre-built EP kernels, Chainguard provides a build script due to upstream's specific version pinning requirements and custom patches for components like NVSHMEM, pplx-kernels, and DeepEP.
The image includes /vllm-workspace/install_python_libraries.sh to build the required components. Before running, ensure you have:
TORCH_CUDA_ARCH_LIST for your GPU (e.g., "8.0;9.0" for A100/H100)The script builds:
Build time is approximately 10-20 minutes depending on hardware. The built kernels persist in the mounted volume for reuse.
After building once, mount the volume when running vLLM:
See the Expert Parallel Deployment Guide for detailed configuration options.
Set the following environment variable to the name of your organization:
Start the vLLM OpenAI-compatible server with a model:
The server exposes OpenAI-compatible endpoints at http://localhost:8000.
Once the server is running, test it with curl:
For optimal performance, configure shared memory size with --shm-size:
If you're running an older CUDA driver version not supported by the container, you can use CUDA forward compatibility:
Refer to NVIDIA's CUDA Compatibility documentation for details on installing compatibility packages.
Chainguard's free tier of Starter container images are built with Wolfi, our minimal Linux undistro.
All other Chainguard Containers are built with Chainguard OS, Chainguard's minimal Linux operating system designed to produce container images that meet the requirements of a more secure software supply chain.
The main features of Chainguard Containers include:
For cases where you need container images with shells and package managers to build or debug, most Chainguard Containers come paired with a development, or -dev, variant.
In all other cases, including Chainguard Containers tagged as :latest or with a specific version number, the container images include only an open-source application and its runtime dependencies. These minimal container images typically do not contain a shell or package manager.
Although the -dev container image variants have similar security features as their more minimal versions, they include additional software that is typically not necessary in production environments. We recommend using multi-stage builds to copy artifacts from the -dev variant into a more minimal production image.
To improve security, Chainguard Containers include only essential dependencies. Need more packages? Chainguard customers can use Custom Assembly to add packages, either through the Console, chainctl, or API.
To use Custom Assembly in the Chainguard Console: navigate to the image you'd like to customize in your Organization's list of images, and click on the Customize image button at the top of the page.
Refer to our Chainguard Containers documentation on Chainguard Academy. Chainguard also offers VMs and Libraries — contact us for access.
This software listing is packaged by Chainguard. The trademarks set forth in this offering are owned by their respective companies, and use of them does not imply any affiliation, sponsorship, or endorsement by such companies.
Chainguard's container images contain software packages that are direct or transitive dependencies. The following licenses were found in the "latest" tag of this image:
Apache-2.0
Artistic-1.0-Perl
BSD-1-Clause
BSD-2-Clause
BSD-3-Clause
BSD-3-Clause-Open-MPI
BSD-4-Clause-UC
For a complete list of licenses, please refer to this Image's SBOM.
Software license agreementThis is a FIPS validated image for FedRAMP compliance.
This image is STIG hardened and scanned against the DISA General Purpose Operating System SRG with reports available.
Learn more about STIGsGet started with STIGs