Why is my self-hosted Engine instance running on the CPU?
Reasons why the Engine does not recognize a GPU
Compose File does not specify NVIDIA runtime
# The speech engine service.
engine:
image: quay.io/deepgram/onprem-engine:release-<version>
# Utilize a GPU, if available.
runtime: nvidia # The speech engine service.
engine:
image: quay.io/deepgram/onprem-engine:release-<version>
# Utilize a GPU, if available.
devices:
- nvidia.com/gpu=allCUDA version incompatabilities
No NVIDIA Container Runtime available
Missing drivers
Detecting Common NVIDIA Issues with Docker/Podman
Google Kubernetes Engine Bug from Oct 2023 - Jan 2024
How can I verify my self-hosted deployment is using the GPU?
Last updated