Skip to main content
EnterpriseAI Platform

Memory and compute oversubscription

Oversubscription improves GPU utilization under controlled risk. It is suitable for development, inference, and lightweight training workloads whose memory peaks do not overlap.

Guidance

  • Enable it first in a test namespace.
  • Monitor OOM, memory peaks, and task failure rates.
  • Keep conservative quotas for critical production workloads.