EnterpriseAI Platform
Memory and compute oversubscription
Oversubscription improves GPU utilization under controlled risk. It is suitable for development, inference, and lightweight training workloads whose memory peaks do not overlap.
Guidance
- Enable it first in a test namespace.
- Monitor OOM, memory peaks, and task failure rates.
- Keep conservative quotas for critical production workloads.