EnterpriseAI Platform
Restart-free auto scaling
Restart-free scaling adjusts GPU allocation according to workload runtime state and reduces interruption for training or inference services.
Use cases
- Resource adjustment for long-running training jobs.
- Resource scaling for inference traffic changes.
- Dynamic resource reclamation in development environments.