Skip to main content
EnterpriseAI Platform

Restart-free auto scaling

Restart-free scaling adjusts GPU allocation according to workload runtime state and reduces interruption for training or inference services.

Use cases

  • Resource adjustment for long-running training jobs.
  • Resource scaling for inference traffic changes.
  • Dynamic resource reclamation in development environments.