Skip to main content

Manage Deployments

View, monitor, and manage all your model deployments from the Deployments dashboard. This page provides centralized control over deployment lifecycle operations including stopping, restarting, editing, and deleting deployments.

Deployment List

The deployment list displays all your deployments in a table with filtering, searching, and quick actions.


Deploy List


Search and Filter

You can narrow down the deployment list using:

  • Keyword Search — Type a keyword in the search bar and press Enter or click the search icon to filter by deployment name, model name, or ID.
  • Status Filter — Use the status dropdown to filter by a specific status: Downloading, Deploying, Ready, Failed, Stopping, or Stopped. Select All Status to clear the filter.
  • Refresh — Click the refresh icon to reload the list with the latest data.

Status Reference

Each deployment transitions through the following statuses during its lifecycle:

StatusDescription
DownloadingThe system is downloading model files from the source repository.
DeployingThe system is provisioning GPU resources, starting the container, and running the deployment command.
ReadyThe deployment is online and ready to accept inference requests.
FailedThe deployment encountered an error and could not start. Hover over the status badge to see the error message.
StoppingThe service is shutting down and releasing GPU resources.
StoppedThe service is offline and not consuming any GPU resources.
Failed Deployments

When a deployment is in Failed status, an error icon appears next to the status badge. Hover over it to view the specific error message. Common causes include insufficient GPU resources, model download failures, or configuration errors.

View Deployment Details

Click the eye icon in the Actions column to open the deployment detail page.


deployment-details


Code Examples Section:

Each code example is pre-filled with your deployment's endpoint URL and can be copied to clipboard with one click.

Edit Deployment

Click the edit icon in the Actions column to modify a deployment. Editing is only available when the deployment is in Ready, Failed, or Stopped status.

Edit Scope

Currently, only the Display Name can be modified after deployment. To change GPU type, replicas, or other resource settings, you need to delete and recreate the deployment.

Stop a Deployment

Click the pause icon in the Actions column to stop a running deployment. Stopping is available when the deployment is in Ready or Deploying status.

When you stop a deployment:

  • The status transitions to Stopping, then to Stopped.
  • All GPU resources are released and billing stops.
  • The deployment configuration is preserved and can be restarted later.

Restart a Deployment

Click the restart icon in the Actions column to restart a stopped or failed deployment. The system re-provisions GPU resources and starts the deployment process.

Delete a Deployment

Click the delete icon in the Actions column to permanently remove a deployment. Deleting is only available when the deployment is in Failed or Stopped status.

Delete Restrictions

You cannot delete a deployment that is currently in Ready, Deploying, Downloading, or Stopping status. Stop the deployment first before deleting.

Next Steps

Create Deployment

Learn how to deploy models from the Model Gallery or your own custom models.

Deploy from Hugging Face

Deploy models directly from Hugging Face with intelligent GPU recommendations.