> ## Documentation Index
> Fetch the complete documentation index at: https://docs.gmicloud.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Dedicated Endpoint

Dedicated Endpoints provide a customizable environment for deploying AI models tailored to specific requirements.

## Create Your Dedicated Inference Endpoint

### Deploy a Dedicated Inference Model

Select a model from the list.

Click the "Dedicated" button to start deployment:

<img src="https://mintcdn.com/gmicloud/eGTC6bV7G8ZZ053N/inference-engine/marketplace/dedicated-click-deploy-1.png?fit=max&auto=format&n=eGTC6bV7G8ZZ053N&q=85&s=b41f17a672154dfdb30d796ec0b0f8c3" alt="Dedicated button" width="2498" height="1889" data-path="inference-engine/marketplace/dedicated-click-deploy-1.png" />

Alternatively, you can also click on the model card and then click the "Deploy" button on the top right:

<img src="https://mintcdn.com/gmicloud/eGTC6bV7G8ZZ053N/inference-engine/marketplace/dedicated-click-deploy-2.png?fit=max&auto=format&n=eGTC6bV7G8ZZ053N&q=85&s=c4cb8f45c0c06f26a900e882f8c0ec6f" alt="Dedicated deploy from model card" width="1903" height="1468" data-path="inference-engine/marketplace/dedicated-click-deploy-2.png" />

### Review Configurations

Confirm your GPU type, deployment name, auto-scaling policy, and other system configurations:

<img src="https://mintcdn.com/gmicloud/eGTC6bV7G8ZZ053N/inference-engine/marketplace/dedicated-review-configurations.png?fit=max&auto=format&n=eGTC6bV7G8ZZ053N&q=85&s=ef35af6b5d1fc41d785cce1a2eb3b546" alt="image.png" width="2304" height="1828" data-path="inference-engine/marketplace/dedicated-review-configurations.png" />

Then click "Deploy".

### View Deployment Status

To view your deployment status click the "Deployment" tab on the top right.

| **Status**    | **Description**                                                                                                            |
| ------------- | -------------------------------------------------------------------------------------------------------------------------- |
| **Queued**    | The deployment task has been added to the queue. It will start once all higher-priority tasks have been processed.         |
| **Deploying** | The system is allocating hardware resources and initializing the model endpoint.                                           |
| **Running**   | Deployment is complete, and the endpoint is active and ready for production use.                                           |
| **Stopped**   | The deployment has been manually stopped by the user. It can be restarted at any time.                                     |
| **Archived**  | The deployment has been terminated permanently. It cannot be restarted, but historical records are retained for reference. |

You will only be billed for the period of time in "Running" status.

<img src="https://mintcdn.com/gmicloud/eGTC6bV7G8ZZ053N/inference-engine/marketplace/review-deployment-status.png?fit=max&auto=format&n=eGTC6bV7G8ZZ053N&q=85&s=90508794635101fc61040a8b8c16b913" alt="Dedicated View Deployment Pn" width="1903" height="1767" data-path="inference-engine/marketplace/review-deployment-status.png" />

### Invoke API Endpoint

Once deployment is in "Running" status, click the "\<>" symol to access endpoint URL:

<img src="https://mintcdn.com/gmicloud/3cVlY2De5-ZFaDjN/assets/dedicated-access-url.png?fit=max&auto=format&n=3cVlY2De5-ZFaDjN&q=85&s=d0f0f51e98106246ac57b1e63f540668" alt="Dedicated Access Url Pn" width="2091" height="1364" data-path="assets/dedicated-access-url.png" />

You can then use this URL to send API requests. An example is provided. Remember to replace "API\_KEY" with your real API key.

<img src="https://mintcdn.com/gmicloud/3cVlY2De5-ZFaDjN/assets/dedicated-send-curl-api-request.png?fit=max&auto=format&n=3cVlY2De5-ZFaDjN&q=85&s=6298077c15686dbcf9953a65f6d3e934" alt="Dedicated Send Curl Api Request Pn" width="1917" height="1250" data-path="assets/dedicated-send-curl-api-request.png" />
