Create a dedicated inference endpoint once or on a schedule
Deploy a Dedicated inference model
Select a model from the list. Click on the button labeled “Dedicated”:
Set compute resources
Select a GPU type, ram, and other system specifications:
Set task details
Set a task name, file path, and other inference details:
Review configuration
Review your configurations to ensure they are correct. After confirmation, click the “Launch” button to launch task.
Active task
In the task list, locate the task and then click button