# Serverless Endpoint

We offer a range of serverless endpoints for popular open-source models.

## Access a Serverless inference model

Select a model from the list.

Click on the button labeled *"Serverless"*:

<img src="https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-select-serverless.png?fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=ab708325e15b7a59b6887341b66b1bd0" alt="Serverless button" width="2101" height="1132" data-path="assets/gmi-select-serverless.png" srcset="https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-select-serverless.png?w=280&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=cc2d7f7b8064377e9a9326623870c696 280w, https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-select-serverless.png?w=560&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=a7639bc7bb730d20d0d2d71b3fce70fb 560w, https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-select-serverless.png?w=840&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=0234fd57bbf76b83bb0687cce7682d50 840w, https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-select-serverless.png?w=1100&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=1ee2a0211d97185bf02a3ddf1f148cd7 1100w, https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-select-serverless.png?w=1650&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=8afce732611c5486df1c4efb2d9e5bd8 1650w, https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-select-serverless.png?w=2500&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=01d9e614634c0690561a4fd87b020f74 2500w" data-optimize="true" data-opv="2" />

### Model Details

To access the connection details for your model, click on *"Model Details"*:

<img src="https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-model-details.png?fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=c393f6fa91359ea18dd88b7c3309785a" alt="Model details" width="2101" height="1132" data-path="assets/gmi-model-details.png" srcset="https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-model-details.png?w=280&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=3fcfb27e24dc6d62138b097d5930a46a 280w, https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-model-details.png?w=560&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=a7f4f070a614f3ef34bbc391b2358085 560w, https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-model-details.png?w=840&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=d12d35547454c4a9cc9cc2c9e4506092 840w, https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-model-details.png?w=1100&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=46f9e23e10974d3f075bbd501236d225 1100w, https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-model-details.png?w=1650&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=09f8e774d3562e543c845d3b8be12976 1650w, https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-model-details.png?w=2500&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=d449c1ed49099087e4e53a6b4e8edfc2 2500w" data-optimize="true" data-opv="2" />

### Playground

To access the Playground for your model, click on *"Playground"*:

<img src="https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-playground.png?fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=edf948af5dbd8afbd21f517e89ed76c8" alt="Playground" width="2101" height="1132" data-path="assets/gmi-playground.png" srcset="https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-playground.png?w=280&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=41d08cd2a884ae1b13334cce8dae4071 280w, https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-playground.png?w=560&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=17e3c74c50b91fe5eeb3838208070a1c 560w, https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-playground.png?w=840&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=b44453d4dc97d3b9af4b44ff08e68501 840w, https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-playground.png?w=1100&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=fa879bc4bbedd02cdaad93abd4fb26ad 1100w, https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-playground.png?w=1650&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=61fc22a7353f25347e3e618c62eb8330 1650w, https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-playground.png?w=2500&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=86509197066e8a237caff72e50165965 2500w" data-optimize="true" data-opv="2" />

The options outlined serve to customize and control the text generation process of the API:

* Temperature:  Temperature allows you to configure how much randomness you want in the generated text. A higher temperature leads to more “creative” results. On the other hand, setting a temperature of 0 will allow you to generate deterministic results which is useful for testing and debugging.
* Max Tokens:  Max Tokens defines the maximum number of tokens the model can generate, with a default of 4096. If the combined token count (prompt + output) exceeds the model’s limit, it automatically reduces the number of generated tokens to fit within the allowed context.
* Top K:  Top-K is another sampling method where the k most probable tokens are filtered and the probability mass is redistributed among tokens.
* Top P:  Top-P (also called nucleus sampling) is an alternative to sampling with temperature, where the model considers the results of the tokens with top\_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered.
* Frequency Penalty:  Frequency penalty reduces repetition of the same words/phrases.A higher frequency penalty reduces the likelihood of the model generating tokens that have already appeared in the output. This helps create more varied and engaging text by preventing redundancy.
* Presence Penalty:  Presence penalty encourages the introduction of new ideas/concepts.A higher presence penalty encourages the model to introduce new ideas or concepts rather than reiterating previously mentioned ones. This can enhance the richness of the generated content by promoting the introduction of fresh topics.
* Stream:  Steam enables output to be processed and displayed incrementally, meaning that outputs are sent back to the user in real time.
* System Prompt:  System prompt serves as a high-level instruction or context-setting mechanism that guides the model's behavior, tone, and responses throughout the interaction.

### Add API Key

To add an API key, click *"Add API Key"* and enter your key in the field:

<img src="https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-api-key.png?fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=d74ef941580125bcc47f52a26ee1d786" alt="API Key" width="2101" height="1132" data-path="assets/gmi-api-key.png" srcset="https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-api-key.png?w=280&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=677343a96ded87068ac521a524f7e8e5 280w, https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-api-key.png?w=560&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=586930eda4867a56def0f6d400385f5e 560w, https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-api-key.png?w=840&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=c67617fe790ae35747502ba5d8940423 840w, https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-api-key.png?w=1100&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=fcc3c248705d4f9cac3b5569f3c41957 1100w, https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-api-key.png?w=1650&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=1dbe52c4d333b6bb935f2cb45f3daee3 1650w, https://mintcdn.com/gmicloud/dRe7Q1smc5i4wdJ_/assets/gmi-api-key.png?w=2500&fit=max&auto=format&n=dRe7Q1smc5i4wdJ_&q=85&s=e2a0c96c6c6244d6c5a052580a8d1fbb 2500w" data-optimize="true" data-opv="2" />