Wire GMI Cloud’s models into Dify’s DeepResearch template to spin up a multi-step research agent in about five minutes. You’ll swap Dify’s default LLM and reasoning nodes for GMI-served models (GLM-4.6 + Qwen3 235B Thinking), then run a real query end-to-end.Documentation Index
Fetch the complete documentation index at: https://docs.gmicloud.ai/llms.txt
Use this file to discover all available pages before exploring further.

Prerequisites
- A GMI Cloud account at console.gmicloud.ai
- A Dify account at dify.ai
- About 5 minutes
Step 1. Get your GMI Cloud API key
- Sign in to the API Key Management page.
- Click Create API Key, name it, and set Scope to Inference.
- Copy the key now, it won’t be shown again.

Step 2. Install the GMI plugin in Dify
Open the Dify plugin marketplace, search GMI Cloud, and install.
Step 3. Configure the plugin
- In Dify, open Settings → Model Provider.
- Find GMI Cloud and click Setup.
- Paste your API key. Custom endpoint is optional; default is
https://api.gmi-serving.com/v1. - Save. Dify hits
/v1/modelsto validate.


Step 4. Build the workflow
From Dify’s home, click Create from Template and pick DeepResearch.



- LLM node → swap
gpt-4ofor GLM-4.6 (model card).

- Reasoning Model node → swap for Qwen3 235B A22B Thinking 2507 FP8 (model card).

Step 5. Run it
Open the workflow app. Set Depth to control how many search rounds the agent runs -2 is a good default.
Sample prompt:


Next steps
- Try a different agent framework: Hermes Agent or OpenClaw.
- Swap models from the Text catalog.
- Stuck? Email support@gmicloud.ai.