iApp vLLM Gateway

4 models
0 active
3374 total
17h 35m
Qwen3.6-35B-A3B-FP8
Qwen/Qwen3.6-35B-A3B-FP8
Healthy
Load 0 / 24 active
qwen3.6-35b-multi GPU 6,7 750 req
Qwen3.5-35B-A3B-FP8
Qwen/Qwen3.5-35B-A3B-FP8
Healthy
Load 0 / 32 active
qwen3.5-35b-multi GPU 3 320 req
Qwen3-Reranker-8B
Qwen/Qwen3-Reranker-8B
Healthy
Load 0 / 64 active
qwen3-reranker-8b GPU 5 1149 req
Qwen3-Embedding-8B
Qwen/Qwen3-Embedding-8B
Healthy
Load 0 / 64 active
qwen3-embedding-8b GPU 4 1155 req
Models endpoint /v1/models
Reload config POST /gateway/reload
Status API /gateway/status