What this error usually means
NVIDIA returning 408/504 / timeout should not be judged from the message alone. Cross-check key shape, Base URL, balance, model permission, region/IP, and visible model list.
- Provider: NVIDIA
- Error type: timeout
- Status code: 408/504
Next action
timeout should not only show failed. The console should return the next move: refill balance, change Base URL, switch model, add permission, change egress IP, hold listing, or monitor.
- Read-only check. Detection data burns after 5 minutes.
- global provider route
- Llama 3.1 Nemotron 70B Instruct · Llama 3.1 Nemotron Ultra 253B v1 · Llama 3.3 Nemotron Super 49B V1.5