TestKey.ai logo
TestKey.ai
KEY CHECKER & MODEL MARKET
You are hereHome
Model limit probe

NousResearch: Hermes 2 Pro - Llama-3 8B | TPM limit

NousResearch: Hermes 2 Pro - Llama-3 8B TPM limit decides whether a key can enter a production route. TestKey reads model ID nousresearch/hermes-2-pro-llama-3-8b, provider Nousresearch, catalog context, real-key headers, 429 errors, and region signals together.

Model
nousresearch/hermes-2-pro-llama-3-8b
NousResearch: Hermes 2 Pro - Llama-3 8B
Provider
Nousresearch
6 models in catalog
Limit dimension
TPM limit
tpm-limit
Visible signal
real key probe required
Context window: 8,192
Limit matrix summary
Model
nousresearch/hermes-2-pro-llama-3-8b
Limit dimension
tpm-limit
Visible signal
real key probe required
Read-only check. Detection data burns after 5 minutes.
Read-only check. Detection data burns after 5 minutes.

Why this limit matters

NousResearch: Hermes 2 Pro - Llama-3 8B TPM limit decides whether a key can enter a production route. TestKey reads model ID nousresearch/hermes-2-pro-llama-3-8b, provider Nousresearch, catalog context, real-key headers, 429 errors, and region signals together.

  • Model: nousresearch/hermes-2-pro-llama-3-8b
  • Provider: Nousresearch
  • Limit dimension: TPM limit

How to prove it

Read-only check. Detection data burns after 5 minutes.

  • Start with the visible signal: real key probe required, then read headers and error bodies with read-only requests.
  • TPM limit must bind model ID nousresearch/hermes-2-pro-llama-3-8b; limits from another model at the same provider cannot be reused.
  • TPM limit · real key probe required · 8,192

Operator action

NousResearch: Hermes 2 Pro - Llama-3 8B TPM limit is not just a number. It should become route throttling, sale tags, headroom alerts, fallback model suggestions, and price protection.

  • Read-only check. Detection data burns after 5 minutes.
  • Visible signal: real key probe required
  • Context window: 8,192