这页解决什么
DeepSeek R1 Distill Llama 70B is a distilled large language model based on Llama-3.3-70B-Instruct, using outputs from DeepSeek R1. The model combines advanced distillation techniques to achieve high performance across...
- DeepSeek · deepseek/deepseek-r1-distill-llama-70b
- text->text · 中国模型路线
- 131,072 context · US$0.70 input