Trang này trả lời điều gì
DeepSeek R1 Distill Llama 70B is a distilled large language model based on Llama-3.3-70B-Instruct, using outputs from DeepSeek R1. The model combines advanced distillation techniques to achieve high performance across...
- DeepSeek · deepseek/deepseek-r1-distill-llama-70b
- text->text · tuyến model Trung Quốc
- 131.072 context · 0,70 US$ input