Merge d7a382f7e1746d30a9e520d7bf023b635fc19e71 into 45b89c6cb13cf6b01da05ef9a7379f13f8d3baf2

fix, Update README.md
fix
2025-05-02 13:46:30 +02:00 · 2025-02-18 22:57:22 +07:00 · 2025-01-31 17:04:48 +08:00
1 changed files with 1 additions and 1 deletions
--- a/README.md
+++ b/README.md
@ -60,7 +60,7 @@ To support the research community, we have open-sourced DeepSeek-R1-Zero, DeepSe
 **Distillation: Smaller Models Can Be Powerful Too**

 -  We demonstrate that the reasoning patterns of larger models can be distilled into smaller models, resulting in better performance compared to the reasoning patterns discovered through RL on small models. The open source DeepSeek-R1, as well as its API, will benefit the research community to distill better smaller models in the future. 
- Using the reasoning data generated by DeepSeek-R1, we fine-tuned several dense models that are widely used in the research community. The evaluation results demonstrate that the distilled smaller dense models perform exceptionally well on benchmarks. We open-source distilled 1.5B, 7B, 8B, 14B, 32B, and 70B checkpoints based on Qwen2.5 and Llama3 series to the community.
+- Using the reasoning data generated by DeepSeek-R1, we fine-tuned several dense models that are widely used in the research community. The evaluation results demonstrate that the distilled smaller dense models perform exceptionally well on benchmarks. We have open-sourced distilled 1.5B, 7B, 8B, 14B, 32B, and 70B checkpoints based on Qwen2.5 and Llama3 series to the community.

 ## 3. Model Downloads
Author	SHA1	Message	Date
Dr. Artificial曾小健	4675f216ed	Merge d7a382f7e1746d30a9e520d7bf023b635fc19e71 into 45b89c6cb13cf6b01da05ef9a7379f13f8d3baf2	2025-02-18 22:57:22 +07:00
Dr. Artificial曾小健	d7a382f7e1	fix, Update README.md fix	2025-01-31 17:04:48 +08:00