mirror of
https://github.com/deepseek-ai/DeepSeek-R1.git
synced 2025-04-28 18:09:23 +00:00
Compare commits
3 Commits
9103a8eeb4
...
8a57f65fb5
Author | SHA1 | Date | |
---|---|---|---|
|
8a57f65fb5 | ||
|
95aaec702f | ||
|
152c1f3d73 |
18
CITATION.cff
18
CITATION.cff
@ -1,18 +0,0 @@
|
||||
cff-version: 1.2.0
|
||||
message: "If you use this work, please cite it using the following metadata."
|
||||
title: "DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning"
|
||||
authors:
|
||||
- name: "DeepSeek-AI"
|
||||
year: 2025
|
||||
identifiers:
|
||||
- type: doi
|
||||
value: 10.48550/arXiv.2501.12948
|
||||
- type: arXiv
|
||||
value: 2501.12948
|
||||
url: "https://arxiv.org/abs/2501.12948"
|
||||
categories:
|
||||
- "cs.CL"
|
||||
repository-code: "https://github.com/deepseek-ai/DeepSeek-R1"
|
||||
license: "MIT"
|
||||
abstract: >
|
||||
We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary step, demonstrates remarkable reasoning capabilities. Through RL, DeepSeek-R1-Zero naturally emerges with numerous powerful and intriguing reasoning behaviors. However, it encounters challenges such as poor readability, and language mixing. To address these issues and further enhance reasoning performance, we introduce DeepSeek-R1, which incorporates multi-stage training and cold-start data before RL. DeepSeek-R1 achieves performance comparable to OpenAI-o1-1217 on reasoning tasks. To support the research community, we open-source DeepSeek-R1-Zero, DeepSeek-R1, and six dense models (1.5B, 7B, 8B, 14B, 32B, 70B) distilled from DeepSeek-R1 based on Qwen and Llama.
|
59
website
Normal file
59
website
Normal file
@ -0,0 +1,59 @@
|
||||
<!DOCTYPE html>
|
||||
<html lang="en">
|
||||
<head>
|
||||
<meta charset="UTF-8">
|
||||
<meta name="viewport" content="width=device-width, initial-scale=1.0">
|
||||
<title>Your Business Name</title>
|
||||
<link rel="stylesheet" href="styles.css">
|
||||
</head>
|
||||
<body>
|
||||
<header>
|
||||
<h1>Welcome to Your Business Name</h1>
|
||||
<nav>
|
||||
<ul>
|
||||
<li><a href="#about">About Us</a></li>
|
||||
<li><a href="#services">Services</a></li>
|
||||
<li><a href="#contact">Contact</a></li>
|
||||
</ul>
|
||||
</nav>
|
||||
</header>
|
||||
|
||||
<main>
|
||||
<section id="about">
|
||||
<h2>About Us</h2>
|
||||
<p>Your business description goes here. Describe what you do and what makes you unique.</p>
|
||||
</section>
|
||||
|
||||
<section id="services">
|
||||
<h2>Our Services</h2>
|
||||
<ul>
|
||||
<li>Service 1: Description</li>
|
||||
<li>Service 2: Description</li>
|
||||
<li>Service 3: Description</li>
|
||||
</ul>
|
||||
</section>
|
||||
|
||||
<section id="contact">
|
||||
<h2>Contact Us</h2>
|
||||
<form action="#" method="post">
|
||||
<label for="name">Name:</label>
|
||||
<input type="text" id="name" name="name" required>
|
||||
|
||||
<label for="email">Email:</label>
|
||||
<input type="email" id="email" name="email" required>
|
||||
|
||||
<label for="message">Message:</label>
|
||||
<textarea id="message" name="message" required></textarea>
|
||||
|
||||
<button type="submit">Send Message</button>
|
||||
</form>
|
||||
</section>
|
||||
</main>
|
||||
|
||||
<footer>
|
||||
<p>© 2025 Your Business Name. All rights reserved.</p>
|
||||
</footer>
|
||||
|
||||
<script src="script.js"></script>
|
||||
</body>
|
||||
</html>
|
Loading…
x
Reference in New Issue
Block a user