Commit Graph

  • 4cc6253d5c
    Merge pull request #666 from codinglover222/deepseek-doc-fix main Xingkai Yu 2025-04-09 09:50:40 +08:00
  • 57d7bd45df
    Merge pull request #736 from shihaobai/main Huang Panpan 2025-04-08 22:18:33 +08:00
  • 88d6547df2
    Merge pull request #816 from KPCOFGS/main Xingkai Yu 2025-04-08 17:27:09 +08:00
  • 741b06ebca
    Merge pull request #720 from xiaokongkong/main Xingkai Yu 2025-04-08 17:20:37 +08:00
  • a5d2ad229e
    Update README.md Shixian Sheng 2025-03-26 08:58:35 -04:00
  • a878eada08
    Delete DeepSeek_V3.pdf DeepSeekDDM 2025-03-16 23:42:21 +08:00
  • 98e67a71f4
    Update paper link DeepSeekDDM 2025-03-16 23:41:52 +08:00
  • 408e6e188a
    Update README.md shihaobai 2025-03-03 20:16:37 +08:00
  • 73f2954fa8 polish shihaobai 2025-03-03 20:10:18 +08:00
  • 1ab09c8780 Docs: add LightLLM as supported engine shihaobai 2025-03-03 19:23:08 +08:00
  • d29a967601 modify the explanation of MLA huxuedan 2025-02-26 17:06:54 +08:00
  • 592fd5daf8
    Delete CITATION.cff DeepSeekDDM 2025-02-24 11:50:20 +08:00
  • c9353aba6c
    Update bib info DeepSeekDDM 2025-02-24 11:25:44 +08:00
  • f09f5fa321
    Merge pull request #616 from Konano/chore-readme Huang Panpan 2025-02-18 18:04:06 +08:00
  • 4a65fd9221 fix an args description. oyzh 2025-02-15 11:02:28 +08:00
  • 1398800ebf
    fix scores mask Xingkai Yu 2025-02-14 20:26:45 +08:00
  • f07bccc49e
    fix: resolve center alignment issue in preview Konano 2025-02-14 12:12:16 +08:00
  • 0866cab5f9
    chore: update README.md to improve layout and image attributes Konano 2025-02-14 12:02:10 +08:00
  • e15f67af1c
    chore: update README.md to improve layout and image attributes Konano 2025-02-08 18:28:40 +08:00
  • 2f7b80eece
    Merge pull request #611 from Konano/chore-stale Huang Panpan 2025-02-08 16:10:06 +08:00
  • 76d8d39560
    chore: add stale issue management configuration Konano 2025-02-08 15:12:09 +08:00
  • 5ee97a83f0
    fix comment Xingkai Yu 2025-02-07 16:42:55 +08:00
  • 1d7d440461
    Merge pull request #432 from luislh-dev/main Xingkai Yu 2025-02-05 16:53:53 +08:00
  • 09d108620a
    Merge pull request #440 from spenserblack/main Xingkai Yu 2025-02-05 16:50:03 +08:00
  • d0f8c4fca3
    Merge pull request #528 from WSL0809/main Xingkai Yu 2025-02-05 16:33:18 +08:00
  • 87a01053e4
    Merge pull request #556 from XxAlonexX/main Xingkai Yu 2025-02-05 16:23:02 +08:00
  • a157077c61
    Merge pull request #408 from fitzjalen/refactor Huang Panpan 2025-02-05 12:03:02 +08:00
  • c32c957fb0
    Merge pull request #364 from Dhie-boop/feature/table-of-content Huang Panpan 2025-02-05 11:39:08 +08:00
  • 6a30b43249 Fix Linear Layer Bias Initialization XxAlonexX 2025-02-04 10:38:45 +05:30
  • 97b35f1fca docs: remove redundant asterisks in note luislopez-developer 2025-02-03 15:02:04 -05:00
  • d5c08b384b
    Update README.md wangsl 2025-02-02 02:34:59 +08:00
  • 760d22821f
    Add syntax highlighting to requirements code block Spenser Black 2025-01-28 18:07:15 -05:00
  • 6784e1976d Fix TOC links to correctly link to headings in Markdown Dhieu 2025-01-28 17:14:35 +03:00
  • 2756e130c2 clarify assertion error Roman Fitzjalen 2025-01-28 13:16:54 +01:00
  • ddc501b80e Add table of contents to README Dhieu 2025-01-27 14:18:17 +03:00
  • b5d872ead0
    Merge pull request #341 from enochkan/main Huang Panpan 2025-01-26 09:29:50 +08:00
  • 53d8dc9966 docs: Update system requirements with GitHub Markdown callout enoch kan 2025-01-25 22:29:54 +00:00
  • 722e6885ef docs: Improve system requirements section formatting enoch kan 2025-01-25 22:26:48 +00:00
  • 53b055bc1e docs: Add system requirements for DeepSeek-Infer demo enoch kan 2025-01-25 22:21:51 +00:00
  • ee4c4ea32b
    Merge pull request #234 from wangfuchun-fc/patch-1 Xingkai Yu 2025-01-07 17:53:28 +08:00
  • 25109d2ccd
    Merge pull request #230 from jacksonpradolima/main Huang Panpan 2025-01-07 14:05:15 +08:00
  • fdbd5be754
    Merge pull request #193 from enochkan/main Huang Panpan 2025-01-07 14:02:11 +08:00
  • 3779a89770
    fix: fix readme doc typo. wangfuchun-fc 2025-01-06 22:00:32 +08:00
  • c070549279 Add CITATION.cff to provide citation metadata Jackson Antonio do Prado Lima 2025-01-05 21:46:37 -03:00
  • bc77f22afc Updated model.py docstrings enoch kan 2025-01-05 18:24:31 +00:00
  • a1296f099e Enhance documentation and update .gitignore for model conversion scripts enoch kan 2025-01-05 18:18:18 +00:00
  • fd011c11aa torch rmsnorm GeeeekExplorer 2025-01-05 14:33:48 +08:00
  • 9b288b86cc
    Update README.md Xingkai Yu 2025-01-03 15:30:48 +08:00
  • 0d16ea24c8
    Merge pull request #206 from kutt/patch-1 Huang Panpan 2025-01-03 09:48:03 +08:00
  • 21bc231f32
    use alert formatting for notes in readme kutt 2025-01-02 15:02:52 +01:00
  • 8710ec2ecb
    require model-parallel in convert.py Xingkai Yu 2024-12-31 18:05:55 +08:00
  • 7c2466b310
    Update issue templates Huang Panpan 2024-12-31 14:49:05 +08:00
  • 1b8e18cc29
    Merge pull request #21 from eltociear/patch-1 Huang Panpan 2024-12-30 15:03:30 +08:00
  • 94410f8d58
    Merge pull request #33 from zhyncs/main Haswell Iris 2024-12-30 14:37:38 +08:00
  • 68d0061937 upd zhyncs 2024-12-30 14:25:28 +08:00
  • 2fc98d1cdf upd zhyncs 2024-12-30 14:21:00 +08:00
  • a1edf4138e upd zhyncs 2024-12-30 14:18:00 +08:00
  • 8638950ec2 docs: update SGLang usage zhyncs 2024-12-30 14:13:27 +08:00
  • 83dd18eda4
    Update README.md DeepSeekDDM 2024-12-30 11:04:14 +08:00
  • 710c8b8b6e
    docs: update README.md Ikko Eltociear Ashimine 2024-12-29 00:43:11 +09:00
  • 8f1c9488b5
    handle missing scale_inv_name (#2) Yang Wang 2024-12-27 09:34:38 +08:00
  • c8087bd8b8
    Merge pull request #9 from simon-mo/vllm Huang Panpan 2024-12-27 09:16:09 +08:00
  • e2c15caf04 add version simon-mo 2024-12-26 17:11:31 -08:00
  • cf47874d8e Docs: add vLLM as supported engine simon-mo 2024-12-26 17:10:33 -08:00
  • 4c2fdb8f55 Release DeepSeek-V3 stack-heap-overflow 2024-12-26 19:01:57 +08:00
  • 4b58dc6bfc
    Initial commit stack-heap-overflow 2024-12-26 17:52:41 +08:00