ModelLink2/examples/mcore
RuanZhiXiang 11a7ccabbd !1576 fix: chatglm3 arguments
Merge pull request !1576 from RuanZhiXiang/load-chatglm
2024-08-29 11:24:31 +00:00
..
chatglm3 !1576 fix: chatglm3 arguments 2024-08-29 11:24:31 +00:00
deepseek2 !1476 【mcore-LLM大模型】新增类deepseekv2模型:支持MLA,YaRN,DeepSeekMoE模型结构 2024-08-06 09:35:36 +00:00
deepseek2_lite !1541 [mcore-llm]新增类deepseekv2-lite模型预训练 2024-08-22 02:21:33 +00:00
gemma !1570 gemma-2b mcore适配 2024-08-28 07:58:32 +00:00
grok1 !1453 修改grok1的双机配置 2024-07-25 07:02:28 +00:00
llama2 !1452 adapt llama2 to mcore 2024-08-02 01:09:46 +00:00
llama31 !1532 新增llama3.1-70b模型 2024-08-23 02:33:55 +00:00
mistral !1455 prompt-type推理适配 2024-08-13 06:53:26 +00:00
mixtral !1447 增加mixtral_mcore,增加mcore的mc2以及优化dispatcher亲和化操作argsort 2024-07-22 14:01:20 +00:00
qwen2 !1477 新模型Qwen2_72B适配mcore分支 2024-08-27 12:19:55 +00:00
qwen15 !1543 添加Qwen1.5-110B适配 2024-08-29 00:55:47 +00:00