Commit Graph

8 Commits

Author SHA1 Message Date
RuanZhiXiang
c258380729 !1799 refactor: use MegatronAdaptation to support adaptation registration, application and execution.
Merge pull request !1799 from RuanZhiXiang/refactor-megatron-adaptor
2024-10-26 08:13:07 +00:00
RuanZhiXiang
e93769afd2 !1511 refactor: support Deepseek Specification
Merge pull request !1511 from RuanZhiXiang/refactor-deepseek
2024-10-21 07:57:37 +00:00
商元义
bf0ebba09d !1707 添加新模型Qwen2-57B-A14B
Merge pull request !1707 from 商元义/master
2024-09-24 14:40:52 +00:00
jzh
eccb540801 !1637 [mcore-llm]新增 deepseek-lite 预训练、推理和微调以及相关数据处理的脚本
Merge pull request !1637 from jzh/master
2024-09-23 01:45:32 +00:00
曲玥泽
86b7a099ec !1680 aquila、baichuan、baichuan2权重转换脚本切换新框架
Merge pull request !1680 from 曲玥泽/master
2024-09-19 06:52:00 +00:00
xiongliangcheng
2b84f8f52a !1658 【mcore】添加baichuan2-13B和deepseek2-coder适配
Merge pull request !1658 from xiongliangcheng/deepseek-coder
2024-09-12 01:04:21 +00:00
glhyy
1f4ab545cc !1597 新增权重转换ut模板和mixtral用例,支持legacy和mcore互转
Merge pull request !1597 from glhyy/master
2024-09-10 07:40:15 +00:00
sunjunjie
58e8133311 !1622 权重转换代码位置优化&修复反向依赖
Merge pull request !1622 from sunjunjie/ckpt_position
2024-09-09 06:37:36 +00:00