RuanZhiXiang
|
c258380729
|
!1799 refactor: use MegatronAdaptation to support adaptation registration, application and execution.
Merge pull request !1799 from RuanZhiXiang/refactor-megatron-adaptor
|
2024-10-26 08:13:07 +00:00 |
|
RuanZhiXiang
|
e93769afd2
|
!1511 refactor: support Deepseek Specification
Merge pull request !1511 from RuanZhiXiang/refactor-deepseek
|
2024-10-21 07:57:37 +00:00 |
|
商元义
|
bf0ebba09d
|
!1707 添加新模型Qwen2-57B-A14B
Merge pull request !1707 from 商元义/master
|
2024-09-24 14:40:52 +00:00 |
|
jzh
|
eccb540801
|
!1637 [mcore-llm]新增 deepseek-lite 预训练、推理和微调以及相关数据处理的脚本
Merge pull request !1637 from jzh/master
|
2024-09-23 01:45:32 +00:00 |
|
曲玥泽
|
86b7a099ec
|
!1680 aquila、baichuan、baichuan2权重转换脚本切换新框架
Merge pull request !1680 from 曲玥泽/master
|
2024-09-19 06:52:00 +00:00 |
|
xiongliangcheng
|
2b84f8f52a
|
!1658 【mcore】添加baichuan2-13B和deepseek2-coder适配
Merge pull request !1658 from xiongliangcheng/deepseek-coder
|
2024-09-12 01:04:21 +00:00 |
|
glhyy
|
1f4ab545cc
|
!1597 新增权重转换ut模板和mixtral用例,支持legacy和mcore互转
Merge pull request !1597 from glhyy/master
|
2024-09-10 07:40:15 +00:00 |
|
sunjunjie
|
58e8133311
|
!1622 权重转换代码位置优化&修复反向依赖
Merge pull request !1622 from sunjunjie/ckpt_position
|
2024-09-09 06:37:36 +00:00 |
|