ModelLink2/examples
商元义 baf8f2237f !1349 修复Qwen1.5错误
Merge pull request !1349 from 商元义/master
2024-06-20 07:29:12 +00:00
..
aquila !1329 ModelLink配套升级到megatron core 0.6.0 2024-06-11 07:53:57 +00:00
aquila2 !1329 ModelLink配套升级到megatron core 0.6.0 2024-06-11 07:53:57 +00:00
baichuan !1329 ModelLink配套升级到megatron core 0.6.0 2024-06-11 07:53:57 +00:00
baichuan2 !1329 ModelLink配套升级到megatron core 0.6.0 2024-06-11 07:53:57 +00:00
bloom !1329 ModelLink配套升级到megatron core 0.6.0 2024-06-11 07:53:57 +00:00
chatglm3 !1345 chatglm3性能优化/增加微调功能 2024-06-18 04:03:01 +00:00
codellama !1329 ModelLink配套升级到megatron core 0.6.0 2024-06-11 07:53:57 +00:00
gemma !1329 ModelLink配套升级到megatron core 0.6.0 2024-06-11 07:53:57 +00:00
gpt3 !1329 ModelLink配套升级到megatron core 0.6.0 2024-06-11 07:53:57 +00:00
intern !1329 ModelLink配套升级到megatron core 0.6.0 2024-06-11 07:53:57 +00:00
llama !1356 权重转换特性看护 megatron格式转megatron格式 2024-06-20 04:18:47 +00:00
llama2 !1356 权重转换特性看护 megatron格式转megatron格式 2024-06-20 04:18:47 +00:00
llama3 !1334 修复LLAMA3-8B 的 预训练权重和词表下载路径 2024-06-17 02:50:37 +00:00
mistral !1342 脚本内删除overlap-param-gather,删除无效网址链接和无效markdown 2024-06-12 12:42:28 +00:00
mixtral !1329 ModelLink配套升级到megatron core 0.6.0 2024-06-11 07:53:57 +00:00
qwen !1329 ModelLink配套升级到megatron core 0.6.0 2024-06-11 07:53:57 +00:00
qwen15 !1349 修复Qwen1.5错误 2024-06-20 07:29:12 +00:00
yi !1329 ModelLink配套升级到megatron core 0.6.0 2024-06-11 07:53:57 +00:00