.. |
aquila
|
!1329 ModelLink配套升级到megatron core 0.6.0
|
2024-06-11 07:53:57 +00:00 |
aquila2
|
!1329 ModelLink配套升级到megatron core 0.6.0
|
2024-06-11 07:53:57 +00:00 |
baichuan
|
!1329 ModelLink配套升级到megatron core 0.6.0
|
2024-06-11 07:53:57 +00:00 |
baichuan2
|
!1329 ModelLink配套升级到megatron core 0.6.0
|
2024-06-11 07:53:57 +00:00 |
bloom
|
!1364 修复 bloom 精度问题
|
2024-06-22 07:39:37 +00:00 |
chatglm3
|
!1345 chatglm3性能优化/增加微调功能
|
2024-06-18 04:03:01 +00:00 |
codellama
|
!1329 ModelLink配套升级到megatron core 0.6.0
|
2024-06-11 07:53:57 +00:00 |
gemma
|
!1329 ModelLink配套升级到megatron core 0.6.0
|
2024-06-11 07:53:57 +00:00 |
gpt3
|
!1329 ModelLink配套升级到megatron core 0.6.0
|
2024-06-11 07:53:57 +00:00 |
intern
|
!1329 ModelLink配套升级到megatron core 0.6.0
|
2024-06-11 07:53:57 +00:00 |
llama
|
!1356 权重转换特性看护 megatron格式转megatron格式
|
2024-06-20 04:18:47 +00:00 |
llama2
|
!1356 权重转换特性看护 megatron格式转megatron格式
|
2024-06-20 04:18:47 +00:00 |
llama3
|
!1334 修复LLAMA3-8B 的 预训练权重和词表下载路径
|
2024-06-17 02:50:37 +00:00 |
mistral
|
!1342 脚本内删除overlap-param-gather,删除无效网址链接和无效markdown
|
2024-06-12 12:42:28 +00:00 |
mixtral
|
!1363 修改mixtral预训练脚本的global-batch-size参数
|
2024-06-22 06:34:16 +00:00 |
qwen
|
!1329 ModelLink配套升级到megatron core 0.6.0
|
2024-06-11 07:53:57 +00:00 |
qwen15
|
!1361 修复Qwen1.5问题
|
2024-06-24 02:34:45 +00:00 |
yi
|
!1329 ModelLink配套升级到megatron core 0.6.0
|
2024-06-11 07:53:57 +00:00 |