Commit Graph

546 Commits

Author SHA1 Message Date
wucong
04ed031500 !1311 添加OWNER,增加codeCheck查看权限
Merge pull request !1311 from wucong/addReviewer
2024-05-27 02:35:08 +00:00
guoxinjie
62c40eef76 !1240 GPT3-175B 整理上库
Merge pull request !1240 from guoxinjie/gelu
2024-05-27 02:34:20 +00:00
zhangbin
371a159c4d !1310 llama3 更新readme
Merge pull request !1310 from zhangbin/master
2024-05-25 07:52:58 +00:00
zhangbin
331a563ded !1286 添加vpp权重转换功能
Merge pull request !1286 from zhangbin/master
2024-05-25 06:27:38 +00:00
yuhui
bf2342dad8 !1302 新增Gemma-2B模型适配
Merge pull request !1302 from yuhui/master
2024-05-24 06:17:28 +00:00
黄宇豪
f19ce463a8 !1299 feat: 添加 Aquila2-34B 模型适配
Merge pull request !1299 from 黄宇豪/master
2024-05-23 09:31:32 +00:00
fengliangjun
7aedb57347 !1304 添加确定性计算功能
Merge pull request !1304 from fengliangjun/master
2024-05-23 08:30:18 +00:00
商元义
19d3b157ff !1291 添加Qwen1.5-14B适配
Merge pull request !1291 from 商元义/14B
2024-05-23 02:57:38 +00:00
leiguodong
0ddad3f9c8 !1295 修复codellama-34B UT失败问题
Merge pull request !1295 from leiguodong/master
2024-05-23 02:22:44 +00:00
yuhui
ae2ef167f9 !1300 修正fused_swiglu引用
Merge pull request !1300 from yuhui/dev
2024-05-22 06:44:13 +00:00
黄宇豪
4151c53e20 !1296 fix: 修复 Aquila2-7B UT失败问题
Merge pull request !1296 from 黄宇豪/master
2024-05-21 13:07:03 +00:00
changlei
e16f08ae1e !1293 Qwen1.5-7B增加推理图片和UT报错修复
Merge pull request !1293 from changlei/master
2024-05-21 13:04:19 +00:00
yuhui
60b3b077db !1269 新增Gemma-7B模型适配
Merge pull request !1269 from yuhui/master
2024-05-21 12:59:27 +00:00
shishaoyu
78c4397f6c !1257 支持Mistral 7B 32K长序列模型
Merge pull request !1257 from shishaoyu/master
2024-05-17 08:09:23 +00:00
changlei
0141fec762 !1281 添加Qwen1.5-7B适配
Merge pull request !1281 from changlei/master
2024-05-17 07:48:44 +00:00
glhyy
b6d946d835 !1287 readme笔误修改
Merge pull request !1287 from glhyy/master
2024-05-17 06:47:52 +00:00
fengliangjun
4a683f8dbe !1285 更新 mixtral-moe 模型至32K
Merge pull request !1285 from fengliangjun/master
2024-05-17 01:32:15 +00:00
wwzhuo
cf6e8f4a9c !1283 更正llama2 7b部分参数
Merge pull request !1283 from wwzhuo/master
2024-05-16 11:53:22 +00:00
glhyy
dd86f13dc0 !1280 增加非共享储存情况下非主节点数据缓存检测和生成
Merge pull request !1280 from glhyy/master
2024-05-16 07:40:36 +00:00
黄宇豪
a1f7e94b22 !1282 fix: 修复错误的微调数据集输入路径
Merge pull request !1282 from 黄宇豪/master
2024-05-15 11:30:31 +00:00
黄宇豪
ce19d7e3a2 !1273 feat: 添加 Aquila2-7B 适配
Merge pull request !1273 from 黄宇豪/master
2024-05-14 09:17:09 +00:00
leiguodong
85943f047c !1261 添加codellama-34B适配
Merge pull request !1261 from leiguodong/master
2024-05-14 08:19:47 +00:00
xiongliangcheng
6ac3959ae0 !1268 添加yi-34B模型适配
Merge pull request !1268 from xiongliangcheng/master
2024-05-14 03:54:20 +00:00
zhangbin
1d6d2d354d !1278 llama3 更新readme
Merge pull request !1278 from zhangbin/master
2024-05-14 03:26:27 +00:00
wucong
03211525c0 !1264 统一 readme 格式(llama2)
Merge pull request !1264 from wucong/dev8
2024-05-07 02:20:30 +00:00
wucong
a8bf1c55c5 !1263 统一 readme 格式(llama_en + qwen)
Merge pull request !1263 from wucong/dev7
2024-05-07 02:20:14 +00:00
wucong
3fd657ad6e !1262 统一 readme 格式(llama)
Merge pull request !1262 from wucong/dev6
2024-05-07 02:20:00 +00:00
wucong
dc6db1f858 !1260 统一 readme 格式(chatglm3 + intern)
Merge pull request !1260 from wucong/dev5
2024-05-07 02:19:45 +00:00
wucong
297fe8b01b !1265 统一 readme 格式(llama3 + mixtral)
Merge pull request !1265 from wucong/dev9
2024-05-07 02:16:49 +00:00
guoxinjie
2ae8749f4a !1252 统一 readme 格式(aquila)
Merge pull request !1252 from guoxinjie/readme
2024-04-30 07:50:27 +00:00
wucong
ae21a622b8 !1254 统一 readme 格式(baichuan2 + bloom)
Merge pull request !1254 from wucong/dev2
2024-04-30 07:39:33 +00:00
Liuchang
9a3d5641f5 !1255 优化聊天功能,增加Llama3聊天脚本和说明
Merge pull request !1255 from Liuchang/master
2024-04-30 02:58:39 +00:00
wucong
4e62972ecd !1253 统一 readme 格式(baichuan)
Merge pull request !1253 from wucong/dev1
2024-04-29 03:52:08 +00:00
wwzhuo
b2915bd2ab !1238 更新llama2 7b/13b 性能最优配置
Merge pull request !1238 from wwzhuo/master
2024-04-26 08:43:14 +00:00
Liuchang
a9f905b63f !1251 Llama3 readme更新
Merge pull request !1251 from Liuchang/master
2024-04-26 07:27:08 +00:00
fengliangjun
791677c135 !1246 更新baichuan2-13B性能至1668
Merge pull request !1246 from fengliangjun/master
2024-04-26 01:47:52 +00:00
Liuchang
4109f95dfd !1242 新增Llama3-8B和70B模型
Merge pull request !1242 from Liuchang/master
2024-04-25 01:24:31 +00:00
guhangsong
d17e1da6b5 !1244 解决patch后报错问题
Merge pull request !1244 from guhangsong/bugfix
2024-04-25 01:16:09 +00:00
guhangsong
39d6fd7336 !1218 迁移megatron patch
Merge pull request !1218 from guhangsong/patch
2024-04-23 01:57:03 +00:00
fengliangjun
464131283f !1239 去除FA适配时的一些冗余shape变换操作,提升性能
Merge pull request !1239 from fengliangjun/master
2024-04-18 01:42:50 +00:00
glhyy
75a81f58f9 !1233 README已知问题更新
Merge pull request !1233 from glhyy/master
2024-04-16 02:22:33 +00:00
LeiZhenzhen
5ad4ceddd4 !1231 对chatglm3增加partial_rope支持
Merge pull request !1231 from LeiZhenzhen/master
2024-04-15 13:11:56 +00:00
liuyanghan
760c3c42cb !1236 权重转换新增padding特性 bug fixed
Merge pull request !1236 from liuyanghan/master
2024-04-15 09:49:15 +00:00
liuyanghan
a5fe9c9f9e !1230 权重转换新增padding特性
Merge pull request !1230 from liuyanghan/master
2024-04-15 03:33:05 +00:00
LeiZhenzhen
ab22271e13 !1227 新增chatglm3 预训练、推理、评估基线
Merge pull request !1227 from LeiZhenzhen/master
2024-04-11 03:23:33 +00:00
guoxinjie
2f32c76be2 !1224 移除 ModelLink 下的 megatron,并在 readme 中进行补充
Merge pull request !1224 from guoxinjie/remove_megatron
2024-04-09 07:44:00 +00:00
LeiZhenzhen
8524ea2735 !1225 增加chatglm3权重转换功能
Merge pull request !1225 from LeiZhenzhen/master
2024-04-09 06:05:25 +00:00
zhangshengdong29
721ce18db6 !1223 将peft引入改为懒加载
Merge pull request !1223 from zhangshengdong29/master
2024-04-08 11:04:46 +00:00
guoxinjie
3ee4b9fa94 !1213 将门禁中的 unittest 改写成 pytest,便于后续门禁增加测试case
Merge pull request !1213 from guoxinjie/ut_pytest
2024-04-03 02:14:09 +00:00
黄宇豪
e23e1e354b !1215 fix: 统一Mixtral-README为预训练模板
Merge pull request !1215 from 黄宇豪/master
2024-04-03 02:08:55 +00:00