Commit Graph

485 Commits

Author SHA1 Message Date
shengjiayi@huawei.com
620dd31e4f llama33B适配更新 2023-12-04 19:06:47 +08:00
wenjiang2357
de248d084c Merge branch 'master' of https://gitee.com/ascend/AscendSpeed into master 2023-12-04 17:24:22 +08:00
wenjiang2357
9c594efe8e llama2-7B readme更新 2023-12-04 17:23:55 +08:00
i-robot
96f103fcda
!352 Aquila7B模型精度达标,性能达标,已修改AscendSpeed的master README增加Aquila相关信息
Merge pull request !352 from yaojia2021/master
2023-12-04 06:57:38 +00:00
i-robot
f18cbbb0d9
!334 add function of save checkpoint to cpu tensor
Merge pull request !334 from xqiangx1991/cpu-checkpoint
2023-12-04 06:19:18 +00:00
19952409173
476131b265 modify llama-7B/13B readme for Q4
modify llama-7B/13B readme for Q4

modify llama-7B/13B readme for Q4

modify llama-7B/13B readme for Q4

modify llama-7B/13B readme for Q4
2023-12-04 11:29:28 +08:00
i-robot
f1a7f4daa5
!351 FineTune Internlm
Merge pull request !351 from l00636998/master
2023-12-04 02:58:33 +00:00
xqiangx1991
0ef6c33cda Merge branch 'master' of gitee.com:ascend/AscendSpeed into cpu-checkpoint
Signed-off-by: xqiangx1991 <joeltsui@gmail.com>
2023-12-04 02:07:32 +00:00
i-robot
f10cdf1267
!343 修复合并脚本后的lora功能
Merge pull request !343 from zhangshengdong29/master
2023-12-04 02:05:15 +00:00
y00546703
3ba91a9d58 add aquila-7B model, modified AscendSpeed master README accordingly. 2023-12-04 09:59:20 +08:00
lizekai
99af579fe5 FineTune Internlm 2023-12-02 18:59:14 +08:00
liuyanghan
fe1c7d7869 书生65B模型合入 2023-12-02 10:39:47 +08:00
zhansghengdong
2c516db866 修复合并脚本后的lora功能 2023-12-01 16:16:01 +08:00
i-robot
680c0fbe65
!333 Finetune Internlm
Merge pull request !333 from l00636998/master
2023-12-01 06:38:03 +00:00
lizekai
9415637bed FineTune Internlm 2023-12-01 10:50:46 +08:00
xqiangx1991
79ee48d108 add function of save checkpoint to cpu tensor 2023-12-01 10:15:26 +08:00
i-robot
a5d76b70b1
!330 支持自适应选择重计算-优化重计算策略选择算法,加上UT
Merge pull request !330 from Gou Jicheng/dev_from_master_only_recompute
2023-11-30 11:15:49 +00:00
goujicheng
8ca3cfe00a 支持自适应选择重计算-优化重计算策略选择算法,加上UT 2023-11-30 14:12:08 +08:00
mhh001
2e6003abf3 合并脚本后,一些问题修改 2023-11-30 09:41:55 +08:00
i-robot
49aac6e5c4
!305 删除llama_model.py,llama模型合并到transformer.py
Merge pull request !305 from mhh001/master
2023-11-28 11:19:01 +00:00
mhh001
1454fa076d 删除llama_model.py,llama模型合并到transformer.py 2023-11-28 17:26:17 +08:00
fengliangjun
a71f138128 update me 2023-11-28 11:50:59 +08:00
i-robot
cc8f80bda7
!315 新增bloom模型评测脚本
Merge pull request !315 from huangyiming/master
2023-11-24 09:18:57 +00:00
matrixssy
eb8973ce0b llama2-7b全流程脚本及readme 2023-11-24 16:52:16 +08:00
huangyiming
b9f2d96a5d 1. 增加bloom模型评测脚本
2. 修复Human_eval代码路径问题
2023-11-24 16:19:18 +08:00
ningbenzhe1
f801b32e15 llama2 uploads the training inference script and the README. 2023-11-24 12:15:46 +08:00
i-robot
569e1cc7b3
!299 bloom适配lora以及bloom7B全流程打通
Merge pull request !299 from ningbenzhe1/master
2023-11-24 01:20:30 +00:00
ningbenzhe1
8a282348ca Adapt to lora and get through the whole process. 2023-11-24 09:16:02 +08:00
19952409173
f36b2414dc 更新llama-7B、llama-13B README.md;新增llama-13B 16p参数脚本
更新llama-7B、llama-13B README.md;新增llama-13B 16p参数脚本

更新llama-7B、llama-13B README.md;新增llama-13B 16p参数脚本

更新llama-7B、llama-13B README.md;新增llama-13B 16p参数脚本
2023-11-23 20:40:36 +08:00
fengliangjun
0b79c57e1b add chinese read me 2023-11-22 20:17:28 +08:00
i-robot
b04f782575
!288 修改README文件单词拼写错误
Merge pull request !288 from xiongliangcheng/master
2023-11-22 06:15:25 +00:00
liulinfeng
e57e6e0b2b Bloom模型适配FA及打通基于megatron框架预训练 2023-11-22 10:39:42 +08:00
i-robot
8757d302a5
!291 增加bloom中文readme
Merge pull request !291 from fengliangjun/master
2023-11-22 01:29:17 +00:00
i-robot
90bcaf0e48
!285 自适应选择重计算特性开发
Merge pull request !285 from 刘蛟龙/dev_from_master_only_recompute
2023-11-21 12:09:05 +00:00
goujicheng
fcbc94aa51 支持自适应选择重计算 2023-11-21 19:18:30 +08:00
i-robot
3dbf8233cb
!278 混精训练内存复用O1(FP32梯度副本去冗余)算法提交
Merge pull request !278 from Kingsleyandher/master
2023-11-20 11:54:20 +00:00
fengliangjun
540651a4fa add chinese bloom readme 2023-11-20 19:33:09 +08:00
l00619700
70412752da FP32梯度副本去冗余等价算法提交 2023-11-20 16:40:23 +08:00
fengliangjun
eea45231c2 add baichuan chinese readme 2023-11-17 13:58:19 +08:00
xiongliangcheng
1ce6eee7de modified: examples/baichuan/README.md 2023-11-17 13:56:37 +08:00
i-robot
7067138248
!274 添加alpaca中文readme
Merge pull request !274 from fengliangjun/master
2023-11-16 01:28:45 +00:00
chantcalf
b6d0d67f3c 由于transformer框架 v3升级,更新LLama2-70B脚本和readme 2023-11-15 18:40:06 +08:00
fengliangjun
79eb5e89ef add alpaca chinese read 2023-11-14 21:59:02 +08:00
mhh001
ad6f867d94 transformer框架升级 2023-11-14 20:44:46 +08:00
i-robot
711911f9fc
!259 LLama2-70B 添加readme
Merge pull request !259 from Liuchang/master
2023-11-13 08:33:08 +00:00
chantcalf
eb38c34a8c LLama2-70B 添加readme 2023-11-13 16:13:29 +08:00
i-robot
63ee9d5ca3
!253 在线推理接口支持多batch输入
Merge pull request !253 from shishaoyu/master
2023-11-13 08:04:26 +00:00
matrixssy
934614b039 在线推理支持batch size >= 1 2023-11-10 15:58:07 +08:00
robert
a14f76b18e 修复在INF_NAN模式下拉起176B OOM问题 2023-11-07 22:32:57 +08:00
i-robot
ac98107b42
!248 LLama model权重转换支持自定义和GQA
Merge pull request !248 from Liuchang/master
2023-11-07 13:31:04 +00:00
chantcalf
df8ab5e07c LLama model权重转换支持自定义、GQA和MLP融合
1.支持自定义模型参数:type设为非预制模型,可自定义num_layers、num_heads、hidden_size
2.支持GQA:增加num_kv_heads参数
3.支持merge-mlp,开启则将gate_proj和up_proj权重合并
4.修复arguments中关于位置编码判断的bug
5.修复convert_weights_when_tp_pp_change中的若干bug
2023-11-07 11:36:07 +08:00
i-robot
5ff0d1d4b9
!247 送检整改: README中补充加固说明
Merge pull request !247 from 郭鑫杰1/master
2023-11-06 11:04:12 +00:00
xqiangx1991
3588767539 !250 fix pip install megatron
* fix pip install megatron
2023-11-06 03:16:21 +00:00
g00841271
877ade85dd README security hardening description and Clean code 2023-11-04 18:03:37 +08:00
i-robot
8e06ed062f
!223 (PART 3)在线推理框架ST用例与样例补充
Merge pull request !223 from shishaoyu/master
2023-11-03 02:52:56 +00:00
i-robot
f2928ea58a
!246 修改baichuan13B权重链接
Merge pull request !246 from xiongliangcheng/master
2023-11-02 13:47:47 +00:00
i-robot
a7ef12d257
!210 LLama2 model添加GQA和GQA FA功能
Merge pull request !210 from Liuchang/master
2023-11-02 02:13:17 +00:00
matrixssy
b8f718cc3d 1.补充样例代码与ST用例 2023-11-01 11:58:52 +08:00
liuyanghan
d56c249a43 更新README中失效huggingface链接 2023-11-01 11:47:40 +08:00
xiongliangcheng
146bfd7012 modify examples/baichuan/README.md 2023-11-01 10:07:20 +08:00
i-robot
1aefef58e0
!218 指令数据集增加动态padding功能
Merge pull request !218 from ningbenzhe1/master
2023-11-01 01:04:16 +00:00
chantcalf
38c83b2a74 LLama model添加GQA和GQA-FA功能
LLama model rotray分支增加GQA功能
LLama model 增加FlashAttention 接口
添加llama2-70B 启动脚本
2023-10-31 22:17:02 +08:00
i-robot
02cb18d70d
!236 修改LLama7B/13B 书生模型超参,提升性能
Merge pull request !236 from Kingsleyandher/master
2023-10-31 08:15:54 +00:00
ningbenzhe1
a8c0549085 Added dynamic padding function. 2023-10-31 11:59:22 +08:00
i-robot
b5dc9c8b58
!241 修改baichuan7B权重转换脚本和readme
Merge pull request !241 from xiongliangcheng/master
2023-10-30 15:00:13 +00:00
i-robot
d6e6722f6a
!227 add llama-33b script
Merge pull request !227 from chenzhihong/llama-33b
2023-10-30 07:19:58 +00:00
xiongliangcheng
97de2c58f5 修改Baichuan模型的README文件 2023-10-28 17:51:20 +08:00
l00619700
5ec2c196d4 修改模型超参,提升性能 2023-10-28 17:37:06 +08:00
liuyanghan
adb069216d 修改版本包链接 2023-10-28 09:12:27 +08:00
liuyanghan
6801afd180 README添加提示语; 修复评估代码中bug 2023-10-27 15:11:00 +08:00
Zhihong Chen
8e6a9c0574 add llama-33b script and readme update 2023-10-26 15:38:32 +08:00
l00619700
a511221b5c llama 13b增加自定义选择重计算,兼容之前所有逻辑。 2023-10-24 19:59:03 +08:00
i-robot
5e27f2e9b6
!157 Add alpaca 13B performance and device info to README.md
Merge pull request !157 from stacey/master
2023-10-20 01:41:09 +00:00
19952409173
a4367fb11b add performance and device information
add performance and device information

add performance and device information

add performance and device information

add performance and device information

add performance and device information

move images to source/images/alpaca, add inference

move images to source/images/alpaca, add inference

move images to source/images/alpaca, add inference

move images to source/images/alpaca, add inference

move images to source/images/alpaca, add inference
2023-10-20 09:31:36 +08:00
liuyanghan
c703b968e3 1 ./examples/bloom/images 文件夹移动到 ./sources/images/bloom
2 ./examples/baichuan/images 文件夹移动到 ./sources/images/baichuan
3 修改评估代码bug
4 删除reanMe中冗余操作步骤
2023-10-19 15:36:39 +08:00
i-robot
16baa028f4
!176 llama-65b readme update
Merge pull request !176 from chenzhihong/master
2023-10-19 06:19:30 +00:00
l00619700
76609dec8c 提交llama13b脚本与readme文件 2023-10-18 14:23:53 +08:00
Zhihong Chen
e6b3b937b4 llama-65b readme update 2023-10-18 12:11:13 +08:00
i-robot
45434eead6
!167 新增bloom权重转换脚本以及推理脚本,完善bloom readme
Merge pull request !167 from ningbenzhe1/master
2023-10-18 01:38:15 +00:00
ningbenzhe1
010747ebb8 Provide bloom reasoning scripts and refine the bloom readme. 2023-10-17 19:06:12 +08:00
i-robot
602eb623e3
!149 llama model加入选择性重计算,更新脚本
Merge pull request !149 from 胡晓波/master
2023-10-17 09:56:10 +00:00
胡晓波
5bd2363a7b llama model支持选择性重计算;修改训练脚本 2023-10-17 15:57:16 +08:00
mhh001
6fe54d95d5 目前llama2 model均使用llama model相关代码,删除llama2无关的代码。 2023-10-17 15:13:37 +08:00
l00562083
1609a0fa25 add llama-2-7b model 2023-10-14 04:11:12 +08:00
l00619700
58a3865fc9 InternLM readme提交 2023-10-13 14:04:42 +08:00
liulinfeng
c747d4ceeb 修复Bloom的权重转换脚本无法正确拉起的错误及修改Bloom README 2023-10-12 17:07:53 +08:00
i-robot
b9b1feadc1
!158 ./README.md.修改
Merge pull request !158 from liuyanghan/master
2023-10-11 09:47:40 +00:00
i-robot
bb09ab2962
!155 送检整改: codecheck空行缩进修改
Merge pull request !155 from chenzhihong/codecheck
2023-10-11 08:21:53 +00:00
Zhihong Chen
9d5c53521f codecheck update 2023-10-11 11:19:20 +08:00
i-robot
a22bd383ca
!148 修改baichuan权重转换脚本和readme
Merge pull request !148 from 高楼大熊/master
2023-10-11 02:01:45 +00:00
liuyanghan
d19f47f4be ./README.md.修改:
1、明确链接标志 如Baichuan-7B/13B -> Baichuan-[README:7B/13B]
2、ARM环境通过下载troch安装包安装torch(直接pip安装会导致torch_npu import失败)
3、更正错误链接; 修改链接排版方式

examples/alpaca/README.md.修改:
1、ARM环境通过下载troch安装包安装torch(直接pip安装会导致torch_npu import失败)

examples/llama/pretrain_llama_7B_zero_8p.sh.
1、解决混合精度下溢出问题
2023-10-10 21:05:41 +08:00
xiongliangcheng
4946f364a6 add readme 2023-10-10 17:40:33 +08:00
xqiangx1991
393f78a3a3 remove unused functions and files. 2023-10-10 10:40:00 +08:00
i-robot
4a42cf21f2
!120 add readme for llama-65b
Merge pull request !120 from chenzhihong/master
2023-10-09 11:34:21 +00:00
Zhihong Chen
9efb449178 add readme for llama-65b 2023-10-09 18:33:33 +08:00
i-robot
a9d3f05550
!98 add finetune chinese llama alpaca scripts and readme file
Merge pull request !98 from stacey/master
2023-10-08 09:10:24 +00:00
19952409173
fb12bc05ee add finetune chinese llama alpaca scripts and readme file
modify readme bugs

modify readme bugs

modify readme

modify readme

modify readme

modify readme

modify readme

modify readme

modify readme

modify readme
2023-10-08 15:38:53 +08:00
gitee_code_template
db2ef8b3fc add baichuan readme 2023-10-08 11:20:27 +08:00
gitee_code_template
23e0424ea8 add baichuan readme 2023-10-08 09:17:14 +08:00
gitee_code_template
e46a197248 add baichuan readme 2023-10-08 09:07:13 +08:00
gitee_code_template
88af8f0d1d add baichuan readme 2023-10-07 23:20:26 +08:00
gitee_code_template
4a04dc7355 add baichuan readme 2023-10-07 22:54:15 +08:00
gitee_code_template
d1d42a7fc9 add baichuan readme 2023-10-07 22:43:16 +08:00
gitee_code_template
b2159ae221 add readme baichuan 2023-10-07 21:20:00 +08:00
gitee_code_template
1ad58c3e2d add readme baichuan 2023-10-07 20:48:05 +08:00
gitee_code_template
062a639d1d add baichuan readme 2023-10-07 19:37:43 +08:00
gitee_code_template
6c8a4f3dc4 修改 baichuan readme 2023-09-28 19:59:36 +08:00
gitee_code_template
36ce0d2c37 add readme for baichua 2023-09-28 18:07:38 +08:00
i-robot
20e090b93d
!91 change llama65B batch size
Merge pull request !91 from Jializheng/master
2023-09-28 09:21:19 +00:00
gitee_code_template
6eb65d6dc9 baichuan13B模型 2023-09-28 10:30:20 +08:00
jializheng
b1de673886 change llama65B batch size 2023-09-28 09:35:30 +08:00
fengliangjun
7a21f0bf58 up 2023-09-26 14:30:47 +08:00
machangjun
8e436e3a9a add ffts mode
del torch_trans

del torch_trans and resove bloom ckpt and add bloom ffts+

add ffts mode

del torch_trans

del torch_trans and resove bloom ckpt and add bloom ffts+

replace fused_adam to adam

del unused code
2023-07-25 14:14:28 +08:00
Mrtutu
4532812837 更新bloom README: bloom7b在osacr-1G单机8卡训练 2023-07-26 14:10:51 +08:00
fengliangjun
260e8eea8f create megatron core 2023-07-24 15:00:57 +08:00
chenzomi
92c27d5e2a add a llama2 brach. 2023-07-21 15:20:25 +08:00
fengliangjun
db9c25bdd9 llama modify 2023-07-19 10:20:40 +08:00
liulinfeng
f6d7982b02 处理review意见 2023-07-14 15:34:07 +08:00
liulinfeng
243bfe5cfa Bloom适配SP代码 2023-07-14 14:18:07 +08:00
chenzomi
4455b80650 change the readme format. 2023-07-14 10:54:42 +08:00
liulinfeng
36f787bc89 Author:刘林峰
修改说明:
1、提交权重加载、推理生成文本的代码实现
2、修改codecheck问题
3、修复断点续训卡死的问题
2023-07-07 15:03:48 +08:00
kingsleyandher
3afb525a97 提交SP算法 2023-07-10 14:44:42 +08:00
kingsleyandher
21609f3083 llama模型zeroshot 33B/65B适配代码提交;提交README.md文件 2023-07-05 14:25:29 +08:00
wiyr
d87e921410 added trick 2023-06-30 11:00:38 +08:00
kingsleyandher
2c104a087e llama-zeroshot任务精度适配,对齐源论文中的效果。 2023-06-25 09:34:42 +08:00
wiyr
6304cab765 remove useless code 2023-06-20 16:54:12 +08:00
machangjun
2d8c6fee9d add bloom st and adapt new data load method
modify bloom st run

modify bloom st run

modify times

add new pretrain_bloom.py

add new pretrain_bloom.py

add new pretrain_bloom.py

add new pretrain_bloom.py

add new pretrain_bloom.py

add new pretrain_bloom.py

add new pretrain_bloom.py

add new pretrain_bloom.py

add new pretrain_bloom.py

add st
2023-06-17 17:36:17 +08:00
kingsleyandher
4e3b7cd992 LlamaTokenizer适配及预训练脚本更改 2023-06-13 12:33:33 +08:00
wiyr
2f826f7351 can run with bloom7b and pass ci 2023-06-12 14:42:29 +08:00
fengliangjun
37ba281c40 readme update 2023-06-10 11:26:55 +08:00
chenzomi
37cc0b949d change megatron to ascendspeed 2023-06-10 21:26:01 +08:00
fengliangjun
106a415556 inital AscendSpeed 2023-06-09 16:15:23 +08:00
wangyixian
d55d341fe1 Adapt the bloom 7.1b model to the AscendSpeed framework, which is jointly completed by liulinfeng and wangyixian 2023-06-06 22:30:19 +08:00
chenzomi
ce6af59f73 remove unused paraemter and models. 2023-05-26 10:53:07 +08:00
chenzomi
e4a120a662 fork megatron-deepspeed code. 2023-05-25 14:49:59 +08:00