ModelLink2/examples/legacy/qwen
RuanZhiXiang d77ddc5e17 !1806 Optim: llama3 qwen系列模型 预训练性能提升
Merge pull request !1806 from RuanZhiXiang/optimization-llama3-qwen
2024-11-18 08:29:28 +00:00
..
convert_ckpt_qwen_hf2legacy.sh !1760 整理主线分支 README 2024-10-25 01:33:31 +00:00
convert_ckpt_qwen_legacy2hf.sh !1760 整理主线分支 README 2024-10-25 01:33:31 +00:00
data_convert_qwen_instruction_pack.sh !1806 Optim: llama3 qwen系列模型 预训练性能提升 2024-11-18 08:29:28 +00:00
data_convert_qwen_instruction.sh !1760 整理主线分支 README 2024-10-25 01:33:31 +00:00
data_convert_qwen_pretrain_pack.sh !1806 Optim: llama3 qwen系列模型 预训练性能提升 2024-11-18 08:29:28 +00:00
data_convert_qwen_pretrain.sh !1760 整理主线分支 README 2024-10-25 01:33:31 +00:00
evaluate_qwen_7b_ptd.sh !1760 整理主线分支 README 2024-10-25 01:33:31 +00:00
evaluate_qwen_14b_ptd.sh !1760 整理主线分支 README 2024-10-25 01:33:31 +00:00
evaluate_qwen_72b_ptd.sh !1760 整理主线分支 README 2024-10-25 01:33:31 +00:00
generate_qwen_7b_ptd.sh !1760 整理主线分支 README 2024-10-25 01:33:31 +00:00
generate_qwen_14b_ptd.sh !1760 整理主线分支 README 2024-10-25 01:33:31 +00:00
generate_qwen_72b_ptd.sh !1760 整理主线分支 README 2024-10-25 01:33:31 +00:00
pretrain_qwen_7b_ptd_pack.sh !1806 Optim: llama3 qwen系列模型 预训练性能提升 2024-11-18 08:29:28 +00:00
pretrain_qwen_7b_ptd.sh !1888 fix: 限制batch_p2p_comm参数关闭条件 2024-11-14 03:15:44 +00:00
pretrain_qwen_14b_ptd.sh !1888 fix: 限制batch_p2p_comm参数关闭条件 2024-11-14 03:15:44 +00:00
pretrain_qwen_72b_ptd.sh !1888 fix: 限制batch_p2p_comm参数关闭条件 2024-11-14 03:15:44 +00:00