ModelLink2/tasks
2023-07-24 15:00:57 +08:00
..
ckpt_convert 1. llama_model.py attention实现回退 2023-07-06 10:33:21 +08:00
eval_harness create megatron core 2023-07-24 15:00:57 +08:00
glue change megatron to ascendspeed 2023-06-10 21:26:01 +08:00
orqa change megatron to ascendspeed 2023-06-10 21:26:01 +08:00
race change megatron to ascendspeed 2023-06-10 21:26:01 +08:00
vision create megatron core 2023-07-24 15:00:57 +08:00
zeroshot create megatron core 2023-07-24 15:00:57 +08:00
data_utils.py fork megatron-deepspeed code. 2023-05-25 14:49:59 +08:00
ensemble_classifier.py fork megatron-deepspeed code. 2023-05-25 14:49:59 +08:00
eval_utils.py create megatron core 2023-07-24 15:00:57 +08:00
finetune_utils.py create megatron core 2023-07-24 15:00:57 +08:00
main.py llama-zeroshot任务精度适配,对齐源论文中的效果。 2023-06-25 09:34:42 +08:00