Commit Graph

118 Commits

Author SHA1 Message Date
x54-729
f319b5bce1 torch ddp 的测试用例 2022-04-26 05:49:32 +00:00
x54-729
df109316e5 small 2022-04-26 05:33:37 +00:00
x54-729
705deeaea9 small 2022-04-26 03:35:25 +00:00
x54-729
cf65da1332 paddle 单卡测试例调整 2022-04-25 12:58:03 +00:00
x54-729
bb10410ccd torch 单卡的测试例 2022-04-25 12:57:17 +00:00
x54-729
c74da391c7 跟进paddle backend关于all_gather的改动 2022-04-20 10:27:09 +00:00
x54-729
fcc45dbf4a PaddleFleetDriver添加all_gather和broadcast_object函数 2022-04-20 10:17:19 +00:00
x54-729
1c0e331bad 完成fastnlp_paddle_all_gather和fastnlp_paddle_broadcast_object函数及测试例 2022-04-20 09:56:44 +00:00
yh_cc
368c17fe73 Merge branch 'dev0.8.0' of github.com:fastnlp/fastNLP into dev0.8.0 2022-04-19 22:38:55 +08:00
yh_cc
aa95513055 1.merge ModelCheckPointCallback和TrainerCheckpointCallback;2.新增MoreEvaluateCallback 2022-04-19 22:38:35 +08:00
x54-729
514415e9d4 完成了paddle fleet的save load函数测试 2022-04-16 15:47:18 +00:00
x54-729
9b13fd313a small 2022-04-16 08:39:38 +00:00
x54-729
aa2f678507 Merge branch 'dev0.8.0' of github.com:fastnlp/fastNLP into dev0.8.0 2022-04-16 07:23:18 +00:00
yh_cc
d813f31f9a 1.删除sampler中num_consumed_samples_array,转为通过batch_size等进行换算。2.在Metric中新增all_gather_object接口,方便评测 2022-04-16 15:22:44 +08:00
x54-729
6bfdb39c2f 完善paddle fleet set_dist_repro_dataloader的测试例 2022-04-16 06:42:29 +00:00
x54-729
a25a73394b Merge branch 'dev0.8.0' of github.com:fastnlp/fastNLP into dev0.8.0 2022-04-16 05:40:35 +00:00
x54-729
de544707d9 paddle fleet set_dist_repro_dataloader的测试例 2022-04-16 05:40:23 +00:00
yh_cc
d758a36d4a update metric的实现 2022-04-16 12:40:29 +08:00
yh_cc
048e409233 Merge branch 'dev0.8.0' of github.com:fastnlp/fastNLP into dev0.8.0 2022-04-16 00:33:32 +08:00
yh_cc
aaa4ccc3e0 1.Metric的aggregate_when_get_metric默认值修改为None;将根据Evaluator中Sampelr的状态最终决定;2.增加fastnlp_bo_syn_context关闭fastnlp中的多卡同步操作 2022-04-16 00:32:52 +08:00
x54-729
32d8e27472 small 2022-04-15 16:10:10 +00:00
x54-729
26c80d620c paddle单卡加载fp16的测试 2022-04-15 16:01:21 +00:00
x54-729
8e9a47cf00 Merge branch 'dev0.8.0' of github.com:fastnlp/fastNLP into dev0.8.0 2022-04-15 13:48:23 +00:00
MorningForest
a2956b697e Merge branch 'dev0.8.0' of github.com:fastnlp/fastNLP into dev0.8.0 2022-04-15 20:04:53 +08:00
MorningForest
665d79a3ed 增加paddle单卡的accuracy测试用例 2022-04-15 20:03:44 +08:00
x54-729
9e97155312 PaddleSingleDriver的save load函数测试 2022-04-15 10:44:49 +00:00
x54-729
16cec4bd99 删除不必要的测试文件 2022-04-15 09:07:35 +00:00
x54-729
cf19062fb2 set_dist_repro_dataloader测试例的完善 2022-04-15 09:06:22 +00:00
x54-729
cca265f99c Merge branch 'dev0.8.0' of github.com:fastnlp/fastNLP into dev0.8.0 2022-04-15 08:10:19 +00:00
x54-729
d0f26c7c34 将validate_step替换为evaluate_step 2022-04-15 08:10:01 +00:00
yh_cc
7c70874b4a 删除core.sampelrs.sampler.py;增加torch的clipgradient和warmupcallback 2022-04-15 16:04:43 +08:00
MorningForest
3ea74b52d2 Merge remote-tracking branch 'refs/remotes/origin/dev0.8.0' into dev0.8.0 2022-04-15 14:00:39 +08:00
yh_cc
f27d53261c 修改部分测试用例中validate_dataloader为evaluate_dataloader 2022-04-15 12:30:20 +08:00
x54-729
f6f489dc90 多卡 set_dist_repro_dataloader 的测试例 2022-04-14 16:17:20 +00:00
x54-729
98644d2d0b fix conflict 2022-04-14 16:05:53 +00:00
x54-729
b97962b8dd 简化paddle trainer的单卡测试例 2022-04-14 16:04:56 +00:00
x54-729
5c29cd384a 调整num_consumed_batches在driver save中的逻辑 2022-04-14 16:04:12 +00:00
yh_cc
a4b2e0fac5 修复若干bug 2022-04-15 00:01:29 +08:00
YWMditto
2924e2117f 删除了 driver 中的 **_step,使用 model_call 和 get_model_call_fn 来代替;删除了 driver 中的所有 dataloaders 2022-04-14 23:34:35 +08:00
MorningForest
4cad7f548d Merge remote-tracking branch 'refs/remotes/origin/dev0.8.0' into dev0.8.0 2022-04-14 19:37:02 +08:00
MorningForest
4be26c5620 修改torch fdl 2022-04-14 19:36:50 +08:00
yh_cc
16a467393c 1.montior允许传入callable的对象进行选择; 2.解决Sampler中存在的循环引用问题 2022-04-14 16:02:41 +08:00
YWMditto
2f23d80ccc 修改了 trainer 中的 validate 的调用的逻辑 2022-04-14 00:45:17 +08:00
YWMditto
8c0b0b8cd0 Merge branch 'dev0.8.0' of github.com:fastnlp/fastNLP into dev0.8.0 2022-04-13 19:09:34 +08:00
YWMditto
b9b0b53430 将 Events 修改为小写 2022-04-13 19:09:27 +08:00
yh_cc
31c6d5d02a merge 2022-04-13 17:05:16 +08:00
yh_cc
9d71170bef 解决Trainer在断点重训的时候无法实现准确load和保存的问题 2022-04-13 17:04:33 +08:00
x54-729
c2575ab357 Merge branch 'dev0.8.0' of github.com:fastnlp/fastNLP into dev0.8.0 2022-04-13 07:33:40 +00:00
x54-729
3ab93b2fae paddle driver单卡和utils的pytest测试,添加了断点重训的测试 2022-04-13 07:33:27 +00:00
YWMditto
eb8d761a05 Merge branch 'dev0.8.0' of github.com:fastnlp/fastNLP into dev0.8.0 2022-04-13 14:27:08 +08:00