Commit Graph

85 Commits

Author SHA1 Message Date
FengZiYjun
27e9453d19 * fix processor.py
* add code comments
* merge *_saver.py & *_loader.py in io/
* (ancient codes) rename Loss into LossFromTorch
2018-12-06 19:28:27 +08:00
yunfan
2aaa381827 refine git commits 2018-11-27 22:43:29 +08:00
yunfan
d643a7a894 update set_target, batch's as_numpy 2018-11-27 22:23:55 +08:00
FengZiYjun
090f7aef5b * fixing unit tests 2018-11-27 22:22:19 +08:00
yh
1d5bb0a3b6 bug fix“ 2018-11-27 22:21:22 +08:00
yh
8906155ca2 为api建立一个Analyzer 2018-11-27 22:21:18 +08:00
FengZiYjun
e9d7074ba1 * delete readme_example.py because it is oooooooout of date.
* rename preprocess.py into utils.py, because nothing about preprocess in it
* anything in loader/ and saver/ is moved directly into io/
* corresponding unit tests are moved to /test/io
* delete fastnlp.py, because we have new and better APIs
* rename Biaffine_parser/run_test.py to Biaffine_parser/main.py; Otherwise, test will fail.
* A looooooooooot of ancient codes to be refined...........
2018-11-27 22:17:41 +08:00
yunfan
b6a0d33cb1 add parser api 2018-11-27 22:14:22 +08:00
yh
77786509df pos与cws开发上传 2018-11-27 22:13:20 +08:00
yh
7d97e9365d 增加新的processor“ 2018-11-27 22:11:09 +08:00
yh
1496031182 新增pos output processor 2018-11-27 22:11:08 +08:00
yh
d5afffee73 新增端到端pos处理到parser的过度代码 2018-11-27 22:11:08 +08:00
yh_cc
10379e9c74 当前为segapp的方式,但是貌似准确率不行,尝试修改为crf 4tag模式试一试 2018-11-27 22:10:52 +08:00
yunfan
822aaf6286 fix and update tester, trainer, seq_model, add parser pipeline builder 2018-11-27 22:07:20 +08:00
FengZiYjun
4be15a5b43 保存pos tag 脚本 2018-11-27 21:53:05 +08:00
yh
9667c524a4 基本完善了cws的predict 2018-11-11 15:53:33 +08:00
yh
9fc20ac7b8 增加infer的pipeline 2018-11-11 12:55:30 +08:00
yh
dc7f8ef8d4 bug fix 2018-11-11 12:42:05 +08:00
yh_cc
7df33b23ea Merge branch 'dataset' of github.com:yhcc/fastNLP into dataset 2018-11-11 00:40:10 +08:00
FengZiYjun
e2b14ed33d Merge remote-tracking branch 'origin/dataset' into dataset 2018-11-10 21:20:34 +08:00
FengZiYjun
5dd0f74d6d - 添加pos_tagger API, pipeline跑通
- 修复processor的bug
- 更新core/的若干组件, 去除batch的冗余参数
- CRF有个打字错误?已修复
- 更新pos tag 训练脚本
2018-11-10 21:20:16 +08:00
yh_cc
3e50ca8a72 创建了一个测试context 2018-11-10 20:37:48 +08:00
yh_cc
de3feeaf5a 调整CWS函数的位置 2018-11-10 20:10:13 +08:00
yh_cc
752efc57fd Merge branch 'dataset' of github.com:yhcc/fastNLP into dataset 2018-11-10 19:59:40 +08:00
yh_cc
ea1c8c1100 当前版本分词准确率已达正常分词分数 2018-11-10 19:59:32 +08:00
FengZiYjun
ec9fd32d60 improve trainer: log mean and std of model params, and sum of gradients 2018-11-10 18:49:22 +08:00
FengZiYjun
cd68d78d50 Merge remote-tracking branch 'origin/dataset' into dataset
# Conflicts:
#	fastNLP/api/pipeline.py
#	fastNLP/api/pos_tagger.py
#	fastNLP/api/processor.py
#	fastNLP/modules/decoder/CRF.py
2018-11-10 17:02:58 +08:00
FengZiYjun
26e3abdf58 - 修改pos tag训练脚本,可以跑
- 在api中创建converter.py
- Pipeline添加初始化方法,方便一次性添加processors
- 删除pos_tagger.py
- 优化整体code style
2018-11-10 16:58:27 +08:00
yh_cc
10bb2810ab Merge branch 'dataset' of github.com:yhcc/fastNLP into dataset 2018-11-10 15:34:21 +08:00
yh_cc
73ba3b5eec bug fix for pipeline 2018-11-10 15:17:58 +08:00
yunfan
a6ab34fd38 fix crf 2018-11-10 14:53:50 +08:00
yh_cc
3cb98ddcf2 Sampler中增加了一个BucketSampler, CWS的训练基本可以实现 2018-11-10 14:46:38 +08:00
yh_cc
69a138eb18 修改了遇到的若干问题,增加了分词任务的一些方法 2018-11-10 13:41:19 +08:00
yh_cc
dc0124cf02 修改model到models 2018-11-10 11:10:14 +08:00
yh_cc
25a53ac5c9 修改processor适配昨天的sao操作 2018-11-10 10:56:28 +08:00
yh
d818e91380 增加dataset自动创建对应的array 2018-11-09 22:11:26 +08:00
yh
515e4f4987 移动processor到processor.py 2018-11-09 22:02:10 +08:00
yh
89ce85b6ed Merge branch 'dataset' of https://github.com/yhcc/fastNLP into dataset 2018-11-09 20:23:11 +08:00
yh
38aa207ea2 新增cws converter, io 2018-11-09 20:23:05 +08:00
FengZiYjun
12e9a93b52 Merge remote-tracking branch 'origin/dataset' into dataset 2018-11-09 19:53:08 +08:00
FengZiYjun
79105381f5 - add interfaces for pos_tagging API
- update predictor.py to remove unused methods
- update model_loader.py & model_saver.py to support entire model saving & loading
- update pos tagging training script
2018-11-09 19:52:31 +08:00
yh
1b9daa1985 新增CWS的部分功能 2018-11-09 19:25:18 +08:00
yunfan
053249420f update parser, fix bugs varrnn & vocab 2018-11-09 10:59:36 +08:00
yunfan
3192c9ac66 update trainer 2018-11-08 22:15:58 +08:00
yunfan
c14d9f4d66 update biaffine 2018-11-08 22:13:47 +08:00
yunfan
830d223344 add transformer 2018-11-08 22:12:13 +08:00
yunfan
102259df39 update biaffine parser 2018-11-08 22:12:13 +08:00
yunfan
a4c9786ca4 update dataset & loader 2018-10-17 09:59:56 +08:00
yunfan
637c37d62b add new model, new module, fix bugs 2018-10-10 16:49:17 +08:00
FengZiYjun
fb806163c3 remove unused codes; add more tests 2018-10-07 15:03:00 +08:00