Commit Graph

1996 Commits

Author SHA1 Message Date
yh
b899b1edd8 修改bucket sampler, 增加url下载功能 2018-11-11 20:25:47 +08:00
yh
9667c524a4 基本完善了cws的predict 2018-11-11 15:53:33 +08:00
FFTYYY
3cadd5a325 fix a iterant lossfuntion , and some error in comments 2018-11-11 13:47:54 +08:00
yh
9fc20ac7b8 增加infer的pipeline 2018-11-11 12:55:30 +08:00
yh
0a8a76f769 冲突解决 2018-11-11 12:43:16 +08:00
yh
dc7f8ef8d4 bug fix 2018-11-11 12:42:05 +08:00
yunfan
82f4351540 add index to word processor 2018-11-11 12:41:20 +08:00
yh_cc
7df33b23ea Merge branch 'dataset' of github.com:yhcc/fastNLP into dataset 2018-11-11 00:40:10 +08:00
FFTYYY
07fb61efdc Update test_loss 2018-11-10 23:21:26 +08:00
FengZiYjun
e2b14ed33d Merge remote-tracking branch 'origin/dataset' into dataset 2018-11-10 21:20:34 +08:00
FengZiYjun
5dd0f74d6d - 添加pos_tagger API, pipeline跑通
- 修复processor的bug
- 更新core/的若干组件, 去除batch的冗余参数
- CRF有个打字错误?已修复
- 更新pos tag 训练脚本
2018-11-10 21:20:16 +08:00
yh_cc
3e50ca8a72 创建了一个测试context 2018-11-10 20:37:48 +08:00
yh_cc
de3feeaf5a 调整CWS函数的位置 2018-11-10 20:10:13 +08:00
yh_cc
752efc57fd Merge branch 'dataset' of github.com:yhcc/fastNLP into dataset 2018-11-10 19:59:40 +08:00
yh_cc
ea1c8c1100 当前版本分词准确率已达正常分词分数 2018-11-10 19:59:32 +08:00
FengZiYjun
ec9fd32d60 improve trainer: log mean and std of model params, and sum of gradients 2018-11-10 18:49:22 +08:00
FengZiYjun
5e84ca618e merge and update 2018-11-10 17:04:37 +08:00
FengZiYjun
cd68d78d50 Merge remote-tracking branch 'origin/dataset' into dataset
# Conflicts:
#	fastNLP/api/pipeline.py
#	fastNLP/api/pos_tagger.py
#	fastNLP/api/processor.py
#	fastNLP/modules/decoder/CRF.py
2018-11-10 17:02:58 +08:00
FengZiYjun
26e3abdf58 - 修改pos tag训练脚本,可以跑
- 在api中创建converter.py
- Pipeline添加初始化方法,方便一次性添加processors
- 删除pos_tagger.py
- 优化整体code style
2018-11-10 16:58:27 +08:00
yunfan
64a9bacbc2 fix crf 2018-11-10 16:50:56 +08:00
yh_cc
10bb2810ab Merge branch 'dataset' of github.com:yhcc/fastNLP into dataset 2018-11-10 15:34:21 +08:00
yunfan
3ae12e2c13 fix processor 2018-11-10 15:32:06 +08:00
yh_cc
73ba3b5eec bug fix for pipeline 2018-11-10 15:17:58 +08:00
yunfan
1806bbdbec fix dataset 2018-11-10 15:13:53 +08:00
yunfan
b7aab90157 init parser api 2018-11-10 15:00:25 +08:00
yunfan
a6ab34fd38 fix crf 2018-11-10 14:53:50 +08:00
yh_cc
3cb98ddcf2 Sampler中增加了一个BucketSampler, CWS的训练基本可以实现 2018-11-10 14:46:38 +08:00
yh_cc
69a138eb18 修改了遇到的若干问题,增加了分词任务的一些方法 2018-11-10 13:41:19 +08:00
yh_cc
dc0124cf02 修改model到models 2018-11-10 11:10:14 +08:00
yh_cc
25a53ac5c9 修改processor适配昨天的sao操作 2018-11-10 10:56:28 +08:00
yh_cc
ae0cc9a46b 修改api.load()函数 2018-11-10 10:31:45 +08:00
xuyige
dff4cdf6a7 update API 2018-11-09 22:20:12 +08:00
yh
d818e91380 增加dataset自动创建对应的array 2018-11-09 22:11:26 +08:00
yh
217cab94d1 Merge branch 'dataset' of https://github.com/yhcc/fastNLP into dataset 2018-11-09 22:03:32 +08:00
yunfan
dd0bb0d791 add data iter 2018-11-09 22:02:34 +08:00
yh
515e4f4987 移动processor到processor.py 2018-11-09 22:02:10 +08:00
FFTYYY
1f15b52216 update readme, for requirements changed 2018-11-09 21:21:57 +08:00
FFTYYY
2cd2dae251 update loss 2018-11-09 21:20:06 +08:00
yunfan
d7e78faf4b Merge branch 'dataset' of https://github.com/yhcc/fastNLP into dataset 2018-11-09 20:42:53 +08:00
yunfan
f90861d7a5 fix fieldarray, dataset 2018-11-09 20:42:33 +08:00
yh
89ce85b6ed Merge branch 'dataset' of https://github.com/yhcc/fastNLP into dataset 2018-11-09 20:23:11 +08:00
yh
38aa207ea2 新增cws converter, io 2018-11-09 20:23:05 +08:00
yunfan
ff6d99bcb2 add dataset support for sampler, update batch 2018-11-09 20:16:08 +08:00
yunfan
0cbbfd5221 update dataset 2018-11-09 20:06:56 +08:00
xuyige
ba51bf4cb5 update requirements 2018-11-09 19:58:15 +08:00
FengZiYjun
12e9a93b52 Merge remote-tracking branch 'origin/dataset' into dataset 2018-11-09 19:53:08 +08:00
FengZiYjun
79105381f5 - add interfaces for pos_tagging API
- update predictor.py to remove unused methods
- update model_loader.py & model_saver.py to support entire model saving & loading
- update pos tagging training script
2018-11-09 19:52:31 +08:00
yh
1b9daa1985 新增CWS的部分功能 2018-11-09 19:25:18 +08:00
yh
fcf5af93d8 修改batch, 新增pipeline和processor的接口 2018-11-09 18:35:18 +08:00
yunfan
8fae3bc2e7 Merge branch 'dev' into dataset 2018-11-09 18:23:40 +08:00