milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2024-12-02 11:59:00 +08:00

Author	SHA1	Message	Date
Chun Han	c46c401112	fix: refine handling type for segment pruner(#34923 ) (#34925 ) related: #34923 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-07-25 13:57:45 +08:00
congqixia	2ac7164c39	enhance: Remove useless ops when there is no write (#34767 ) Related to #33235 THe querynode pipeline will make map & call ProcessInsert when there is no write messages. So querynodes will have high CPU usage even when there is no workload. This PR check msg length before composing data struct and calling method Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-07-19 14:31:42 +08:00
zhagnlu	804dd5409a	enhance: mark duplicated pk as deleted (#34586 ) fix #34247 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-07-16 14:25:39 +08:00
congqixia	531092c031	enhance: Add lint rule to forbid gogo protobuf (#34594 ) github.com/gogo/protobuf is deprecated and could be error prune after upgrade protobuf message to v2. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-07-12 10:19:35 +08:00
jaime	3b62138c5c	fix: unstable UT for level0 deletion (#34524 ) issue: #34533 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-07-11 10:02:56 +08:00
congqixia	d60e628aed	enhance: Avoid use concrete segment type in segments interfaces (#34521 ) See also #34519 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-07-10 10:18:12 +08:00
wei liu	eeb03a0d6a	fix: Query may return deleted records (#34501 ) issue: #34500 cause the sort in `GetLevel0Deletions` will broken the corresponed order between pks and tss, then the pks and tss will be sorted in segment.Delete() interface. This PR remove this uncessary and incorrect sort progress to avoid query may return deleted records. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-09 10:46:11 +08:00
Chun Han	8af187f673	fix: lose partitionIDs when scalar pruning and refine segment prune ratio metrics(#30376 ) (#34477 ) related: #30376 fix: paritionIDs lost when no setting paritions enhance: refine metrics for segment prune Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-07-08 19:54:15 +08:00
Chun Han	fcafdb6d5f	enhance: reconstruct scalar part's code for segment-pruner(#30376 ) (#34346 ) related: #30376 1. support more complex expr 2. add more ut test for unrelated fields Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-07-04 16:36:09 +08:00
Chun Han	34bec2ea5e	enhance: add metrics for segment prune latnecy(#30376 ) (#34094 ) related: #30376 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-07-03 10:04:07 +08:00
wei liu	b49862d4f3	enhance: Optimize grow slice cost during query (#34253 ) issue: #32252 This PR try to pre-allocate FieldData for Reduce operations in the Query chain using typeutil.PrepareResultFieldData to avoid the overhead of dynamically growing the slice during appendFieldData process. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-01 15:18:11 +08:00
wei liu	45203425fd	enhance: Avoid search querynode return nil status in response (#34100 ) Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-26 11:50:11 +08:00
jaime	9630974fbb	enhance: move rocksmq from internal to pkg module (#33881 ) issue: #33956 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-06-25 21:18:15 +08:00
wayblink	f9a0f7bb25	Add an option to enable/disable vector field clustering key (#34097 ) #30633 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-06-25 18:52:04 +08:00
congqixia	fd922d921a	enhance: Add nilness linter and fix some small issues (#34049 ) Add `nilness` for govet linter and fixed some detected issues Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-24 14:52:03 +08:00
Chun Han	ca7ef26e4b	fix: sync part stats task cannot be finished(#30376 ) (#34027 ) related: #30376 also: refine log output for query_coord task by rephrasing action string Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-06-24 10:16:02 +08:00
chyezh	259a682673	enhance: async search and retrieve in cgo (#33228 ) issue: #30926, #33132 related pr: #33133 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-06-22 09:38:02 +08:00
smellthemoon	2a1356985d	enhance: support null in go payload (#32296 ) #31728 --------- Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-06-19 17:08:00 +08:00
Gao	a789c60380	enhance: autoindex for multi data type (#33868 ) issue: #22837 contain https://github.com/milvus-io/milvus/pull/33625 https://github.com/milvus-io/milvus/pull/33867 https://github.com/milvus-io/milvus/pull/33911 which already merged to 2.4 branch Signed-off-by: chasingegg <chao.gao@zilliz.com> Co-authored-by: foxspy <xianliang.li@zilliz.com>	2024-06-18 21:34:01 +08:00
congqixia	3fdaae8792	fix: Return record with largest timestamp for entires with same PK (#33936 ) See also #33883 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-18 15:55:59 +08:00
cqy123456	32f685ff12	enhance: growing segment support mmap (#32633 ) issue: https://github.com/milvus-io/milvus/issues/32984 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>	2024-06-18 14:42:00 +08:00
congqixia	ec64499536	fix: Check nodeID wildcard when removing pkOracle (#33895 ) See also #33894 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-18 14:11:58 +08:00
congqixia	2a04b0929a	fix: Prevent use captured iteration variable partitionID (#33906 ) See also #33902 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-17 19:11:59 +08:00
chyezh	9b69601dfb	fix: load operation when segment is on releasing (#31340 ) issue: #30857 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-06-14 15:35:56 +08:00
wei liu	4987067375	enhance: Execute bloom filter apply in parallel to speed up segment predict (#33792 ) issue: #33610 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-14 11:37:56 +08:00
wei liu	ab93d9c23d	enhance: Use BatchPkExist to reduce bloom filter func call cost (#33611 ) issue:#33610 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-13 17:57:56 +08:00
chyezh	8ca5ced821	fix: async warmup will be blocked by state lock (#33686 ) issue: #33685 Signed-off-by: chyezh <chyezh@outlook.com>	2024-06-10 21:59:53 +08:00
wayblink	a1232fafda	feat: Major compaction (#33620 ) #30633 Signed-off-by: wayblink <anyang.wang@zilliz.com> Co-authored-by: MrPresent-Han <chun.han@zilliz.com>	2024-06-10 21:34:08 +08:00
yihao.dai	3540eee977	enhance: Support L0 import (#33514 ) issue: https://github.com/milvus-io/milvus/issues/33157 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-07 14:17:20 +08:00
jaime	8858fcb40a	fix: fix loaded entity num is inaccurate (#33521 ) issue: #33520 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-06-04 20:09:54 +08:00
wei liu	34c6a989ab	enhance: Avoid load bf in delegator when qn worker has no more memory (#33557 ) query coord send load request to delegator, delegator load bf first, then forward load request to qn worker. but when qn worker has no more memory, it will return load failed immediatelly. then delegator roll back the loaded bf. query coord wil retry the load request, and delegator will load and roll back bf again and again. this PR delay the loading bf step until load segment succeed in worker. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-03 19:23:45 +08:00
wei liu	c6a1c49e02	enhance: Use Blocked Bloom Filter instead of basic bloom fitler impl. (#33405 ) issue: #32995 To speed up the construction and querying of Bloom filters, we chose a blocked Bloom filter instead of a basic Bloom filter implementation. WARN: This PR is compatible with old version bf impl, but if fall back to old milvus version, it may causes bloom filter deserialize failed. In single Bloom filter test cases with a capacity of 1,000,000 and a false positive rate (FPR) of 0.001, the blocked Bloom filter is 5 times faster than the basic Bloom filter in both querying and construction, at the cost of a 30% increase in memory usage. - Block BF construct time {"time": "54.128131ms"} - Block BF size {"size": 3021578} - Block BF Test cost {"time": "55.407352ms"} - Basic BF construct time {"time": "210.262183ms"} - Basic BF size {"size": 2396308} - Basic BF Test cost {"time": "192.596229ms"} In multi Bloom filter test cases with a capacity of 100,000, an FPR of 0.001, and 100 Bloom filters, we reuse the primary key locations for all Bloom filters to avoid repeated hash computations. As a result, the blocked Bloom filter is also 5 times faster than the basic Bloom filter in querying. - Block BF TestLocation cost {"time": "529.97183ms"} - Basic BF TestLocation cost {"time": "3.197430181s"} --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-31 17:49:45 +08:00
Jiquan Long	0c5d8660aa	feat: support inverted index for array (#33452 ) issue: https://github.com/milvus-io/milvus/issues/27704 --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-05-31 09:47:47 +08:00
Chun Han	416a2cf507	fix: query iterator lack results(#33137 ) (#33422 ) related: #33137 adding has_more_result_tag for various level's reduce to rectify reduce_stop_for_best Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-05-30 17:51:44 +08:00
jaime	0d3272ed6d	enhance: refine logs of cgo pool (#33373 ) Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-05-27 19:06:11 +08:00
aoiasd	59a7a46904	enhance: Merge query stream result for reduce delete task (#32855 ) relate: https://github.com/milvus-io/milvus/issues/32854 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-05-27 18:15:43 +08:00
SimFG	cb99e3db34	enhance: add the includeCurrentMsg param for the Seek method (#33326 ) /kind improvement - issue: #33325 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-05-27 10:31:41 +08:00
jaime	58ee613fea	enhance: remove repeated stats of loaded entity (#33255 ) Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-05-27 01:49:41 +08:00
yihao.dai	760223f80a	fix: use seperate warmup pool and disable warmup by default (#33348 ) 1. use a small warmup pool to reduce the impact of warmup 2. change the warmup pool to nonblocking mode 3. disable warmup by default 4. remove the maximum size limit of 16 for the load pool issue: https://github.com/milvus-io/milvus/issues/32772 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com> Co-authored-by: xiaofanluan <xiaofan.luan@zilliz.com>	2024-05-27 01:25:40 +08:00
Bingyi Sun	370562b4ec	fix: fix partition loaded num metric (#33316 ) issue: https://github.com/milvus-io/milvus/issues/32108 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-05-24 15:31:42 +08:00
wei liu	39f56678a0	enhance: Reduce bloom filter lock contention between insert and delete in query coord (#32643 ) issue: #32530 cause ProcessDelete need to check whether pk exist in bloom filter, and ProcessInsert need to update pk to bloom filter, when execute ProcessInsert and ProcessDelete in parallel, it will cause race condition in segment's bloom filter This PR execute ProcessInsert and ProcessDelete in serial to avoid block each other Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-22 19:11:40 +08:00
Xiaofan	3d105fcb4d	enhance: Remove l0 delete cache (#32990 ) fix #32979 remove l0 cache and build delete pk and ts everytime. this reduce the memory and also increase the code readability Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2024-05-21 22:53:40 +08:00
Bingyi Sun	0f8c6f49ff	enhance: mmap load raw data if scalar index does not have raw data (#33175 ) Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-05-21 11:53:39 +08:00
wei liu	f1c9986974	enhance: Skip return data distribution if no change happen (#32814 ) issue: #32813 --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-17 10:11:37 +08:00
Jiquan Long	dd9919a7dc	fix: two-phase retrieval on lru-segment (#32945 ) issue: #31822 --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-05-15 17:53:34 +08:00
cai.zhang	6ea7633bd5	enhance: Add memory size for binlog (#33025 ) issue: #33005 1. add `MemorySize` field for insert binlog. 2. `LogSize` means the file size in the storage object. 3. `MemorySize` means the size of the data in the memory. --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com> Signed-off-by: cai.zhang <cai.zhang@zilliz.com>	2024-05-15 12:59:34 +08:00
SimFG	1d48d0aeb2	enhance: use different value to get related data size according to segment type (#33017 ) issue: #30436 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-05-14 14:59:33 +08:00
Cai Yudong	4fc7915c70	enhance: unify data generation test APIs (#32955 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-14 14:33:33 +08:00
chyezh	96489b814d	fix: remove busy log (#33042 ) issue: #32963 Signed-off-by: chyezh <chyezh@outlook.com>	2024-05-14 14:20:32 +08:00
foxspy	f6777267e3	enhance: add score compute consistency config for knowhere (#32997 ) issue: https://github.com/milvus-io/milvus/issues/32583 related: #32584 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2024-05-13 14:21:31 +08:00

1 2 3 4 5 ...

504 Commits