milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2024-12-02 20:09:57 +08:00

Author	SHA1	Message	Date
wei liu	c6a1c49e02	enhance: Use Blocked Bloom Filter instead of basic bloom fitler impl. (#33405 ) issue: #32995 To speed up the construction and querying of Bloom filters, we chose a blocked Bloom filter instead of a basic Bloom filter implementation. WARN: This PR is compatible with old version bf impl, but if fall back to old milvus version, it may causes bloom filter deserialize failed. In single Bloom filter test cases with a capacity of 1,000,000 and a false positive rate (FPR) of 0.001, the blocked Bloom filter is 5 times faster than the basic Bloom filter in both querying and construction, at the cost of a 30% increase in memory usage. - Block BF construct time {"time": "54.128131ms"} - Block BF size {"size": 3021578} - Block BF Test cost {"time": "55.407352ms"} - Basic BF construct time {"time": "210.262183ms"} - Basic BF size {"size": 2396308} - Basic BF Test cost {"time": "192.596229ms"} In multi Bloom filter test cases with a capacity of 100,000, an FPR of 0.001, and 100 Bloom filters, we reuse the primary key locations for all Bloom filters to avoid repeated hash computations. As a result, the blocked Bloom filter is also 5 times faster than the basic Bloom filter in querying. - Block BF TestLocation cost {"time": "529.97183ms"} - Basic BF TestLocation cost {"time": "3.197430181s"} --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-31 17:49:45 +08:00
Jiquan Long	0c5d8660aa	feat: support inverted index for array (#33452 ) issue: https://github.com/milvus-io/milvus/issues/27704 --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-05-31 09:47:47 +08:00
Chun Han	416a2cf507	fix: query iterator lack results(#33137 ) (#33422 ) related: #33137 adding has_more_result_tag for various level's reduce to rectify reduce_stop_for_best Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-05-30 17:51:44 +08:00
jaime	0d3272ed6d	enhance: refine logs of cgo pool (#33373 ) Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-05-27 19:06:11 +08:00
aoiasd	59a7a46904	enhance: Merge query stream result for reduce delete task (#32855 ) relate: https://github.com/milvus-io/milvus/issues/32854 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-05-27 18:15:43 +08:00
SimFG	cb99e3db34	enhance: add the includeCurrentMsg param for the Seek method (#33326 ) /kind improvement - issue: #33325 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-05-27 10:31:41 +08:00
jaime	58ee613fea	enhance: remove repeated stats of loaded entity (#33255 ) Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-05-27 01:49:41 +08:00
yihao.dai	760223f80a	fix: use seperate warmup pool and disable warmup by default (#33348 ) 1. use a small warmup pool to reduce the impact of warmup 2. change the warmup pool to nonblocking mode 3. disable warmup by default 4. remove the maximum size limit of 16 for the load pool issue: https://github.com/milvus-io/milvus/issues/32772 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com> Co-authored-by: xiaofanluan <xiaofan.luan@zilliz.com>	2024-05-27 01:25:40 +08:00
Bingyi Sun	370562b4ec	fix: fix partition loaded num metric (#33316 ) issue: https://github.com/milvus-io/milvus/issues/32108 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-05-24 15:31:42 +08:00
wei liu	39f56678a0	enhance: Reduce bloom filter lock contention between insert and delete in query coord (#32643 ) issue: #32530 cause ProcessDelete need to check whether pk exist in bloom filter, and ProcessInsert need to update pk to bloom filter, when execute ProcessInsert and ProcessDelete in parallel, it will cause race condition in segment's bloom filter This PR execute ProcessInsert and ProcessDelete in serial to avoid block each other Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-22 19:11:40 +08:00
Xiaofan	3d105fcb4d	enhance: Remove l0 delete cache (#32990 ) fix #32979 remove l0 cache and build delete pk and ts everytime. this reduce the memory and also increase the code readability Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2024-05-21 22:53:40 +08:00
Bingyi Sun	0f8c6f49ff	enhance: mmap load raw data if scalar index does not have raw data (#33175 ) Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-05-21 11:53:39 +08:00
wei liu	f1c9986974	enhance: Skip return data distribution if no change happen (#32814 ) issue: #32813 --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-17 10:11:37 +08:00
Jiquan Long	dd9919a7dc	fix: two-phase retrieval on lru-segment (#32945 ) issue: #31822 --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-05-15 17:53:34 +08:00
cai.zhang	6ea7633bd5	enhance: Add memory size for binlog (#33025 ) issue: #33005 1. add `MemorySize` field for insert binlog. 2. `LogSize` means the file size in the storage object. 3. `MemorySize` means the size of the data in the memory. --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com> Signed-off-by: cai.zhang <cai.zhang@zilliz.com>	2024-05-15 12:59:34 +08:00
SimFG	1d48d0aeb2	enhance: use different value to get related data size according to segment type (#33017 ) issue: #30436 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-05-14 14:59:33 +08:00
Cai Yudong	4fc7915c70	enhance: unify data generation test APIs (#32955 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-14 14:33:33 +08:00
chyezh	96489b814d	fix: remove busy log (#33042 ) issue: #32963 Signed-off-by: chyezh <chyezh@outlook.com>	2024-05-14 14:20:32 +08:00
foxspy	f6777267e3	enhance: add score compute consistency config for knowhere (#32997 ) issue: https://github.com/milvus-io/milvus/issues/32583 related: #32584 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2024-05-13 14:21:31 +08:00
chyezh	1c84a1c9b6	fix: lru related issue fixup patch (#32916 ) issue: #32206, #32801 - search failure with some assertion, segment not loaded and resource insufficient. - segment leak when query segments --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-05-10 19:17:30 +08:00
wei liu	25689859a1	fix: Load index metric use wrong time unit (#32935 ) issue:#32899 This PR fix the wrong metric value of load index, which introduced by pr#32567, use wrong time unit for load index metrics Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-10 18:07:30 +08:00
Jiquan Long	0783582e2e	fix: temporarily disable two-phase retrieval when lru is enabled (#32927 ) issue: #31822 Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-05-10 14:19:45 +08:00
Bingyi Sun	5cbf081111	fix: fix index resource estimation (#32842 ) issue: #32820 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-05-10 11:53:30 +08:00
Bingyi Sun	17a79f4ca9	enhance: The LRU cache evicts items and retries loading if the disk limit is reached. (#32819 ) Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-05-08 14:45:30 +08:00
wei liu	5038036ece	enhance: Reuse hash locations during access bloom fitler (#32642 ) issue: #32530 when try to match segment bloom filter with pk, we can reuse the hash locations. This PR maintain the max hash Func, and compute hash location once for all segment, reuse hash location can speed up bf access --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-07 06:13:47 -07:00
congqixia	40728ce83d	enhance: Add `metautil.Channel` to convert string compare to int (#32749 ) See also #32748 This PR: - Add `metautil.Channel` utiltiy which convert virtual name to physical channel name, collectionID and shard idx - Add channel mapper interface & implementation to convert limited physical channel name into int index - Apply `metautil.Channel` filter in querynode segment manager logic --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-05-07 19:13:35 +08:00
yihao.dai	9db3aa18bc	enhance: Remove deprecated EnableIndex (#32704 ) /kind improvement Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-05-07 17:11:30 +08:00
chyezh	641f702f64	fix: add request resource timeout for lazy load, refactor context usage in cache (#32709 ) issue: #32663 - Use new param to control request resource timeout for lazy load. - Remove the timeout parameter of `Do`, remove `DoWait`. use `context` to control the timeout. - Use `VersionedNotifier` to avoid notify event lost and broadcast, remove the redundant goroutine in cache. related dev pr: #32684 Signed-off-by: chyezh <chyezh@outlook.com>	2024-05-07 16:33:30 +08:00
congqixia	efa0c73c62	fix: Unify querynode metrics cleanup in collection release (#32805 ) Related to #32803 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-05-07 15:41:29 +08:00
aoiasd	31dca3249e	enhance: add type info for payload writer error message and add log when querynode find new collection (#32522 ) relate: https://github.com/milvus-io/milvus/issues/32668 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-05-07 14:45:29 +08:00
Jiquan Long	1f58cda957	enhance: add more trace for search & query (#32734 ) issue: https://github.com/milvus-io/milvus/issues/32728 --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-05-07 13:03:29 +08:00
yihao.dai	cf4db3ff4e	enhance: Fix compilation error (#32797 ) Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-05-06 19:31:49 -07:00
congqixia	7102403a6b	fix: Add Wrapper and Keepalive for CTraceContext ids (#32746 ) See also #32742 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-05-07 10:05:35 +08:00
congqixia	53b5f1be17	enhance: Remove legacy L0 segment if watch failed (#32725 ) Like growing segments, legacy l0 segments shall be removed if watch dml channel execution fails as well. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-05-07 10:03:42 +08:00
Bingyi Sun	fecd9c21ba	feat: LRU cache implementation (#32567 ) issue: https://github.com/milvus-io/milvus/issues/32783 This pr is the implementation of lru cache on branch lru-dev. Signed-off-by: sunby <sunbingyi1992@gmail.com> Co-authored-by: chyezh <chyezh@outlook.com> Co-authored-by: MrPresent-Han <chun.han@zilliz.com> Co-authored-by: Ted Xu <ted.xu@zilliz.com> Co-authored-by: jaime <yun.zhang@zilliz.com> Co-authored-by: wayblink <anyang.wang@zilliz.com>	2024-05-06 20:29:30 +08:00
Chun Han	ac82cef04d	enhance: disable reload partstats by config (#32702 ) Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-04-29 19:11:26 +08:00
wei liu	c0555d4b45	fix: Remove read only node from replica immedaitely after node down (#32666 ) issue: #32665 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-04-28 20:25:25 +08:00
chyezh	2586c2f1b3	enhance: use WalkWithPrefix api for oss, enable piplined file gc (#31740 ) issue: #19095,#29655,#31718 - Change `ListWithPrefix` to `WalkWithPrefix` of OOS into a pipeline mode. - File garbage collection is performed in other goroutine. - Segment Index Recycle clean index file too. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-04-25 20:41:27 +08:00
Jiquan Long	c002745902	enhance: retrieve output fields after local reduce (#32346 ) issue: #31822 --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-04-25 09:49:26 +08:00
congqixia	faa559592d	enhance: Make applyDelete work in paralell in segment level (#32291 ) `applyDelete` used to be serial for delete entries on each segments. This PR make it work in parallel with errgroup to improve performance --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-04-24 17:01:24 +08:00
yihao.dai	281a583eda	fix: Correct the negative queryable num entities metric (#32361 ) issue: https://github.com/milvus-io/milvus/issues/32281 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-04-24 15:55:24 +08:00
Cai Yudong	16b8b7b35d	enhance: Add get_vector unittest for float16 & bfloat16 (#32153 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-04-23 16:15:23 +08:00
chyezh	e19d17076f	fix: delete may lost when enable lru cache, some field should be reset when ReleaseData (#32012 ) issue: #30361 - Delete may be lost when segment is not data-loaded status in lru cache. skip filtering to fix it. - `stats_` and `variable_fields_avg_size_` should be reset when `ReleaseData` - Remove repeat load delta log operation in lru. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-04-16 11:17:20 +08:00
Gao	55d894bd5e	enhance: support disable search optimization (#32141 ) Signed-off-by: chasingegg <chao.gao@zilliz.com>	2024-04-16 10:51:20 +08:00
SimFG	c012e6786f	feat: support rate limiter based on db and partition levels (#31070 ) issue: https://github.com/milvus-io/milvus/issues/30577 co-author: @jaime0815 --------- Signed-off-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com> Signed-off-by: SimFG <bang.fu@zilliz.com> Co-authored-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com>	2024-04-12 16:01:19 +08:00
wei liu	68dec7dcd4	fix: Use correct ts to avoid exclude segment list leak (#31991 ) issue: #31990 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-04-12 10:39:19 +08:00
wei liu	1a98ce39f5	enhance: Remove useless logic about FromShardLeader (#32029 ) issue: #32047 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-04-10 20:11:19 +08:00
Xiaofan	dbab9c5096	fix: reduce didn't handle offset without limit and reduceStopForBest correctly (#32089 ) fix https://github.com/milvus-io/milvus/issues/32059 this pr fix two issues: offset is not handled correctly without specify a limit reduceStopForBest doesn't guarantee to return limit result even if there are more result when there is small segment Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2024-04-10 16:01:18 +08:00
wei liu	df208d538c	fix: Check exclude segment before add new growing segment (#31803 ) issue: #31479 #31797 milvus will add released segment to excluded info, and filter out it's stream data in filter_node. but for data buffered in insert_node's channel, if it belongs to growing segment which already be released, then it will all the growing segment back again. This PR maintain `excluded segments` in delegator, and check excluded segment before new growing segment. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-04-10 15:29:17 +08:00
Chun Han	f3f2a5a7e9	fix: evicted segments in the serverlss mode(#31959 ) (#31961 ) related: #31959 1. reset segment index status after evicting to lazyload=true 2. reset num_rows to null_opt Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-04-10 15:15:19 +08:00

1 2 3 4 5 ...

473 Commits