milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2024-11-30 19:08:30 +08:00

Author	SHA1	Message	Date
congqixia	2a04b0929a	fix: Prevent use captured iteration variable partitionID (#33906 ) See also #33902 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-17 19:11:59 +08:00
wei liu	4987067375	enhance: Execute bloom filter apply in parallel to speed up segment predict (#33792 ) issue: #33610 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-14 11:37:56 +08:00
wei liu	ab93d9c23d	enhance: Use BatchPkExist to reduce bloom filter func call cost (#33611 ) issue:#33610 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-13 17:57:56 +08:00
wayblink	a1232fafda	feat: Major compaction (#33620 ) #30633 Signed-off-by: wayblink <anyang.wang@zilliz.com> Co-authored-by: MrPresent-Han <chun.han@zilliz.com>	2024-06-10 21:34:08 +08:00
wei liu	34c6a989ab	enhance: Avoid load bf in delegator when qn worker has no more memory (#33557 ) query coord send load request to delegator, delegator load bf first, then forward load request to qn worker. but when qn worker has no more memory, it will return load failed immediatelly. then delegator roll back the loaded bf. query coord wil retry the load request, and delegator will load and roll back bf again and again. this PR delay the loading bf step until load segment succeed in worker. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-03 19:23:45 +08:00
wei liu	c6a1c49e02	enhance: Use Blocked Bloom Filter instead of basic bloom fitler impl. (#33405 ) issue: #32995 To speed up the construction and querying of Bloom filters, we chose a blocked Bloom filter instead of a basic Bloom filter implementation. WARN: This PR is compatible with old version bf impl, but if fall back to old milvus version, it may causes bloom filter deserialize failed. In single Bloom filter test cases with a capacity of 1,000,000 and a false positive rate (FPR) of 0.001, the blocked Bloom filter is 5 times faster than the basic Bloom filter in both querying and construction, at the cost of a 30% increase in memory usage. - Block BF construct time {"time": "54.128131ms"} - Block BF size {"size": 3021578} - Block BF Test cost {"time": "55.407352ms"} - Basic BF construct time {"time": "210.262183ms"} - Basic BF size {"size": 2396308} - Basic BF Test cost {"time": "192.596229ms"} In multi Bloom filter test cases with a capacity of 100,000, an FPR of 0.001, and 100 Bloom filters, we reuse the primary key locations for all Bloom filters to avoid repeated hash computations. As a result, the blocked Bloom filter is also 5 times faster than the basic Bloom filter in querying. - Block BF TestLocation cost {"time": "529.97183ms"} - Basic BF TestLocation cost {"time": "3.197430181s"} --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-31 17:49:45 +08:00
SimFG	cb99e3db34	enhance: add the includeCurrentMsg param for the Seek method (#33326 ) /kind improvement - issue: #33325 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-05-27 10:31:41 +08:00
Xiaofan	3d105fcb4d	enhance: Remove l0 delete cache (#32990 ) fix #32979 remove l0 cache and build delete pk and ts everytime. this reduce the memory and also increase the code readability Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2024-05-21 22:53:40 +08:00
wei liu	5038036ece	enhance: Reuse hash locations during access bloom fitler (#32642 ) issue: #32530 when try to match segment bloom filter with pk, we can reuse the hash locations. This PR maintain the max hash Func, and compute hash location once for all segment, reuse hash location can speed up bf access --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-07 06:13:47 -07:00
congqixia	40728ce83d	enhance: Add `metautil.Channel` to convert string compare to int (#32749 ) See also #32748 This PR: - Add `metautil.Channel` utiltiy which convert virtual name to physical channel name, collectionID and shard idx - Add channel mapper interface & implementation to convert limited physical channel name into int index - Apply `metautil.Channel` filter in querynode segment manager logic --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-05-07 19:13:35 +08:00
Bingyi Sun	fecd9c21ba	feat: LRU cache implementation (#32567 ) issue: https://github.com/milvus-io/milvus/issues/32783 This pr is the implementation of lru cache on branch lru-dev. Signed-off-by: sunby <sunbingyi1992@gmail.com> Co-authored-by: chyezh <chyezh@outlook.com> Co-authored-by: MrPresent-Han <chun.han@zilliz.com> Co-authored-by: Ted Xu <ted.xu@zilliz.com> Co-authored-by: jaime <yun.zhang@zilliz.com> Co-authored-by: wayblink <anyang.wang@zilliz.com>	2024-05-06 20:29:30 +08:00
Chun Han	ac82cef04d	enhance: disable reload partstats by config (#32702 ) Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-04-29 19:11:26 +08:00
chyezh	2586c2f1b3	enhance: use WalkWithPrefix api for oss, enable piplined file gc (#31740 ) issue: #19095,#29655,#31718 - Change `ListWithPrefix` to `WalkWithPrefix` of OOS into a pipeline mode. - File garbage collection is performed in other goroutine. - Segment Index Recycle clean index file too. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-04-25 20:41:27 +08:00
congqixia	faa559592d	enhance: Make applyDelete work in paralell in segment level (#32291 ) `applyDelete` used to be serial for delete entries on each segments. This PR make it work in parallel with errgroup to improve performance --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-04-24 17:01:24 +08:00
yihao.dai	281a583eda	fix: Correct the negative queryable num entities metric (#32361 ) issue: https://github.com/milvus-io/milvus/issues/32281 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-04-24 15:55:24 +08:00
wei liu	68dec7dcd4	fix: Use correct ts to avoid exclude segment list leak (#31991 ) issue: #31990 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-04-12 10:39:19 +08:00
wei liu	1a98ce39f5	enhance: Remove useless logic about FromShardLeader (#32029 ) issue: #32047 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-04-10 20:11:19 +08:00
wei liu	df208d538c	fix: Check exclude segment before add new growing segment (#31803 ) issue: #31479 #31797 milvus will add released segment to excluded info, and filter out it's stream data in filter_node. but for data buffered in insert_node's channel, if it belongs to growing segment which already be released, then it will all the growing segment back again. This PR maintain `excluded segments` in delegator, and check excluded segment before new growing segment. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-04-10 15:29:17 +08:00
aoiasd	5b693c466d	fix: delegator filter out all partition's delete msg when loading segment (#31585 ) May cause deleted data queryable a period of time. relate: https://github.com/milvus-io/milvus/issues/31484 https://github.com/milvus-io/milvus/issues/31548 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-04-09 15:21:24 +08:00
zhenshan.cao	089c805e0a	enhance:Refactor hybrid search (#32020 ) issue: https://github.com/milvus-io/milvus/issues/25639 https://github.com/milvus-io/milvus/issues/31368 Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2024-04-09 14:21:18 +08:00
jaime	bd853be8c7	enhance: Add db label for some usual metrics (#30956 ) issue: #31782 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-04-02 14:27:13 +08:00
Chun Han	bd44bd5ae2	enhance: add default value config for segment prune filterRatio(#31003 ) (#31580 ) related: #31003 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-03-27 16:13:10 +08:00
Chun Han	c3264ca3e3	feat: support segment pruner (#31003 ) related: #30376	2024-03-22 13:57:06 +08:00
congqixia	a647b84f3e	enhance: Add AllPartitionsID const to replace InvalidPartitionID (#31438 ) "-1" as `InvalidPartitionID` previously used as All partition place holder in delete cases. It's confusing and hard to maintain when a const var has more than one meaning. This PR add `AllPartitionsID` to replace these usages in delete scenarios. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-03-20 19:01:05 +08:00
Xiaofan	a63b4cedcf	fix: remove some unnecessary unrecoverable errors (#31327 ) use retry.handle when request is not able to service but don't throw unrecoverable erros fix #31323 Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2024-03-20 11:35:07 +08:00
chyezh	8e293dc1ce	enhance: add resource usage estimate for segment interface (#31050 ) issue: #30931 - move resource estimate function outside from segment loader. - add load info and collection to base segment. - add resource usage method for sealed segment. Signed-off-by: chyezh <chyezh@outlook.com>	2024-03-19 11:53:05 +08:00
congqixia	243e311515	fix: Report offline info in `GetDataDistribition` (#31347 ) See also #31345 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-03-18 14:51:04 +08:00
congqixia	ba8197fafb	fix: Filter channel level zero segments when build level delete cache (#31129 ) See also #31125 Delegator shall build level zero delete cache from l0 segments belongs to it. Previously it build cache from all existing level zero segments in the querynode which may lead to high memory usage and even panicking when pk types are not matched --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-03-07 23:01:02 +08:00
smellthemoon	a4f3e01a3a	fix: add the range search params check in proxy (#30423 ) if check in Segcore, will not do the it when not insert data. so, check "radius" and "range_filter" in proxy. related with #30365 Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-02-28 11:24:58 +08:00
xige-16	060c8603a3	fix: Support mvcc with hybrid serach (#30114 ) issue: https://github.com/milvus-io/milvus/issues/29656 /kind bug Signed-off-by: xige-16 <xi.ge@zilliz.com> --------- Signed-off-by: xige-16 <xi.ge@zilliz.com>	2024-02-01 16:03:03 +08:00
congqixia	7c086a4608	enhance: Set delete scope for LoadSegment streaming data (#30245 ) See also #29474 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-30 11:13:02 +08:00
chyezh	d300bc7bcb	fix: querynode num entity metric is broken by illegal label (#29948 ) issue: #29766 also see pr: #29825 Signed-off-by: chyezh <ye.zhen@zilliz.com>	2024-01-14 10:23:00 +08:00
congqixia	adf0c8885c	enhance: add trace span for wait tsafe (#29911 ) Add tracing span for search/query operation waiting tsafe duration Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-12 14:03:04 +08:00
yah01	d357139064	fix: the entities num metric may be contributed more than once (#29767 ) the growing segments contribute to this metric while inserting and putting into the manager, but the current impl inserts data before putting the segments into manager, which leads to double contributions fix: #29766 Signed-off-by: yah01 <yah2er0ne@outlook.com>	2024-01-10 10:00:51 +08:00
zhenshan.cao	60e88fb833	fix: Restore the MVCC functionality. (#29749 ) When the TimeTravel functionality was previously removed, it inadvertently affected the MVCC functionality within the system. This PR aims to reintroduce the internal MVCC functionality as follows: 1. Add MvccTimestamp to the requests of Search/Query and the results of Search internally. 2. When the delegator receives a Query/Search request and there is no MVCC timestamp set in the request, set the delegator's current tsafe as the MVCC timestamp of the request. If the request already has an MVCC timestamp, do not modify it. 3. When the Proxy handles Search and triggers the second phase ReQuery, divide the ReQuery into different shards and pass the MVCC timestamp to the corresponding Query requests. issue: #29656 Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2024-01-09 11:38:48 +08:00
congqixia	fe47deebf3	fix: Set & Return correct SegmentLevel in querynode segment manager (#29740 ) See also #27349 The segment level label in querynode used `Legacy` before segment level was correctly passed in Load request. Now this attribute is still using legacy so the metrics does not look right. This PR add paramter for `NewSegment` and passes corrent values for each invocation. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-08 14:16:48 +08:00
yah01	a0cec4047a	fix: make the entity num metric accurate (#29643 ) fix #29642 Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-01-05 18:24:47 +08:00
congqixia	da7c3cbd88	enhance: make delegator delete buffer holding all delete from cp (#29626 ) See also #29625 This PR: - Add a new implemention of `DeleteBuffer`: listDeleteBuffer - holds cacheBlock slice - `Put` method append new delete data into last block - when a block is full, append a new block into the list - Add `TryDiscard` method for `DeleteBuffer` interface - For doubleCacheBuffer, do nothing - For listDeleteBuffer, try to evict "old" blocks, which are blocks before the first block whose start ts is behind provided ts - Add checkpoint field for `UpdateVersion` sync action, which shall be used to discard old cache delete block --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-04 17:02:46 +08:00
MrPresent-Han	ed644983e2	enhance: add param for bloomfilter(#29388 ) (#29490 ) related: #29388 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2023-12-28 18:10:46 +08:00
congqixia	b251c3a682	enhance: add ctx for HandleCStatus and callers (#29517 ) See also #29516 Make `HandleCStatus` print trace id for better logging Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-27 16:10:47 +08:00
congqixia	02bc0d0dd5	fix: Add scope limit for querynode DeleteRequest (#29474 ) See also #27515 When Delegator processes delete data, it forwards delete data with only segment id specified. When two segments has same segment id but one is growing and the other is sealed, the delete will be applied to both segments which causes delete data out of order when concurrent load segment occurs. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-26 14:28:47 +08:00
congqixia	ac95c52171	enhance: change protection to RLock for loadStreamDelete (#29450 ) See also: #29332 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-25 23:27:02 +08:00
congqixia	1eacdc591b	fix: delegator may mark segment offline by mistake (#29343 ) See also #29332 The segment may be released before or during the request when delegator tries to forward delete request to yet. Currently, these two situation returns different error code. In this particular case, `ErrSegmentNotLoaded` and `ErrSegmentNotFound` shall both be ignored preventing return search service unavailable by mistake. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-20 21:22:43 +08:00
yah01	9b3e06ae86	enhance: add more metrics for level zero segments (#29029 ) - Add SegmentNum metric for level zero segments - Add level zero segments size metirc --------- Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-12-07 14:34:35 +08:00
Gao	3e77365de5	fix: correct autoindex segment num (#28387 ) Fix #28386 Current code snippet ``` // get delegator sd, ok := node.delegators.Get(channel) if !ok { err := merr.WrapErrChannelNotFound(channel) log.Warn("Query failed, failed to get shard delegator for search", zap.Error(err)) return nil, err } req, err = node.optimizeSearchParams(ctx, req, sd) if err != nil { log.Warn("failed to optimize search params", zap.Error(err)) return nil, err } // do search results, err := sd.Search(searchCtx, req) ``` We could move these into `ShardDelegator`, and directly use sealed segment num in `Search` methods, also segment num got outside could be wrong when we specify partitions. Signed-off-by: chasingegg <chao.gao@zilliz.com>	2023-11-22 11:12:22 +08:00
yah01	cc952e0486	enhance: optimize forwarding level0 deletions by respecting partition (#28456 ) - Cache the level 0 deletions after loading level0 segments - Divide the level 0 deletions by partition related: #27349 --------- Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-11-21 18:24:22 +08:00
yah01	d20ea061d6	Fix panic while forwarding empty deletions to growing segment (#28213 ) Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-11-08 16:42:21 +08:00
yah01	ece592a42f	Deliver L0 segments delete records (#27722 ) Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-11-07 01:44:18 +08:00
wei liu	ecec5dfcfd	fix retry on offline node (#28079 ) Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2023-11-03 10:14:16 +08:00
congqixia	e4fdf5e68e	Refine offline segments logic in shard delegator (#28073 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-01 20:04:24 +08:00

1 2 3

115 Commits