milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2024-12-04 21:09:06 +08:00

Author	SHA1	Message	Date
congqixia	de8a266d8a	enhance: Enable linux code checker (#35084 ) See also #34483 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-07-30 15:53:51 +08:00
wei liu	c45f38aa61	enhance: Update protobuf-go to protobuf-go v2 (#34394 ) issue: #34252 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-29 11:31:51 +08:00
congqixia	4ee6c69217	enhance: Add Segment Level in milvus segment info APIs (#34763 ) See also #34746 This PR add segment level field in response of `GetPersistentSegmentInfo` and `GetQuerySegmentInfo` --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-07-26 10:01:46 +08:00
wei liu	166fc902b0	enhance: Limit collection's normal balance speed (#34810 ) issue: #34798 after we remove the task priority on query coord, to avoid load/release segment blocked by too much balance task, we limit the balance task size in each round. at same time, we reduce the balance interval to trigger balance more frequently. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-24 19:11:44 +08:00
wei liu	92de49e38c	fix: Segment may bounce between delegator and worker (#34830 ) issue: #34595 pr#34596 to we add an overloaded factor to segment in delegator, which cause same segment got different score in delegator and worker. which may cause segment bounce between delegator and worker. This PR use average score to compute the delegator overloaded factor, to avoid segment bounce between delegator and worker. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-23 15:57:49 +08:00
wei liu	40e39ef7c9	fix: Avoid segment lack caused by deduplicate segment task (#34782 ) issue: #34781 when balance segment hasn't finished yet, query coord may found 2 loaded copy of segment, then it will generate task to deduplicate, which may cancel the balance task. then the old copy has been released, and the new copy hasn't be ready yet but canceled, then search failed by segment lack. this PR set deduplicate segment task's proirity to low, to avoid balance segment task canceled by deduplicate task. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-22 16:35:43 +08:00
wei liu	acb33bba4d	enhance: Preserve fixed-size memory in delegator node for growing segment. (#34596 ) issue: #34595 When consuming insert data on the delegator node, QueryCoord will move out some sealed segments to manage its memory usage. After the growing segment gets flushed, some sealed segments from other workers will be moved back to the delegator node. To avoid the frequent movement of segments, we estimate the maximum growing row count and preserve a fixed-size memory in the delegator node. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-15 20:51:46 +08:00
wei liu	9b37d3f517	enhance: Enable setting the replica number and resource group during collection creation (#34403 ) issue: #30040 --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-10 10:20:13 +08:00
congqixia	b284b81a47	fix: Check partition in current target when observing partition load status (#34282 ) See also #34234 `LoadPartitions` does not guarantee the current target has loading partitions if there are some partitions already loaded before. This PR check current target contains the partition to load when advancing loading percentage to 100. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-07-01 17:40:07 +08:00
wei liu	f7ecafe77d	enhance: Skip update index for L0 segment (#34099 ) try to update index for l0 segment, will failed by `index not found` This PR skip update index for l0 segment Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-01 10:26:06 +08:00
jaime	0426390f06	enhance: improve check health (#33800 ) issue: #34264 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-07-01 10:16:06 +08:00
wei liu	8123bea1ae	enhance: Avoid assign too much segment/channels to new querynode (#34096 ) issue: #34095 When a new query node comes online, the segment_checker, channel_checker, and balance_checker simultaneously attempt to allocate segments to it. If this occurs during the execution of a load task and the distribution of the new query node hasn't been updated, the query coordinator may mistakenly view the new query node as empty. As a result, it assigns segments or channels to it, potentially overloading the new query node with more segments or channels than expected. This PR measures the workload of the executing tasks on the target query node to prevent assigning an excessive number of segments to it. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-27 19:06:05 +08:00
jaime	9630974fbb	enhance: move rocksmq from internal to pkg module (#33881 ) issue: #33956 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-06-25 21:18:15 +08:00
congqixia	07c25a19d9	fix: Make querycoord panick when rg metastore sync fail (#34106 ) See also #34047 When `unassignNode` sync resource group with node removed failed Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-24 21:38:02 +08:00
Chun Han	ca7ef26e4b	fix: sync part stats task cannot be finished(#30376 ) (#34027 ) related: #30376 also: refine log output for query_coord task by rephrasing action string Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-06-24 10:16:02 +08:00
wei liu	935bc1fb71	fix: Fix GetReplicas API return nil status (#33715 ) issue: #33702 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-20 14:40:15 +08:00
wayblink	5fac2fa1d2	fix: Panic if ProcessActiveStandBy returns error (#33369 ) #33368 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-06-19 11:16:00 +08:00
wei liu	02945959d9	enhance: Avoid to iterate whole segment list for each task's process (#33943 ) when querycoord process segment task, it will try to iterate whole segment list to checke whether segment is loaded, which cost too much cpu if there has thousands of segments. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-19 10:19:58 +08:00
Chun Han	f7af323d1e	fix: sync partitiion stats blocking balance task(#33741 ) (#33742 ) related: #33741 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-06-11 14:21:56 +08:00
wayblink	a1232fafda	feat: Major compaction (#33620 ) #30633 Signed-off-by: wayblink <anyang.wang@zilliz.com> Co-authored-by: MrPresent-Han <chun.han@zilliz.com>	2024-06-10 21:34:08 +08:00
yihao.dai	3540eee977	enhance: Support L0 import (#33514 ) issue: https://github.com/milvus-io/milvus/issues/33157 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-07 14:17:20 +08:00
SimFG	ecee7d90d4	enhance: try to speed up the loading of small collections (#33570 ) - issue: #33569 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-06-07 08:25:53 +08:00
wei liu	b13932bb55	enhance: Enable database level replica num and resource groups for loading collection (#33052 ) issue: #30040 This PR introduce two database level props: 1. database.replica.number 2. database.resource_groups User can set those two database props by AlterDatabase API, then can load collection without specified replica_num and resource groups. then it will use database level load param when try to load collections. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-29 10:59:43 +08:00
wei liu	6275c75013	fix: Watch channel task may stuck forever until qn become offline (#33394 ) issue: #32901 pr #32814 introduce the compatible issue, when upgrade to milvus latest, the query coord may skip update dist due to the lastModifyTs doesn't changes. but for old version querynode, the lastModifyTs in GetDataDistritbuionResponse is always 0, which makes qc skip update dist. then qc will keep retry the task to watch channel again and again. this PR add compatible with old version querynode, when lastModifyTs is 0, qc will update it's data distribution. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-27 15:01:42 +08:00
wei liu	303470fc35	fix: Clean offline node from resource group after qc restart (#33232 ) issue: #33200 #33207 pr#33104 causes the offline node will be kept in resource group after qc recover, and offline node will be assign to new replica as rwNode, then request send to those node will fail by NodeNotFound. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-22 10:03:40 +08:00
wei liu	33bd6eed28	fix: Clean offline node from replica after qc recover (#33213 ) issue: #33200 #33207 pr#33104 remove this logic by mistake, which cause the offline node will be kept in replica after qc recover, and request send to offline qn will go a NodeNotFound error. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-21 15:41:39 +08:00
wei liu	2013d97243	enhance: Enable to dynamic update balancer policy in querycoord (#33037 ) issue: #33036 This PR enable to dynamic update balancer policy without restart querycoord. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-21 14:29:39 +08:00
jaime	0d99db23b8	fix: metrics leak on the coord nodes (#33075 ) issue: #32980 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-05-20 22:03:39 +08:00
wei liu	a7f6193bfc	fix: query node may stuck at stopping progress (#33104 ) issue: #33103 when try to do stopping balance for stopping query node, balancer will try to get node list from replica.GetNodes, then check whether node is stopping, if so, stopping balance will be triggered for this replica. after the replica refactor, replica.GetNodes only return rwNodes, and the stopping node maintains in roNodes, so balancer couldn't find replica which contains stopping node, and stopping balance for replica won't be triggered, then query node will stuck forever due to segment/channel doesn't move out. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-20 10:21:38 +08:00
wei liu	f1c9986974	enhance: Skip return data distribution if no change happen (#32814 ) issue: #32813 --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-17 10:11:37 +08:00
cai.zhang	6ea7633bd5	enhance: Add memory size for binlog (#33025 ) issue: #33005 1. add `MemorySize` field for insert binlog. 2. `LogSize` means the file size in the storage object. 3. `MemorySize` means the size of the data in the memory. --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com> Signed-off-by: cai.zhang <cai.zhang@zilliz.com>	2024-05-15 12:59:34 +08:00
SimFG	1d48d0aeb2	enhance: use different value to get related data size according to segment type (#33017 ) issue: #30436 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-05-14 14:59:33 +08:00
congqixia	861977ab60	fix: Start `LeaderCacheObserver` before `SyncAll` (#33035 ) Related to #33033 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com> Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-05-14 13:25:32 +08:00
wei liu	cba2c7a3be	enhance: clean channel node info in meta store (#32988 ) issue: #32910 see also: #32911 when channel exclusive mode is enabled, replica will record channel node info in meta store, and if the balance policy changes, which means channel exclusive mode is disabled, we should clean up the channel node info in meta store, and stop to balance node between channels. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-14 10:05:40 +08:00
chyezh	293f14a8b9	fix: remove redundant replica recover (#32985 ) issue: #22288 - replica recover should be only triggered by replica recover Signed-off-by: chyezh <chyezh@outlook.com>	2024-05-13 15:25:32 +08:00
Xiaofan	b044e5503e	enhance:Improve load speed (#32898 ) fix #32897 add memory check when load collection Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2024-05-11 10:29:31 +08:00
chyezh	1c84a1c9b6	fix: lru related issue fixup patch (#32916 ) issue: #32206, #32801 - search failure with some assertion, segment not loaded and resource insufficient. - segment leak when query segments --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-05-10 19:17:30 +08:00
wei liu	e2332bdc17	enhance: Enable channel exclusive balance policy (#32911 ) issue: #32910 * split replica's node list to channels when create replicas * balance nodes among channels when node change happens * implement channel level balance, let balance happens in channel level Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-10 17:27:31 +08:00
wei liu	04a8ec69f6	fix: Segment on stopping query node can't be release successfully (#32929 ) issue: #32901 Cause release segment request need be send to delegator, but it need replica to info find segment's delegator. but the stopping query node will be marked as read only in replica, then `replica.Contains()` just return true for rwNode in replica. then it can't get replica info by stopping query node and release segment will be blocked. This PR make `replica.Contains()` return true for both roNode and rwNode. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-10 14:33:30 +08:00
Bingyi Sun	b7ef8da360	fix: set channel checkpoint to delta position (#32878 ) issue: https://github.com/milvus-io/milvus/issues/32853 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-05-10 11:51:30 +08:00
congqixia	efa58ae423	enhance: Utilize coll2replica mapping when getting rg by collection (#32892 ) See also #32165 In old `GetResourceGroupByCollection` implementation, it iterates all replicas to match collection id, which is slow and CPU time consuming. This PR make it utilize the coll2Replicas mapping by calling `GetByCollection` and mapping replicas into resource group. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com> Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-05-09 19:37:30 +08:00
congqixia	acb0417a9f	enhance: Avoid iteration over channel results when update leaderview (#32887 ) See also #32165 Cache channel name to channel info to avoid iteration over channel results when updating leader view version. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-05-09 15:41:30 +08:00
wei liu	fad8f0afa5	enhance: enable stopping balance after balance has been suspended (#32812 ) issue: #32811 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-08 10:15:29 +08:00
wei liu	ba02d54a30	enhance: update shard leader cache when leader location changed (#32470 ) issue: #32466 this PR enhance that when shard location changed, update proxy's shard leader cache. in case of query node failover case, proxy can find replica recover --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-08 10:05:29 +08:00
yihao.dai	9db3aa18bc	enhance: Remove deprecated EnableIndex (#32704 ) /kind improvement Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-05-07 17:11:30 +08:00
chyezh	b904c8d377	enhance: resource group unittest refactory (#32739 ) issue: #30647 Signed-off-by: chyezh <chyezh@outlook.com>	2024-05-06 10:17:34 +08:00
wei liu	d900e68440	fix: fix GetShardLeaders return empty node list (#32685 ) issue: #32449 to avoid GetShardLeaders return empty node list, this PR add node list check in both client side and server side. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-04-29 14:19:26 +08:00
chyezh	ef4c875d4c	fix: resource group ut may failure (#32688 ) issue: https://github.com/milvus-io/milvus/issues/30647 Signed-off-by: chyezh <chyezh@outlook.com>	2024-04-29 14:17:26 +08:00
wei liu	c0555d4b45	fix: Remove read only node from replica immedaitely after node down (#32666 ) issue: #32665 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-04-28 20:25:25 +08:00
congqixia	4cdf6c3c41	fix: Check partition nil before observe load progress (#32659 ) See also #32441 #32615 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-04-28 16:29:25 +08:00

1 2 3 4 5 ...

523 Commits