milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2024-11-29 18:38:44 +08:00

Author	SHA1	Message	Date
jaime	9d16b972ea	feat: add tasks page into management WebUI (#37002 ) issue: #36621 1. Add API to access task runtime metrics, including: - build index task - compaction task - import task - balance (including load/release of segments/channels and some leader tasks on querycoord) - sync task 2. Add a debug model to the webpage by using debug=true or debug=false in the URL query parameters to enable or disable debug mode. Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-10-28 10:13:29 +08:00
cai.zhang	ac8c5fcd5d	enhance: Remove pre-marking segments as L2 during clustering compaction (#36799 ) issue: #36686 This pr will remove pre-marking segments as L2 during clustering compaction in version 2.5, and ensure compatibility with version 2.4. The core of this change is to ensure that the many-to-many lineage derivation logic is correct, making sure that both the parent and child cannot simultaneously exist in the target segment view. feature: - Clustering compaction no longer marks the input segments as L2. - Add a new field `is_invisible` to `segmentInfo`, and mark segments that have completed clustering but have not yet built indexes as `is_invisible` to prevent them from being loaded prematurely." - Do not mark the input segment as `Dropped` before the clustering compaction is completed. - After compaction fails, only the result segment needs to be marked as Dropped. compatibility: - If the upgraded task has not failed, there are no compatibility issues. - If the status after the upgrade is `MetaSaved`, then skip the stats task based on whether TmpSegments is empty. - If the failure occurs before `MetaSaved`: - there are no ResultSegments, and InputSegments have not been marked as dropped yet. - the level of input segments need to revert to LastLevel - If the failure occurs after `MetaSaved`: - ResultSegments have already been generated, and InputSegments have been marked as Dropped. At this point, simply make the ResultSegments visible. - the level of ResultSegments needs to be set to L1（in order to participate in mixCompaction） --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-10-23 17:15:28 +08:00
yihao.dai	d230b91bd1	enhance: Add PreallocatedSegmentIDs for the compaction task (#36734 ) Add `PreallocatedSegmentIDs` field to the compaction task, allowing the `ResultSegments` in the compaction task to represent the final segments produced by the compaction. issue: https://github.com/milvus-io/milvus/issues/36733 also related: https://github.com/milvus-io/milvus/issues/36686 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-13 17:59:21 +08:00
XuanYang-cn	4e0ea39235	fix: Remove neighbors if compactTo is unindexed (#36503 ) See also: #36360 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-10-08 14:15:19 +08:00
aoiasd	139787371e	feat: support embedding bm25 sparse vector and flush bm25 stats log (#36036 ) relate: https://github.com/milvus-io/milvus/issues/35853 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-09-19 10:57:12 +08:00
yihao.dai	a61668c77e	feat: Introduce stats task for import (#35868 ) This PR introduce stats task for import: 1. Define new `Stats` and `IndexBuilding` states for importJob 2. Add new stats step to the import process: trigger the stats task and wait for its completion 3. Abort stats task if import job failed issue: https://github.com/milvus-io/milvus/issues/33744 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-09-15 15:17:08 +08:00
Jiquan Long	89bf226f0b	feat: support keyword text match (#35923 ) fix: #35922 --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-09-10 15:11:08 +08:00
cai.zhang	2c9bb4dfa3	feat: Support stats task to sort segment by PK (#35054 ) issue: #33744 This PR includes the following changes: 1. Added a new task type to the task scheduler in datacoord: stats task, which sorts segments by primary key. 2. Implemented segment sorting in indexnode. 3. Added a new field `FieldStatsLog` to SegmentInfo to store token index information. --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-09-02 14:19:03 +08:00
yihao.dai	1413ffe9b1	enhance: Rename preAllocatedSegments (#35871 ) Rename `preAllocatedSegments` to `preAllocatedSegmentIDs` to avoid confusion. Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-09-01 17:09:01 +08:00
XuanYang-cn	323400c190	enhance: Enable to write multiple segments in mix compactor (#35705 ) Prevent segments to be written larger than maxSize * expansionRate See also: #35584 Signed-off-by: yangxuan <xuan.yang@zilliz.com> --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-08-30 11:29:01 +08:00
yihao.dai	a4439cc911	enhance: Implement flusher in streamingNode (#34942 ) - Implement flusher to: - Manage the pipelines (creation, deletion, etc.) - Manage the segment write buffer - Manage sync operation (including receive flushMsg and execute flush) - Add a new `GetChannelRecoveryInfo` RPC in DataCoord. - Reorganize packages: `flushcommon` and `datanode`. issue: https://github.com/milvus-io/milvus/issues/33285 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-08-02 18:30:23 +08:00
wayblink	81773bfadf	enhance: add commit time in partitionStats proto (#35125 ) fix: #35110 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-08-02 16:16:14 +08:00
chyezh	1cff55381d	enhance: add manual alloc segment rpc for datacoord (#35002 ) issue: #33285 - segment allocation will move to streamingnode, so a manual alloc segment rpc is required Signed-off-by: chyezh <chyezh@outlook.com>	2024-07-26 10:15:46 +08:00
wayblink	c79d1af390	enhance: Add compaction task slot usage logic (#34581 ) #34544 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-07-18 10:27:41 +08:00
yihao.dai	ca758c36cc	enhance: Pre-allocate ids for compaction (#34187 ) This PR removes the dependency of compaction on the ID allocator by pre-allocating the logID and segmentID. issue: https://github.com/milvus-io/milvus/issues/33957 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-07-17 13:23:42 +08:00
wayblink	da56880d0f	fix: Avoid datarace in clustering compaction (#34288 ) #34289 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-07-03 19:08:09 +08:00
yihao.dai	8537f3daeb	enhance: Rename Compaction to CompactionV2 (#33858 ) Due to the removal of injection and syncSegments from the compaction, we need to ensure that no compaction is successfully executed during the rolling upgrade. This PR renames Compaction to CompactionV2, with the following effects: - New datacoord + old datanode: Utilizes the CompactionV2 interface, resulting in the datanode error "CompactionV2 not implemented," causing compaction to fail; - Old datacoord + new datanode: Utilizes the CompactionV1 interface, resulting in the datanode error "CompactionV1 not implemented," causing compaction to fail. issue: https://github.com/milvus-io/milvus/issues/32809 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-16 22:07:57 +08:00
wayblink	a1232fafda	feat: Major compaction (#33620 ) #30633 Signed-off-by: wayblink <anyang.wang@zilliz.com> Co-authored-by: MrPresent-Han <chun.han@zilliz.com>	2024-06-10 21:34:08 +08:00
yihao.dai	3540eee977	enhance: Support L0 import (#33514 ) issue: https://github.com/milvus-io/milvus/issues/33157 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-07 14:17:20 +08:00
cai.zhang	feeb869ff9	enhance: Remove compaction plans on the datanode (#33548 ) issue: #33546 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-06-05 20:27:51 +08:00
zhenshan.cao	ac4f3997ce	enhance: Reconstructing Compaction to possess persistence capability (#33265 ) issue #33586 Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2024-06-05 10:17:50 +08:00
cai.zhang	77637180fa	enhance: Periodically synchronize segments to datanode watcher (#33420 ) issue: #32809 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-05-30 13:37:44 +08:00
yihao.dai	7730b910b9	enhance: Decouple compaction from shard (#33138 ) Decouple compaction from shard, remove dependencies on shards (e.g. SyncSegments, injection). issue: https://github.com/milvus-io/milvus/issues/32809 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-05-24 09:07:41 +08:00
yihao.dai	32560263fa	enhance: Query slot for compaction task (#32881 ) Query slot of compaction in datanode, and transfer the control logic for limiting compaction tasks from datacoord to the datanode. issue: https://github.com/milvus-io/milvus/issues/32809 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-05-17 18:19:38 +08:00
cai.zhang	6ea7633bd5	enhance: Add memory size for binlog (#33025 ) issue: #33005 1. add `MemorySize` field for insert binlog. 2. `LogSize` means the file size in the storage object. 3. `MemorySize` means the size of the data in the memory. --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com> Signed-off-by: cai.zhang <cai.zhang@zilliz.com>	2024-05-15 12:59:34 +08:00
wayblink	42d0412e93	enhance: Add channelCPs in FlushResponse (#32044 ) #32609 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-04-30 09:45:27 +08:00
SimFG	c012e6786f	feat: support rate limiter based on db and partition levels (#31070 ) issue: https://github.com/milvus-io/milvus/issues/30577 co-author: @jaime0815 --------- Signed-off-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com> Signed-off-by: SimFG <bang.fu@zilliz.com> Co-authored-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com>	2024-04-12 16:01:19 +08:00
yihao.dai	9a13b9822f	enhance: Return more fields in import progress response (#31539 ) Return more fields in import progress response, include importedRows and totalRows. Additionally, ensure compatibility with the old import progress response by retaining fields of create timestamp and row count. issue: https://github.com/milvus-io/milvus/issues/31448 https://github.com/milvus-io/milvus/issues/31237 https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-24 21:57:06 +08:00
yihao.dai	0fe5e90e8b	enhance: Remove import v1 (#31403 ) Remove all code and logic related to import v1. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-22 15:29:09 +08:00
yihao.dai	c408a32db6	feat: Add disk quota checks for import V2 (#31131 ) Return quota error when the files to be imported exceed the disk quota. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-15 14:43:03 +08:00
yihao.dai	b5c67948b7	enhance: Enhance and modify the return content of ImportV2 (#31192 ) 1. The Import APIs now provide detailed progress information for each imported file, including details such as file name, file size, progress, and more. 2. The APIs now return the collection name and the completion time. 3. Other modifications include changing jobID to jobId and other similar adjustments. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-13 19:51:03 +08:00
yihao.dai	c411cb4a49	enhance: Prevent the backlog of channelCP update tasks, perform batch updates of channelCPs (#30941 ) This PR includes the following adjustments: 1. To prevent channelCP update task backlog, only one task with the same vchannel is retained in the updater. Additionally, the lastUpdateTime is refreshed after the flowgraph submits the update task, rather than in the callBack function. 2. Batch updates of multiple vchannel checkpoints are performed in the UpdateChannelCheckpoint RPC (default batch size is 128). Additionally, the lock for channelCPs in DataCoord meta has been switched from key lock to global lock. 3. The concurrency of UpdateChannelCheckpoint RPCs in the datanode has been reduced from 1000 to 10. issue: https://github.com/milvus-io/milvus/issues/30004 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com> Co-authored-by: jaime <yun.zhang@zilliz.com> Co-authored-by: congqixia <congqi.xia@zilliz.com>	2024-03-07 20:39:02 +08:00
congqixia	d81ba164c8	enhance: Add `ListIndexes` API from datacoord (#31104 ) See also #31103 This PR add `listIndexes` API for datacoor server to list all indexes for provided collection. Comparing to the existing `DescribeIndex` API, the new one does NOT check the segment index building progress to ease the burden when invoking it Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-03-07 17:37:01 +08:00
yihao.dai	a434d33e75	feat: Add import scheduler and manager (#29367 ) This PR introduces novel managerial roles for importv2: 1. ImportMeta: To manage all the import tasks; 2. ImportScheduler: To process tasks and modify their states; 3. ImportChecker: To ascertain the completion of all tasks and instigate relevant operations. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-01 18:31:02 +08:00
wayblink	2bc212c6c9	enhance: Add L2 segment level (#29595 ) #28410 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-02-18 14:12:49 +08:00
yihao.dai	c5918290e6	feat: Add import executor and manager for datanode (#29438 ) This PR introduces novel importv2 roles for datanode: 1. Executor: To execute tasks, a import task will be divided into the following steps: read data -> hash data -> sync data; 2. Manager: To manage all the tasks; issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-01-31 20:45:04 +08:00
yihao.dai	8780d65b66	fix: Use channel cp as the dml&start position for import segments (#30107 ) This PR discontinuing the subscription to the mq and, instead, employing the channel checkpoint as the DML and starting position for the import segments. issue: https://github.com/milvus-io/milvus/issues/30106 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-01-22 14:36:55 +08:00
XuanYang-cn	86f48861c1	fix: Add more throughput in related metrics (#30038 ) This PR also fixes bugs in l0 compactor where l0 results would never be removed from datanode See also: #30099 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-01-19 11:34:54 +08:00
smellthemoon	e52ce370b6	enhance:don't store logPath in meta to reduce memory (#28873 ) don't store logPath in meta to reduce memory, when service get segmentinfo, generate logpath from logid. #28885 Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-01-18 22:06:31 +08:00
congqixia	aa967de0a8	enhance: Explicitly pass LevelZero segment ids in vchan info (#29612 ) See also #27675 For `GetRecoveryInfo` & `GetRecoveryInfoV2`, Level zero segment ids shall be specified in vchan info so that querycoord could re-fetch current segment info during watch procedure without having all segment info Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-04 16:46:45 +08:00
yah01	a0e1a1eb31	feat: support enable/disable mmap for index (#29005 ) support enable/disable mmap for index, the user could alter the index's mode by `AlterIndex` method related: https://github.com/milvus-io/milvus/issues/21866 --------- Signed-off-by: yah01 <yah2er0ne@outlook.com> Signed-off-by: yah01 <yang.cen@zilliz.com>	2023-12-21 18:07:24 +08:00
congqixia	8a63e53421	enhance: Add http method to control datacoord garbage collection (#29052 ) See also #29051 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-14 19:26:39 +08:00
yihao.dai	d26b563a8b	feat: Define import API and metadata (#28731 ) Define the new rpc and metadata for ImportV2. see also: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2023-12-04 19:56:35 +08:00
XuanYang-cn	9b371067d2	feat: Add Compaction views and triggers (#27906 ) - Add Compaction l0 views - Add Compaction scheduler - Add Compaction triggerv2 - Add Compaction view manager See also: #27606 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2023-11-23 17:30:25 +08:00
Bingyi Sun	4fedff6d47	feat: integrate storage v2 into the write path (#28440 ) #28378 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2023-11-23 17:26:24 +08:00
XuanYang-cn	40d5c902b6	Enable getting multiple segments in plan result (#28350 ) Compaction plan result contained one segment for one plan. For l0 compaction would write to multiple segments, this PR expand the segments number in plan results and refactor some names for readibility. - Name refactory: - CompactionStateResult -> CompactionPlanResult - CompactionResult -> CompactionSegment See also: #27606 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2023-11-14 15:56:19 +08:00
congqixia	c41df18b6d	Add compaction meta in `SyncSegmentsRequest` (#28199 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-07 10:06:17 +08:00
aoiasd	1d4be0d257	Adjust datacoord for L0 Delta (#28021 ) Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2023-11-06 15:26:16 +08:00
congqixia	b7bfccaf21	Add channel name paramter in `FlushSegmentsRequest` (#27725 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-10-17 12:00:10 +08:00
jaime	e386a62fae	Remove recollect segment stats during starting datacoord (#27410 ) Signed-off-by: jaime <yun.zhang@zilliz.com>	2023-10-16 10:26:09 +08:00

1 2 3

148 Commits