milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2024-12-05 05:18:52 +08:00

Author	SHA1	Message	Date
SimFG	c012e6786f	feat: support rate limiter based on db and partition levels (#31070 ) issue: https://github.com/milvus-io/milvus/issues/30577 co-author: @jaime0815 --------- Signed-off-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com> Signed-off-by: SimFG <bang.fu@zilliz.com> Co-authored-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com>	2024-04-12 16:01:19 +08:00
congqixia	c0fa169d9a	enhance: Make write buffer memory check do until safe (#32172 ) See also #27675 #26177 Make memory check evict memory buffer until memory water level is safe. Also make `EvictBuffer` wait until sync task done. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-04-12 10:55:18 +08:00
jaime	371e6d2c1a	enhance: refine sync memory watermark configuration (#32140 ) Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-04-11 20:07:24 +08:00
XuanYang-cn	aad3ed3835	fix: [cherry-pick]Skip changing meta if nodeID not match with channel (#31672 ) See also: #31648 pr: #31665, #31694 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-04-10 15:09:18 +08:00
yihao.dai	49d109de18	enhance: Use an individual buffer size parameter for imports (#31833 ) Use an individual buffer size parameter for imports and set buffer size to 64MB. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-04-08 21:07:18 +08:00
yihao.dai	d6cdcf74db	fix: Return err for conc.Future in sync manager (#31790 ) Should not return `err, nil` when using conc.Future, as the error will be lost/ignored when using `AwaitAll` to wait for the future. issue: https://github.com/milvus-io/milvus/issues/31788 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-04-06 11:36:57 -07:00
congqixia	49b8ee4339	fix: Make FlushTs Sync Policy apply to all buffers (#31839 ) See also #30552 FlushTS policy was orignally designed to flushed/L0 segments only, but in some edge case, new growing segment buffer would by-pass flush request and hold a buffer before flush ts, which caused flush timeout Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-04-03 11:47:13 +08:00
congqixia	0feee53631	enhance: Add back unit test for compactor and fix some TODOs (#31829 ) This PR adds back compactor "Unhandled" data type unit test and fixes some TODOs behvaior Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-04-02 20:35:14 +08:00
yihao.dai	4e264003bf	enhance: Ensure ImportV2 waits for the index to be built and refine some logic (#31629 ) Feature Introduced: 1. Ensure ImportV2 waits for the index to be built Enhancements Introduced: 1. Utilization of local time for timeout ts instead of allocating ts from rootcoord. 3. Enhanced input file length check for binlog import. 4. Removal of duplicated manager in datanode. 5. Renaming of executor to scheduler in datanode. 6. Utilization of a thread pool in the scheduler in datanode. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-04-01 20:09:13 +08:00
XuanYang-cn	39337e09b8	fix: Using zero serverID for metrics (#31518 ) Fixes: #31516 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-04-01 16:55:19 +08:00
yihao.dai	78fbb87b3a	enhance: Release blobs in sync task once sync is completed (#31661 ) Once the synchronization of the sync task is completed, it's necessary to release the blob within the sync task, as the caller may continue to reference it. issue: https://github.com/milvus-io/milvus/issues/31545 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-28 10:23:11 +08:00
SimFG	b1a1cca10b	feat: add more operation detail info for better allocation (#30438 ) issue: #30436 --------- Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-03-28 06:33:11 +08:00
yihao.dai	31cf849f68	enhance: Support retriving file size from importutilv2.Reader (#31533 ) To reduce the overhead caused by listing the S3 objects, add an interface to importutil.Reader to retrieve file sizes. issue: https://github.com/milvus-io/milvus/issues/31532, https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-25 20:29:07 +08:00
Bingyi Sun	8e661f791a	fix: lazy load index data in cache (#31094 ) issue: https://github.com/milvus-io/milvus/issues/31571 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-03-25 15:43:07 +08:00
yihao.dai	f65a796d18	enhance: Add max file num limit and max file size limit for import (#31497 ) The max number of import files per request should not exceed 1024 by default (configurable). The import file size allowed for importing should not exceed 16GB by default (configurable). issue: https://github.com/milvus-io/milvus/issues/28521 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-22 18:13:06 +08:00
yihao.dai	0fe5e90e8b	enhance: Remove import v1 (#31403 ) Remove all code and logic related to import v1. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-22 15:29:09 +08:00
congqixia	a647b84f3e	enhance: Add AllPartitionsID const to replace InvalidPartitionID (#31438 ) "-1" as `InvalidPartitionID` previously used as All partition place holder in delete cases. It's confusing and hard to maintain when a const var has more than one meaning. This PR add `AllPartitionsID` to replace these usages in delete scenarios. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-03-20 19:01:05 +08:00
Xiaofan	a63b4cedcf	fix: remove some unnecessary unrecoverable errors (#31327 ) use retry.handle when request is not able to service but don't throw unrecoverable erros fix #31323 Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2024-03-20 11:35:07 +08:00
congqixia	d9efea2fea	fix: Cleanup write buffer when flowgraph released (#31376 ) See also #30137 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-03-19 01:33:05 +08:00
yihao.dai	776709e5ff	fix: Fix binlog import (#31310 ) Fix binlog import functionality by removing the existing check and refining the size retrieval process. issue: https://github.com/milvus-io/milvus/issues/31221, https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-17 20:59:04 +08:00
yihao.dai	811316d2ba	fix: Fix binlog import and refine error reporting (#31241 ) 1. Fix binlog import with partition key. 2. Refine binlog import error reportins. 3. Avoid division by zero when retrieving import progress. issue: https://github.com/milvus-io/milvus/issues/31221, https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-15 10:55:05 +08:00
jaime	db79be3ae0	fix: ctx cancel should be the last step while stopping server (#31220 ) issue: #31219 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-03-15 10:33:05 +08:00
XuanYang-cn	a52a52064d	fix: Use lock and map instead of concurrentMap (#31212 ) See also: #31209 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-03-14 18:39:04 +08:00
Buqian Zheng	3c80083f51	feat: [Sparse Float Vector] add sparse vector support to milvus components (#30630 ) add sparse float vector support to different milvus components, including proxy, data node to receive and write sparse float vectors to binlog, query node to handle search requests, index node to build index for sparse float column, etc. https://github.com/milvus-io/milvus/issues/29419 --------- Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-03-13 14:32:54 -07:00
yihao.dai	b5c67948b7	enhance: Enhance and modify the return content of ImportV2 (#31192 ) 1. The Import APIs now provide detailed progress information for each imported file, including details such as file name, file size, progress, and more. 2. The APIs now return the collection name and the completion time. 3. Other modifications include changing jobID to jobId and other similar adjustments. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-13 19:51:03 +08:00
congqixia	937f2440ab	fix: TestBlock case use different segment id in testcase (#31173 ) Resolves: #31172 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-03-11 17:51:03 +08:00
congqixia	ff1e967e89	enhance: Add segment id short cut for WithSegmentID filter (#31144 ) See also #31143 This PR add short cut for datanoe metacache `WithSegmentIDs` filter, which could just fetch segment from map with provided segmentIDs. Also add benchmark for new implementation vs old one. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-03-11 10:55:02 +08:00
Ted Xu	987d9023a5	enhance: Enable binlog deserialize reader in datanode compaction (#31036 ) See #30863 Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2024-03-08 18:25:02 +08:00
yihao.dai	c411cb4a49	enhance: Prevent the backlog of channelCP update tasks, perform batch updates of channelCPs (#30941 ) This PR includes the following adjustments: 1. To prevent channelCP update task backlog, only one task with the same vchannel is retained in the updater. Additionally, the lastUpdateTime is refreshed after the flowgraph submits the update task, rather than in the callBack function. 2. Batch updates of multiple vchannel checkpoints are performed in the UpdateChannelCheckpoint RPC (default batch size is 128). Additionally, the lock for channelCPs in DataCoord meta has been switched from key lock to global lock. 3. The concurrency of UpdateChannelCheckpoint RPCs in the datanode has been reduced from 1000 to 10. issue: https://github.com/milvus-io/milvus/issues/30004 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com> Co-authored-by: jaime <yun.zhang@zilliz.com> Co-authored-by: congqixia <congqi.xia@zilliz.com>	2024-03-07 20:39:02 +08:00
yihao.dai	0a2c255630	enhance: Reduce the memory usage of the timeTickSender (#30968 ) In the cache of the timeTickSender, retain only the latest stats instead of storing stats for every time tick. issue: https://github.com/milvus-io/milvus/issues/30967 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-02 10:13:01 +08:00
yihao.dai	a434d33e75	feat: Add import scheduler and manager (#29367 ) This PR introduces novel managerial roles for importv2: 1. ImportMeta: To manage all the import tasks; 2. ImportScheduler: To process tasks and modify their states; 3. ImportChecker: To ascertain the completion of all tasks and instigate relevant operations. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-01 18:31:02 +08:00
wayblink	f3c56c83ab	fix: Fix binlog_io metric name conflict (#30689 ) follow: #29725 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-03-01 18:13:02 +08:00
XuanYang-cn	2867f50fcc	fix: Clear DN unkown compaction tasks (#30850 ) If DC restarted, those unkonwn compaction tasks will never get call back in DN, so that the segments in the compaction task will be locked, unable to sync and compaction again, blocking cp advance and compaction executing. See also: #30137 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-03-01 11:31:00 +08:00
yihao.dai	a5a4ca8459	enhance: Remove debug log (#30955 ) Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-01 10:02:59 +08:00
chyezh	0c7474d7e8	enhance: add graceful stop timeout to avoid node stop hang under extreme cases (#30317 ) 1. add coordinator graceful stop timeout to 5s 2. change the order of datacoord component while stop 3. change querynode grace stop timeout to 900s, and we should potentially change this to 600s when graceful stop is smooth issue: #30310 also see pr: #30306 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-02-29 17:01:50 +08:00
yiwangdr	c6665c2a4c	test: support multiple data/querynodes in integration test (#30618 ) issue: https://github.com/milvus-io/milvus/issues/29507 Signed-off-by: yiwangdr <yiwangdr@gmail.com>	2024-02-21 11:54:53 +08:00
wayblink	f976385421	enhance: replace binlogIO with io.BinlogIO in datanode (#29725 ) #30633 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-02-20 14:38:51 +08:00
wayblink	6c89609de7	enhance: Reduce unnessary log in binlog_io (#30625 ) Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-02-18 16:50:51 +08:00
congqixia	3c2e0375df	fix: make compactor inject done called no more than once (#30603 ) See also #30571 When `compactionExecutor` stops one compaction task, the `stop` method will case `injectDone` called. However in `executeTask` when `compact` method returns error, it shall also invoke `injectDone` as well. That the reason `Unlock of unlocked RWMutex` panicking happened. This PR add sync.Once to make sure that `injectDone` is called only once. We did not remove any of the `injectDone` since removal any of those invocation may cause logic problem. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-02-18 14:08:49 +08:00
congqixia	91b02b5d22	enhance: Add param item for datanode l0 batch/linear mode memory ratio (#30523 ) See also #27606 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-02-18 13:02:50 +08:00
congqixia	b111f3b110	enhance: Use RWMutex and change WLock to RLock (#30557 ) Related to #27675 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-02-06 17:13:56 +08:00
congqixia	d4100d5442	enhance: Change update channel cp magic number to param item (#30555 ) See also #28817 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-02-06 16:02:00 +08:00
congqixia	a68b32134a	fix: Verify sync task target segment and retry if not match (#30500 ) See also #27675 #30469 For a sync task, the segment could be compacted during sync task. In previous implementation, this sync task will hold only the old segment id as KeyLock, in which case compaction on compacted to segment may run in parallel with delta sync of this sync task. This PR introduces sync target segment verification logic. It shall check target segment lock it's holding beforing actually syncing logic. If this check failed, sync task shall return`errTargetSegementNotMatch` error and make manager re-fetch the current target segment id. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-02-05 11:33:43 +08:00
yihao.dai	18b979d9b4	enhance: Extend support for varchar autoID to BulkInsertV2 (#30477 ) issue: https://github.com/milvus-io/milvus/issues/30476 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-02-04 16:57:05 +08:00
XuanYang-cn	e6eb6f2c78	enhance: Speed up L0 compaction (#30410 ) This PR changes the following to speed up L0 compaction and prevent OOM: 1. Lower deltabuf limit to 16MB by default, so that each L0 segment would be 4X smaller than before. 2. Add BatchProcess, use it if memory is sufficient 3. Iterator will Deserialize when called HasNext to avoid massive memory peek 4. Add tracing in spiltDelta See also: #30191 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-02-04 10:49:05 +08:00
yihao.dai	7ce876a072	fix: Decoupling importing segment from flush process (#30402 ) This pr decoups importing segment from flush process by: 1. Exclude the importing segment from the flush policy, this approch avoids notifying the datanode to flush the importing segment, which may not exist. 2. When RootCoord call Flush, DataCoord directly set the importing segment state to `Flushed`. issue: https://github.com/milvus-io/milvus/issues/30359 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-02-03 13:01:12 +08:00
congqixia	1ab851d73f	enhance: Remove useless frequent log in Mintimestamp (#30471 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-02-02 20:39:05 +08:00
XuanYang-cn	d744962aa1	fix: Correct Size calculation of DeleteData (#30397 ) This PR would correct the actual deltalog size See also: #30191 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-02-02 10:47:04 +08:00
XuanYang-cn	fb5e09d94d	fix: call injectDone after compaction failed (#30277 ) syncMgr.Block() will lock the segment when executing compaction. Previous implementation was unable to Unblock thoese segments when compaction failed. If next compaction of the same segments arrives, it'll stuck forever and block all later compation tasks. This PR makes sure compaction executor would Unblock these segments after a failure compaction. Apart form that, this PR also refines some logs and clean some codes of compaction, compactor: 1. Log segment count instead of segmentIDs to avoid logging too many segments 2. Flush RPC returns L1 segments only, skip L0 and L2 3. CompactionType is checked in `Compaction`, no need to check again inside compactor 4. Use ligter method to replace `getSegmentMeta` 5. Log information for L0 compaction when encounters an error See also: #30213 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-02-01 14:25:04 +08:00
congqixia	be8831b311	enhance: Reduce get segments scan during l0 compaction (#30408 ) See also #27606 Previously l0 linear compaction will scan all target segment id from metacache for each line of delta entry, which is not needed since compaction target segments shall be all immutable. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-02-01 10:59:03 +08:00
yihao.dai	c5918290e6	feat: Add import executor and manager for datanode (#29438 ) This PR introduces novel importv2 roles for datanode: 1. Executor: To execute tasks, a import task will be divided into the following steps: read data -> hash data -> sync data; 2. Manager: To manage all the tasks; issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-01-31 20:45:04 +08:00
congqixia	fc0d007bd1	enhance: Add `MemoryHighSyncPolicy` back to write buffer manager (#29997 ) See also #27675 This PR adds back MemoryHighSyncPolicy implementation. Also change MinSegmentSize & CheckInterval to configurable param item. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-31 19:03:04 +08:00
congqixia	b5e078c4d3	enhance: Remove current stats after RollStats action (#30391 ) See also #27675 BloomFilterSet.current shall be reset after RollStats, otherwise it will keep tracking whole segment data causing the false positive ratio larger than expected. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-31 18:55:04 +08:00
chyezh	6d63fb5d3f	fix: panic with datanode negetive wait group counter (#30135 ) issue: #29170 Signed-off-by: chyezh <chyezh@outlook.com>	2024-01-30 18:15:04 +08:00
congqixia	0c7a96b48d	enhance: Make compaction log has traceID (#30338 ) See also #30167 After support open telemetry tracing, we want to have traceID as well, this PR adds util functions to set traceID with span & propagate traceID between different context. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-30 10:09:03 +08:00
congqixia	743bdf1434	enhance: Make l0 compactor download files in parallel (#30309 ) See also #27606 `MultiRead` actually download file in sequence, which may lead to large time consumption during l0 compaction download phase. This PR make l0 compactor download deltalogs in parallel utilizing conc package & io pool. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-30 10:07:09 +08:00
congqixia	6445880753	fix: prevent segments got flushed multiple times (#30240 ) See also #30111 Segments could be "Flushed" only by `FlushSegments` grpc call from datacoord by design. There are two possible reason to cause one segment got flushed multiple times. - Segment is in flushing state during multiple epoch in flowgraph - Segment is flushed by flushTs & Flush segments So this pr fix: - Remove state change logic form FlushTs policy - Change Flush segment into three stage way: Sealed->Flushing->Flushed preventing multiple Flushed=true operations. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-24 14:19:00 +08:00
congqixia	6a73860815	enhance: Add open telemetry tracing for compaction (#30168 ) Resolves #30167 This PR add tracing for all compaction from the task start in datacoord and execution procedures in datanode. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-23 10:37:00 +08:00
congqixia	8a6de3d2b1	fix: decompress deltelog path for level zero compaction (#30164 ) Resolves: #30161 See also: #28873 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-22 14:44:55 +08:00
yihao.dai	8780d65b66	fix: Use channel cp as the dml&start position for import segments (#30107 ) This PR discontinuing the subscription to the mq and, instead, employing the channel checkpoint as the DML and starting position for the import segments. issue: https://github.com/milvus-io/milvus/issues/30106 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-01-22 14:36:55 +08:00
XuanYang-cn	3d46096f86	fix: Set segment level for comapct to segment (#30129 ) See also: #29204 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-01-19 18:52:53 +08:00
congqixia	0d356b0545	enhance: only buffer delete if it match insert has smaller timestamp (#30122 ) See also: #30121 #27675 This PR changes the delete buffering logic: - Write buffer shall buffer insert first - Then the delete messages shall be evaluated - Whether PK matches previous Bloom filter, which ts is always smaller - Whether PK matches insert data which has smaller timestamp - Then the segment bloom filter is updates by the newly buffered pk rows --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-19 17:28:53 +08:00
XuanYang-cn	86f48861c1	fix: Add more throughput in related metrics (#30038 ) This PR also fixes bugs in l0 compactor where l0 results would never be removed from datanode See also: #30099 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-01-19 11:34:54 +08:00
smellthemoon	e52ce370b6	enhance:don't store logPath in meta to reduce memory (#28873 ) don't store logPath in meta to reduce memory, when service get segmentinfo, generate logpath from logid. #28885 Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-01-18 22:06:31 +08:00
XuanYang-cn	ad7a0b4091	fix: Change finish log level to info (#30031 ) Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-01-17 10:12:55 +08:00
SimFG	d9edd50f97	fix: the delete msg disorder issue (#29915 ) /kind improvement Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-01-14 10:26:52 +08:00
congqixia	ed89c6a2ee	enhance: make compactor use actual buffer size to decide when to sync (#29945 ) See also: #29657 Datanode Compactor use estimated row number from schema to decide when to sync the batch of data when executing compaction. This est value could go way from actual size when the schema contains variable field( say VarChar, JSON, etc.) This PR make compactor able to check the actual buffer data size and make it possible to sync when buffer is actually beyong max binglog size. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-13 01:32:52 +08:00
Bingyi Sun	e1258b8cad	feat: integrate storagev2 into loading segment (#29336 ) issue: #29335 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-01-12 18:10:51 +08:00
Xu Tong	e429965f32	Add float16 approve for multi-type part (#28427 ) issue：https://github.com/milvus-io/milvus/issues/22837 Add bfloat16 vector, add the index part of float16 vector. Signed-off-by: Writer-X <1256866856@qq.com>	2024-01-11 15:48:51 +08:00
XuanYang-cn	9c8fd5e51d	fix: Save lite WatchInfo into etcd in DataNode (#29687 ) Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-01-10 21:18:49 +08:00
congqixia	a040692129	enhance: Use estimated batch size to initalize BF (#29842 ) See also: #27675 The bloom filter set initialized new BF with fixed configured `n`. This value is always larger than the actual batch size and causes generated BF using more memory. This PR make write buffer to initialize BF with estimated batch size from schema & configuration value. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-10 20:36:50 +08:00
Buqian Zheng	d506d33a8d	fix: meta cache in datanode incorrectly tracking row nums (#29817 ) ... of compacted segments issue: #29816 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-01-10 13:22:48 +08:00
congqixia	f18a7191f2	enhance: make `ColumnBasedInsertMsgToInsertData` check field missing (#29758 ) fix: #29757 In previous code, `ColumnBasedInsertMsgToInsertData` adds empty field if the insertMsg parameter does not have the column schema defined. This may lead to unexpected behavior of caller functions. This PR: - Add column missing check - Add column length check - Generate BlobInfo for ColumnBasedInsertMsgToInsertData result --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-09 11:50:48 +08:00
smellthemoon	1c1f2a1371	enhance:change some logs (#29579 ) related #29588 Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-01-05 16:12:48 +08:00
congqixia	dc6a6a50fa	enhance: reduce SyncTask AllocID call and refine code (#29701 ) See also #27675 `Allocator.Alloc` and `Allocator.AllocOne` might be invoked multiple times if there were multiple blobs set in one sync task. This PR add pre-fetch logic for all blobs and cache logIDs in sync task so that at most only one call of the allocator is needed. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-05 10:04:46 +08:00
XuanYang-cn	a3aff37f73	fix: Correct flush buffer size metrics (#29571 ) See also: #29204 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-01-04 17:22:46 +08:00
congqixia	79c06c5e73	fix: serializer shall bypass L0 segment merge stats step (#29636 ) See also #27675 Fix logic problem introduced by #29413, which is serializer tries to merge statslog list while level segments do not have statslog. This shall result returning error. `writeBufferBase` ignores this error but it shall only ignore `ErrSegmentNotFound`. This PR add logic checking segment level before execution of merging statslog list. And add error type check for getSyncTask failure. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-04 16:52:45 +08:00
congqixia	55af8f611f	fix: always sync level zero segments as flushed (#29569 ) See also #27675 For now, Level zero segments shall always be synced as `Flushed` ones. This PR fixes when level zero segments selected by policies other than flush ts policy will be synced as growing state. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-29 10:34:47 +08:00
MrPresent-Han	ed644983e2	enhance: add param for bloomfilter(#29388 ) (#29490 ) related: #29388 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2023-12-28 18:10:46 +08:00
yah01	a8a0aa9357	fix: missing to support compact for Array type (#29505 ) the array type can't be compacted, the system could continue with the inserted segments, but these segments can be never compacted fix #29503 --------- Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-12-28 15:42:48 +08:00
XuanYang-cn	632d8b3743	enhance: Change DN channelmanger into interface (#29307 ) See also: #28854 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2023-12-27 16:00:48 +08:00
XuanYang-cn	fe04598900	enhance: Add compaction type label to metrics (#29485 ) Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2023-12-27 15:56:48 +08:00
congqixia	f6cff25712	enhance: fix serialization record span & flushed buffer size metrics (#29482 ) See also #27675 #29413 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-27 10:20:48 +08:00
congqixia	277849a915	enhance: separate serializer logic from sync task (#29413 ) See also #27675 Since serialization segment buffer does not related to sync manager can shall be done before submit into sync manager. So that the pk statistic file could be more accurate and reduce complex logic inside sync manager. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-26 10:40:47 +08:00
congqixia	a937e4c232	fix: segment may never get flushed if sealed before watch (#29436 ) See also #29092 `FlushSegments` transfer only `Growing` segment to flushing, if the segment is in `Sealed` state before Datanode watch channel, the state will never got satisfied for a segment be selected to be flushed. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-23 21:32:43 +08:00
SimFG	dd9c61831d	enhance: Support to get the param value in the runtime (#29297 ) /kind improvement issue: #29299 Signed-off-by: SimFG <bang.fu@zilliz.com>	2023-12-22 18:36:44 +08:00
XuanYang-cn	7a6aa8552a	fix: add back existing datanode metrics (#29360 ) See also: #29204 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2023-12-22 14:20:43 +08:00
Xiaofan	77b291c5dc	fix: Add jitter in GetSyncStaleBufferPolicy (#28626 ) related to #28427 Add a jitter in syncStatleBuffer policy so all segments won't flush at the same time Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2023-12-20 15:00:52 +08:00
congqixia	1ee016709d	fix: Unstable `TestDataSyncService/TestStartStop` unit test (#29291 ) fix #29290 Change EXPECT `NotifyCheckpointUpdated` call to `Maybe` expectation Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-19 23:16:42 +08:00
XuanYang-cn	5164377e68	fix: Skip updating checkpoint after dropcollection (#29220 ) Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2023-12-15 16:04:45 +08:00
congqixia	4731c1b0d5	enhance: make SyncManager pool size refreshable (#29224 ) See also #29223 This PR make `conc.Pool` resizable by adding `Resize` method for it. Also make newly added datanode `MaxParallelSyncMgrTasks` config refreshable --------- Signed-off-by: Congqi.Xia <congqi.xia@zilliz.com>	2023-12-15 09:58:43 +08:00
congqixia	25a4525297	enhance: Change sync manager parallel config item (#29216 ) Since the sync manager is global in datanode now, the old `maxParallelSyncTaskNum` does not fit into current implementation anymore. This PR add a new param item for sync mgr parallel control and enlarge default value Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-14 20:46:41 +08:00
Bingyi Sun	ad866d2889	feat: integrate storagev2 into index build process (#28995 ) issue: https://github.com/milvus-io/milvus/issues/28994 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2023-12-13 17:24:38 +08:00
wayblink	51f870da7e	feat: Introduce channelCheckpointUpdater to reduce goroutine use in ttNode (#28570 ) /kind enhancement Signed-off-by: wayblink <anyang.wang@zilliz.com>	2023-12-12 13:48:42 +08:00
congqixia	cb43647b9e	enhance: Log channel checkpoint source info in writebuffer (#28993 ) See also #27675 Print channel checkpoint source with rated log will help debugging system behavior Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-07 11:50:36 +08:00
congqixia	cb31016640	enhance: Write buffer time range when syncing logs (#28970 ) Related to #27675 The timestamp from, to field is not field for new implementation of writebuffer & sync manager This pr fills these field for better log information Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-05 17:36:36 +08:00
Bingyi Sun	36f69ea031	feat: integrate storagev2 in building index of segcore (#28768 ) issue: https://github.com/milvus-io/milvus/issues/28655 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2023-12-05 16:48:54 +08:00
yihao.dai	d26b563a8b	feat: Define import API and metadata (#28731 ) Define the new rpc and metadata for ImportV2. see also: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2023-12-04 19:56:35 +08:00
congqixia	393b1f943c	fix: Reject compaction task with growing segments (#28925 ) See also #28924 The compaction task generated before datanode finish `SaveBinlogPath` grpc call contains segments which are still in Growing state DataNode shall verify each non-levelzero segments before submit compaction task to executor Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-04 19:06:40 +08:00
XuanYang-cn	e62edb991a	enhance: Add FlowgraphManager interface (#28852 ) - Change flowgraphManager to fgManagerImpl - Change close to stop - change execute to controlMemWaterLevel - Change method name of fgManager for readability - Add mockery for fgmanager Issue: #28853 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2023-11-30 18:42:32 +08:00
XuanYang-cn	5d0a9f9344	fix: Forget to set EntriesNum for deltalogs (#28858 ) See also: #28520 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2023-11-30 16:52:31 +08:00
XuanYang-cn	aae7e62729	feat: Add levelzero compaction in DN (#28470 ) See also: #27606 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2023-11-30 14:30:28 +08:00
congqixia	2cd8daaf0b	fix: compacted segment still buffers delta data (#28816 ) Related to #28628 Compacted segment syncing counter is not set correctly in sync task and the bf write buffer shall not use compacted segment as candidate when buffering delta data --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-30 10:20:28 +08:00
XuanYang-cn	321c5c32e3	fix: Separate schedule and check results loop (#28692 ) This PR: - Separates compaction scheduler and check results loop So that slow in check-loop doesn't influence execution. - Cleans compaction tasks when drop a vchannel so dropped-channel's compaction tasks won't be checked over and over again. - Skips meta change when meta's already changed, avoid panic - Remove not inuse injectDone(bool) parameter See also: #28628, #28209 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2023-11-29 10:50:29 +08:00
XuanYang-cn	606ec77b66	enhance: Unify levelzero segment config in DN (#28720 ) Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2023-11-28 18:04:26 +08:00
congqixia	eaabe0293b	fix: Update segment compactTo when compactTo segment is compacted (#28755 ) Related to #28736 #28748 See also #27675 Previous PR: #28646 This PR fixes `SegmentNotFound` issue when compaction happens multiple times and the buffer of first generation segment is sync due to stale policy Now the `CompactSegments` API of metacache shall update the compactTo field of segmentInfo if the compactTo segment is also compacted to keep the bloodline clean Also, add the `CompactedSegment` SyncPolicy to sync the compacted segment asap to keep metacache clean Now the `SyncPolicy` is an interface instead of a function type so that when it selects some segments to sync, we colud log the reason and target segment Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-27 19:48:26 +08:00
jaime	b1e0a27f31	enhance: Add logs for each step during service initialization (#28624 ) /kind improvement Signed-off-by: jaime <yun.zhang@zilliz.com>	2023-11-27 16:30:26 +08:00
congqixia	8a9ab69369	fix: Skip statslog generation flushing empty L0 segment (#28733 ) See also #27675 When L0 segment contains only delta data, merged statslog shall be skiped when performing sync task --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-25 15:10:25 +08:00
congqixia	39be35804c	enhance: Add back clean compacted segment info logic (#28646 ) See also #27675 Compacted segment info shall be removed after all buffer belongs to it is sync-ed. This PR add the cleanup function after triggerSyncTask logic: - The buffer is stable and protected by mutex - Cleanup fetches compacted & non-sync segment - Remove segment info only there is no buffered maintained in manager --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-24 15:38:25 +08:00
smellthemoon	79c0edb1d8	enhance:Remove msgbase unnecessary assignments (#28511 ) remove some unnecessary assignments, for the reason that commonpbutil.NewMsgBase has default value. Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2023-11-24 15:02:39 +08:00
congqixia	a2fe9dad49	enhance: Make etcd kv request timeout configurable (#28661 ) See also #28660 This pr add request timeout config item for etcd kv request timeout Sync the default timeout value to same value for etcdKV & tikv config Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-23 19:34:23 +08:00
XuanYang-cn	b1f15fa0e8	fix: Ease the log level when sync task done (#28678 ) Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2023-11-23 19:18:28 +08:00
Bingyi Sun	4fedff6d47	feat: integrate storage v2 into the write path (#28440 ) #28378 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2023-11-23 17:26:24 +08:00
XuanYang-cn	55800ade84	fix: Remove logging extra segmentIDs (#28662 ) Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2023-11-23 14:40:23 +08:00
congqixia	af18aa709b	fix: Remove not needed `BlockAll` call in `SyncSegments` (#28632 ) See also #28628 Previous compaction task blocked the segment sync task and may block the flowgraph when sync task is generated by auto sync policy This `BlockAll` call will block forever and cause whole fg stuck Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-22 14:52:23 +08:00
smellthemoon	29249c4bd3	enhance: create goroutine only once (#28594 ) create goroutine only once when getOrCreateMergedTimeTickerSender Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2023-11-22 10:26:28 +08:00
smellthemoon	73f2bab454	enhance:add some log when create client and get component states (#28160 ) /kind improvement Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2023-11-22 09:12:22 +08:00
congqixia	2fc743992a	fix: syncmgr unstable TestCompact unittest logic (#28630 ) fix #28629 orignal unit test close channel before setting the segment id, so there is a chance that test read segment id before setting it change unit test behavior to wait future return now Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-22 00:52:23 +08:00
congqixia	6521b519d7	enhance: Use bufSize instead of row number in sync policy (#28498 ) See also #27675, comment from #27874 This PR changes the `IsFull` logic of insert buffer Changing the limit from rowNum to buffer size, this shall help form better binlog file when schema has variable-length field Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-21 15:02:25 +08:00
congqixia	2b3fa8f67b	fix: Add length check for `storage.NewPrimaryKeyStats` (#28576 ) See also #28575 Add zero-length check for `storage.NewPrimaryKeyStats`. This function shall return error when non-positive rowNum passed. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-21 10:28:21 +08:00
congqixia	18dc6b61ce	enhance: fix LevelZero segment sync logic (#28482 ) See also #27675 - Fix LevelZero segment cannot be flushed - Add level option for syncTask - Invoke `AddSegment` when new LevelZero segment is allocated Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-17 21:46:20 +08:00
congqixia	a3cd0bc9c3	fix: Refine sync task field binlog compose logic (#28494 ) See also #27675 Since `MetaWriter` need `*datapb.FieldBinlog` struct, sync task now generate FieldBinlog directly Also fix merged statslog not generated if last task has no insert Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-17 14:40:26 +08:00
congqixia	bed7467f20	enhance: Remove commented code and fix naming issue (#28450 ) This PR removes all the commented code and files from PR #28320 For naming issue: - Renaming `MinCheckpoint` to `EarliestPosition`, see #28320 comment - Renaming `writebuffer.Mananger` to `BufferMananger`, see #27874 comment Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-16 00:22:20 +08:00
congqixia	0b905078e7	Use writebuffer, sync manager refactory in datanode (#28320 ) See also #27675 This PR make previously merged refactory of datanode go online - Use write node to replace insert/delete node - Use write buffer manager to control all buffers - Use sync manager to control sync tasks instead of flush manager Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-15 15:24:18 +08:00
XuanYang-cn	40d5c902b6	Enable getting multiple segments in plan result (#28350 ) Compaction plan result contained one segment for one plan. For l0 compaction would write to multiple segments, this PR expand the segments number in plan results and refactor some names for readibility. - Name refactory: - CompactionStateResult -> CompactionPlanResult - CompactionResult -> CompactionSegment See also: #27606 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2023-11-14 15:56:19 +08:00
XuanYang-cn	8037f35de7	Change MetaCache interface (#28388 ) See also: #28320, #27675 Signed-off-by: yangxuan <xuan.yang@zilliz.com> Co-authored-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-14 15:08:19 +08:00
congqixia	b1eb1ea506	Refine datanode Timetick Sender (#28393 ) - Use explicit lifetime control methods: `Start` and `Stop` - Allow control retry option - Make sure tt sender worker exit after `Stop` return Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-13 21:46:20 +08:00
smellthemoon	0aa90de141	Reduce the goroutine in flowgraph to 2 (#28233 ) Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2023-11-13 10:50:17 +08:00
smellthemoon	5365748338	Use single instance for mergedTimeTickerSender (#27730 ) Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2023-11-13 10:18:17 +08:00
yah01	f4341f254d	Not convert legacy error code to new merr (#28232 ) Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-11-08 14:42:20 +08:00
SimFG	e3b7fdac61	Delay the cancellation of ctx when stopping the node (#28247 ) Signed-off-by: SimFG <bang.fu@zilliz.com>	2023-11-08 03:20:17 +08:00
congqixia	af1c2044b9	Fix atomic.Int64 not found in go 1.18 (#28216 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-07 15:16:21 +08:00
groot	3f6b203018	Fix bulkinsert bug that segments are compacted after import (#28192 ) Signed-off-by: yhmo <yihua.mo@zilliz.com>	2023-11-07 15:14:26 +08:00
congqixia	c41df18b6d	Add compaction meta in `SyncSegmentsRequest` (#28199 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-07 10:06:17 +08:00
yah01	90e2c63d9e	Fix getting incorrect CPU num (#28146 ) Signed-off-by: yah01 <yang.cen@zilliz.com>	2023-11-06 06:02:16 +08:00
congqixia	bf2f62c1e7	Add `WriteBuffer` to provide abstraction for delta policy (#27874 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-04 12:10:17 +08:00
yihao.dai	8d080383b7	Use merr to prevent datanode panic (#28121 ) Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2023-11-04 00:58:21 +08:00
Jiquan Long	a21042dde7	Fix flushManager.isFull is too slow (#28141 ) Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2023-11-03 14:42:17 +08:00
congqixia	1e51255c15	Implement `Injection` for SyncManager with block and meta transition (#28093 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-11-03 04:48:15 +08:00
yah01	9658367a3c	Refine chunk manager errors (#27590 ) Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-10-31 12:18:15 +08:00
congqixia	233bf90c55	Add SyncManager to replace flush manager (#27873 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-10-31 02:30:16 +08:00
congqixia	98e2aad752	Refine datanode metacache and implement CoW (#27985 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-10-28 11:12:11 +08:00
smellthemoon	403c6680cc	Add error description (#27959 ) Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2023-10-27 10:10:13 +08:00
Filip Haltmayer	6b1a106a31	Moving etcd client into session (#27069 ) Signed-off-by: Filip Haltmayer <filip.haltmayer@zilliz.com>	2023-10-27 07:36:12 +08:00
yah01	f79c7370f4	Fix panic while flushing dropped/compacted segment (#27927 ) Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-10-25 22:10:13 +08:00
yihao.dai	b9d5ef3599	Fix datanode ttNode goroutine leak (#27878 ) Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2023-10-24 18:22:10 +08:00
aoiasd	9091a27832	Add meta cache to datanode for L0 Delta (#27768 ) Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2023-10-23 19:42:10 +08:00
SimFG	9b0ecbdca7	Support to replicate the mq message (#27240 ) Signed-off-by: SimFG <bang.fu@zilliz.com>	2023-10-20 14:26:09 +08:00
smellthemoon	4b0ec156b3	Set channel work pool size in datanode (#27728 ) Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2023-10-19 08:28:08 +08:00
XuanYang-cn	7358c3527b	Add iterators (#27643 ) See also: #27606 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2023-10-18 19:34:08 +08:00

1 2 3 4 5 ...

1064 Commits