milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2024-11-30 02:48:45 +08:00

Author	SHA1	Message	Date
congqixia	d9efea2fea	fix: Cleanup write buffer when flowgraph released (#31376 ) See also #30137 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-03-19 01:33:05 +08:00
yihao.dai	776709e5ff	fix: Fix binlog import (#31310 ) Fix binlog import functionality by removing the existing check and refining the size retrieval process. issue: https://github.com/milvus-io/milvus/issues/31221, https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-17 20:59:04 +08:00
yihao.dai	811316d2ba	fix: Fix binlog import and refine error reporting (#31241 ) 1. Fix binlog import with partition key. 2. Refine binlog import error reportins. 3. Avoid division by zero when retrieving import progress. issue: https://github.com/milvus-io/milvus/issues/31221, https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-15 10:55:05 +08:00
jaime	db79be3ae0	fix: ctx cancel should be the last step while stopping server (#31220 ) issue: #31219 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-03-15 10:33:05 +08:00
XuanYang-cn	a52a52064d	fix: Use lock and map instead of concurrentMap (#31212 ) See also: #31209 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-03-14 18:39:04 +08:00
Buqian Zheng	3c80083f51	feat: [Sparse Float Vector] add sparse vector support to milvus components (#30630 ) add sparse float vector support to different milvus components, including proxy, data node to receive and write sparse float vectors to binlog, query node to handle search requests, index node to build index for sparse float column, etc. https://github.com/milvus-io/milvus/issues/29419 --------- Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-03-13 14:32:54 -07:00
yihao.dai	b5c67948b7	enhance: Enhance and modify the return content of ImportV2 (#31192 ) 1. The Import APIs now provide detailed progress information for each imported file, including details such as file name, file size, progress, and more. 2. The APIs now return the collection name and the completion time. 3. Other modifications include changing jobID to jobId and other similar adjustments. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-13 19:51:03 +08:00
congqixia	937f2440ab	fix: TestBlock case use different segment id in testcase (#31173 ) Resolves: #31172 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-03-11 17:51:03 +08:00
congqixia	ff1e967e89	enhance: Add segment id short cut for WithSegmentID filter (#31144 ) See also #31143 This PR add short cut for datanoe metacache `WithSegmentIDs` filter, which could just fetch segment from map with provided segmentIDs. Also add benchmark for new implementation vs old one. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-03-11 10:55:02 +08:00
Ted Xu	987d9023a5	enhance: Enable binlog deserialize reader in datanode compaction (#31036 ) See #30863 Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2024-03-08 18:25:02 +08:00
yihao.dai	c411cb4a49	enhance: Prevent the backlog of channelCP update tasks, perform batch updates of channelCPs (#30941 ) This PR includes the following adjustments: 1. To prevent channelCP update task backlog, only one task with the same vchannel is retained in the updater. Additionally, the lastUpdateTime is refreshed after the flowgraph submits the update task, rather than in the callBack function. 2. Batch updates of multiple vchannel checkpoints are performed in the UpdateChannelCheckpoint RPC (default batch size is 128). Additionally, the lock for channelCPs in DataCoord meta has been switched from key lock to global lock. 3. The concurrency of UpdateChannelCheckpoint RPCs in the datanode has been reduced from 1000 to 10. issue: https://github.com/milvus-io/milvus/issues/30004 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com> Co-authored-by: jaime <yun.zhang@zilliz.com> Co-authored-by: congqixia <congqi.xia@zilliz.com>	2024-03-07 20:39:02 +08:00
yihao.dai	0a2c255630	enhance: Reduce the memory usage of the timeTickSender (#30968 ) In the cache of the timeTickSender, retain only the latest stats instead of storing stats for every time tick. issue: https://github.com/milvus-io/milvus/issues/30967 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-02 10:13:01 +08:00
yihao.dai	a434d33e75	feat: Add import scheduler and manager (#29367 ) This PR introduces novel managerial roles for importv2: 1. ImportMeta: To manage all the import tasks; 2. ImportScheduler: To process tasks and modify their states; 3. ImportChecker: To ascertain the completion of all tasks and instigate relevant operations. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-01 18:31:02 +08:00
wayblink	f3c56c83ab	fix: Fix binlog_io metric name conflict (#30689 ) follow: #29725 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-03-01 18:13:02 +08:00
XuanYang-cn	2867f50fcc	fix: Clear DN unkown compaction tasks (#30850 ) If DC restarted, those unkonwn compaction tasks will never get call back in DN, so that the segments in the compaction task will be locked, unable to sync and compaction again, blocking cp advance and compaction executing. See also: #30137 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-03-01 11:31:00 +08:00
yihao.dai	a5a4ca8459	enhance: Remove debug log (#30955 ) Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-03-01 10:02:59 +08:00
chyezh	0c7474d7e8	enhance: add graceful stop timeout to avoid node stop hang under extreme cases (#30317 ) 1. add coordinator graceful stop timeout to 5s 2. change the order of datacoord component while stop 3. change querynode grace stop timeout to 900s, and we should potentially change this to 600s when graceful stop is smooth issue: #30310 also see pr: #30306 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-02-29 17:01:50 +08:00
yiwangdr	c6665c2a4c	test: support multiple data/querynodes in integration test (#30618 ) issue: https://github.com/milvus-io/milvus/issues/29507 Signed-off-by: yiwangdr <yiwangdr@gmail.com>	2024-02-21 11:54:53 +08:00
wayblink	f976385421	enhance: replace binlogIO with io.BinlogIO in datanode (#29725 ) #30633 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-02-20 14:38:51 +08:00
wayblink	6c89609de7	enhance: Reduce unnessary log in binlog_io (#30625 ) Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-02-18 16:50:51 +08:00
congqixia	3c2e0375df	fix: make compactor inject done called no more than once (#30603 ) See also #30571 When `compactionExecutor` stops one compaction task, the `stop` method will case `injectDone` called. However in `executeTask` when `compact` method returns error, it shall also invoke `injectDone` as well. That the reason `Unlock of unlocked RWMutex` panicking happened. This PR add sync.Once to make sure that `injectDone` is called only once. We did not remove any of the `injectDone` since removal any of those invocation may cause logic problem. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-02-18 14:08:49 +08:00
congqixia	91b02b5d22	enhance: Add param item for datanode l0 batch/linear mode memory ratio (#30523 ) See also #27606 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-02-18 13:02:50 +08:00
congqixia	b111f3b110	enhance: Use RWMutex and change WLock to RLock (#30557 ) Related to #27675 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-02-06 17:13:56 +08:00
congqixia	d4100d5442	enhance: Change update channel cp magic number to param item (#30555 ) See also #28817 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-02-06 16:02:00 +08:00
congqixia	a68b32134a	fix: Verify sync task target segment and retry if not match (#30500 ) See also #27675 #30469 For a sync task, the segment could be compacted during sync task. In previous implementation, this sync task will hold only the old segment id as KeyLock, in which case compaction on compacted to segment may run in parallel with delta sync of this sync task. This PR introduces sync target segment verification logic. It shall check target segment lock it's holding beforing actually syncing logic. If this check failed, sync task shall return`errTargetSegementNotMatch` error and make manager re-fetch the current target segment id. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-02-05 11:33:43 +08:00
yihao.dai	18b979d9b4	enhance: Extend support for varchar autoID to BulkInsertV2 (#30477 ) issue: https://github.com/milvus-io/milvus/issues/30476 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-02-04 16:57:05 +08:00
XuanYang-cn	e6eb6f2c78	enhance: Speed up L0 compaction (#30410 ) This PR changes the following to speed up L0 compaction and prevent OOM: 1. Lower deltabuf limit to 16MB by default, so that each L0 segment would be 4X smaller than before. 2. Add BatchProcess, use it if memory is sufficient 3. Iterator will Deserialize when called HasNext to avoid massive memory peek 4. Add tracing in spiltDelta See also: #30191 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-02-04 10:49:05 +08:00
yihao.dai	7ce876a072	fix: Decoupling importing segment from flush process (#30402 ) This pr decoups importing segment from flush process by: 1. Exclude the importing segment from the flush policy, this approch avoids notifying the datanode to flush the importing segment, which may not exist. 2. When RootCoord call Flush, DataCoord directly set the importing segment state to `Flushed`. issue: https://github.com/milvus-io/milvus/issues/30359 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-02-03 13:01:12 +08:00
congqixia	1ab851d73f	enhance: Remove useless frequent log in Mintimestamp (#30471 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-02-02 20:39:05 +08:00
XuanYang-cn	d744962aa1	fix: Correct Size calculation of DeleteData (#30397 ) This PR would correct the actual deltalog size See also: #30191 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-02-02 10:47:04 +08:00
XuanYang-cn	fb5e09d94d	fix: call injectDone after compaction failed (#30277 ) syncMgr.Block() will lock the segment when executing compaction. Previous implementation was unable to Unblock thoese segments when compaction failed. If next compaction of the same segments arrives, it'll stuck forever and block all later compation tasks. This PR makes sure compaction executor would Unblock these segments after a failure compaction. Apart form that, this PR also refines some logs and clean some codes of compaction, compactor: 1. Log segment count instead of segmentIDs to avoid logging too many segments 2. Flush RPC returns L1 segments only, skip L0 and L2 3. CompactionType is checked in `Compaction`, no need to check again inside compactor 4. Use ligter method to replace `getSegmentMeta` 5. Log information for L0 compaction when encounters an error See also: #30213 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-02-01 14:25:04 +08:00
congqixia	be8831b311	enhance: Reduce get segments scan during l0 compaction (#30408 ) See also #27606 Previously l0 linear compaction will scan all target segment id from metacache for each line of delta entry, which is not needed since compaction target segments shall be all immutable. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-02-01 10:59:03 +08:00
yihao.dai	c5918290e6	feat: Add import executor and manager for datanode (#29438 ) This PR introduces novel importv2 roles for datanode: 1. Executor: To execute tasks, a import task will be divided into the following steps: read data -> hash data -> sync data; 2. Manager: To manage all the tasks; issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-01-31 20:45:04 +08:00
congqixia	fc0d007bd1	enhance: Add `MemoryHighSyncPolicy` back to write buffer manager (#29997 ) See also #27675 This PR adds back MemoryHighSyncPolicy implementation. Also change MinSegmentSize & CheckInterval to configurable param item. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-31 19:03:04 +08:00
congqixia	b5e078c4d3	enhance: Remove current stats after RollStats action (#30391 ) See also #27675 BloomFilterSet.current shall be reset after RollStats, otherwise it will keep tracking whole segment data causing the false positive ratio larger than expected. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-31 18:55:04 +08:00
chyezh	6d63fb5d3f	fix: panic with datanode negetive wait group counter (#30135 ) issue: #29170 Signed-off-by: chyezh <chyezh@outlook.com>	2024-01-30 18:15:04 +08:00
congqixia	0c7a96b48d	enhance: Make compaction log has traceID (#30338 ) See also #30167 After support open telemetry tracing, we want to have traceID as well, this PR adds util functions to set traceID with span & propagate traceID between different context. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-30 10:09:03 +08:00
congqixia	743bdf1434	enhance: Make l0 compactor download files in parallel (#30309 ) See also #27606 `MultiRead` actually download file in sequence, which may lead to large time consumption during l0 compaction download phase. This PR make l0 compactor download deltalogs in parallel utilizing conc package & io pool. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-30 10:07:09 +08:00
congqixia	6445880753	fix: prevent segments got flushed multiple times (#30240 ) See also #30111 Segments could be "Flushed" only by `FlushSegments` grpc call from datacoord by design. There are two possible reason to cause one segment got flushed multiple times. - Segment is in flushing state during multiple epoch in flowgraph - Segment is flushed by flushTs & Flush segments So this pr fix: - Remove state change logic form FlushTs policy - Change Flush segment into three stage way: Sealed->Flushing->Flushed preventing multiple Flushed=true operations. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-24 14:19:00 +08:00
congqixia	6a73860815	enhance: Add open telemetry tracing for compaction (#30168 ) Resolves #30167 This PR add tracing for all compaction from the task start in datacoord and execution procedures in datanode. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-23 10:37:00 +08:00
congqixia	8a6de3d2b1	fix: decompress deltelog path for level zero compaction (#30164 ) Resolves: #30161 See also: #28873 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-22 14:44:55 +08:00
yihao.dai	8780d65b66	fix: Use channel cp as the dml&start position for import segments (#30107 ) This PR discontinuing the subscription to the mq and, instead, employing the channel checkpoint as the DML and starting position for the import segments. issue: https://github.com/milvus-io/milvus/issues/30106 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-01-22 14:36:55 +08:00
XuanYang-cn	3d46096f86	fix: Set segment level for comapct to segment (#30129 ) See also: #29204 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-01-19 18:52:53 +08:00
congqixia	0d356b0545	enhance: only buffer delete if it match insert has smaller timestamp (#30122 ) See also: #30121 #27675 This PR changes the delete buffering logic: - Write buffer shall buffer insert first - Then the delete messages shall be evaluated - Whether PK matches previous Bloom filter, which ts is always smaller - Whether PK matches insert data which has smaller timestamp - Then the segment bloom filter is updates by the newly buffered pk rows --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-19 17:28:53 +08:00
XuanYang-cn	86f48861c1	fix: Add more throughput in related metrics (#30038 ) This PR also fixes bugs in l0 compactor where l0 results would never be removed from datanode See also: #30099 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-01-19 11:34:54 +08:00
smellthemoon	e52ce370b6	enhance:don't store logPath in meta to reduce memory (#28873 ) don't store logPath in meta to reduce memory, when service get segmentinfo, generate logpath from logid. #28885 Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-01-18 22:06:31 +08:00
XuanYang-cn	ad7a0b4091	fix: Change finish log level to info (#30031 ) Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-01-17 10:12:55 +08:00
SimFG	d9edd50f97	fix: the delete msg disorder issue (#29915 ) /kind improvement Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-01-14 10:26:52 +08:00
congqixia	ed89c6a2ee	enhance: make compactor use actual buffer size to decide when to sync (#29945 ) See also: #29657 Datanode Compactor use estimated row number from schema to decide when to sync the batch of data when executing compaction. This est value could go way from actual size when the schema contains variable field( say VarChar, JSON, etc.) This PR make compactor able to check the actual buffer data size and make it possible to sync when buffer is actually beyong max binglog size. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-13 01:32:52 +08:00
Bingyi Sun	e1258b8cad	feat: integrate storagev2 into loading segment (#29336 ) issue: #29335 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-01-12 18:10:51 +08:00

1 2 3 4 5 ...

946 Commits