milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2024-12-04 04:49:08 +08:00

Author	SHA1	Message	Date
SimFG	5016038781	enhance: release the record in delete codec and add some log for compaction (#34454 ) /kind improvement Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-07-09 15:40:17 +08:00
yihao.dai	e15ac2b472	fix: Fix incorrect segment num rows (#34441 ) Repeated calls to UpdateStatistics, this PR correct it. issue: https://github.com/milvus-io/milvus/issues/34440 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-07-08 20:00:14 +08:00
yihao.dai	4e5f1d5f75	enhance: Pre-allocate ids for import (#33958 ) The import is dependent on syncTask, which in turn relies on the allocator. This PR pre-allocate the necessary IDs for import syncTask. issue: https://github.com/milvus-io/milvus/issues/33957 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-07-07 21:26:14 +08:00
yihao.dai	43fd8d19c2	enhance: Check segment existence when FlushSegments and add some key logs (#34438 ) Check if the segment exists during FlushSegments and add some key logs in write path. issue: https://github.com/milvus-io/milvus/issues/34255 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-07-06 08:50:11 +08:00
jaime	21fc5f5d46	enhance: Remove datanode reporting TT based on MQ implementation (#34421 ) issue: #34420 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-07-05 15:48:09 +08:00
yihao.dai	0b404bff22	fix: Correct the update logic of timerecorder (#34339 ) Correct the update logic of timerecorder in the flowgraph to avoid false failure: "some node(s) haven't received input". issue: https://github.com/milvus-io/milvus/issues/34337 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-07-04 16:34:17 +08:00
XuanYang-cn	8a2be8a457	fix: DataNode might OOM by estimating based on MemorySize (#34201 ) See also: #34136 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-07-04 10:16:17 +08:00
cai.zhang	2e434e4453	fix: Only load or release Flushed segment in datanode meta (#34390 ) issue: #34376 , #34379 , #34375 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-07-03 17:52:09 +08:00
cai.zhang	8165961223	enhance: Optimeize clustering compaction (#34313 ) issue: #30633 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-07-03 15:14:08 +08:00
cai.zhang	4cf1a358ba	fix: Sync the sealed and flushed segments to datanode (#34301 ) issue: #33696 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-07-01 22:42:08 +08:00
jaime	d6afb31b94	enhance: make subfunctions of datanode component modular (#33992 ) issue: #33994 also remove deprecated channel manager based on the etcd implementation Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-07-01 14:46:07 +08:00
wayblink	e5d691d854	Use new stream segment reader in clustering compaction (#34232 ) #32939 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-06-30 20:26:07 +08:00
wayblink	fbe3231b1f	fix: fix error ignore in compactor (#34169 ) #34170 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-06-26 10:24:03 +08:00
jaime	9630974fbb	enhance: move rocksmq from internal to pkg module (#33881 ) issue: #33956 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-06-25 21:18:15 +08:00
cai.zhang	c65f41dc60	fix: Only sync flushed segments to datanode (#34156 ) issue: #33540 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-06-25 20:46:07 +08:00
congqixia	962a5446f8	enhance: Add ctx in `SyncTask.Run` to be cancellable (#34042 ) Related to #33716 This PR add context param in SyncTask.Run execution functions to make it cancellable from the caller. This make it possible to cancel task when datanode/data sync service is beeing shut down. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-25 14:22:04 +08:00
congqixia	506a915272	fix: Deep copy ImportTask.segmentsInfo to prevent data race (#34090 ) See also #34089 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com> Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-25 10:06:02 +08:00
congqixia	b5c9a7364b	fix: Prevent remove new growing L1 segment when SyncSegments (#34056 ) Related to #34018 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-24 10:18:01 +08:00
yihao.dai	6c1d815894	enhance: Remove the unused compaction logic from shard (#33932 ) 1. Remove the `compactTo` field in `SegmentInfo`. 2. Remove the target segment not match and its retry logic in `SyncManager`. issue: https://github.com/milvus-io/milvus/issues/32809 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-23 21:12:01 +08:00
wayblink	380d3f4469	fix: Fix memory buffer error & some renaming (#33850 ) #30633 --------- Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-06-21 17:30:01 +08:00
XuanYang-cn	04edb07d82	enhance: Add deltaRowCount in l0 compaction (#33997 ) See also: #33998 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-06-20 17:46:01 +08:00
cqy123456	dc4437ff82	enhance: use segment id and type to register in MmapChunkManager and opt malloc in variableChunk (#33993 ) issue: https://github.com/milvus-io/milvus/issues/32984 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>	2024-06-20 17:42:02 +08:00
wei liu	31ef0a1fe8	enhance: Add trace for bf cost in l0 compactor (#33860 ) Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-20 10:10:05 +08:00
smellthemoon	2a1356985d	enhance: support null in go payload (#32296 ) #31728 --------- Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-06-19 17:08:00 +08:00
wayblink	5cb0760187	fix: Small fixs of major compaction (#33929 ) #30633 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-06-18 16:53:58 +08:00
cqy123456	32f685ff12	enhance: growing segment support mmap (#32633 ) issue: https://github.com/milvus-io/milvus/issues/32984 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>	2024-06-18 14:42:00 +08:00
cai.zhang	95148866ed	fix: Don't remove growing L0 segment in datanode metacache (#33829 ) issue: #33540 1. gorwing L0 segments is invisible to datacoord. 2. flushed L0 segments need to clean by datacoord. Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-06-17 10:09:57 +08:00
yihao.dai	1a9ab52f66	enhance: Ensure the idempotency of compaction task (#33872 ) /kind enhancement Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-16 22:09:57 +08:00
yihao.dai	8537f3daeb	enhance: Rename Compaction to CompactionV2 (#33858 ) Due to the removal of injection and syncSegments from the compaction, we need to ensure that no compaction is successfully executed during the rolling upgrade. This PR renames Compaction to CompactionV2, with the following effects: - New datacoord + old datanode: Utilizes the CompactionV2 interface, resulting in the datanode error "CompactionV2 not implemented," causing compaction to fail; - Old datacoord + new datanode: Utilizes the CompactionV1 interface, resulting in the datanode error "CompactionV1 not implemented," causing compaction to fail. issue: https://github.com/milvus-io/milvus/issues/32809 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-16 22:07:57 +08:00
yihao.dai	86a36b105a	enhance: Tidy compaction executor (#33778 ) Move compaction executor to compaction pacakge. issue: https://github.com/milvus-io/milvus/issues/32451 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-14 14:34:01 +08:00
wei liu	4987067375	enhance: Execute bloom filter apply in parallel to speed up segment predict (#33792 ) issue: #33610 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-14 11:37:56 +08:00
wei liu	ab93d9c23d	enhance: Use BatchPkExist to reduce bloom filter func call cost (#33611 ) issue:#33610 Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-13 17:57:56 +08:00
ArenaSu	2dfa752527	doc: [skip-e2e] add comments for event manager (#33444 ) Add comments for event manager(internal/datanode/event_manager.go). Signed-off-by: ArenaSu <704427617@qq.com>	2024-06-13 17:56:06 +08:00
congqixia	512ea6be5f	enhance: Avoid merging insert data when buffering insert msgs (#33562 ) See also #33561 This PR: - Use zero copy when buffering insert messages - Make `storage.InsertCodec` support serialize multiple insert data chunk into same batch binlog files Signed-off-by: Congqi Xia <congqi.xia@zilliz.com> --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-13 11:15:56 +08:00
congqixia	9ab3058da2	fix: Prevent restart timetick sender creating ut datanode (#33790 ) See also #33789 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com> Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-12 22:29:58 +08:00
yihao.dai	9a3e4080f1	enhance: Add comment for channel cp updater (#33759 ) /kind enhancement Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-12 20:01:55 +08:00
coldWater	6b9901c59f	enhance: add a semaphore for CompactionExecutor (#33558 ) #33182 --------- Signed-off-by: coldWater <254244460@qq.com>	2024-06-11 17:25:55 +08:00
yihao.dai	eb5d4de390	fix: Check if the import job exists (#33672 ) issue: https://github.com/milvus-io/milvus/issues/33671 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-10 21:51:55 +08:00
wayblink	a1232fafda	feat: Major compaction (#33620 ) #30633 Signed-off-by: wayblink <anyang.wang@zilliz.com> Co-authored-by: MrPresent-Han <chun.han@zilliz.com>	2024-06-10 21:34:08 +08:00
yihao.dai	3540eee977	enhance: Support L0 import (#33514 ) issue: https://github.com/milvus-io/milvus/issues/33157 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-07 14:17:20 +08:00
XuanYang-cn	4dd0c54ca0	fix: Fix l0 compactor may cause DN from OOM (#33554 ) See also: #33547 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-06-06 14:33:52 +08:00
XuanYang-cn	68c9e7db8c	fix: Sync dropped segment for dropped partition (#33331 ) See also: #33330 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-06-06 10:25:52 +08:00
cai.zhang	feeb869ff9	enhance: Remove compaction plans on the datanode (#33548 ) issue: #33546 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-06-05 20:27:51 +08:00
yihao.dai	bbdf99a45e	fix: Fix import segment size is uneven (#33605 ) The data coordinator computed the appropriate number of import segments, thus when importing in the data node, one can randomly select a segment. issue: https://github.com/milvus-io/milvus/issues/33604 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-05 15:41:51 +08:00
zhenshan.cao	ac4f3997ce	enhance: Reconstructing Compaction to possess persistence capability (#33265 ) issue #33586 Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2024-06-05 10:17:50 +08:00
aoiasd	387b7cd7f4	enhance:avoid maintain checkpoint info in sync manager (#33413 ) relate: https://github.com/milvus-io/milvus/issues/32915 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-06-05 10:05:50 +08:00
congqixia	7f4698f4a7	enhance: Use map PK to timestamp in buffer insert (#33566 ) Related to #27675 Store pk to minimal timestamp in `inData` instead of bloom filter to check whether some delete entry hit current insert batch Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-06-04 10:07:48 +08:00
XuanYang-cn	0382628668	enhance: Add more tracing for l0 compactor (#33435 ) Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-06-03 10:19:49 +08:00
wei liu	c6a1c49e02	enhance: Use Blocked Bloom Filter instead of basic bloom fitler impl. (#33405 ) issue: #32995 To speed up the construction and querying of Bloom filters, we chose a blocked Bloom filter instead of a basic Bloom filter implementation. WARN: This PR is compatible with old version bf impl, but if fall back to old milvus version, it may causes bloom filter deserialize failed. In single Bloom filter test cases with a capacity of 1,000,000 and a false positive rate (FPR) of 0.001, the blocked Bloom filter is 5 times faster than the basic Bloom filter in both querying and construction, at the cost of a 30% increase in memory usage. - Block BF construct time {"time": "54.128131ms"} - Block BF size {"size": 3021578} - Block BF Test cost {"time": "55.407352ms"} - Basic BF construct time {"time": "210.262183ms"} - Basic BF size {"size": 2396308} - Basic BF Test cost {"time": "192.596229ms"} In multi Bloom filter test cases with a capacity of 100,000, an FPR of 0.001, and 100 Bloom filters, we reuse the primary key locations for all Bloom filters to avoid repeated hash computations. As a result, the blocked Bloom filter is also 5 times faster than the basic Bloom filter in querying. - Block BF TestLocation cost {"time": "529.97183ms"} - Basic BF TestLocation cost {"time": "3.197430181s"} --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-31 17:49:45 +08:00
cai.zhang	77637180fa	enhance: Periodically synchronize segments to datanode watcher (#33420 ) issue: #32809 --------- Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-05-30 13:37:44 +08:00

1 2 3 4 5 ...

1050 Commits