milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2024-11-30 10:59:32 +08:00

Author	SHA1	Message	Date
jaime	da2d3fb430	enhance: enable manual compaction for collections without indexes (#36581 ) issue: #36576 pr: #36577 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-10-08 14:51:20 +08:00
Zhen Ye	e34fa0461b	fix: port listen racing in mix or standalone mode (#36459 ) issue: #36441 pr: #36442 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-09-26 21:01:15 +08:00
wei liu	975a9797a2	enhance: Enable dynamic update loaded collection's replica (#36417 ) issue: #35821 pr: #35822 After collection loaded, if we need to increase/decrease collection's replica, we need to release and load it again. milvus offers 4 solution to update loaded collection's replica, this PR aims to dynamic change the replica number without release, and after replica number changed, milvus will execute load replica or release replica in async, and the replica loaded status can be checked by getReplicas API. Notice that if set too much replicas than querynode can afford，the new replica won't be loaded successfully until enough querynode joins. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-09-26 10:43:15 +08:00
congqixia	210adbaffc	fix: [2.4] Wait check node id goroutine in case of data race (#36302 ) (#36377 ) Cherry-pick from master pr: #36302 Resolves: #36301 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-09-20 13:03:12 +08:00
Buqian Zheng	089790a459	enhance: [2.4]Allow empty sparse row (#36061 ) pr: #34700 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-09-12 10:13:09 +08:00
wei liu	ceca666e2a	fix: Fix privilege group hasn't been register for validate (#35938 ) issue: #35471 pr: #35937 --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-09-05 18:09:11 +08:00
yihao.dai	b578064869	fix: Fix DB limiter nodes are mistakenly cleaned up (#35991 ) (#35992 ) This issue only occurs for a short time right after a table is created. To avoid this, we simply reduce the frequency of cleaning up invalid limiter nodes. issue: https://github.com/milvus-io/milvus/issues/35933 pr: https://github.com/milvus-io/milvus/pull/35991 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-09-05 17:31:05 +08:00
jaime	a044a590cc	enhance: add IT for rate limit using db properties (#35931 ) issue: #35929 pr: #35930 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-09-04 14:35:05 +08:00
wei liu	da026b1e28	enhance: Add depguard rules to ban deprecated proto lib (#35140 ) (#35818 ) See also #34394 #34252 pr: #35140 Signed-off-by: Wei Liu <wei.liu@zilliz.com> Co-authored-by: congqixia <congqi.xia@zilliz.com>	2024-08-30 14:13:01 +08:00
XuanYang-cn	ff68eb53d7	fix: [skip e2e]unstable l0 it (#35613 ) See also: #35617 pr: #35612 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-08-26 18:53:04 +08:00
wei liu	421a00bfe8	enhance: Refresh proxy cache after restore rbac meta (#35636 ) issue: #35443 pr: #35635 --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-22 19:09:01 +08:00
wei liu	e2542a1bf5	enhance: Update protobuf-go to protobuf-go v2 (#34394 ) (#35555 ) issue: #34252 pr: #34394 #35072 #35084 Signed-off-by: Wei Liu <wei.liu@zilliz.com> Co-authored-by: Congqi Xia <congqi.xia@zilliz.com>	2024-08-21 18:50:58 +08:00
wei liu	4bf4cbad85	enhance: Mark query node as read only after suspend (#35492 ) (#35586 ) issue: #34985 #35493 pr: #35492 after querynode has been suspended, it's not allow to load segment/channel on it, which means the node is read only. to be compatible with resource group design, after query node has been suspend, we remove it from it's original resource group, make it a read only query node in replica. then two things will happens: 1. it's original resource group will be lacking of query nodes, query coord will assign new node to it. 2. querycoord will try to move out all segments/channels after querynode has been suspended Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-20 19:00:56 +08:00
Chun Han	cf8494ef45	enhance: support httpv1/v2 throttle and add it for httpV2(#35350 ) (#35504 ) related: #35350 pr: https://github.com/milvus-io/milvus/pull/35470 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-08-20 16:32:56 +08:00
XuanYang-cn	4a5e6bc6f6	enhance: Add integration tests for l0 (#35430 ) pr: #35429 See also: #34796 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-08-19 10:54:55 +08:00
wei liu	248a6ea401	enhance: Add BackupRBAC/RestoreRBAC API to enable rbac backup (#35444 ) (#35513 ) issue: #35443 pr: #35444 --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-18 13:42:55 +08:00
smellthemoon	23052f6ac2	fix: upsert use the previous pk in insert when autoid(#34672 ) (#35130 ) issue: #34668 pr: #34672 Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-08-12 18:58:25 +08:00
wei liu	0201e00a2f	enhance: enable to set load config in cluster level (#35293 ) issue: #35170 pr: #35169 This PR enable to set load configs in cluster level, such as replicas and resource groups. then when load collections will use the load config. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-07 12:38:21 +08:00
wei liu	2ac1bf7532	enhance: Enable setting the replica number and resource group during collection creation (#34403 ) (#34561 ) issue: #30040 pr: #34403 --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-08-06 15:06:17 +08:00
yellow-shine	3a997e8056	enhance: docker-compose first then try to use docker compose (#35227 ) https://github.com/milvus-io/milvus/issues/35209 https://github.com/milvus-io/milvus/pull/35208 --------- Signed-off-by: Yellow Shine <sammy.huang@zilliz.com>	2024-08-02 19:37:09 +08:00
cai.zhang	c340f387cf	enhance: [cherry-pick] Change the fixed value to a ratio for clustering segment size (#35075 ) issue: #34495 master pr: #35076 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-07-31 10:32:00 +08:00
congqixia	079276c6ff	fix: [2.4] Unify hook singleton implementation in proxy (#34888 ) Cherry-pick from master pr: #34887 Related to #34885 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-07-26 18:07:53 +08:00
wayblink	1e5c71d550	fix: [cherry-pick] fix dropped segment still visible after dropped by L2 single compaction (#35006 ) bug: #35003 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-07-26 13:47:48 +08:00
cai.zhang	74adedf750	enhance: Optimized the GC logic to ensure that memory is released in time (#34950 ) issue: #34703 master pr: #34949 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-07-24 14:07:43 +08:00
Chun Han	ae1636c2be	fix: refine handling type for segment pruner(#34923 ) (#34926 ) related: #34923 pr: https://github.com/milvus-io/milvus/pull/34925 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-07-24 12:05:44 +08:00
wayblink	8f3c126129	enhance: [cherry-pick] support l2 single compaction (#34929 ) #34928 pr: #34935 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-07-24 11:47:50 +08:00
cai.zhang	4ed62e9dbb	enhance: [cherry-pick] Add integration test for clustering compaction (#34860 ) issue: #34792 master pr: #34881 Signed-off-by: cai.zhang <cai.zhang@zilliz.com>	2024-07-22 17:49:42 +08:00
wayblink	21973a600d	enhance: [cherry-pick] refine clustering compaction basic it (#34794 ) issue: #34792 pr: #34793 Signed-off-by: wayblink <anyang.wang@zilliz.com>	2024-07-21 20:05:41 +08:00
yihao.dai	07bc1b6717	enhance: Seal by total growing segments size (#34692 ) (#34779 ) Seals the largest growing segment if the total size of growing segments of each shard exceeds the size threshold(default 4GB). Introducing this policy can help keep the size of growing segments within a suitable level, alleviating the pressure on the delegator. issue: https://github.com/milvus-io/milvus/issues/34554 pr: https://github.com/milvus-io/milvus/pull/34692 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-07-19 18:25:50 +08:00
XuanYang-cn	edefc3cbb5	enhance: [skip e2e]Enable compaction it test (#34526 ) (#34720 ) pr: #34526 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-07-16 18:19:38 +08:00
smellthemoon	0fdb288de7	enhance: upsert support autoid(#30342 ) (#34633 ) pr: #30342 issue: #29258 Signed-off-by: lixinguo <xinguo.li@zilliz.com>	2024-07-15 20:53:39 +08:00
wei liu	cf701a9bf0	enhance: Preserve fixed-size memory in delegator node for growing segment (#34600 ) issue: #34595 pr: #34596 When consuming insert data on the delegator node, QueryCoord will move out some sealed segments to manage its memory usage. After the growing segment gets flushed, some sealed segments from other workers will be moved back to the delegator node. To avoid the frequent movement of segments, we estimate the maximum growing row count and preserve a fixed-size memory in the delegator node. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-15 20:51:46 +08:00
wei liu	d3e94f9861	enhance: Use Blocked Bloom Filter instead of basic bloom fitler impl (#34377 ) issue: #32995 pr: #33405 To speed up the construction and querying of Bloom filters, we chose a blocked Bloom filter instead of a basic Bloom filter implementation. WARN: This PR is compatible with old version bf impl, but if fall back to old milvus version, it may causes bloom filter deserialize failed. In single Bloom filter test cases with a capacity of 1,000,000 and a false positive rate (FPR) of 0.001, the blocked Bloom filter is 5 times faster than the basic Bloom filter in both querying and construction, at the cost of a 30% increase in memory usage. Block BF construct time {"time": "54.128131ms"} Block BF size {"size": 3021578} Block BF Test cost {"time": "55.407352ms"} Basic BF construct time {"time": "210.262183ms"} Basic BF size {"size": 2396308} Basic BF Test cost {"time": "192.596229ms"} In multi Bloom filter test cases with a capacity of 100,000, an FPR of 0.001, and 100 Bloom filters, we reuse the primary key locations for all Bloom filters to avoid repeated hash computations. As a result, the blocked Bloom filter is also 5 times faster than the basic Bloom filter in querying. Block BF TestLocation cost {"time": "529.97183ms"} Basic BF TestLocation cost {"time": "3.197430181s"} Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-07-05 17:04:10 +08:00
wayblink	c62bf8a0b0	fix: [Cherry-pick]Pick major compaction fixs and optimizations (#34360 ) This PR cherry-picks the following commits: - fix: sync partitiion stats blocking balance task #33742 - fix: Fix meta prefix overlap bug #33830 - fix: Small fixs of major compaction #33929 - fix: Fix memory buffer error & some renaming #33850 - fix: sync part stats task cannot be finished #34027 - Add an option to enable/disable vector field clustering key #34097 - fix: fix error ignore in compactor #34169 - fix:load major compaction partial result #34052 - Use new stream segment reader in clustering compaction #34232 issue: #30633 pr: #33742 #33830 #33929 #33850 #34027 #34097 #34169 #34052 #34232 --------- Signed-off-by: MrPresent-Han <chun.han@zilliz.com> Signed-off-by: wayblink <anyang.wang@zilliz.com> Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: Chun Han <116052805+MrPresent-Han@users.noreply.github.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-07-03 09:53:37 +08:00
wayblink	99586066f5	feat: [cherry-pick] Major compaction (#34326 ) This PR cherry-picks the following commits: fix: speed up segment lookup via channel name in datacoord (#33530) needed by the next commit feat: Major compaction (#33620) issue: #30633 pr: #33620 --------- Signed-off-by: yiwangdr <yiwangdr@gmail.com> Signed-off-by: wayblink <anyang.wang@zilliz.com> Co-authored-by: yiwangdr <80064917+yiwangdr@users.noreply.github.com> Co-authored-by: MrPresent-Han <chun.han@zilliz.com>	2024-07-02 18:29:01 +08:00
zhenshan.cao	14a11e379c	enhance: Refactor Compaction to enable persistence(#33265 ) (#34268 ) pr : #33265 issue #33586 Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2024-07-01 19:32:07 +08:00
yihao.dai	b1e74dc7cb	enhance: [cherry-pick] Decouple compaction from shard (#34157 ) This PR cherry-picks the following commits: - Implement task limit control logic in datanode. https://github.com/milvus-io/milvus/pull/32881 - Load bf from storage instead of memory during L0 compaction. https://github.com/milvus-io/milvus/pull/32913 - Remove dependencies on shards (e.g. SyncSegments, injection). https://github.com/milvus-io/milvus/pull/33138 - Rename Compaction interface to CompactionV2. https://github.com/milvus-io/milvus/pull/33858 - Remove the unused residual compaction logic. https://github.com/milvus-io/milvus/pull/33932 issue: https://github.com/milvus-io/milvus/issues/32809 pr: https://github.com/milvus-io/milvus/pull/32881, https://github.com/milvus-io/milvus/pull/32913, https://github.com/milvus-io/milvus/pull/33138, https://github.com/milvus-io/milvus/pull/33858, https://github.com/milvus-io/milvus/pull/33932 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-25 20:22:03 +08:00
wei liu	061a00c58f	enhance: Enable database level replica num and resource groups for loading collection (#33052 ) (#33981 ) pr: #33052 issue: #30040 This PR introduce two database level props: 1. database.replica.number 2. database.resource_groups User can set those two database props by AlterDatabase API, then can load collection without specified replica_num and resource groups. then it will use database level load param when try to load collections. Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-06-21 16:56:02 +08:00
Cai Yudong	ebd0af14f4	enhance: Handle Float16Vector/BFloat16Vector numpy bulk insert as same as BinaryVector (#33760 ) (#33788 ) pr: #33760 Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-06-13 10:49:57 +08:00
yihao.dai	396f8608dd	fix: Fix multiple vector fields import (#33723 ) (#33724 ) 1. Fix dim mismatch with multi-vector fields and JSON import 2. Enhance: do not display file ID in GetImportResponse. issue: https://github.com/milvus-io/milvus/issues/33681, https://github.com/milvus-io/milvus/issues/33682 pr: https://github.com/milvus-io/milvus/pull/33723 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-10 21:55:55 +08:00
yihao.dai	ed1dee9e38	enhance: Support L0 import (#33514 ) (#33712 ) issue: https://github.com/milvus-io/milvus/issues/33157 pr: https://github.com/milvus-io/milvus/pull/33514 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-06-08 11:17:52 +08:00
yihao.dai	8ff5d2793c	fix: Fill stats log id and check validity (#33477 ) (#33478 ) 1. Fill log ID of stats log from import 2. Add a check to validate the log ID before writing to meta issue: https://github.com/milvus-io/milvus/issues/33476 pr: https://github.com/milvus-io/milvus/pull/33477 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-05-31 14:13:46 +08:00
Cai Yudong	68e2d532d8	enhance: Cherry-pick following SparseFloatVector bulk insert PRs to Milvus2.4 (#33391 ) Cherry pick from master pr: #33064 #33101 #33187 #33259 #33224 #33064 Support readable JSON file import for Float16/BFloat16/SparseFloat #33101 Store SparseFloatVector into parquet as JSON string #33187 Fix SparseFloatVector data parse error for parquet #33259 Fix SparseFloatVector data parse error for json #33224 Optimize bulk insert unittest Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-30 10:31:45 +08:00
yihao.dai	ad4c1975bd	fix: Fix filtering by partition key fails for importing data (#33274 ) (#33277 ) Before executing the import, partition IDs should be reordered according to partition names. Otherwise, the data might be hashed to the wrong partition during import. This PR corrects this error. issue: https://github.com/milvus-io/milvus/issues/33237 pr: https://github.com/milvus-io/milvus/pull/33274 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-05-23 11:25:40 +08:00
Cai Yudong	4fc7915c70	enhance: unify data generation test APIs (#32955 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-14 14:33:33 +08:00
SimFG	4031abd2fa	enhance: change default partition num to 16 when using partition key (#32950 ) /kind improvement Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-05-13 14:19:31 +08:00
wei liu	e2332bdc17	enhance: Enable channel exclusive balance policy (#32911 ) issue: #32910 * split replica's node list to channels when create replicas * balance nodes among channels when node change happens * implement channel level balance, let balance happens in channel level Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-05-10 17:27:31 +08:00
Cai Yudong	dc89c6f810	enhance: remove duplicated data generation APIs for bulk insert test (#32889 ) Issue: #22837 including following changes: 1. Add API CreateInsertData() and BuildArrayData() in internal/util/testutil 2. Remove duplicated test APIs from importutilv2 unittest and bulk insert integration test Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-10 15:27:31 +08:00
Cai Yudong	bcdbd1966e	feat: Support sparse float vector bulk insert for binlog/json/parquet (#32649 ) Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>	2024-05-07 18:43:30 +08:00
yihao.dai	53874ce245	fix: Fix cannot specify partition name in binlog import (#32730 ) issue: https://github.com/milvus-io/milvus/issues/32807 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-05-07 17:19:30 +08:00

1 2 3 4

166 Commits