milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2024-12-03 04:19:18 +08:00

Author	SHA1	Message	Date
sammy.huang	265453f400	enhance: [skip e2e]revert back to original way to archive, upload and download (#30248 ) Signed-off-by: Sammy Huang <sammy.huang@zilliz.com>	2024-02-01 10:15:04 +08:00
zhuwenxing	b02b525196	test: refine debug log and error handle (#30406 ) refine debug log and error handle --------- Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>	2024-02-01 09:59:05 +08:00
congqixia	e677af19b0	enhance: Add PrimaryKeys interface to reduce memory usage (#30405 ) See also #30404 `PrimaryKey` is used to hold pk values for both int64 & varchar data type. Since it is an interface it may occupies more memory than pure slices when holding a group of pks. This PR add `PrimaryKeys` interface when some other module need to hold lots of PrimaryKeys. By using this interface, it could reduce the memory of pk slice to half when using Int64 Pk data type and reduce interface cost for each row of varchar as well. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-02-01 09:57:11 +08:00
yihao.dai	c5918290e6	feat: Add import executor and manager for datanode (#29438 ) This PR introduces novel importv2 roles for datanode: 1. Executor: To execute tasks, a import task will be divided into the following steps: read data -> hash data -> sync data; 2. Manager: To manage all the tasks; issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-01-31 20:45:04 +08:00
congqixia	fc0d007bd1	enhance: Add `MemoryHighSyncPolicy` back to write buffer manager (#29997 ) See also #27675 This PR adds back MemoryHighSyncPolicy implementation. Also change MinSegmentSize & CheckInterval to configurable param item. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-31 19:03:04 +08:00
congqixia	b5e078c4d3	enhance: Remove current stats after RollStats action (#30391 ) See also #27675 BloomFilterSet.current shall be reset after RollStats, otherwise it will keep tracking whole segment data causing the false positive ratio larger than expected. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-31 18:55:04 +08:00
PowderLi	5cf9bb236e	enhance: restful support import jobs (#30343 ) issue: #28521 #29732 include 1. list collection's import jobs 2. create a new import job 3. get the progress of an import job fix: 1. mix the order of dbName & collectionName #29728 2. trace log keep the same as v1 3. support traceID 4. azure precheck, blob name cannot end with / #29703 --------- Signed-off-by: PowderLi <min.li@zilliz.com>	2024-01-31 17:57:04 +08:00
yah01	2f778d9649	fix: proxy may never setup if the port binded (#30035 ) the proxy miss-returned nil while failed to listen the port, then the server continues to run but we can't connect to service resolve #30034 Signed-off-by: yah01 <yah2er0ne@outlook.com>	2024-01-31 17:37:04 +08:00
yanliang567	54150253e7	enhance: Add more tests for groupby (#30346 ) Related issue: #30033 skip the tests before bug fixes --------- Signed-off-by: yanliang567 <yanliang.qiao@zilliz.com>	2024-01-31 17:23:03 +08:00
yah01	878c4c9463	enhance: limit the max pool size to 16 (#30371 ) according to our benchmark, concurrency level 16 is enough to fully utilize the object storage network bandwidth Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-01-31 14:13:06 +08:00
zhuwenxing	e6daff49a6	test: fix query result verification (#30351 ) fix query result verification: changed the query expression and adopted a more lenient validation method to address the issue of not being able to guarantee the retrieval of specific IDs due to frequent deletion operation Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>	2024-01-31 14:11:04 +08:00
sammy.huang	fb9d64ab1e	enhance: hotfix container network (#30389 ) Signed-off-by: Sammy Huang <sammy.huang@zilliz.com>	2024-01-31 10:23:05 +08:00
sammy.huang	fda6f8d04e	enhance: hotfix docker-compose.yml (#30378 ) Signed-off-by: Sammy Huang <sammy.huang@zilliz.com>	2024-01-30 19:23:04 +08:00
chyezh	6d63fb5d3f	fix: panic with datanode negetive wait group counter (#30135 ) issue: #29170 Signed-off-by: chyezh <chyezh@outlook.com>	2024-01-30 18:15:04 +08:00
cai.zhang	47af347d0e	enhance: Limit index pool size of standalone server (#30170 ) issue: #29926 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-01-30 16:47:03 +08:00
cai.zhang	f619d792c0	enhance: Break down the granularity of collection info cache expired (#29977 ) issue: #29772 1. `DropPartition` only invalidates the cache related to the partition. 2. `CreateAlias` does not invalidate the cache. Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-01-30 16:45:02 +08:00
nico	168260cba3	test: update test cases about rbac (#29088 ) issue: #29087 RBAC cases fail a lot. 1. some cases are out of date, for example, the default value of db_name has changed from "default" to "" in some apis 2. add time sleep after the action of grant or revoke, for it costs time to take effect Signed-off-by: nico <cheng.yuan@zilliz.com>	2024-01-30 11:33:02 +08:00
congqixia	7c086a4608	enhance: Set delete scope for LoadSegment streaming data (#30245 ) See also #29474 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-30 11:13:02 +08:00
congqixia	0c7a96b48d	enhance: Make compaction log has traceID (#30338 ) See also #30167 After support open telemetry tracing, we want to have traceID as well, this PR adds util functions to set traceID with span & propagate traceID between different context. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-30 10:09:03 +08:00
congqixia	743bdf1434	enhance: Make l0 compactor download files in parallel (#30309 ) See also #27606 `MultiRead` actually download file in sequence, which may lead to large time consumption during l0 compaction download phase. This PR make l0 compactor download deltalogs in parallel utilizing conc package & io pool. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-30 10:07:09 +08:00
cqy123456	74cfba0249	enhance:limit binlog index rows num (#30173 ) issue: https://github.com/milvus-io/milvus/issues/27678 also relate issue: https://github.com/milvus-io/milvus/issues/30065 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>	2024-01-29 19:49:02 +08:00
zhuwenxing	f92edc6cc5	test: Update requirements and pytest configuration (#30336 ) Update requirements and pytest configuration Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>	2024-01-29 18:59:02 +08:00
binbin	f274c16594	test: add milvus client test cases (#30323 ) issue: #30221 Signed-off-by: binbin lv <binbin.lv@zilliz.com>	2024-01-29 18:33:02 +08:00
chyezh	211143c5e6	enhance: add basic information of milvus into metrics (#29665 ) add basic build information and runtime component dependency into metrics. issue: #29664 Signed-off-by: chyezh <ye.zhen@zilliz.com>	2024-01-29 15:47:02 +08:00
zhuwenxing	fcd9f894ca	test: add invert index and string datatype for bulk insert test (#30334 ) add invert index and string datatype for bulk insert test Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>	2024-01-29 14:43:02 +08:00
Bingyi Sun	406bf14e84	enhance: Add growing row count weight (#30271 ) Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-01-29 14:05:02 +08:00
zhuwenxing	467379d693	test: [skip e2e]add values yaml file for one pod standalone chaos test (#30311 ) add values yaml file for one pod standalone chaos test Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>	2024-01-29 13:35:01 +08:00
zhuwenxing	aab7cc9ecd	test: add freshness checker (#30280 ) add freshness checker insert/upsert --> query: Get the time when it can be queried delete --> query: Get the time when it can not be queried Signed-off-by: zhuwenxing <wenxing.zhu@zilliz.com>	2024-01-29 12:09:01 +08:00
xige-16	033eae9e73	enhance: Set segment.maxSize param to 1024M (#30139 ) issue: #25639 /kind improvement When the number of vector columns increases, the number of rows per segment will decrease. In order to reduce the impact on vector indexing performance, it is necessary to increase the segment max limit. If a collection has multiple vector fields with memory and disk indices on different vector fields, the size limit after segment compaction is the minimum of segment.maxSize and segment.diskSegmentMaxSize. Signed-off-by: xige-16 <xi.ge@zilliz.com> --------- Signed-off-by: xige-16 <xi.ge@zilliz.com>	2024-01-29 10:17:02 +08:00
sammy.huang	927d310959	enhance: [skip e2e]change to a better name (#30329 ) Signed-off-by: Sammy Huang <sammy.huang@zilliz.com>	2024-01-29 09:45:08 +08:00
sre-ci-robot	0542a0e7dc	[automated] Update Knowhere Commit (#30332 ) Update Knowhere Commit Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2024-01-29 01:05:01 +08:00
zhagnlu	aeb1e36f00	enhance: change plan desc log from info to debug (#30304 ) #30172 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-01-28 16:04:38 +08:00
PowderLi	6abbab12fa	feat: restful phase two (#29728 ) issue: #29732 Signed-off-by: PowderLi <min.li@zilliz.com>	2024-01-28 16:03:01 +08:00
PowderLi	8fc4ebfa11	fix: empty MetricType (#30216 ) issue: #30102 #30225 we should read MetricType from SearchResult, because query node never 1. read metricType from LoadMeta 2. store to collection 3. set SearchRequest.MetricType Signed-off-by: PowderLi <min.li@zilliz.com>	2024-01-28 15:33:02 +08:00
smellthemoon	9512af357b	enhance: reduce memory when read data (#30284 ) Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2024-01-26 20:49:00 +08:00
sre-ci-robot	c3c9dcdc2b	Update all contributors Signed-off-by: sre-ci-robot <sre-ci-robot@zilliz.com>	2024-01-26 12:00:52 +00:00
aoiasd	8385157717	enhance: adjust config source for support config event use paramtable (#29995 ) Adjust config source for support config event which for dynamic config could use paramtable and not deadlock. relate: https://github.com/milvus-io/milvus/issues/29807 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-01-26 19:07:00 +08:00
chyezh	f2985d8454	fix: compact operation on datacoord meta should preform as a transcation (#29775 ) issue: #29691 Signed-off-by: chyezh <chyezh@outlook.com>	2024-01-26 16:59:00 +08:00
cai.zhang	d87726e4c7	enhance: Don't expire ShardLeaders cache actively (#29879 ) issue: #29772 The shardLeaders cache does not actively expire, update the cache when search/query fails. Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-01-26 14:13:01 +08:00
congqixia	20a3569c14	enhance: Set correct role for non-standalone deployment (#30303 ) See also #30211 After fix initialization problem, distributed components do no have their role set. This will cause logger & tracing miss component service info when recording information. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-26 14:05:01 +08:00
xige-16	e9fdd2475d	fix: fix searchPlan metricType modified concurrently (#30227 ) issue: #30225 /kind bug Signed-off-by: xige-16 <xi.ge@zilliz.com> --------- Signed-off-by: xige-16 <xi.ge@zilliz.com>	2024-01-26 14:03:09 +08:00
congqixia	7ced0af197	enhance: Enlarge default datanode sync parallel to 256 (#30270 ) See also #27675 After supporting control sync parallel in datanode globally, the shall change default value to a more suitable value for most use cases. Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-26 11:35:00 +08:00
sammy.huang	199c198a55	enhance: support standalone mode with one only pod for e2e test (#30282 ) issue: #30294 --------- Signed-off-by: Sammy Huang <sammy.huang@zilliz.com>	2024-01-26 11:27:02 +08:00
aoiasd	f84d9a589a	fix: channel checker reduce balancing channels. (#30087 ) Ignore leader unavailable when channel checker judge repeat channel to avoid channel checker remove channels balancing. relate: https://github.com/milvus-io/milvus/issues/29841 https://github.com/milvus-io/milvus/issues/29838 Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-01-26 10:59:00 +08:00
XuanYang-cn	fd19e419f9	fix: Use size bucket for compacted segment size metric (#30028 ) See also: #29204 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-01-26 10:53:02 +08:00
XuanYang-cn	0b6beb7e0f	fix: Fill in info in CompactionSegmentBinlogs (#30279 ) After #28873, PartitionID and CollectionID should be filled in CompactionSegmentBinlog so that DataNode can compose the correct logPath. However There're some places left forgotten to fill in the information, causing Datanode downloading `xxx/0/0/xxxx/xxxx` binlogs during compaction See also: #30213 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-01-26 10:51:02 +08:00
congqixia	405877c8cd	fix: Use correct pools for all CGO methods in segments pkg (#30274 ) See also #30273 This PR: - Rename confusing `LoadIndexInfo` to `UpdateIndexInfo` for LocalSegment - Use `DynamicPool` instead of `LoadPool` for `UpdateSealedSegmentIndex` - Fix cgo call missing pool control Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-26 10:09:00 +08:00
binbin	a847d56ac0	test: add part of milvus client test cases (#30222 ) issue: #30221 Signed-off-by: binbin lv <binbin.lv@zilliz.com>	2024-01-26 09:25:01 +08:00
MrPresent-Han	116d0f20b8	fix: groupby bug for ut (#30272 ) related: #29965 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-01-25 20:57:00 +08:00
yihao.dai	c02fb64ad6	enhance: Allows proactive warming up of chunk cache (#30182 ) Allows proactive warming up of chunk cache. Original vector data will be asynchronously loaded into the chunk cache during the load process. It has the potential to significantly reduce query/search latency for a certain duration after the load, albeit with a concurrent increase in disk usage. issue: https://github.com/milvus-io/milvus/issues/30181 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-01-25 19:55:39 +08:00

... 2 3 4 5 6 ...

19159 Commits