milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2024-12-04 04:49:08 +08:00

Author	SHA1	Message	Date
Jiquan Long	16b785e149	enhance: optimize the memory usage and speed up loading variable length data (#30787 ) /kind improvement this removes the 1x copying while loading variable length data, also avoids constructing std::string, which could lead to memory fragmentation --------- Signed-off-by: yah01 <yah2er0ne@outlook.com> Signed-off-by: longjiquan <jiquan.long@zilliz.com> Co-authored-by: yah01 <yah2er0ne@outlook.com>	2024-02-28 16:45:00 +08:00
Jiquan Long	4459078e0b	fix: wrong num_entities used when mmap variable length data (#30848 ) https://github.com/milvus-io/milvus/issues/30728 Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-02-28 16:38:56 +08:00
congqixia	a115b731ed	enhance: fix old pr cpp format issue (#30894 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-02-28 16:28:20 +08:00
Buqian Zheng	f658dd5faa	enhance: update knowhere version to 60a5c9c (#30788 ) /kind improvement Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-02-28 14:18:55 +08:00
Cai Yudong	8a219e0102	feat: Support knowhere trace using OpenTelemetry (#30750 ) Issue: #21508 Signed-off-by: Yudong Cai <yudong.cai@zilliz.com>	2024-02-28 12:29:00 +08:00
sre-ci-robot	6e9f3ea531	[automated] Update Knowhere Commit (#30744 ) Update Knowhere Commit Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2024-02-28 10:50:57 +08:00
yah01	57397b1307	enhance: add new LRU cache impl (#30360 ) - remove the unused LRU cache - add new LRU cache impl which wraps github.com/karlseguin/ccache related #30361 --------- Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-02-27 20:58:40 +08:00
Jiquan Long	3e82d21ca1	enhance: reduce 1x memory copy when loading json (#30753 ) /kind improvement --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-02-27 10:18:55 +08:00
Jiquan Long	e2330f02f8	fix: pattern match use incorrect raw data (#30764 ) issue: https://github.com/milvus-io/milvus/issues/30687 We store all the varchar datas in an continuous address and use string_view to quickly find them. In this case, using string_view.data() directly will point to all rest varchar datas. --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-02-22 19:56:52 +08:00
MrPresent-Han	77eb6defb1	feat: support groupby on growing and non-indexed sealed egment(#30307 ) (#30644 ) related: #30308 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-02-21 14:02:53 +08:00
zhagnlu	18aac076de	fix: move test from NEON to X86 (#30324 ) #26137 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-02-21 11:58:53 +08:00
zhagnlu	0118bef2a2	fix: replace sse2 simd interface with older version (#30668 ) #30667 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-02-21 10:04:54 +08:00
zhagnlu	976b6fc0e4	enhance: change opendal as compile configurable (#30384 ) #30373 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-02-20 19:16:52 +08:00
yah01	b74673c147	enhance: calculate the accuracy memory usage while loading segment (#30473 ) the old version Knowhere would copy the index data while loading, we need to consider this to avoid OOM. Knowhere provides a util function to indicate whether it will load the index with disk, if not, we need to double the memory usage prediction for index data Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-02-20 14:52:51 +08:00
foxspy	43e8cd531d	enhance: Update Knowhere version (#30675 ) issue: #30669 Signed-off-by: xianliang <xianliang.li@zilliz.com>	2024-02-19 22:04:51 +08:00
congqixia	18c351efa6	fix: Prevent ChunkCache use absolute path in All-in-one mode (#30666 ) See also #30651 Append operator of `std::filesystem::path` will replace whole path when the param of "/" operation is an absolute path. In "All-in-one" mode, this shall cause ChunkCache removing the original vector data file when building chunk cache during/after load procedure. This PR changes the ChunkCache path generation logic to a separate function in which will check whether the file path is absolute or not. If the file path is absolute, it removes the root path prefix and return concatenated file path. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-02-19 20:58:51 +08:00
Cai Yudong	5bb28a9ea4	enhance: Print out range_filter and radius when range search param check fail (#30623 ) Signed-off-by: Yudong Cai <yudong.cai@zilliz.com>	2024-02-18 15:40:48 +08:00
Alexander Guzhva	a297baae9d	enhance: remove unused code (#30601 ) Signed-off-by: Alexandr Guzhva <alexanderguzhva@gmail.com>	2024-02-13 10:26:47 +08:00
zhagnlu	e8a6f1ea2b	fix: erase pk empty check when pk index replace raw data (#30432 ) #30350 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-02-07 14:56:47 +08:00
MrPresent-Han	92d1d744ae	fix: groupby results lack good results(#29883 ) (#30428 ) related: #29883 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-02-06 17:08:34 +08:00
cqy123456	5449e862d5	fix: safety access unordered_map and remove some useless code excute (#30504 ) issue: https://github.com/milvus-io/milvus/issues/30358 and https://github.com/milvus-io/milvus/issues/30491 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>	2024-02-05 22:03:09 +08:00
sre-ci-robot	ebbe32df9a	[automated] Update Knowhere Commit (#30515 ) Update Knowhere Commit Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2024-02-05 01:32:44 +08:00
Jiquan Long	a587450e56	enhance: [skip-e2e] disable asan (#30498 ) fix: #30511 /kind improvement --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-02-04 21:25:05 +08:00
sre-ci-robot	20c9cfc587	[automated] Update Knowhere Commit (#30487 ) Update Knowhere Commit Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2024-02-04 01:23:04 +08:00
Jiquan Long	e549148a19	enhance: full-support for wildcard pattern matching (#30288 ) issue: #29988 This pr adds full-support for wildcard pattern matching from end to end. Before this pr, the users can only use prefix match in their expression, for example, "like 'prefix%'". With this pr, more flexible syntax can be combined. To do so, this pr makes these changes: - 1. support regex query both on index and raw data; - 2. translate the pattern matching to regex query, so that it can be handled by the regex query logic; - 3. loose the limit of the expression parsing, which allows general pattern matching syntax; With the support of regex query in segcore backend, we can also add mysql-like `REGEXP` syntax later easily. --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-02-01 12:37:04 +08:00
PowderLi	5cf9bb236e	enhance: restful support import jobs (#30343 ) issue: #28521 #29732 include 1. list collection's import jobs 2. create a new import job 3. get the progress of an import job fix: 1. mix the order of dbName & collectionName #29728 2. trace log keep the same as v1 3. support traceID 4. azure precheck, blob name cannot end with / #29703 --------- Signed-off-by: PowderLi <min.li@zilliz.com>	2024-01-31 17:57:04 +08:00
yah01	878c4c9463	enhance: limit the max pool size to 16 (#30371 ) according to our benchmark, concurrency level 16 is enough to fully utilize the object storage network bandwidth Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-01-31 14:13:06 +08:00
cqy123456	74cfba0249	enhance:limit binlog index rows num (#30173 ) issue: https://github.com/milvus-io/milvus/issues/27678 also relate issue: https://github.com/milvus-io/milvus/issues/30065 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>	2024-01-29 19:49:02 +08:00
sre-ci-robot	0542a0e7dc	[automated] Update Knowhere Commit (#30332 ) Update Knowhere Commit Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2024-01-29 01:05:01 +08:00
zhagnlu	aeb1e36f00	enhance: change plan desc log from info to debug (#30304 ) #30172 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-01-28 16:04:38 +08:00
xige-16	e9fdd2475d	fix: fix searchPlan metricType modified concurrently (#30227 ) issue: #30225 /kind bug Signed-off-by: xige-16 <xi.ge@zilliz.com> --------- Signed-off-by: xige-16 <xi.ge@zilliz.com>	2024-01-26 14:03:09 +08:00
MrPresent-Han	116d0f20b8	fix: groupby bug for ut (#30272 ) related: #29965 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-01-25 20:57:00 +08:00
yihao.dai	c02fb64ad6	enhance: Allows proactive warming up of chunk cache (#30182 ) Allows proactive warming up of chunk cache. Original vector data will be asynchronously loaded into the chunk cache during the load process. It has the potential to significantly reduce query/search latency for a certain duration after the load, albeit with a concurrent increase in disk usage. issue: https://github.com/milvus-io/milvus/issues/30181 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-01-25 19:55:39 +08:00
yah01	a27c0e86fd	enhance: reduce many I/O operations while loading disk index (#30189 ) before this, every time writting the index chunk data into the disk, there are 4 I/O operations: - open the file - seek to the offset - write the data - close the file this optimized this to open only once and continiously write all data. This also makes it concurrent to load the files from object storage Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-01-25 15:23:02 +08:00
zhagnlu	8c58d9af67	enhance: optimize marisa trie range search for performance (#30079 ) #30078 #29986 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-01-25 10:07:00 +08:00
Patrick Weizhi Xu	0907d76253	enhance: pass partition key scalar info if enabled when build vector index (#29931 ) issue: #29892 Pass optional scalar IVF offsets to Cardinal Signed-off-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com>	2024-01-24 00:04:55 +08:00
cqy123456	42bb4e37e5	fix:diskann search crash when search list = 9999999999 (#30185 ) issue: https://github.com/milvus-io/milvus/issues/29020 Json can't not pass a max_int32 value to int32_t, so let knowhere check value range by itself. After fix this, pymilvus will report: pymilvus.exceptions.MilvusException: <MilvusException: (code=65535, message=fail to search on QueryNode 6: worker(6) query failed: => failed to search: arithmetic overflow: param search_list_size should be at most 2147483647)> Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>	2024-01-23 14:46:55 +08:00
cai.zhang	6cf2f09b60	feat: Support tencent cloud object storage for milvus (#30163 ) issue: #30162 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com>	2024-01-23 11:28:56 +08:00
yah01	a77693aa19	enhance: convert the `GetObject` util to async (#30166 ) This makes it much easier to use Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-01-22 19:20:57 +08:00
sre-ci-robot	e967949cc5	[automated] Update Knowhere Commit (#30120 ) Update Knowhere Commit Signed-off-by: sre-ci-robot sre-ci-robot@users.noreply.github.com Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2024-01-22 18:40:54 +08:00
MrPresent-Han	4436effdc3	enhance: support groupby based on scalar-index(#29965 ) (#30091 ) related: #29965 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-01-22 10:50:54 +08:00
xige-16	aee19dcd6b	enhance: Opt vector dimension mismatch error message (#29928 ) issue: https://github.com/milvus-io/milvus/issues/29791 /kind improvement Signed-off-by: xige-16 <xi.ge@zilliz.com> --------- Signed-off-by: xige-16 <xi.ge@zilliz.com>	2024-01-19 17:52:54 +08:00
yah01	f542bdbf3c	enhance: calc the accurate mem size of segment (#30093 ) this stats the real memory size of segment, also reduces the memory usage in mmap mode resolve #30095 Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-01-19 12:32:53 +08:00
xige-16	fa7cf587b0	enhance: Opt metric type does not match error message (#29927 ) issue: #29791 /kind improvement Signed-off-by: xige-16 <xi.ge@zilliz.com> Signed-off-by: xige-16 <xi.ge@zilliz.com>	2024-01-17 20:25:03 +08:00
yah01	1185e4dcd5	fix: written file size is over the int32 range and raises error (#30057 ) we sum the total data size in int32, which could lead to an overflow error related #30056 Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-01-17 16:42:54 +08:00
Bingyi Sun	8030b90891	fix: correct file name when loading index (#29985 ) issue: #29973 Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-01-16 10:24:52 +08:00
MrPresent-Han	c31e68446e	enhance: refine groupby-performance (#29933 ) related: #29844 Signed-off-by: MrPresent-Han <chun.han@zilliz.com>	2024-01-15 14:12:52 +08:00
chyezh	def717af55	fix: SealedIndexingEntry in SealedIndexingRecord may leak without smart pointer protect. (#29932 ) may related issue: #29828 Signed-off-by: chyezh <ye.zhen@zilliz.com>	2024-01-14 10:28:51 +08:00
Bingyi Sun	e1258b8cad	feat: integrate storagev2 into loading segment (#29336 ) issue: #29335 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-01-12 18:10:51 +08:00
yah01	f2e36db488	enhance: optimize the loading index performance (#29894 ) this utilizes concurrent loading Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-01-12 17:44:51 +08:00

1 2 3 4 5 ...

1378 Commits