milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2024-12-02 03:48:37 +08:00

Author	SHA1	Message	Date
foxspy	642a651f60	enhance: remove useless vector index param checker (#37329 ) issue: #34298 because all vector index config checker has been moved into vector_index_checker, then the useless checkers can be removed. Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2024-11-01 06:20:21 +08:00
foxspy	3224e58c5b	enhance: add unify vector index config management (#36846 ) issue: #34298 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2024-11-01 06:18:21 +08:00
zhenshan.cao	63843dce33	fix: Fix conan gdal building problem (#37338 ) issue:https://github.com/milvus-io/milvus/issues/27576 Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2024-10-31 21:04:16 +08:00
Hao Tan	67c4340565	feat: Geospatial Data Type and GIS Function Support for milvus server (#35990 ) issue:https://github.com/milvus-io/milvus/issues/27576 # Main Goals 1. Create and describe collections with geospatial fields, enabling both client and server to recognize and process geo fields. 2. Insert geospatial data as payload values in the insert binlog, and print the values for verification. 3. Load segments containing geospatial data into memory. 4. Ensure query outputs can display geospatial data. 5. Support filtering on GIS functions for geospatial columns. # Solution 1. Add Type: Modify the Milvus core by adding a Geospatial type in both the C++ and Go code layers, defining the Geospatial data structure and the corresponding interfaces. 2. Dependency Libraries: Introduce necessary geospatial data processing libraries. In the C++ source code, use Conan package management to include the GDAL library. In the Go source code, add the go-geom library to the go.mod file. 3. Protocol Interface: Revise the Milvus protocol to provide mechanisms for Geospatial message serialization and deserialization. 4. Data Pipeline: Facilitate interaction between the client and proxy using the WKT format for geospatial data. The proxy will convert all data into WKB format for downstream processing, providing column data interfaces, segment encapsulation, segment loading, payload writing, and cache block management. 5. Query Operators: Implement simple display and support for filter queries. Initially, focus on filtering based on spatial relationships for a single column of geospatial literal values, providing parsing and execution for query expressions. 6. Client Modification: Enable the client to handle user input for geospatial data and facilitate end-to-end testing.Check the modification in pymilvus. --------- Signed-off-by: tasty-gumi <1021989072@qq.com>	2024-10-31 20:58:20 +08:00
XuanYang-cn	4926021c02	fix: Skip mark compaction timeout for mix and l0 compaction (#37118 ) Timeout is a bad design for long running tasks, especially using a static timeout config. We should monitor execution progress and fail the task if the progress has been stale for a long time. This pr is a small patch to stop DC from marking compaction tasks timeout, while still waiting for DN to finish. The design is self-conflicted. After this pr, mix and L0 compaction are no longer controlled by DC timeout, but clustering is still under timeout control. The compaction queue capacity grows larger for priority calc, hence timeout compactions appears more often, and when timeout, the queuing tasks will be timeout too, no compaction will success after. See also: #37108, #37015 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-10-28 14:33:29 +08:00
jaime	9d16b972ea	feat: add tasks page into management WebUI (#37002 ) issue: #36621 1. Add API to access task runtime metrics, including: - build index task - compaction task - import task - balance (including load/release of segments/channels and some leader tasks on querycoord) - sync task 2. Add a debug model to the webpage by using debug=true or debug=false in the URL query parameters to enable or disable debug mode. Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-10-28 10:13:29 +08:00
foxspy	d7b2ffe5aa	enhance: add an unify vector index config checker (#36844 ) issue: #34298 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2024-10-28 10:11:37 +08:00
zhagnlu	eeb67a3845	fix:reset default auto index type for scalar (#37086 ) #32900 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-10-27 16:19:29 +08:00
SimFG	1cc9cb49ad	enhance: allow to delete data when disk quota exhausted (#37134 ) - issue: #37133 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-10-25 16:47:29 +08:00
Gao	1d61b604e1	enhance: support retry search when topk is reduced and result not enough (#35645 ) issue: #35576 This pr is to cover those cases when queryHook optimize search params and make the result size insufficient, add retry search mechanism and add related metrics for alarming. --------- Signed-off-by: chasingegg <chao.gao@zilliz.com>	2024-10-23 19:19:30 +08:00
yihao.dai	f0b3942a08	enhance: Limit import job number (#36891 ) issue: https://github.com/milvus-io/milvus/issues/36890 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-23 16:01:28 +08:00
Zhen Ye	f3d9d05a28	fix: use binlog counter to trigger flush but not stats log (#37037 ) issue: #36804 Signed-off-by: chyezh <chyezh@outlook.com>	2024-10-23 15:07:29 +08:00
jaime	4746f47282	feat: management WebUI homepage (#36822 ) issue: #36784 1. Implement an embedded web server for WebUI access. 2. Complete the homepage development. Home page demo: <img width="2177" alt="iShot_2024-10-10_17 57 34" src="https://github.com/user-attachments/assets/38539917-ce09-4e54-a5b5-7f4f7eaac353"> Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-10-23 11:29:28 +08:00
Ted Xu	50da48a30d	enhance: adding mix compaction first prioritizer (#36956 ) Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2024-10-18 11:37:24 +08:00
foxspy	3de57ec4fa	enhance: add vector index mgr to remove vector index type dependency (#36843 ) issue: #34298 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>	2024-10-17 22:15:25 +08:00
cqy123456	b474374ea5	enhance: use growingMmapEnabled to control the behavior of interim index, not vectorField (#36500 ) issue:https://github.com/milvus-io/milvus/issues/36392 related pr: https://github.com/milvus-io/milvus/pull/36391 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>	2024-10-17 20:25:24 +08:00
congqixia	1184319644	fix: Load original key if ts is MaxTimestamp (#36934 ) Related to #36933 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-10-17 14:11:29 +08:00
liliu-z	811f6f2402	enhance: mmap for faiss hnsw (#36943 ) Unblock the mmap check for Faiss-based HNSW Signed-off-by: Li Liu <li.liu@zilliz.com>	2024-10-17 13:05:23 +08:00
XuanYang-cn	b172ea1093	fix: Remove enableLevelZeroSegment config (#36535 ) See also: #36504 --------- Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-10-17 11:59:24 +08:00
Buqian Zheng	51f13ba7cf	fix: add checks for bm25 k1 and b in sparse index checker (#36907 ) issue: https://github.com/milvus-io/milvus/issues/36883, https://github.com/milvus-io/milvus/issues/35853 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-10-16 19:43:24 +08:00
liliu-z	b0d5866507	enhance: add checkers for Faiss-based HNSW (#36857 ) Knowhere introduced FAISS-based HNSQ Flat/SQ/PQ/PRQ. This PR adds param check in Milvus side. This PR actually do nothing but let the check pass, since: 1. All indexes param check will be moved to Knowhere recently. @foxspy 2. The index name will be changed from `FAISS_HNSW_xx` to `HNSW_xx` after QA test. @alexanderguzhva Signed-off-by: Li Liu <li.liu@zilliz.com>	2024-10-15 12:29:22 +08:00
SimFG	bb3ef5349f	enhance: update the expr version to support automatic conversion of variable types (#36832 ) /kind improvement Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-10-15 10:53:22 +08:00
Buqian Zheng	383350c120	feat: added more checks for function creation check (#36766 ) issue: https://github.com/milvus-io/milvus/issues/35853 * BM25 Function now takes no params, k1, b should be passed via index params * support BM25 full text search when metric type is not present in search request * add more strict validation with functions at collection creation time Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-10-13 17:43:22 +08:00
Buqian Zheng	82c5cf2fa2	feat: add bulk insert support for Functions (#36715 ) issue: https://github.com/milvus-io/milvus/issues/35853 and https://github.com/milvus-io/milvus/issues/35856 Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-10-12 17:19:20 +08:00
Bingyi Sun	a75bb85f3a	feat: support chunked column for sealed segment (#35764 ) This PR splits sealed segment to chunked data to avoid unnecessary memory copy and save memory usage when loading segments so that loading can be accelerated. To support rollback to previous version, we add an option `multipleChunkedEnable` which is false by default. Signed-off-by: sunby <sunbingyi1992@gmail.com>	2024-10-12 15:04:52 +08:00
SimFG	7acf724185	enhance: support to execute the method which contains the ctx param (#36797 ) /kind improvement Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-10-12 00:01:20 +08:00
aoiasd	db34572c56	feat: support load and query with bm25 metric (#36071 ) relate: https://github.com/milvus-io/milvus/issues/35853 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-10-11 10:23:20 +08:00
Buqian Zheng	f7b811450d	feat: add enable_tokenizer params to VarChar field (#36480 ) issue: #35922 add an enable_tokenizer param to varchar field: must be set to true so that a varchar field can enable_match or used as input of BM25 function --------- Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-10-10 20:33:21 +08:00
yihao.dai	3685edb264	enhance: Use common gc config (#36668 ) Use the GC config from `common` and remove the GC config from `queryNode`. issue: https://github.com/milvus-io/milvus/issues/36667 related pr: https://github.com/milvus-io/milvus/pull/34949 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-09 19:47:19 +08:00
Ted Xu	5fc731795b	enhance: Datacoord to support prioritization of compaction tasks (#36547 ) See #36550 This PR made 2 changes: 1. Introducing a prioritization mechanism, if `dataCoord.compaction.taskPrioritizer` is set to `level`, compaction tasks are always executed as the priority of L0>Mix>Clustering 2. `dataCoord.compaction.maxParallelTaskNum` now controls the parallelism of executing tasks, not the task number of queue + executing. --------- Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2024-10-09 19:11:20 +08:00
SimFG	130a923dec	enhance: the estimate method when loading the collection (#36307 ) - issue: #36530 --------- Signed-off-by: SimFG <bang.fu@zilliz.com> Signed-off-by: xianliang.li <xianliang.li@zilliz.com> Co-authored-by: xianliang.li <xianliang.li@zilliz.com>	2024-10-09 17:35:19 +08:00
XuanYang-cn	c84bdfa766	fix: raise l0 compaction memory ratio to 0.5 (#36690 ) 5 percent of free memory is too less for l0 compaction. This pr will raise it to 50 percent. See also: #36614 Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2024-10-09 17:19:24 +08:00
yihao.dai	0fc2a4aa53	enhance: Optimize import scheduling and add time cost metric (#36601 ) 1. Optimize import scheduling strategic: a. Revise slot weights, calculating them based on the number of files and segments for both import and pre-import tasks. b. Ensure that the DN executes tasks in ascending order of task ID. 2. Add time cost metric and log. issue: https://github.com/milvus-io/milvus/issues/36600, https://github.com/milvus-io/milvus/issues/36518 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-10-09 14:41:20 +08:00
Zhen Ye	2ec6e602d6	enhance: add streaming client metrics (#36523 ) issue: #33285 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-10-08 21:25:19 +08:00
Rijin-N	a05a37a583	enhance: GCS native support (GCS implemented using Google Cloud Storage libraries) (#36214 ) Native support for Google cloud storage using the Google Cloud Storage libraries. Authentication is performed using GCS service account credentials JSON. Currently, Milvus supports Google Cloud Storage using S3-compatible APIs via the AWS SDK. This approach has the following limitations: 1. Overhead: Translating requests between S3-compatible APIs and GCS can introduce additional overhead. 2. Compatibility Limitations: Some features of the original S3 API may not fully translate or work as expected with GCS. To address these limitations, This enhancement is needed. Related Issue: #36212	2024-09-30 13:23:32 +08:00
SimFG	9c1772f659	enhance: avoid to create many timer object in the target (#36570 ) /kind improvement Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-09-29 19:23:16 +08:00
Zhen Ye	a6545b2e29	fix: refactor milvus config and change default txn timeout (#36522 ) issue: #36498 Signed-off-by: chyezh <chyezh@outlook.com>	2024-09-29 11:01:15 +08:00
aoiasd	ffc12fb5c4	fix: split delete task msg to MaxMessageSize to avoid mq message too large error (#36197 ) relate: https://github.com/milvus-io/milvus/issues/36089 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-09-27 18:15:19 +08:00
Zhen Ye	d29e01e284	fix: port listen racing in mix or standalone mode (#36442 ) issue: #36441 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-09-26 21:23:16 +08:00
yihao.dai	9e8cafcbe2	enhance: Skip loading bf in datanode (#36367 ) Skip loading bf in datanode: 1. When watching vchannels, skip loading bloom filters for segments. 2. Bypass bloom filter checks for delete messages, directly writing to L0 segments. 3. Remove flushed segments proactively after flush. issue: https://github.com/milvus-io/milvus/issues/34585 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-09-26 10:11:15 +08:00
jaime	52cce4de58	fix: iaccurate size estimation for encoded array data (#36373 ) issue: #36029 Signed-off-by: jaime <yun.zhang@zilliz.com>	2024-09-24 14:51:14 +08:00
congqixia	1833913f44	enhance: Add streaming forward policy switch for delegator (#36330 ) Related to #35303 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-09-23 18:01:12 +08:00
SimFG	c50fe71163	fix: long buffering causes mq to be unable to receive messages. (#36420 ) - issue: #36397 Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-09-23 16:33:18 +08:00
wei liu	3b10085f61	enhance: Optimize workload based replica selection policy (#36181 ) issue: #35859 This PR introduce two new param: toleranceFactor and checkRequestNum, after every checkRequestNum request has been assigned, try to compute querynode's workload score. if the diff is less than the toleranceFactor, replica selection policy will fallback to round_robin, which reduce the average cost to about 500ns. if the diff is larger than the toleranceFactor, replica selection policy will compute querynode's score to select the target node with smallest score in every assigment. --------- Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2024-09-20 12:33:11 +08:00
Ted Xu	363004fd44	enhance: simplify reduction on single search result (#36334 ) See: #36122 --------- Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2024-09-20 11:59:10 +08:00
aoiasd	139787371e	feat: support embedding bm25 sparse vector and flush bm25 stats log (#36036 ) relate: https://github.com/milvus-io/milvus/issues/35853 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-09-19 10:57:12 +08:00
yihao.dai	763fd0dfc5	enhance: Use a separate mmap config for chunk cache (#36276 ) issue: https://github.com/milvus-io/milvus/issues/35273 Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-09-15 16:23:09 +08:00
Chun Han	b8b4aea4f5	enhance: restrict max group size(#33544 ) (#36223 ) related: #33544 Signed-off-by: MrPresent-Han <chun.han@gmail.com> Co-authored-by: MrPresent-Han <chun.han@gmail.com>	2024-09-14 15:45:08 +08:00
aoiasd	c22a2cebb6	fix: split stream query result to avoid grpc response too large error (#36090 ) relate: https://github.com/milvus-io/milvus/issues/36089 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-09-13 15:07:09 +08:00
Ted Xu	b9c037f558	feat: adding cache to expression parse (#36185 ) See #36122 This PR improves the proxy node performance by adding cache to expression parse. --------- Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2024-09-13 10:03:09 +08:00

1 2 3 4 5 ...

680 Commits