milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2024-11-30 02:48:45 +08:00

Author	SHA1	Message	Date
Ted Xu	63f0154dfb	fix: enable milvus.yaml check (#34567 ) See #32168 --------- Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2024-07-25 18:05:46 +08:00
chyezh	39c7e06bc5	enhance: add message and msgstream msgpack adaptor (#34874 ) issue: #33285 - make message builder and message conversion type safe - add adaptor type and function to adapt old msgstream msgpack and message interface --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-07-22 20:59:42 +08:00
chyezh	cc8f7aa110	fix: streaming service related fix patch (#34696 ) issue: #33285 - add idAlloc interface - fix binary unsafe bug for message - fix service discovery lost when repeated address with different server id --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-07-16 15:49:38 +08:00
chyezh	fda720b880	enhance: streaming service grpc utilities (#34436 ) issue: #33285 - add two grpc resolver (by session and by streaming coord assignment service) - add one grpc balancer (by serverID and roundrobin) - add lazy conn to avoid block by first service discovery - add some utility function for streaming service Signed-off-by: chyezh <chyezh@outlook.com>	2024-07-15 20:49:38 +08:00
pingliu	8c42f1341d	doc: [skip e2e] add extraConfig to embed milvus (#34395 ) Signed-off-by: ping.liu <ping.liu@zilliz.com>	2024-07-03 18:20:17 +08:00
sammy.huang	b14482baa6	enhance: [skip e2e] use disk ann when os is ubuntu (#34236 ) issue: https://github.com/milvus-io/milvus/issues/34222 Signed-off-by: Liang Huang <sammy.huang@zilliz.com>	2024-06-27 20:50:13 +08:00
pingliu	b12c34a8ba	doc: [skip e2e] change milvus docker image version to v2.4.5 (#34135 ) Signed-off-by: ping.liu <ping.liu@zilliz.com>	2024-06-25 14:30:04 +08:00
chyezh	b9237280c2	enhance: wal interface definition (#33745 ) issue: #33285 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-06-24 10:34:12 +08:00
Yinzuo Jiang	7d74edd6dd	fix: update clang-tidy and clang-format from 10 to 12 (#33141 ) Default llvm toolchain version in Ubuntu 20.04 is 10, while Ubuntu 22.04 does not have `clang-tidy-10` or `clang-format-10` by default. issue: #33142 Signed-off-by: Patrick Weizhi Xu <weizhi.xu@zilliz.com> Signed-off-by: Yinzuo Jiang <jiangyinzuo@foxmail.com>	2024-06-13 15:27:58 +08:00
chyezh	2b7ee1968f	enhance: new messsage interface for log service (#33286 ) issue: #33285 --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-06-11 10:38:01 +08:00
wayblink	a1232fafda	feat: Major compaction (#33620 ) #30633 Signed-off-by: wayblink <anyang.wang@zilliz.com> Co-authored-by: MrPresent-Han <chun.han@zilliz.com>	2024-06-10 21:34:08 +08:00
chyezh	f53ab54c5d	enhance: async cgo utility (#33133 ) issue: #30926, #33132 - implement future-based cgo utility. --------- Signed-off-by: chyezh <chyezh@outlook.com>	2024-06-09 22:55:53 +08:00
cai.zhang	27cc9f2630	enhance: Support analyze data (#33651 ) issue: #30633 Signed-off-by: Cai Zhang <cai.zhang@zilliz.com> Co-authored-by: chasingegg <chao.gao@zilliz.com>	2024-06-06 17:37:51 +08:00
Jiquan Long	0c5d8660aa	feat: support inverted index for array (#33452 ) issue: https://github.com/milvus-io/milvus/issues/27704 --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-05-31 09:47:47 +08:00
shaoting-huang	de7901121f	Upgrade go from 1.20 to 1.21 (#33047 ) Signed-off-by: shaoting-huang [shaoting-huang@zilliz.com] issue: https://github.com/milvus-io/milvus/issues/32982 # Background Go 1.21 introduces several improvements and changes over Go 1.20, which is quite stable now. According to [Go 1.21 Release Notes](https://tip.golang.org/doc/go1.21), the big difference of Go 1.21 is enabling Profile-Guided Optimization by default, which can improve performance by around 2-14%. Here are the summary steps of PGO: 1. Build Initial Binary (Without PGO) 2. Deploying the Production Environment 3. Run the program and collect Performance Analysis Data (CPU pprof) 4. Analyze the Collected Data and Select a Performance Profile for PGO 5. Place the Performance Analysis File in the Main Package Directory and Name It default.pgo 6. go build Detects the default.pgo File and Enables PGO 7. Build and Release the Updated Binary (With PGO) 8. Iterate and Repeat the Above Steps <img width="657" alt="Screenshot 2024-05-14 at 15 57 01" src="https://github.com/milvus-io/milvus/assets/167743503/b08d4300-0be1-44dc-801f-ce681dabc581"> # What does this PR do There are three experiments, search benchmark by Zilliz test platform, search benchmark by open-source [VectorDBBench](https://github.com/zilliztech/VectorDBBench?tab=readme-ov-file), and search benchmark with PGO. We do both search benchmarks by Zilliz test platform and by VectorDBBench to reduce reliance on a single experimental result. Besides, we validate the performance enhancement with PGO. ## Search Benchmark Report by Zilliz Test Platform An upgrade to Go 1.21 was conducted on a Milvus Standalone server, equipped with 16 CPUs and 64GB of memory. The search performance was evaluated using a 1 million entry local dataset with an L2 metric type in a 768-dimensional space. The system was tested for concurrent searches with 50 concurrent tasks for 1 hour, each with a 20-second interval. The reason for using one server rather than two servers to compare is to guarantee the same data source and same segment state after compaction. Test Sequence: 1. Go 1.20 Initial Run: Insert data, build index, load index, and search. 2. Go 1.20 Rebuild: Rebuild the index with the same dataset, load index, and search. 3. Go 1.21 Load: Upload to Go 1.21 within the server. Then load the index from the second run, and search. 4. Go 1.21 Rebuild: Rebuild the index with the same dataset, load index, and search. Search Metrics: \| Metric \| Go 1.20 \| Go 1.20 Rebuild Index \| Go 1.21 \| Go 1.21 Rebuild Index \| \|----------------------------\|------------------\|-----------------\|------------------\|-----------------\| \| `search requests` \| 10,942,683 \| 16,131,726 \| 16,200,887 \| 16,331,052 \| \| `search fails` \| 0 \| 0 \| 0 \| 0 \| \| `search RT_avg` (ms) \| 16.44 \| 11.15 \| 11.11 \| 11.02 \| \| `search RT_min` (ms) \| 1.30 \| 1.28 \| 1.31 \| 1.26 \| \| `search RT_max` (ms) \| 446.61 \| 233.22 \| 235.90 \| 147.93 \| \| `search TP50` (ms) \| 11.74 \| 10.46 \| 10.43 \| 10.35 \| \| `search TP99` (ms) \| 92.30 \| 25.76 \| 25.36 \| 25.23 \| \| `search RPS` \| 3,039 \| 4,481 \| 4,500 \| 4,536 \| ### Key Findings The benchmark tests reveal that the index build time with Go 1.20 at 340.39 ms and Go 1.21 at 337.60 ms demonstrated negligible performance variance in index construction. However, Go 1.21 offers slightly better performance in search operations compared to Go 1.20, with improvements in handling concurrent tasks and reducing response times. ## Search Benchmark Report By VectorDb Bench Follow [VectorDBBench](https://github.com/zilliztech/VectorDBBench?tab=readme-ov-file) to create a VectorDb Bench test for Go 1.20 and Go 1.21. We test the search performance with Go 1.20 and Go 1.21 (without PGO) on the Milvus Standalone system. The tests were conducted using the Cohere dataset with 1 million entries in a 768-dimensional space, utilizing the COSINE metric type. Search Metrics: Metric \| Go 1.20 \| Go 1.21 without PGO -- \| -- \| -- Load Duration (seconds) \| 1195.95 \| 976.37 Queries Per Second (QPS) \| 841.62 \| 875.89 99th Percentile Serial Latency (seconds) \| 0.0047 \| 0.0076 Recall \| 0.9487 \| 0.9489 ### Key Findings Go 1.21 indicates faster index loading times and larger search QPS handling. ## PGO Performance Test Milvus has already added [net/http/pprof](https://pkg.go.dev/net/http/pprof) in the metrics. So we can curl the CPU profile directly by running `curl -o default.pgo "http://${MILVUS_SERVER_IP}:${MILVUS_SERVER_PORT}/debug/pprof/profile?seconds=${TIME_SECOND}"` to collect the profile as the default.pgo during the first search. Then I build Milvus with PGO and use the same index to run the search again. The result is as below: Search Metrics \| Metric \| Go 1.21 Without PGO \| Go 1.21 With PGO \| Change (%) \| \|---------------------------------------------\|------------------\|-----------------\|------------\| \| `search Requests` \| 2,644,583 \| 2,837,726 \| +7.30% \| \| `search Fails` \| 0 \| 0 \| N/A \| \| `search RT_avg` (ms) \| 11.34 \| 10.57 \| -6.78% \| \| `search RT_min` (ms) \| 1.39 \| 1.32 \| -5.18% \| \| `search RT_max` (ms) \| 349.72 \| 143.72 \| -58.91% \| \| `search TP50` (ms) \| 10.57 \| 9.93 \| -6.05% \| \| `search TP99` (ms) \| 26.14 \| 24.16 \| -7.56% \| \| `search RPS` \| 4,407 \| 4,729 \| +7.30% \| ### Key Findings PGO led to a notable enhancement in search performance, particularly in reducing the maximum response time by 58% and increasing the search QPS by 7.3%. ### Further Analysis Generate a diff flame graphs between two CPU profiles by running `go tool pprof -http=:8000 -diff_base nopgo.pgo pgo.pgo -normalize` <img width="1894" alt="goprofiling" src="https://github.com/milvus-io/milvus/assets/167743503/ab9e91eb-95c7-4963-acd9-d1c3c73ee010"> Further insight of HnswIndexNode and Milvus Search Handler <img width="1906" alt="hnsw" src="https://github.com/milvus-io/milvus/assets/167743503/a04cf4a0-7c97-4451-b3cf-98afc20a0b05"> <img width="1873" alt="search_handler" src="https://github.com/milvus-io/milvus/assets/167743503/5f4d3982-18dd-4115-8e76-460f7f534c7f"> After applying PGO to the Milvus server, the CPU utilization of the faiss::fvec_L2 function has decreased. This optimization significantly enhances the performance of the [HnswIndexNode::Search::searchKnn](`e0c9c41aa2/src/index/hnsw/hnsw.cc (L203)`) method, which is frequently invoked by Knowhere during high-concurrency searches. As the explanation from Go release notes, the function might be more aggressively inlined by Go compiler during the second build with the CPU profiling collected from the first run. As a result, the search handler efficiency within Milvus DataNode has improved, allowing the server to process a higher number of search queries per second (QPS). # Conclusion The combination of Go 1.21 and PGO has led to substantial enhancements in search performance for Milvus server, particularly in terms of search QPS and response times, making it more efficient for handling high-concurrency search operations. Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2024-05-22 13:21:39 +08:00
Alexander Guzhva	f20becb725	fix: Download and install cmake for the current platform, not x86_64 only (#32548 ) issue #32476 tested on x86_64 and aarch64. I'm not sure what needs to be done on some exotic architectures. Signed-off-by: Alexandr Guzhva <alexanderguzhva@gmail.com>	2024-05-22 11:15:39 +08:00
pingliu	a1b5d7d6d3	doc: [skip-e2e] change milvus docker image version to v2.4.1 (#33170 ) Signed-off-by: ping.liu <ping.liu@zilliz.com>	2024-05-20 16:21:38 +08:00
Yinzuo Jiang	2cc50d80a3	fix: add openblas in install_deps.sh (#33065 ) Install openblas using apt or yum in scripts/install_deps.sh, update documentations and fix some typos related to build and installation. issue: #33056, #33066 Signed-off-by: Yinzuo Jiang <jiangyinzuo@foxmail.com>	2024-05-17 14:53:37 +08:00
congqixia	8cf2cf5c94	enhance: Add `go-deadlock` as unittest only dependency (#33063 ) See also #33062 This PR: - Add `lock.RWMutex` & `lock.Mutex` alias to switch implementation based on build flags - When build flags has `test` in it, use `go-deadlock` to detect possible deadlocks - Replace all `sync.RWMutex` & `sync.Mutex` in datacoord pkg Signed-off-by: Congqi Xia <congqi.xia@zilliz.com> Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-05-15 16:33:34 +08:00
congqixia	244d2c04f6	feat: Add `milvusclient` package and migrate GoSDK (#32907 ) Related to #31293 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-05-10 18:01:30 +08:00
aoiasd	54a51b1236	enhance: Support dynamic config for opentelemetry trace (#32169 ) relate: https://github.com/milvus-io/milvus/issues/31940 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2024-05-09 17:43:30 +08:00
Aldrin	b6c06ff2c3	fix: Removed tests on non existent directories and solved name resolution errors (#32266 ) issue: #32254 Signed-off-by: Ald392 <imagesai32@gmail.com>	2024-04-30 11:33:26 +08:00
shaoting-huang	f5e1fc757c	fix: [skip e2e] dev container yaml indentation error (#32601 ) issue: #32594 Signed-off-by: shaoting-huang <shaoting.huang@zilliz.com>	2024-04-26 10:15:27 +08:00
pingliu	72b054e0c3	doc: change milvus docker image version to v2.4.0 (#32489 ) change milvus docker image version to v2.4.0 Signed-off-by: ping.liu <ping.liu@zilliz.com>	2024-04-23 09:47:23 +08:00
Ted Xu	744a54a534	enhance: enforce milvus.yaml assertion in UT (#32357 ) See #32168 Signed-off-by: Ted Xu <ted.xu@zilliz.com>	2024-04-19 16:47:20 +08:00
yiwangdr	1cd15d9322	test: support segment release in integration test (#31190 ) issue: #29507 Notice that api_testonly.go files should be guarded by compiler tag `test`, so that production build rules don't compile them and these APIs don't get misused. Signed-off-by: yiwangdr <yiwangdr@gmail.com>	2024-04-09 11:39:17 +08:00
congqixia	f399416b92	enhance: Use gotestsum to run go unit test (#31622 ) See also #31490 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-03-27 15:29:10 +08:00
congqixia	4da8b6607d	enhance: Add scripts to use `gotestsum` to execute integration test (#31490 ) See also #31489 --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-03-22 10:29:06 +08:00
pingliu	d5ba221b25	doc: change milvus docker image version to v2.4.0-rc.1 (#31461 ) Signed-off-by: ping.liu <ping.liu@zilliz.com>	2024-03-20 22:15:05 +08:00
pingliu	26539a1b6d	doc: change milvus docker image version to 2.3.12 (#31336 ) Signed-off-by: ping.liu <ping.liu@zilliz.com>	2024-03-18 14:19:05 +08:00
Jiquan Long	375190e76e	fix: cpp format check not work (#30767 ) fix: #30765 --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-02-22 19:40:53 +08:00
zhagnlu	976b6fc0e4	enhance: change opendal as compile configurable (#30384 ) #30373 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2024-02-20 19:16:52 +08:00
pingliu	f4559cbe54	doc: change image version to 2.3.9 (#30695 ) Signed-off-by: ping.liu <ping.liu@zilliz.com>	2024-02-20 18:52:52 +08:00
yiwangdr	32cff25f97	enhance: decrease coordinator init time (#29822 ) This PR mainly improve two items: 1. Target observer should refresh loading status during init time. An uninitialized loading status blocks search/query. Currently, the target observer refreshes every 10 seconds, i.e. we'd need to wait for 10s for no reason. That's also the reason why we constantly see false log "collection unloaded" upon mixcoord restarts. 2. Delete session when service is stopped. So that the new service doesn't need to wait for the previous session to expire (~10s). Item 1 is the major improvement of this PR, which should speed up init time by 10s. Item 2 is not a big concern in most cases as coordinators usually shut down after stop(). In those cases, coordinator restart triggers serverID change which further triggers an existing logic that deletes expired session. This PR only fixes rare cases where serverID doesn't change. integration test: `go test -tags dynamic -v -coverprofile=profile.out -covermode=atomic tests/integration/coordrecovery/coord_recovery_test.go -timeout=20m` Performance after the change: Average init time of coordinators: 10s Hardware: M2 Pro Test setup: 1000 collections with 1000 rows (dim=128) per collection. issue: #29409 Signed-off-by: yiwangdr <yiwangdr@gmail.com>	2024-02-05 14:00:12 +08:00
pingliu	5cdf0f2490	doc: fix standalone stop cannot start issue (#30481 ) Signed-off-by: ping.liu <ping.liu@zilliz.com>	2024-02-02 21:05:05 +08:00
pingliu	e4a033858e	doc: Add Milvus standalone all in one script (#30459 ) Signed-off-by: ping.liu <ping.liu@zilliz.com>	2024-02-02 18:05:04 +08:00
yah01	ec688f5bf6	fix: failed to download OpenDAL (#30380 ) OpenDAL's url has been changed fix #30379 Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-02-02 14:35:04 +08:00
sammy.huang	265453f400	enhance: [skip e2e]revert back to original way to archive, upload and download (#30248 ) Signed-off-by: Sammy Huang <sammy.huang@zilliz.com>	2024-02-01 10:15:04 +08:00
congqixia	9f8eb0e527	enhance: make integration test case timeout configurable (#30073 ) currently integration test may timeout if any case run time is above 3 minutes. This duration was hard coded. This PR change this duration into a customized parameter and could be passed via test running commands. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-18 12:22:54 +08:00
yah01	031243fee7	feat: support mmap for marisa trie (#29613 ) this supports mmap for marisa trie index related https://github.com/milvus-io/milvus/issues/21866 Signed-off-by: yah01 <yang.cen@zilliz.com>	2024-01-11 10:22:50 +08:00
Jiquan Long	4b3de64733	enhance: add rust to install_dep.sh (#29586 ) fix: #29585 Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2024-01-07 20:04:12 +08:00
yihao.dai	3561586edf	feat: Add import reader for binlog (#28910 ) This PR defines the new import reader interfaces and implement a binlog reader for import. issue: https://github.com/milvus-io/milvus/issues/28521 --------- Signed-off-by: bigsheeper <yihao.dai@zilliz.com>	2024-01-05 11:48:47 +08:00
PowderLi	5f00bad4b8	fix: link with install path's libblob-chunk-manager (#29496 ) issue: #29494 1. link with install path's libblob-chunk-manager 2. performance of `ShouldBindWith` is better than `ShouldBindBodyWith` 3. the middleware shouldn't read the unrefreshed parameter repeatly Signed-off-by: PowderLi <min.li@zilliz.com>	2023-12-31 20:02:48 +08:00
wei liu	e41fd6fbde	enhance: Move proxy client manager to util package (#28955 ) issue: #28898 This PR move the `ProxyClientManager` to util package, in case of reusing it's implementation in querycoord Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2023-12-20 19:22:42 +08:00
zhagnlu	56d7225673	fix: Add LD_LIBRARY_PATH when local start milvus (#29287 ) #28165 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2023-12-19 10:08:41 +08:00
zhagnlu	a602171d06	enhance: Refactor runtime and expr framework (#28166 ) #28165 Signed-off-by: luzhang <luzhang@zilliz.com> Co-authored-by: luzhang <luzhang@zilliz.com>	2023-12-18 12:04:42 +08:00
Jiquan Long	ed79505d31	enhance: use specific rust version to compile opendal (#29106 ) /kind improvement --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2023-12-13 16:34:38 +08:00
XuanYang-cn	d2207cac27	fix: Fix rustup update (#29093 ) Signed-off-by: yangxuan <xuan.yang@zilliz.com>	2023-12-11 11:36:37 +08:00
Bingyi Sun	36f69ea031	feat: integrate storagev2 in building index of segcore (#28768 ) issue: https://github.com/milvus-io/milvus/issues/28655 --------- Signed-off-by: sunby <sunbingyi1992@gmail.com>	2023-12-05 16:48:54 +08:00
congqixia	e79c3eaa90	enhance: [skip e2e] Update install_deps.sh cmake version (#28946 ) cmake version needs to be greater or equal than 3.26.4, sync `install_deps.sh` scripts version to 3.26.5 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-04 18:26:34 +08:00

1 2 3 4 5 ...

417 Commits