milvus

mirror of https://gitee.com/milvus-io/milvus.git synced 2024-12-05 05:18:52 +08:00

Author	SHA1	Message	Date
Buqian Zheng	3c80083f51	feat: [Sparse Float Vector] add sparse vector support to milvus components (#30630 ) add sparse float vector support to different milvus components, including proxy, data node to receive and write sparse float vectors to binlog, query node to handle search requests, index node to build index for sparse float column, etc. https://github.com/milvus-io/milvus/issues/29419 --------- Signed-off-by: Buqian Zheng <zhengbuqian@gmail.com>	2024-03-13 14:32:54 -07:00
Xiaofan	4bda6c33ad	fix: binary vector should not limit dimension to 32768 (#30676 ) all the vector dimension check should happen on collection creation but not index build fix #30285 Signed-off-by: xiaofanluan <xiaofan.luan@zilliz.com>	2024-03-05 14:21:00 +08:00
PowderLi	6abbab12fa	feat: restful phase two (#29728 ) issue: #29732 Signed-off-by: PowderLi <min.li@zilliz.com>	2024-01-28 16:03:01 +08:00
PowderLi	8fc4ebfa11	fix: empty MetricType (#30216 ) issue: #30102 #30225 we should read MetricType from SearchResult, because query node never 1. read metricType from LoadMeta 2. store to collection 3. set SearchRequest.MetricType Signed-off-by: PowderLi <min.li@zilliz.com>	2024-01-28 15:33:02 +08:00
SimFG	ddccccbcab	enhance: add the bytes data type for merge data and format some code (#30105 ) /kind improvement Signed-off-by: SimFG <bang.fu@zilliz.com>	2024-01-18 22:18:55 +08:00
xige-16	91aa81b4d7	fix: Add more checks to rank params (#29950 ) issue: #29840 #29867 /kind bug Signed-off-by: xige-16 <xi.ge@zilliz.com> Signed-off-by: xige-16 <xi.ge@zilliz.com>	2024-01-17 20:28:58 +08:00
Xu Tong	e429965f32	Add float16 approve for multi-type part (#28427 ) issue：https://github.com/milvus-io/milvus/issues/22837 Add bfloat16 vector, add the index part of float16 vector. Signed-off-by: Writer-X <1256866856@qq.com>	2024-01-11 15:48:51 +08:00
xige-16	9702cef2b5	feat: Support multiple vector search (#29433 ) issue #25639 Signed-off-by: xige-16 <xi.ge@zilliz.com> Signed-off-by: xige-16 <xi.ge@zilliz.com>	2024-01-08 15:34:48 +08:00
congqixia	4f8c540c77	enhance: cache collection schema attributes to reduce proxy cpu (#29668 ) See also #29113 The collection schema is crucial when performing search/query but some of the information is calculated for every request. This PR change schema field of cached collection info into a utility `schemaInfo` type to store some stable result, say pk field, partitionKeyEnabled, etc. And provided field name to id map for search/query services. --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2024-01-04 17:28:46 +08:00
Jiquan Long	3f46c6d459	feat: support inverted index (#28783 ) issue: https://github.com/milvus-io/milvus/issues/27704 Add inverted index for some data types in Milvus. This index type can save a lot of memory compared to loading all data into RAM and speed up the term query and range query. Supported: `INT8`, `INT16`, `INT32`, `INT64`, `FLOAT`, `DOUBLE`, `BOOL` and `VARCHAR`. Not supported: `ARRAY` and `JSON`. Note: - The inverted index for `VARCHAR` is not designed to serve full-text search now. We will treat every row as a whole keyword instead of tokenizing it into multiple terms. - The inverted index don't support retrieval well, so if you create inverted index for field, those operations which depend on the raw data will fallback to use chunk storage, which will bring some performance loss. For example, comparisons between two columns and retrieval of output fields. The inverted index is very easy to be used. Taking below collection as an example: ```python fields = [ FieldSchema(name="pk", dtype=DataType.VARCHAR, is_primary=True, auto_id=False, max_length=100), FieldSchema(name="int8", dtype=DataType.INT8), FieldSchema(name="int16", dtype=DataType.INT16), FieldSchema(name="int32", dtype=DataType.INT32), FieldSchema(name="int64", dtype=DataType.INT64), FieldSchema(name="float", dtype=DataType.FLOAT), FieldSchema(name="double", dtype=DataType.DOUBLE), FieldSchema(name="bool", dtype=DataType.BOOL), FieldSchema(name="varchar", dtype=DataType.VARCHAR, max_length=1000), FieldSchema(name="random", dtype=DataType.DOUBLE), FieldSchema(name="embeddings", dtype=DataType.FLOAT_VECTOR, dim=dim), ] schema = CollectionSchema(fields) collection = Collection("demo", schema) ``` Then we can simply create inverted index for field via: ```python index_type = "INVERTED" collection.create_index("int8", {"index_type": index_type}) collection.create_index("int16", {"index_type": index_type}) collection.create_index("int32", {"index_type": index_type}) collection.create_index("int64", {"index_type": index_type}) collection.create_index("float", {"index_type": index_type}) collection.create_index("double", {"index_type": index_type}) collection.create_index("bool", {"index_type": index_type}) collection.create_index("varchar", {"index_type": index_type}) ``` Then, term query and range query on the field can be speed up automatically by the inverted index: ```python result = collection.query(expr='int64 in [1, 2, 3]', output_fields=["pk"]) result = collection.query(expr='int64 < 5', output_fields=["pk"]) result = collection.query(expr='int64 > 2997', output_fields=["pk"]) result = collection.query(expr='1 < int64 < 5', output_fields=["pk"]) ``` --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>	2023-12-31 19:50:47 +08:00
xige-16	0a70e8b601	enhance: Remove multiple vector field limit (#27827 ) issue: https://github.com/milvus-io/milvus/issues/25639 /kind improvement Signed-off-by: xige-16 <xi.ge@zilliz.com> Signed-off-by: xige-16 <xi.ge@zilliz.com>	2023-12-28 16:40:46 +08:00
aoiasd	a76e3b2813	Refine delete by expression for forbid proxy dml task scheduler hang (#29340 ) relate: https://github.com/milvus-io/milvus/issues/29146 --------- Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2023-12-26 19:52:48 +08:00
yah01	a0e1a1eb31	feat: support enable/disable mmap for index (#29005 ) support enable/disable mmap for index, the user could alter the index's mode by `AlterIndex` method related: https://github.com/milvus-io/milvus/issues/21866 --------- Signed-off-by: yah01 <yah2er0ne@outlook.com> Signed-off-by: yah01 <yang.cen@zilliz.com>	2023-12-21 18:07:24 +08:00
congqixia	bcf8f27aa7	enhance: refine proxy meta cache partition logic (#29315 ) See also #29113 - Unify partition info refresh logic - Prevent parse partition names for each partition key search request --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-12-20 10:02:43 +08:00
PowderLi	20fc90c591	enhance: find collection schema from cache (#28782 ) issue: #28781 #28329 1. There is no need to call `DescribeCollection`, if the collection's schema is found in the globalMetaCache 2. did `GetProperties` to check the access to Azure Blob Service while construct the ChunkManager Signed-off-by: PowderLi <min.li@zilliz.com>	2023-12-03 19:22:33 +08:00
SimFG	9c46788d87	enhance: Support to trace restful request and request error (#28685 ) issue: #28348 Signed-off-by: SimFG <bang.fu@zilliz.com>	2023-11-27 20:14:26 +08:00
yah01	3ea0129eb3	enhance: improve the error messages and logs (#28684 ) - better name for log fields - make the error and log consistent Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-11-24 15:08:24 +08:00
SimFG	de13865769	enhance: Add load/release partitions to replicate msg stream (#28399 ) /kind improvement issue: #25655 Signed-off-by: SimFG <bang.fu@zilliz.com>	2023-11-23 15:38:24 +08:00
Ikko Eltociear Ashimine	ed4f20b0ed	Fix typo in util.go (#27975 ) suppot -> support Signed-off-by: Ikko Eltociear Ashimine <eltociear@gmail.com>	2023-10-30 14:40:27 +08:00
SimFG	9b0ecbdca7	Support to replicate the mq message (#27240 ) Signed-off-by: SimFG <bang.fu@zilliz.com>	2023-10-20 14:26:09 +08:00
zhenshan.cao	020ad9a6bc	Rectify wrong exception messages associated with Array datatype (#27769 ) Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>	2023-10-19 17:24:07 +08:00
SimFG	630636c4ec	Support the apikey authentication for the restful api (#27758 ) Signed-off-by: SimFG <bang.fu@zilliz.com>	2023-10-18 16:36:12 +08:00
PowderLi	09d8b76048	[restful] new context with grpc metadata (#27668 ) Signed-off-by: PowderLi <min.li@zilliz.com>	2023-10-17 20:00:14 +08:00
xige-16	6cbb67832f	Compatible with scalar index types marisa-trie and Ascending (#27638 ) Signed-off-by: xige-16 <xi.ge@zilliz.com>	2023-10-15 13:52:06 +08:00
yah01	3759857bc5	Refine Proxy errors (#27499 ) Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-10-09 10:09:33 +08:00
yah01	8394b3a1ec	Block creating new error from status reason (#27426 ) Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-10-07 11:29:32 +08:00
yah01	63ac43a3b8	Refine errors for import (#27379 ) Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-09-30 10:31:28 +08:00
yah01	6539a5ae2c	Refine DataCoord status (#27262 ) Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-09-26 17:15:27 +08:00
jaime	7f7c71ea7d	Decoupling client and server API in types interface (#27186 ) Co-authored-by:: aoiasd <zhicheng.yue@zilliz.com> Signed-off-by: jaime <yun.zhang@zilliz.com>	2023-09-26 09:57:25 +08:00
SimFG	26f06dd732	Format the code (#27275 ) Signed-off-by: SimFG <bang.fu@zilliz.com>	2023-09-21 09:45:27 +08:00
cai.zhang	a362bb1457	Support array datatype (#26369 ) Signed-off-by: cai.zhang <cai.zhang@zilliz.com>	2023-09-19 14:23:23 +08:00
congqixia	cc9974979f	Add staticcheck linter and fix existing problems (#27174 ) Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>	2023-09-19 10:05:22 +08:00
yah01	168e82ee10	Fix panic while handling with the nil status (#27040 ) Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-09-15 10:09:21 +08:00
yah01	00c65fa0d7	Refine QueryNode errors (#27013 ) Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-09-12 16:07:18 +08:00
aoiasd	e107d0794c	support complex delete expression (#25752 ) Signed-off-by: aoiasd <zhicheng.yue@zilliz.com>	2023-09-12 10:19:17 +08:00
Xu Tong	9166011c4a	Add float16 vector (#25852 ) Signed-off-by: Writer-X <1256866856@qq.com>	2023-09-08 10:03:16 +08:00
yah01	3349db4aa7	Refine errors to remove changes breaking design (#26521 ) Signed-off-by: yah01 <yah2er0ne@outlook.com>	2023-09-04 09:57:09 +08:00
Cai Yudong	8dc16b599b	Add binary metric types SUBSTRUCTURE/SUPERSTRUCTURE back (#26766 ) Signed-off-by: Yudong Cai <yudong.cai@zilliz.com>	2023-08-31 20:07:00 +08:00
SimFG	9311dc91ee	Clear error message in the delete request (#26656 ) Signed-off-by: SimFG <bang.fu@zilliz.com>	2023-08-30 14:47:00 +08:00
smellthemoon	87ecaac703	Add dynamic schema check in upsert (#26644 ) Signed-off-by: lixinguo <xinguo.li@zilliz.com> Co-authored-by: lixinguo <xinguo.li@zilliz.com>	2023-08-30 10:52:26 +08:00
xige-16	1e5836221a	Fix CollectionNotExists when search and retrieve vector (#26524 ) Signed-off-by: xige-16 <xi.ge@zilliz.com>	2023-08-22 17:06:22 +08:00
Enwei Jiao	ca1349708b	Remove time travel ralted testcase (#26119 ) Signed-off-by: Enwei Jiao <enwei.jiao@zilliz.com>	2023-08-10 18:53:17 +08:00
PowderLi	a7eecb1be0	support high-level RESTFUL API, listen on the same port as grpc. (#25108 ) Signed-off-by: PowderLi <min.li@zilliz.com>	2023-08-08 10:15:07 +08:00
Cai Yudong	9a4761dcc7	Remove binary metrics TANIMOTO/SUPERSTRUCTURE/SUBSTRUCTURE (#25708 ) Signed-off-by: Yudong Cai <yudong.cai@zilliz.com>	2023-07-19 16:16:58 +08:00
groot	96c987ed62	Bulkinsert supports partition keys (#25284 ) Signed-off-by: yhmo <yihua.mo@zilliz.com>	2023-07-11 15:18:28 +08:00
xige-16	8b9e3f1127	Fix max_length check (#25207 ) Signed-off-by: xige-16 <xi.ge@zilliz.com>	2023-06-30 11:50:26 +08:00
jaime	18df2ba6fd	[Cherry-Pick] Support Database (#24769 ) Support Database(#23742) Fix db nonexists error for FlushAll (#24222) Fix check collection limits fails (#24235) backward compatibility with empty DB name (#24317) Fix GetFlushAllState with DB (#24347) Remove db from global meta cache after drop database (#24474) Fix db name is empty for describe collection response (#24603) Add RBAC for Database API (#24653) Fix miss load the same name collection during recover stage (#24941) RBAC supports Database validation (#23609) Fix to list grant with db return empty (#23922) Optimize PrivilegeAll permission check (#23972) Add the default db value for the rbac request (#24307) Signed-off-by: jaime <yun.zhang@zilliz.com> Co-authored-by: SimFG <bang.fu@zilliz.com> Co-authored-by: longjiquan <jiquan.long@zilliz.com>	2023-06-25 17:20:43 +08:00
chyezh	ccf3f0066f	[Pick] Enable max result window limit (#24986 ) Signed-off-by: chyezh <ye.zhen@zilliz.com>	2023-06-25 14:42:43 +08:00
Zhao Shunjie	3b5b50bda8	add autoID to varchar dataType (#24907 ) Signed-off-by: shunjiezhao <939038111@qq.com>	2023-06-16 17:00:40 +08:00
wei liu	59457eb75b	fix get partition progress return wrong error msg (#24899 ) Signed-off-by: Wei Liu <wei.liu@zilliz.com>	2023-06-15 11:14:39 +08:00

1 2 3

116 Commits