milvus/README.md

320 lines
11 KiB
Markdown
Raw Normal View History

<img src="https://zillizstorage.blob.core.windows.net/zilliz-assets/zilliz-assets/assets/readme_en_9f910915cf.png" alt="milvus banner">
2021-04-20 13:53:56 +08:00
<div class="column" align="middle">
<a href="https://join.slack.com/t/milvusio/shared_invite/zt-e0u4qu3k-bI2GDNys3ZqX1YCJ9OM~GQ">
<img src="https://img.shields.io/badge/Join-Slack-orange" /></a>
<img src="https://img.shields.io/github/license/milvus-io/milvus" />
<img src="https://img.shields.io/docker/pulls/milvusdb/milvus" />
</div>
2021-04-20 13:53:56 +08:00
<div class="column" align="middle">
<a href="https://internal.zilliz.com:18080/jenkins/job/milvus-ha-ci/job/master/badge/">
<img src="https://internal.zilliz.com:18080/jenkins/job/milvus-ha-ci/job/master/badge/icon" />
</a>
<a href="https://bestpractices.coreinfrastructure.org/projects/3563">
<img src="https://bestpractices.coreinfrastructure.org/projects/3563/badge" />
</a>
<a href="https://codecov.io/gh/milvus-io/milvus">
<img src="https://codecov.io/gh/milvus-io/milvus/branch/master/graph/badge.svg" />
</a>
<a href="https://app.codacy.com/gh/milvus-io/milvus?utm_source=github.com&utm_medium=referral&utm_content=milvus-io/milvus&utm_campaign=Badge_Grade_Dashboard">
<img src="https://api.codacy.com/project/badge/Grade/c4bb2ccfb51b47f99e43bfd1705edd95" />
</a>
</div>
<br />
## What is Milvus?
Milvus is an open-source vector database built to power AI applications and embedding similarity search. Milvus makes unstructured data search more accessible, and provides a consistent user experience regardless of the deployment environment.
Both Milvus Standalone and Milvus Cluster are available.
Milvus was released under the [open-source Apache License 2.0](https://github.com/milvus-io/milvus/blob/master/LICENSE) in October 2019. It is currently a graduate project under [LF AI & Data Foundation](https://lfaidata.foundation/).
## Key features
<details>
<summary><b>Millisecond search on trillion vector datasets</b></summary>
Average latency measured in milliseconds on trillion vector datasets.
</details>
<details>
<summary><b>Simplified unstructured data management</b></summary>
<li>Rich APIs designed for data science workflows.</li><li>Consistent user experience across laptop, local cluster, and cloud.</li><li>Embed real-time search and analytics into virtually any application.</li>
</details>
<details>
<summary><b>Reliable, always on vector database</b></summary>
Milvus built-in replication and failover/failback features ensure data and applications can maintain business continuity in the event of a disruption.
</details>
<details>
<summary><b>Highly scalable and elastic</b></summary>
Component-level scalability makes it possible to scale up and down on demand. Milvus can autoscale at a component level according to the load type, making resource scheduling much more efficient.
</details>
<details>
<summary><b>Hybrid search</b></summary>
In addition to vectors, Milvus supports data types such as Boolean, integers, floating-point numbers, and more. A collection in Milvus can hold multiple fields for accommodating different data features or properties. By complementing scalar filtering to vector similarity search, Milvus makes modern search much more flexible than ever.
</details>
<details>
<summary><b>Unified Lambda structure</b></summary>
Milvus combines stream and batch processing for data storage to balance timeliness and efficiency. Its unified interface makes vector similarity search a breeze.
</details>
<details>
<summary><b>Community supported, industry recognized</b></summary>
With over 1,000 enterprise users, 6,000+ stars on GitHub, and an active open-source community, youre not alone when you use Milvus. As a graduate project under the <a href="https://lfaidata.foundation/">LF AI & Data Foundation</a>, Milvus has institutional support.
</details>
> **IMPORTANT** The master branch is for the development of Milvus v2.0. On March 9th, 2021, we released Milvus v1.0, the first stable version of Milvus with long-term support. To use Milvus v1.0, switch to [branch 1.0](https://github.com/milvus-io/milvus/tree/1.0).
## Installation
### Install Milvus Standalone
Install with Docker-Compose
```bash
$ cd milvus/deployments/docker/standalone
$ sudo docker-compose up -d
```
Install with Helm
```bash
$ helm install -n milvus --set image.all.repository=registry.zilliz.com/milvus/milvus --set image.all.tag=master-latest milvus milvus-helm-charts/charts/milvus-ha
```
Build from source code
```bash
# Clone github repository.
$ cd /home/$USER/
$ git clone https://github.com/milvus-io/milvus.git
# Install third-party dependencies.
$ cd /home/$USER/milvus/
$ ./scripts/install_deps.sh
# Compile Milvus standalone.
$ make standalone
```
### Install Milvus Cluster
2021-04-20 13:53:56 +08:00
Install with Docker-Compose
2021-04-20 13:53:56 +08:00
```bash
$ cd milvus/deployments/docker/distributed
$ sudo docker-compose up -d
```
2021-04-20 13:53:56 +08:00
Install with Helm
2021-04-20 13:53:56 +08:00
```bash
$ helm install -n milvus --set image.all.repository=registry.zilliz.com/milvus/milvus --set image.all.tag=master-latest --set standalone.enabled=false milvus milvus-helm-charts/charts/milvus-ha
```
Build from source code
```bash
# Clone github repository.
$ cd /home/$USER
$ git clone https://github.com/milvus-io/milvus.git
# Install third-party dependencies.
$ cd milvus
$ ./scripts/install_deps.sh
# Compile Milvus Cluster.
$ make milvus
```
## Milvus 2.0 is better than Milvus 1.x
<table class="comparison">
<tr>
<th>&nbsp;</th>
<th><b>Milvus 1.x</b></th>
<th><b>Milvus 2.0</b></th>
</tr>
<tr>
<td><b>Architecture</b></td>
<td>Shared storage</td>
<td>Cloud native</td>
</tr>
<tr>
<td><b>Scalability</b></td>
<td>1 to 32 read nodes with only one write node</td>
<td>500+ nodes</td>
</tr>
<tr>
<td><b>Durability</b></td>
<td><li>Local disk</li><li>Network file system (NFS)</li></td>
<td><li>Object storage service (OSS)</li><li>Distributed file system (DFS)</li></td>
</tr>
<tr>
<td><b>Availability</b></td>
<td>99%</td>
<td>99.9%</td>
</tr>
<tr>
<td><b>Data consistency</b></td>
<td>Eventual consistency</td>
<td>Three levels of consistency:<li>Strong</li><li>Session</li><li>Consistent prefix</li></td>
</tr>
<tr>
<td><b>Data types supported</b></td>
<td>Vectors</td>
<td><li>Vectors</li><li>Fixed-length scalars</li><li>String and text (in planning)</li></td>
</tr>
<tr>
<td><b>Basic operations supported</b></td>
<td><li>Data insertion</li><li>Data deletion</li><li>Approximate nearest neighbor (ANN) Search</li></td>
<td><li>Data insertion</li><li>Data deletion (in planning)</li><li>Data query</li><li>Approximate nearest neighbor (ANN) Search</li><li>Recurrent neural network (RNN) search (in planning)</li></td>
</tr>
<tr>
<td><b>Advanced features</b></td>
<td><li>Mishards</li><li>Milvus DM</li></td>
<td><li>Scalar filtering</li><li>Time Travel</li><li>Multi-site deployment and multi-cloud integration</li><li>Data management tools</li></td>
</tr>
<tr>
<td><b>Index types and libraries</b></td>
<td><li>Faiss</li><li>Annoy</li><li>Hnswlib</li><li>RNSG</li></td>
<td><li>Faiss</li><li>Annoy</li><li>Hnswlib</li><li>RNSG</li><li>ScaNN (in planning)</li><li>On-disk index (in planning)</li></td>
</tr>
<tr>
<td><b>SDKs</b></td>
<td><li>Python</li><li>Java</li><li>Go</li><li>RESTful</li><li>C++</li></td>
<td><li>Python</li><li>Go (in planning)</li><li>RESTful (in planning)</li><li>C++ (in planning)</li></td>
</tr>
<tr>
<td><b>Release status</b></td>
<td>Long-term support (LTS)</td>
<td>Release candidate. A stable version will be released in August.</td>
</tr>
</table>
## Getting Started
### Demos
<table>
<tr>
<td width="30%">
<a href="https://zilliz.com/milvus-demos">
<img src="https://zillizstorage.blob.core.windows.net/zilliz-assets/zilliz-assets/assets/image_search_59a64e4f22.gif" />
</a>
</td>
<td width="30%">
<a href="https://zilliz.com/milvus-demos">
<img src="https://zillizstorage.blob.core.windows.net/zilliz-assets/zilliz-assets/assets/qa_df5ee7bd83.gif" />
</a>
</td>
<td width="30%">
<a href="https://zilliz.com/milvus-demos">
<img src="https://zillizstorage.blob.core.windows.net/zilliz-assets/zilliz-assets/assets/mole_search_76f8340572.gif" />
</a>
</td>
</tr>
<tr>
<th>
<a href="https://zilliz.com/milvus-demos">Image search</a>
</th>
<th>
<a href="https://zilliz.com/milvus-demos">Chatbots</a>
</th>
<th>
<a href="https://zilliz.com/milvus-demos">Chemical structure search</a>
</th>
</tr>
</table>
2021-04-20 13:53:56 +08:00
- [Image Search](https://zilliz.com/milvus-demos)
2021-04-20 13:53:56 +08:00
Images made searchable. Instantaneously return the most similar images from a massive database.
2021-04-20 13:53:56 +08:00
- [Chatbots](https://zilliz.com/milvus-demos)
2021-04-20 13:53:56 +08:00
Interactive digital customer service that saves users time and businesses money.
2021-04-20 13:53:56 +08:00
- [Chemical Structure Search](https://zilliz.com/milvus-demos)
2021-04-20 13:53:56 +08:00
Blazing fast similarity search, substructure search, or superstructure search for a specified molecule.
2021-04-20 13:53:56 +08:00
## Bootcamps
2021-04-20 13:53:56 +08:00
Milvus [bootcamp](https://github.com/milvus-io/bootcamp/tree/new-bootcamp) are designed to expose users to both the simplicity and depth of the vector database. Discover how to run benchmark tests as well as build similarity search applications spanning chatbots, recommendation systems, reverse image search, molecular search, and much more.
2021-04-20 13:53:56 +08:00
## Contributing
2021-04-20 13:53:56 +08:00
Contributions to Milvus are welcome from everyone. See [Guidelines for Contributing](https://github.com/milvus-io/milvus/blob/master/CONTRIBUTING.md) for details on submitting patches and the contribution workflow. See our [community repository](https://github.com/milvus-io/community) to learn about our governance and access more community resources.
2021-04-20 13:53:56 +08:00
## Documentation
2021-04-20 13:53:56 +08:00
### SDK
The implemented SDK and its API documentation are listed below:
- [PyMilvus-ORM](https://github.com/milvus-io/pymilvus-orm)
## Community
Join the Milvus community on [Slack](https://join.slack.com/t/milvusio/shared_invite/zt-e0u4qu3k-bI2GDNys3ZqX1YCJ9OM~GQ) to share your suggestions, advice, and questions with our engineering team.
<a href="https://join.slack.com/t/milvusio/shared_invite/zt-e0u4qu3k-bI2GDNys3ZqX1YCJ9OM~GQ">
<img src="https://zillizstorage.blob.core.windows.net/zilliz-assets/zilliz-assets/assets/readme_slack_4a07c4c92f.png" alt="Miluvs Slack Channel" height="150" width="500">
</a>
You can also check out our [FAQ page](https://milvus.io/docs/v1.0.0/performance_faq.md) to discover solutions or answers to your issues or questions.
Subscribe to our mailing lists:
- [Milvus Technical Steering Committee](https://lists.lfai.foundation/g/milvus-tsc)
- [Milvus Technical Discussions](https://lists.lfai.foundation/g/milvus-technical-discuss)
- [Milvus Announcement](https://lists.lfai.foundation/g/milvus-announce)
Follow us on social media:
- [Milvus Medium](https://medium.com/@milvusio)
- [Milvus Twitter](https://twitter.com/milvusio)
2021-04-20 13:53:56 +08:00
## Join Us
2021-04-20 13:53:56 +08:00
Zilliz, the company behind Milvus, is [actively hiring](https://app.mokahr.com/apply/zilliz/37974#/) full-stack developers and solution engineers to build the next-generation open-source data fabric.
## Acknowledgments
Milvus adopts dependencies from the following:
- Thank [FAISS](https://github.com/facebookresearch/faiss) for the excellent search library.
- Thank [etcd](https://github.com/coreos/etcd) for providing some great open-source tools.
- Thank [Pulsar](https://github.com/apache/pulsar) for its great distributed information pub/sub platform.
- Thank [RocksDB](https://github.com/facebook/rocksdb) for the powerful storage engines.