milvus/internal/datacoord
yihao.dai 3ae1865ffb
fix: Prevent import from generating orphaned files (#34070) (#34071)
When import failed, mark the import segment as dropped instead of drop
it directly to prevent generating orphaned files.

issue: https://github.com/milvus-io/milvus/issues/34068

pr: https://github.com/milvus-io/milvus/pull/34070

---------

Signed-off-by: bigsheeper <yihao.dai@zilliz.com>
2024-06-25 17:00:04 +08:00
..
broker enhance: Improve datacoord segment filtering with collection (#32831) 2024-05-08 21:37:29 +08:00
allocator_test.go Decoupling client and server API in types interface (#27186) 2023-09-26 09:57:25 +08:00
allocator.go feat: Add import scheduler and manager (#29367) 2024-03-01 18:31:02 +08:00
build_index_policy.go Format the code (#27275) 2023-09-21 09:45:27 +08:00
channel_checker_test.go Format the code (#27275) 2023-09-21 09:45:27 +08:00
channel_checker.go Format the code (#27275) 2023-09-21 09:45:27 +08:00
channel_manager_factory.go feat: datacoord/node watch based on rpc (#32036) 2024-05-07 15:49:30 +08:00
channel_manager_test.go enhance: Make channel meta able to writer 200k plus segments (#33300) 2024-05-28 10:09:48 +08:00
channel_manager_v2_test.go fix: Return error when startup Delete/AddNode fail (#33193) (#33258) 2024-05-22 14:49:40 +08:00
channel_manager_v2.go fix: [2.4] Remove loopclosure issue in ChannelManagerImplV2 (#33989) (#34004) 2024-06-20 14:42:00 +08:00
channel_manager.go feat: datacoord/node watch based on rpc (#32036) 2024-05-07 15:49:30 +08:00
channel_store_test.go enhance: Make channel meta able to writer 200k plus segments (#33300) 2024-05-28 10:09:48 +08:00
channel_store_v2_test.go enhance: Make channel meta able to writer 200k plus segments (#33300) 2024-05-28 10:09:48 +08:00
channel_store_v2.go enhance: Make channel meta able to writer 200k plus segments (#33300) 2024-05-28 10:09:48 +08:00
channel_store.go enhance: Make channel meta able to writer 200k plus segments (#33300) 2024-05-28 10:09:48 +08:00
channel.go feat: datacoord/node watch based on rpc (#32036) 2024-05-07 15:49:30 +08:00
cluster_test.go feat: datacoord/node watch based on rpc (#32036) 2024-05-07 15:49:30 +08:00
cluster.go feat: datacoord/node watch based on rpc (#32036) 2024-05-07 15:49:30 +08:00
compaction_l0_view_test.go enhance: Add deltaRowCount in l0 compaction (#33843) 2024-06-20 19:32:02 +08:00
compaction_l0_view.go enhance: Add deltaRowCount in l0 compaction (#33843) 2024-06-20 19:32:02 +08:00
compaction_scheduler_test.go enhance: Add CompactionTaskNum metrics (#29518) 2023-12-28 15:46:55 +08:00
compaction_scheduler.go fix: Trigger l0 compaction when l0 views don't change (#30729) 2024-03-05 16:37:00 +08:00
compaction_test.go enhance: Donot mark compaction tasks to be timeouted (#33833) 2024-06-14 10:45:58 +08:00
compaction_trigger_test.go enhance: Add memory size for binlog (#33025) 2024-05-15 12:59:34 +08:00
compaction_trigger_v2_test.go enhance: Improve datacoord segment filtering with collection (#32831) 2024-05-08 21:37:29 +08:00
compaction_trigger_v2.go fix: Skip to submit l0 tasks when scheduler full (#31270) 2024-03-15 10:21:12 +08:00
compaction_trigger.go enhance: Add memory size for binlog (#33025) 2024-05-15 12:59:34 +08:00
compaction_view_manager_test.go enhance: Add deltaRowCount in l0 compaction (#33843) 2024-06-20 19:32:02 +08:00
compaction_view_manager.go fix: Correct the last empty l0 views (#31198) 2024-03-14 10:31:04 +08:00
compaction_view.go enhance: [cherry-pick] Move compactor into sub package (#34098) 2024-06-25 11:16:02 +08:00
compaction.go enhance: Donot mark compaction tasks to be timeouted (#33833) 2024-06-14 10:45:58 +08:00
const.go enhance: pass partition key scalar info if enabled when build vector index (#29931) 2024-01-24 00:04:55 +08:00
errors_test.go Format the code (#27275) 2023-09-21 09:45:27 +08:00
errors.go Refine DataCoord status (#27262) 2023-09-26 17:15:27 +08:00
garbage_collector_test.go enhance: Improve datacoord segment filtering with collection (#32831) 2024-05-08 21:37:29 +08:00
garbage_collector.go enhance: Improve datacoord segment filtering with collection (#32831) 2024-05-08 21:37:29 +08:00
handler.go enhance: Improve datacoord segment filtering with collection (#32831) 2024-05-08 21:37:29 +08:00
import_checker_test.go enhance: Ensure ImportV2 waits for the index to be built and refine some logic (#31629) 2024-04-01 20:09:13 +08:00
import_checker.go fix: Fix multiple vector fields import (#33723) (#33724) 2024-06-10 21:55:55 +08:00
import_job.go fix: Fix multiple vector fields import (#33723) (#33724) 2024-06-10 21:55:55 +08:00
import_meta_test.go feat: Add import scheduler and manager (#29367) 2024-03-01 18:31:02 +08:00
import_meta.go feat: Add import scheduler and manager (#29367) 2024-03-01 18:31:02 +08:00
import_scheduler_test.go fix: Prevent import from generating orphaned files (#34070) (#34071) 2024-06-25 17:00:04 +08:00
import_scheduler.go fix: Prevent import from generating orphaned files (#34070) (#34071) 2024-06-25 17:00:04 +08:00
import_task.go enhance: Enhance and modify the return content of ImportV2 (#31192) 2024-03-13 19:51:03 +08:00
import_util_test.go fix: Check if the import job exists (#33672) (#33673) 2024-06-10 21:50:29 +08:00
import_util.go fix: Fix multiple vector fields import (#33723) (#33724) 2024-06-10 21:55:55 +08:00
index_builder_test.go enhance: [2.4] support integral type for MV and skip MV if there is only one category (#34005) 2024-06-24 19:12:02 +08:00
index_builder.go feat: support inverted index for array (#33452) (#34053) 2024-06-24 10:50:03 +08:00
index_engine_version_manager_test.go Optimize the codec code of session (#27360) 2023-10-01 10:33:30 +08:00
index_engine_version_manager.go feat: trigger compaction to handle index version (#28442) 2023-11-21 09:26:22 +08:00
index_meta_test.go enhance: speed up minor functions calls in datacoord (#32389) 2024-04-20 07:55:21 +08:00
index_meta.go enhance: speed up minor functions calls in datacoord (#32389) 2024-04-20 07:55:21 +08:00
index_service_test.go enhance: Improve datacoord segment filtering with collection (#32831) 2024-05-08 21:37:29 +08:00
index_service.go enhance: Improve datacoord segment filtering with collection (#32831) 2024-05-08 21:37:29 +08:00
indexnode_manager_test.go Refine state check (#27541) 2023-10-11 21:01:35 +08:00
indexnode_manager.go fix: Refine index param check for Create/Alter Index (#32712) 2024-05-06 15:43:29 +08:00
meta_test.go fix: Fill stats log id and check validity (#33477) (#33478) 2024-05-31 14:13:46 +08:00
meta_util.go
meta.go fix: lost DroppedAt field when DropCollection (#33767) 2024-06-12 20:19:56 +08:00
metrics_info_test.go enhance: Use channel manager interface in server_test (#31621) 2024-04-12 14:59:20 +08:00
metrics_info.go enhance: Add metrics for segment index files size (#32979) 2024-05-13 17:59:32 +08:00
mock_allocator_test.go enhance: index meta use independent rather than global meta lock (#30869) 2024-03-04 16:56:59 +08:00
mock_channel_store.go feat: datacoord/node watch based on rpc (#32036) 2024-05-07 15:49:30 +08:00
mock_channelmanager.go feat: datacoord/node watch based on rpc (#32036) 2024-05-07 15:49:30 +08:00
mock_cluster.go feat: datacoord/node watch based on rpc (#32036) 2024-05-07 15:49:30 +08:00
mock_compaction_meta.go enhance: Improve datacoord segment filtering with collection (#32831) 2024-05-08 21:37:29 +08:00
mock_compaction_plan_context.go fix: Separate schedule and check results loop (#28692) 2023-11-29 10:50:29 +08:00
mock_handler.go fix: fix collectionInfo leak in datacoord (#32175) 2024-04-15 16:33:19 +08:00
mock_index_engine_version_manager.go feat: trigger compaction to handle index version (#28442) 2023-11-21 09:26:22 +08:00
mock_scheduler.go enhance: Add CompactionTaskNum metrics (#29518) 2023-12-28 15:46:55 +08:00
mock_segment_manager.go enhance: Execute bloom filter apply in parallel to speed up segment predict (#33793) 2024-06-13 14:14:04 +08:00
mock_session_manager.go enhance: [cherry-pick] Move compactor into sub package (#34098) 2024-06-25 11:16:02 +08:00
mock_subcluster.go feat: datacoord/node watch based on rpc (#32036) 2024-05-07 15:49:30 +08:00
mock_test.go fix: [2.4] not found database name in the datacoord meta object (#33412) 2024-05-28 10:07:43 +08:00
mock_trigger_manager.go feat: Add Compaction views and triggers (#27906) 2023-11-23 17:30:25 +08:00
OWNERS
policy_test.go enhance: Make channel meta able to writer 200k plus segments (#33300) 2024-05-28 10:09:48 +08:00
policy.go fix: Enable to balance released standby channel (#32986) 2024-05-14 11:51:32 +08:00
README.md
segment_allocation_policy_test.go Adjust datacoord for L0 Delta (#28021) 2023-11-06 15:26:16 +08:00
segment_allocation_policy.go fix: Decoupling importing segment from flush process (#30402) 2024-02-03 13:01:12 +08:00
segment_info_test.go enhance: use WalkWithPrefix api for oss, enable piplined file gc (#31740) 2024-04-25 20:41:27 +08:00
segment_info.go enhance: Add memory size for binlog (#33025) 2024-05-15 12:59:34 +08:00
segment_manager_test.go enhance: Support L0 import (#33514) (#33712) 2024-06-08 11:17:52 +08:00
segment_manager.go enhance: Support L0 import (#33514) (#33712) 2024-06-08 11:17:52 +08:00
segment_operator_test.go fix: Prevent clone when selecting segments from meta (#30928) 2024-03-01 13:59:02 +08:00
segment_operator.go enhance: Improve datacoord segment filtering with collection (#32831) 2024-05-08 21:37:29 +08:00
server_test.go enhance: Support L0 import (#33514) (#33712) 2024-06-08 11:17:52 +08:00
server.go fix: [2.4] not found database name in the datacoord meta object (#33412) 2024-05-28 10:07:43 +08:00
services_test.go fix: Check if the import job exists (#33672) (#33673) 2024-06-10 21:50:29 +08:00
services.go fix: Check if the import job exists (#33672) (#33673) 2024-06-10 21:50:29 +08:00
session_manager_test.go feat: datacoord/node watch based on rpc (#32036) 2024-05-07 15:49:30 +08:00
session_manager.go enhance: Use channel manager interface in server_test (#31621) 2024-04-12 14:59:20 +08:00
session.go feat: datacoord/node watch based on rpc (#32036) 2024-05-07 15:49:30 +08:00
util_test.go enhance: Add memory size for binlog (#33025) 2024-05-15 12:59:34 +08:00
util.go enhance: Add memory size for binlog (#33025) 2024-05-15 12:59:34 +08:00

Data Coordinator

Data cooridnator(datacoord for short) is the component to organize DataNodes and segments allocations.

Dependency

  • KV store: a kv store has all the meta info datacoord needs to operate. (etcd)
  • Message stream: a message stream to communicate statistics information with data nodes. (Pulsar)
  • Root Coordinator: timestamp, id and meta source.
  • Data Node(s): could be an instance or a cluster, actual worker group handles data modification operations.