datacap/configure/etc/conf/plugins/jdbc/clickhouse.yaml

165 lines
5.1 KiB
YAML
Raw Normal View History

name: ClickHouse
supportTime: '2022-09-22'
configures:
- field: name
type: String
required: true
message: name is a required field, please be sure to enter
- field: host
type: String
required: true
value: 127.0.0.1
message: host is a required field, please be sure to enter
- field: port
type: Number
required: true
min: 1
max: 65535
value: 9000
message: port is a required field, please be sure to enter
- field: username
type: String
group: authorization
- field: password
type: String
group: authorization
- field: database
type: String
group: advanced
value: default
required: true
- field: configures
type: Array
value: [ ]
group: custom
pipelines:
- executor: Seatunnel
type: SOURCE
fields:
- field: host
origin: host|port
required: true
2023-09-13 12:36:29 +08:00
- field: username
origin: username
required: true
override: false
- field: password
origin: password
required: false
override: false
- field: database
2023-09-13 12:36:29 +08:00
origin: database
required: true
2023-09-12 18:55:31 +08:00
hidden: false
2023-09-12 18:26:53 +08:00
description: The ClickHouse database
2023-09-13 12:36:29 +08:00
override: true
input: true
2023-09-01 17:48:01 +08:00
width: 300
type: INPUT
- field: sql
origin: context
2023-09-12 20:31:51 +08:00
required: false
2023-09-12 18:55:31 +08:00
hidden: false
2023-09-12 18:26:53 +08:00
description: The query sql used to search data though Clickhouse server
override: true
input: true
2023-09-01 17:48:01 +08:00
width: 300
type: TEXT
tooltip: If this value is entered, the default sql statement will be replaced
2023-08-30 16:17:26 +08:00
- field: result_table_name
2023-09-13 12:36:29 +08:00
origin: result_table_name
2023-09-12 18:55:31 +08:00
description: |
When result_table_name is not specified, the data processed by this plugin will not be registered as a data set (dataStream/dataset) that can be directly accessed by other plugins, or called a temporary table (table) ;
When result_table_name is specified, the data processed by this plugin will be registered as a data set (dataStream/dataset) that can be directly accessed by other plugins, or called a temporary table (table) . The data set (dataStream/dataset) registered here can be directly accessed by other plugins by specifying source_table_name .
2023-08-30 16:17:26 +08:00
required: false
2023-09-12 18:55:31 +08:00
hidden: true
input: true
2023-09-13 12:36:29 +08:00
override: true
- executor: Seatunnel
type: SINK
fields:
- field: host
origin: host|port
required: true
- field: username
2023-09-13 12:36:29 +08:00
origin: username
2023-08-31 21:56:13 +08:00
override: false
required: true
- field: password
2023-08-31 21:56:13 +08:00
override: false
required: true
2023-09-13 12:36:29 +08:00
- field: database
origin: database
description: |
The ClickHouse database
override: true
required: true
input: true
- field: table
origin: table
description: |
The table name of the ClickHouse database
override: true
required: true
input: true
- field: fields
2023-09-13 12:36:29 +08:00
origin: fields
description: |
The data field that needs to be output to ClickHouse , if not configured, it will be automatically adapted according to the sink table schema .
2023-09-13 13:30:03 +08:00
2023-09-13 12:36:29 +08:00
If there are multiple values, each value needs to be one row.
override: true
required: true
2023-08-31 21:56:13 +08:00
input: true
2023-09-13 12:36:29 +08:00
type: TEXT
- field: bulk_size
origin: bulk_size
description: |
The number of rows written through Clickhouse-jdbc each time, the default is 20000 .
override: true
required: false
input: true
value: 20000
width: 100
type: NUMBER
- field: split_mode
origin: split_mode
description: |
This mode only support clickhouse table which engine is 'Distributed'.And internal_replication option should be true. They will split distributed table data in seatunnel and perform write directly on each shard. The shard weight define is clickhouse will be counted.
override: true
required: false
input: true
value: false
width: 45
2023-09-13 13:30:03 +08:00
type: SWITCH
- field: sharding_key
origin: sharding_key
description: |
When use split_mode, which node to send data to is a problem, the default is random selection, but the 'sharding_key' parameter can be used to specify the field for the sharding algorithm. This option only worked when 'split_mode' is true.
override: true
required: false
input: true
width: 100
type: INPUT
- field: primary_key
origin: primary_key
description: |
Mark the primary key column from clickhouse table, and based on primary key execute INSERT/UPDATE/DELETE to clickhouse table
override: true
required: false
input: true
width: 100
type: INPUT
- field: support_upsert
origin: support_upsert
description: |
Support upsert row by query primary key
override: true
required: false
input: true
value: false
width: 45
type: SWITCH