StarRocks
Last updated
Last updated
StarRocks source connector
Read external data source data through StarRocks. The internal implementation of StarRocks source connector is obtains the query plan from the frontend (FE), delivers the query plan as a parameter to BE nodes, and then obtains data results from BE nodes.
node_urls
list
yes
-
username
string
yes
-
password
string
yes
-
database
string
yes
-
table
string
yes
-
scan_filter
string
no
-
schema
config
yes
-
request_tablet_size
int
no
Integer.MAX_VALUE
scan_connect_timeout_ms
int
no
30000
scan_query_timeout_sec
int
no
3600
scan_keep_alive_min
int
no
10
scan_batch_rows
int
no
1024
scan_mem_limit
long
no
2147483648
max_retries
int
no
3
scan.params.*
string
no
-
StarRocks
cluster address, the format is ["fe_ip:fe_http_port", ...]
StarRocks
user username
StarRocks
user password
The name of StarRocks database
The name of StarRocks table
Filter expression of the query, which is transparently transmitted to StarRocks. StarRocks uses this expression to complete source-side data filtering.
e.g.
The schema of the starRocks that you want to generate
e.g.
The number of StarRocks Tablets corresponding to an Partition. The smaller this value is set, the more partitions will be generated. This will increase the parallelism on the engine side, but at the same time will cause greater pressure on StarRocks.
The following is an example to explain how to use request_tablet_size to controls the generation of partitions
requests connection timeout sent to StarRocks
Query the timeout time of StarRocks, the default value is 1 hour, -1 means no timeout limit
The keep-alive duration of the query task, in minutes. The default value is 10. we recommend that you set this parameter to a value greater than or equal to 5.
The maximum number of data rows to read from BE at a time. Increasing this value reduces the number of connections established between engine and StarRocks and therefore mitigates overhead caused by network latency.
The maximum memory space allowed for a single query in the BE node, in bytes. The default value is 2147483648 (2 GB).
number of retry requests sent to StarRocks
The parameter of the scan data from be
fields [Config]