Source Common Options
Common parameters of source connectors
result_table_name
String
No
-
When result_table_name
is not specified, the data processed by this plugin will not be registered as a data set (dataStream/dataset)
that can be directly accessed by other plugins, or called a temporary table (table)
When result_table_name
is specified, the data processed by this plugin will be registered as a data set (dataStream/dataset)
that can be directly accessed by other plugins, or called a temporary table (table)
. The data set (dataStream/dataset)
registered here can be directly accessed by other plugins by specifying source_table_name
.
parallelism
Int
No
-
When parallelism
is not specified, the parallelism
in env is used by default.
When parallelism is specified, it will override the parallelism in env.
Important note
When the job configuration result_table_name
you must set the source_table_name
parameter
Task Example
Simple:
This registers a stream or batch data source and returns the table name
fake_table
at registration
source {
FakeSourceStream {
result_table_name = "fake_table"
}
}
Multiple Pipeline Simple
This is to convert the data source fake and write it to two different sinks
env {
job.mode = "BATCH"
}
source {
FakeSource {
result_table_name = "fake"
row.num = 100
schema = {
fields {
id = "int"
name = "string"
age = "int"
c_timestamp = "timestamp"
c_date = "date"
c_map = "map<string, string>"
c_array = "array<int>"
c_decimal = "decimal(30, 8)"
c_row = {
c_row = {
c_int = int
}
}
}
}
}
}
transform {
Sql {
source_table_name = "fake"
result_table_name = "fake1"
# the query table name must same as field 'source_table_name'
query = "select id, regexp_replace(name, '.+', 'b') as name, age+1 as age, pi() as pi, c_timestamp, c_date, c_map, c_array, c_decimal, c_row from fake"
}
# The SQL transform support base function and criteria operation
# But the complex SQL unsupported yet, include: multi source table/rows JOIN and AGGREGATE operation and the like
}
sink {
Console {
source_table_name = "fake1"
}
Console {
source_table_name = "fake"
}
}
Last updated