Apache Iceberg
Last updated
Last updated
Apache Iceberg source connector
1.4.2
iceberg catalog
hadoop(2.7.1 , 2.7.5 , 3.1.3)
hive(2.3.9 , 3.1.2)
Source connector for Apache Iceberg. It can support batch and stream mode.
Iceberg
hive-exec
Iceberg
libfb303
BOOLEAN
BOOLEAN
INTEGER
INT
LONG
BIGINT
FLOAT
FLOAT
DOUBLE
DOUBLE
DATE
DATE
TIME
TIME
TIMESTAMP
TIMESTAMP
STRING
STRING
FIXED BINARY
BYTES
DECIMAL
DECIMAL
STRUCT
ROW
LIST
ARRAY
MAP
MAP
catalog_name
string
yes
-
User-specified catalog name.
namespace
string
yes
-
The iceberg database name in the backend catalog.
table
string
yes
-
The iceberg table name in the backend catalog.
iceberg.catalog.config
map
yes
-
hadoop.config
map
no
-
Properties passed through to the Hadoop configuration
iceberg.hadoop-conf-path
string
no
-
The specified loading paths for the 'core-site.xml', 'hdfs-site.xml', 'hive-site.xml' files.
schema
config
no
-
Use projection to select data columns and columns order.
case_sensitive
boolean
no
false
If data columns where selected via schema [config], controls whether the match to the schema will be done with case sensitivity.
start_snapshot_timestamp
long
no
-
Instructs this scan to look for changes starting from the most recent snapshot for the table as of the timestamp. timestamp β the timestamp in millis since the Unix epoch
start_snapshot_id
long
no
-
Instructs this scan to look for changes starting from a particular snapshot (exclusive).
end_snapshot_id
long
no
-
Instructs this scan to look for changes up to a particular snapshot (inclusive).
use_snapshot_id
long
no
-
Instructs this scan to look for use the given snapshot ID.
use_snapshot_timestamp
long
no
-
Instructs this scan to look for use the most recent snapshot as of the given time in milliseconds. timestamp β the timestamp in millis since the Unix epoch
stream_scan_strategy
enum
no
FROM_LATEST_SNAPSHOT
Starting strategy for stream mode execution, Default to use FROM_LATEST_SNAPSHOT
if donβt specify any value,The optional values are:
TABLE_SCAN_THEN_INCREMENTAL: Do a regular table scan then switch to the incremental mode.
FROM_LATEST_SNAPSHOT: Start incremental mode from the latest snapshot inclusive.
FROM_EARLIEST_SNAPSHOT: Start incremental mode from the earliest snapshot inclusive.
FROM_SNAPSHOT_ID: Start incremental mode from a snapshot with a specific id inclusive.
FROM_SNAPSHOT_TIMESTAMP: Start incremental mode from a snapshot with a specific timestamp inclusive.
common-options
no
-
Specify the properties for initializing the Iceberg catalog, which can be referenced in this file:"
Source plugin common parameters, please refer to for details.