Glue / Client / create_integration_table_properties
create_integration_table_properties¶
- Glue.Client.create_integration_table_properties(**kwargs)¶
This API is used to provide optional override properties for the the tables that need to be replicated. These properties can include properties for filtering and partitioning for the source and target tables. To set both source and target properties the same API need to be invoked with the Glue connection ARN as
ResourceArn
withSourceTableConfig
, and the Glue database ARN asResourceArn
withTargetTableConfig
respectively.See also: AWS API Documentation
Request Syntax
response = client.create_integration_table_properties( ResourceArn='string', TableName='string', SourceTableConfig={ 'Fields': [ 'string', ], 'FilterPredicate': 'string', 'PrimaryKey': [ 'string', ], 'RecordUpdateField': 'string' }, TargetTableConfig={ 'UnnestSpec': 'TOPLEVEL'|'FULL'|'NOUNNEST', 'PartitionSpec': [ { 'FieldName': 'string', 'FunctionSpec': 'string', 'ConversionSpec': 'string' }, ], 'TargetTableName': 'string' } )
- Parameters:
ResourceArn (string) –
[REQUIRED]
The Amazon Resource Name (ARN) of the target table for which to create integration table properties. Currently, this API only supports creating integration table properties for target tables, and the provided ARN should be the ARN of the target table in the Glue Data Catalog. Support for creating integration table properties for source connections (using the connection ARN) is not yet implemented and will be added in a future release.
TableName (string) –
[REQUIRED]
The name of the table to be replicated.
SourceTableConfig (dict) –
A structure for the source table configuration. See the
SourceTableConfig
structure to see list of supported source properties.Fields (list) –
A list of fields used for column-level filtering. Currently unsupported.
(string) –
FilterPredicate (string) –
A condition clause used for row-level filtering. Currently unsupported.
PrimaryKey (list) –
Provide the primary key set for this table. Currently supported specifically for SAP
EntityOf
entities upon request. Contact Amazon Web Services Support to make this feature available.(string) –
RecordUpdateField (string) –
Incremental pull timestamp-based field. Currently unsupported.
TargetTableConfig (dict) –
A structure for the target table configuration.
UnnestSpec (string) –
Specifies how nested objects are flattened to top-level elements. Valid values are: “TOPLEVEL”, “FULL”, or “NOUNNEST”.
PartitionSpec (list) –
Determines the file layout on the target.
(dict) –
A structure that describes how data is partitioned on the target.
FieldName (string) –
The field name used to partition data on the target. Avoid using columns that have unique values for each row (for example, LastModifiedTimestamp, SystemModTimeStamp) as the partition column. These columns are not suitable for partitioning because they create a large number of small partitions, which can lead to performance issues.
FunctionSpec (string) –
Specifies the function used to partition data on the target. The only accepted value for this parameter is ‘identity’ (string). The ‘identity’ function ensures that the data partitioning on the target follows the same scheme as the source. In other words, the partitioning structure of the source data is preserved in the target destination.
ConversionSpec (string) –
Specifies the timestamp format of the source data. Valid values are:
epoch_sec
- Unix epoch timestamp in secondsepoch_milli
- Unix epoch timestamp in millisecondsiso
- ISO 8601 formatted timestamp
Note
Only specify
ConversionSpec
when using timestamp-based partition functions (year, month, day, or hour). Glue Zero-ETL uses this parameter to correctly transform source data into timestamp format before partitioning.Do not use high-cardinality columns with the
identity
partition function. High-cardinality columns include:Primary keys
Timestamp fields (such as
LastModifiedTimestamp
,CreatedDate
)System-generated timestamps
Using high-cardinality columns with identity partitioning creates many small partitions, which can significantly degrade ingestion performance.
TargetTableName (string) –
The optional name of a target table.
- Return type:
dict
- Returns:
Response Syntax
{}
Response Structure
(dict) –
Exceptions