@aws-cdk_aws-glue-alpha.DataFormat

class DataFormat ๐Ÿ”น

LanguageType name
.NETAmazon.CDK.AWS.Glue.Alpha.DataFormat
Gogithub.com/aws/aws-cdk-go/awscdkgluealpha/v2#DataFormat
Javasoftware.amazon.awscdk.services.glue.alpha.DataFormat
Pythonaws_cdk.aws_glue_alpha.DataFormat
TypeScript (source)@aws-cdk/aws-glue-alpha ยป DataFormat

Defines the input/output formats and ser/de for a single DataFormat.

Example

declare const myDatabase: glue.Database;
new glue.Table(this, 'MyTable', {
  database: myDatabase,
  columns: [{
    name: 'col1',
    type: glue.Schema.STRING,
  }],
  partitionKeys: [{
    name: 'year',
    type: glue.Schema.SMALL_INT,
  }, {
    name: 'month',
    type: glue.Schema.SMALL_INT,
  }],
  dataFormat: glue.DataFormat.JSON,
});

Initializer

new DataFormat(props: DataFormatProps)

Parameters

  • props DataFormatProps

Properties

NameTypeDescription
inputFormat๐Ÿ”นInputFormatInputFormat for this data format.
outputFormat๐Ÿ”นOutputFormatOutputFormat for this data format.
serializationLibrary๐Ÿ”นSerializationLibrarySerialization library for this data format.
classificationString?๐Ÿ”นClassificationStringClassification string given to tables with this data format.
static APACHE_LOGS๐Ÿ”นDataFormatDataFormat for Apache Web Server Logs.
static AVRO๐Ÿ”นDataFormatDataFormat for Apache Avro.
static CLOUDTRAIL_LOGS๐Ÿ”นDataFormatDataFormat for CloudTrail logs stored on S3.
static CSV๐Ÿ”นDataFormatDataFormat for CSV Files.
static JSON๐Ÿ”นDataFormatStored as plain text files in JSON format.
static LOGSTASH๐Ÿ”นDataFormatDataFormat for Logstash Logs, using the GROK SerDe.
static ORC๐Ÿ”นDataFormatDataFormat for Apache ORC (Optimized Row Columnar).
static PARQUET๐Ÿ”นDataFormatDataFormat for Apache Parquet.
static TSV๐Ÿ”นDataFormatDataFormat for TSV (Tab-Separated Values).

inputFormat๐Ÿ”น

Type: InputFormat

InputFormat for this data format.


outputFormat๐Ÿ”น

Type: OutputFormat

OutputFormat for this data format.


serializationLibrary๐Ÿ”น

Type: SerializationLibrary

Serialization library for this data format.


classificationString?๐Ÿ”น

Type: ClassificationString (optional)

Classification string given to tables with this data format.


static APACHE_LOGS๐Ÿ”น

Type: DataFormat

DataFormat for Apache Web Server Logs.

Also works for CloudFront logs

See also: https://docs.aws.amazon.com/athena/latest/ug/apache.html


static AVRO๐Ÿ”น

Type: DataFormat

DataFormat for Apache Avro.

See also: https://docs.aws.amazon.com/athena/latest/ug/avro.html


static CLOUDTRAIL_LOGS๐Ÿ”น

Type: DataFormat

DataFormat for CloudTrail logs stored on S3.

See also: https://docs.aws.amazon.com/athena/latest/ug/cloudtrail.html


static CSV๐Ÿ”น

Type: DataFormat

DataFormat for CSV Files.

See also: https://docs.aws.amazon.com/athena/latest/ug/csv.html


static JSON๐Ÿ”น

Type: DataFormat

Stored as plain text files in JSON format.

Uses OpenX Json SerDe for serialization and deseralization.

See also: https://docs.aws.amazon.com/athena/latest/ug/json.html


static LOGSTASH๐Ÿ”น

Type: DataFormat

DataFormat for Logstash Logs, using the GROK SerDe.

See also: https://docs.aws.amazon.com/athena/latest/ug/grok.html


static ORC๐Ÿ”น

Type: DataFormat

DataFormat for Apache ORC (Optimized Row Columnar).

See also: https://docs.aws.amazon.com/athena/latest/ug/orc.html


static PARQUET๐Ÿ”น

Type: DataFormat

DataFormat for Apache Parquet.

See also: https://docs.aws.amazon.com/athena/latest/ug/parquet.html


static TSV๐Ÿ”น

Type: DataFormat

DataFormat for TSV (Tab-Separated Values).

See also: https://docs.aws.amazon.com/athena/latest/ug/lazy-simple-serde.html