Package org.apache.beam.sdk.io.fileschematransform
package org.apache.beam.sdk.io.fileschematransform
Defines transforms for File reading and writing support with Schema Transform.
-
ClassDescriptionA
FileWriteSchemaTransformFormatProvider
for avro format.AFileWriteSchemaTransformFormatProvider
for CSV format.Interface that provides aPTransform
that reads in aPCollection
ofFileIO.ReadableFile
s and outputs the data represented as aPCollection
ofRow
s.The configuration for building file writing transforms usingSchemaTransform
andSchemaTransformProvider
.Configures extra details related to writing CSV formatted files.Configures extra details related to writing Parquet formatted files.Configures extra details related to writing XML formatted files.Provides aPTransform
that writes aPCollection
ofRow
s and outputs aPCollection
of the file names according to a registeredAutoService
FileWriteSchemaTransformFormatProvider
implementation.FileWriteSchemaTransformFormatProviders
containsFileWriteSchemaTransformFormatProvider
implementations.ATypedSchemaTransformProvider
implementation for writing aRow
PCollection
to file systems, driven by aFileWriteSchemaTransformConfiguration
.AFileReadSchemaTransformFormatProvider
that reads newline-delimited JSONs.AFileWriteSchemaTransformFormatProvider
for JSON format.AFileReadSchemaTransformFormatProvider
that reads lines as Strings.AFileWriteSchemaTransformFormatProvider
for Parquet format.AFileWriteSchemaTransformFormatProvider
for XML format.