Class AvroIO.ParseFiles<T>
- All Implemented Interfaces:
Serializable
,HasDisplayData
- Enclosing class:
AvroIO
AvroIO.parseFilesGenericRecords(org.apache.beam.sdk.transforms.SerializableFunction<org.apache.avro.generic.GenericRecord, T>)
.- See Also:
-
Field Summary
Fields inherited from class org.apache.beam.sdk.transforms.PTransform
annotations, displayData, name, resourceHints
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionexpand
(PCollection<FileIO.ReadableFile> input) Override this method to specify how thisPTransform
should be expanded on the givenInputT
.void
populateDisplayData
(DisplayData.Builder builder) Register display data for the given transform or component.Specifies the coder for the result of theparseFn
.withDesiredBundleSizeBytes
(long desiredBundleSizeBytes) Set a value for the bundle size for parallel reads.withFileExceptionHandler
(ReadAllViaFileBasedSource.ReadFileRangesFnExceptionHandler exceptionHandler) Specifies if exceptions should be logged only for streaming pipelines.withUsesReshuffle
(boolean usesReshuffle) Specifies if a Reshuffle should run before file reads occur.Methods inherited from class org.apache.beam.sdk.transforms.PTransform
addAnnotation, compose, compose, getAdditionalInputs, getAnnotations, getDefaultOutputCoder, getDefaultOutputCoder, getDefaultOutputCoder, getKindString, getName, getResourceHints, setDisplayData, setResourceHints, toString, validate, validate
-
Constructor Details
-
ParseFiles
public ParseFiles()
-
-
Method Details
-
withCoder
Specifies the coder for the result of theparseFn
. -
withUsesReshuffle
Specifies if a Reshuffle should run before file reads occur. -
withFileExceptionHandler
public AvroIO.ParseFiles<T> withFileExceptionHandler(ReadAllViaFileBasedSource.ReadFileRangesFnExceptionHandler exceptionHandler) Specifies if exceptions should be logged only for streaming pipelines. -
withDesiredBundleSizeBytes
Set a value for the bundle size for parallel reads. Default is 64 MB. You may want to use a lower value (e.g. 1 MB) for streaming applications. -
expand
Description copied from class:PTransform
Override this method to specify how thisPTransform
should be expanded on the givenInputT
.NOTE: This method should not be called directly. Instead apply the
PTransform
should be applied to theInputT
using theapply
method.Composite transforms, which are defined in terms of other transforms, should return the output of one of the composed transforms. Non-composite transforms, which do not apply any transforms internally, should return a new unbound output and register evaluators (via backend-specific registration methods).
- Specified by:
expand
in classPTransform<PCollection<FileIO.ReadableFile>,
PCollection<T>>
-
populateDisplayData
Description copied from class:PTransform
Register display data for the given transform or component.populateDisplayData(DisplayData.Builder)
is invoked by Pipeline runners to collect display data viaDisplayData.from(HasDisplayData)
. Implementations may callsuper.populateDisplayData(builder)
in order to register display data in the current namespace, but should otherwise usesubcomponent.populateDisplayData(builder)
to use the namespace of the subcomponent.By default, does not register any display data. Implementors may override this method to provide their own display data.
- Specified by:
populateDisplayData
in interfaceHasDisplayData
- Overrides:
populateDisplayData
in classPTransform<PCollection<FileIO.ReadableFile>,
PCollection<T>> - Parameters:
builder
- The builder to populate with display data.- See Also:
-