Class AvroIO.ParseFiles<T>

java.lang.Object
org.apache.beam.sdk.transforms.PTransform<PCollection<FileIO.ReadableFile>,PCollection<T>>
org.apache.beam.sdk.extensions.avro.io.AvroIO.ParseFiles<T>
All Implemented Interfaces:
Serializable, HasDisplayData
Enclosing class:
AvroIO

public abstract static class AvroIO.ParseFiles<T> extends PTransform<PCollection<FileIO.ReadableFile>,PCollection<T>>
See Also:
  • Constructor Details

    • ParseFiles

      public ParseFiles()
  • Method Details

    • withCoder

      public AvroIO.ParseFiles<T> withCoder(Coder<T> coder)
      Specifies the coder for the result of the parseFn.
    • withUsesReshuffle

      public AvroIO.ParseFiles<T> withUsesReshuffle(boolean usesReshuffle)
      Specifies if a Reshuffle should run before file reads occur.
    • withFileExceptionHandler

      public AvroIO.ParseFiles<T> withFileExceptionHandler(ReadAllViaFileBasedSource.ReadFileRangesFnExceptionHandler exceptionHandler)
      Specifies if exceptions should be logged only for streaming pipelines.
    • withDesiredBundleSizeBytes

      public AvroIO.ParseFiles<T> withDesiredBundleSizeBytes(long desiredBundleSizeBytes)
      Set a value for the bundle size for parallel reads. Default is 64 MB. You may want to use a lower value (e.g. 1 MB) for streaming applications.
    • expand

      public PCollection<T> expand(PCollection<FileIO.ReadableFile> input)
      Description copied from class: PTransform
      Override this method to specify how this PTransform should be expanded on the given InputT.

      NOTE: This method should not be called directly. Instead apply the PTransform should be applied to the InputT using the apply method.

      Composite transforms, which are defined in terms of other transforms, should return the output of one of the composed transforms. Non-composite transforms, which do not apply any transforms internally, should return a new unbound output and register evaluators (via backend-specific registration methods).

      Specified by:
      expand in class PTransform<PCollection<FileIO.ReadableFile>,PCollection<T>>
    • populateDisplayData

      public void populateDisplayData(DisplayData.Builder builder)
      Description copied from class: PTransform
      Register display data for the given transform or component.

      populateDisplayData(DisplayData.Builder) is invoked by Pipeline runners to collect display data via DisplayData.from(HasDisplayData). Implementations may call super.populateDisplayData(builder) in order to register display data in the current namespace, but should otherwise use subcomponent.populateDisplayData(builder) to use the namespace of the subcomponent.

      By default, does not register any display data. Implementors may override this method to provide their own display data.

      Specified by:
      populateDisplayData in interface HasDisplayData
      Overrides:
      populateDisplayData in class PTransform<PCollection<FileIO.ReadableFile>,PCollection<T>>
      Parameters:
      builder - The builder to populate with display data.
      See Also: