@Experimental(value=SOURCE_SINK) public class ReadAllViaFileBasedSource<T> extends PTransform<PCollection<FileIO.ReadableFile>,PCollection<T>>
PCollection
of FileIO.ReadableFile
using given parameters
for splitting files into offset ranges and for creating a FileBasedSource
for a file. The
input PCollection
must not contain directories
.
To obtain the collection of FileIO.ReadableFile
from a filepattern, use FileIO.readMatches()
.
Modifier and Type | Class and Description |
---|---|
static class |
ReadAllViaFileBasedSource.ReadFileRangesFnExceptionHandler
A class to handle errors which occur during file reads.
|
Modifier and Type | Field and Description |
---|---|
protected static boolean |
DEFAULT_USES_RESHUFFLE |
name, resourceHints
Constructor and Description |
---|
ReadAllViaFileBasedSource(long desiredBundleSizeBytes,
SerializableFunction<java.lang.String,? extends FileBasedSource<T>> createSource,
Coder<T> coder) |
ReadAllViaFileBasedSource(long desiredBundleSizeBytes,
SerializableFunction<java.lang.String,? extends FileBasedSource<T>> createSource,
Coder<T> coder,
boolean usesReshuffle,
ReadAllViaFileBasedSource.ReadFileRangesFnExceptionHandler exceptionHandler) |
Modifier and Type | Method and Description |
---|---|
PCollection<T> |
expand(PCollection<FileIO.ReadableFile> input)
Override this method to specify how this
PTransform should be expanded on the given
InputT . |
compose, compose, getAdditionalInputs, getDefaultOutputCoder, getDefaultOutputCoder, getDefaultOutputCoder, getKindString, getName, getResourceHints, populateDisplayData, setResourceHints, toString, validate, validate
protected static final boolean DEFAULT_USES_RESHUFFLE
public ReadAllViaFileBasedSource(long desiredBundleSizeBytes, SerializableFunction<java.lang.String,? extends FileBasedSource<T>> createSource, Coder<T> coder)
public ReadAllViaFileBasedSource(long desiredBundleSizeBytes, SerializableFunction<java.lang.String,? extends FileBasedSource<T>> createSource, Coder<T> coder, boolean usesReshuffle, ReadAllViaFileBasedSource.ReadFileRangesFnExceptionHandler exceptionHandler)
public PCollection<T> expand(PCollection<FileIO.ReadableFile> input)
PTransform
PTransform
should be expanded on the given
InputT
.
NOTE: This method should not be called directly. Instead apply the PTransform
should
be applied to the InputT
using the apply
method.
Composite transforms, which are defined in terms of other transforms, should return the output of one of the composed transforms. Non-composite transforms, which do not apply any transforms internally, should return a new unbound output and register evaluators (via backend-specific registration methods).
expand
in class PTransform<PCollection<FileIO.ReadableFile>,PCollection<T>>