public class XmlSource<T> extends FileBasedSource<T>
XmlIO.read().FileBasedSource.FileBasedReader<T>, FileBasedSource.ModeOffsetBasedSource.OffsetBasedReader<T>BoundedSource.BoundedReader<T>Source.Reader<T>| Modifier and Type | Method and Description |
|---|---|
protected FileBasedSource<T> |
createForSubrangeOfFile(MatchResult.Metadata metadata,
long start,
long end)
Creates and returns a new
FileBasedSource of the same type as the current FileBasedSource backed by a given file and an offset range. |
protected FileBasedSource.FileBasedReader<T> |
createSingleFileReader(PipelineOptions options)
Creates and returns an instance of a
FileBasedReader implementation for the current
source assuming the source represents a single file. |
Coder<T> |
getOutputCoder()
Returns the
Coder to use for the data read from this source. |
createReader, createSourceForSubrange, getEmptyMatchTreatment, getEstimatedSizeBytes, getFileOrPatternSpec, getFileOrPatternSpecProvider, getMaxEndOffset, getMode, getSingleFileMetadata, isSplittable, populateDisplayData, split, toString, validategetBytesPerOffset, getEndOffset, getMinBundleSize, getStartOffsetgetDefaultOutputCoderprotected FileBasedSource<T> createForSubrangeOfFile(MatchResult.Metadata metadata, long start, long end)
FileBasedSourceFileBasedSource of the same type as the current FileBasedSource backed by a given file and an offset range. When current source is being
split, this method is used to generate new sub-sources. When creating the source subclasses
must call the constructor #FileBasedSource(Metadata, long, long, long) of FileBasedSource with corresponding parameter values passed here.createForSubrangeOfFile in class FileBasedSource<T>metadata - file backing the new FileBasedSource.start - starting byte offset of the new FileBasedSource.end - ending byte offset of the new FileBasedSource. May be Long.MAX_VALUE, in
which case it will be inferred using FileBasedSource.getMaxEndOffset(org.apache.beam.sdk.options.PipelineOptions).protected FileBasedSource.FileBasedReader<T> createSingleFileReader(PipelineOptions options)
FileBasedSourceFileBasedReader implementation for the current
source assuming the source represents a single file. File patterns will be handled by FileBasedSource implementation automatically.createSingleFileReader in class FileBasedSource<T>