K - Type of keys to be read.V - Type of values to be read.public static class HadoopInputFormatIO.HadoopInputFormatBoundedSource<K,V> extends BoundedSource<KV<K,V>> implements java.io.Serializable
HadoopInputFormatIO.BoundedSource.BoundedReader<T>Source.Reader<T>| Modifier | Constructor and Description |
|---|---|
protected |
HadoopInputFormatBoundedSource(SerializableConfiguration conf,
Coder<K> keyCoder,
Coder<V> valueCoder,
SimpleFunction<?,K> keyTranslationFunction,
SimpleFunction<?,V> valueTranslationFunction,
HadoopInputFormatIO.SerializableSplit inputSplit) |
| Modifier and Type | Method and Description |
|---|---|
protected void |
createInputFormatInstance()
Creates instance of InputFormat class.
|
BoundedSource.BoundedReader<KV<K,V>> |
createReader(PipelineOptions options)
Returns a new
BoundedSource.BoundedReader that reads from this source. |
SerializableConfiguration |
getConfiguration() |
long |
getEstimatedSizeBytes(PipelineOptions po)
An estimate of the total size (in bytes) of the data that would be read from this source.
|
Coder<KV<K,V>> |
getOutputCoder()
Returns the
Coder to use for the data read from this source. |
void |
populateDisplayData(DisplayData.Builder builder)
Register display data for the given transform or component.
|
java.util.List<BoundedSource<KV<K,V>>> |
split(long desiredBundleSizeBytes,
PipelineOptions options)
Splits the source into bundles of approximately
desiredBundleSizeBytes. |
void |
validate()
Checks that this source is valid, before it can be used in a pipeline.
|
getDefaultOutputCoderprotected HadoopInputFormatBoundedSource(SerializableConfiguration conf, Coder<K> keyCoder, Coder<V> valueCoder, @Nullable SimpleFunction<?,K> keyTranslationFunction, @Nullable SimpleFunction<?,V> valueTranslationFunction, HadoopInputFormatIO.SerializableSplit inputSplit)
public SerializableConfiguration getConfiguration()
public void validate()
SourceIt is recommended to use Preconditions for implementing
this method.
public void populateDisplayData(DisplayData.Builder builder)
SourcepopulateDisplayData(DisplayData.Builder) is invoked by Pipeline runners to collect
display data via DisplayData.from(HasDisplayData). Implementations may call
super.populateDisplayData(builder) in order to register display data in the current
namespace, but should otherwise use subcomponent.populateDisplayData(builder) to use
the namespace of the subcomponent.
By default, does not register any display data. Implementors may override this method to provide their own display data.
populateDisplayData in interface HasDisplayDatapopulateDisplayData in class Source<KV<K,V>>builder - The builder to populate with display data.HasDisplayDatapublic java.util.List<BoundedSource<KV<K,V>>> split(long desiredBundleSizeBytes, PipelineOptions options) throws java.lang.Exception
BoundedSourcedesiredBundleSizeBytes.split in class BoundedSource<KV<K,V>>java.lang.Exceptionpublic long getEstimatedSizeBytes(PipelineOptions po) throws java.lang.Exception
BoundedSourceIf there is no way to estimate the size of the source implementations MAY return 0L.
getEstimatedSizeBytes in class BoundedSource<KV<K,V>>java.lang.Exceptionprotected void createInputFormatInstance()
throws java.io.IOException
java.io.IOExceptionpublic Coder<KV<K,V>> getOutputCoder()
SourceCoder to use for the data read from this source.getOutputCoder in class Source<KV<K,V>>public BoundedSource.BoundedReader<KV<K,V>> createReader(PipelineOptions options) throws java.io.IOException
BoundedSourceBoundedSource.BoundedReader that reads from this source.createReader in class BoundedSource<KV<K,V>>java.io.IOException