public abstract static class TFRecordIO.Read extends PTransform<PBegin,PCollection<byte[]>>
TFRecordIO.read()
.name
Constructor and Description |
---|
Read() |
Modifier and Type | Method and Description |
---|---|
PCollection<byte[]> |
expand(PBegin input)
Applies this
PTransform on the given InputT , and returns its
Output . |
TFRecordIO.Read |
from(java.lang.String filepattern)
Returns a transform for reading TFRecord files that reads from the file(s)
with the given filename or filename pattern.
|
TFRecordIO.Read |
from(ValueProvider<java.lang.String> filepattern)
Same as
from(filepattern) , but accepting a ValueProvider . |
protected Coder<byte[]> |
getDefaultOutputCoder()
Returns the default
Coder to use for the output of this
single-output PTransform . |
protected FileBasedSource<byte[]> |
getSource() |
void |
populateDisplayData(DisplayData.Builder builder)
Register display data for the given transform or component.
|
TFRecordIO.Read |
withCompressionType(TFRecordIO.CompressionType compressionType)
Returns a transform for reading TFRecord files that decompresses all input files
using the specified compression type.
|
TFRecordIO.Read |
withoutValidation()
Returns a transform for reading TFRecord files that has GCS path validation on
pipeline creation disabled.
|
getAdditionalInputs, getDefaultOutputCoder, getDefaultOutputCoder, getKindString, getName, toString, validate
public TFRecordIO.Read from(java.lang.String filepattern)
"gs://<bucket>/<filepath>"
(if running locally or using remote
execution). Standard Java Filesystem glob patterns ("*", "?", "[..]") are supported.public TFRecordIO.Read from(ValueProvider<java.lang.String> filepattern)
from(filepattern)
, but accepting a ValueProvider
.public TFRecordIO.Read withoutValidation()
This can be useful in the case where the GCS input does not exist at the pipeline creation time, but is expected to be available at execution time.
public TFRecordIO.Read withCompressionType(TFRecordIO.CompressionType compressionType)
If no compression type is specified, the default is
TFRecordIO.CompressionType.AUTO
.
In this mode, the compression type of the file is determined by its extension
(e.g., *.gz
is gzipped, *.zlib
is zlib compressed, and all other
extensions are uncompressed).
public PCollection<byte[]> expand(PBegin input)
PTransform
PTransform
on the given InputT
, and returns its
Output
.
Composite transforms, which are defined in terms of other transforms, should return the output of one of the composed transforms. Non-composite transforms, which do not apply any transforms internally, should return a new unbound output and register evaluators (via backend-specific registration methods).
expand
in class PTransform<PBegin,PCollection<byte[]>>
protected FileBasedSource<byte[]> getSource()
public void populateDisplayData(DisplayData.Builder builder)
PTransform
populateDisplayData(DisplayData.Builder)
is invoked by Pipeline runners to collect
display data via DisplayData.from(HasDisplayData)
. Implementations may call
super.populateDisplayData(builder)
in order to register display data in the current
namespace, but should otherwise use subcomponent.populateDisplayData(builder)
to use
the namespace of the subcomponent.
By default, does not register any display data. Implementors may override this method to provide their own display data.
populateDisplayData
in interface HasDisplayData
populateDisplayData
in class PTransform<PBegin,PCollection<byte[]>>
builder
- The builder to populate with display data.HasDisplayData
protected Coder<byte[]> getDefaultOutputCoder()
PTransform
Coder
to use for the output of this
single-output PTransform
.
By default, always throws
getDefaultOutputCoder
in class PTransform<PBegin,PCollection<byte[]>>