public abstract static class ParquetIO.Read extends PTransform<PBegin,PCollection<GenericRecord>>
ParquetIO.read(Schema)
.name, resourceHints
Constructor and Description |
---|
Read() |
Modifier and Type | Method and Description |
---|---|
PCollection<GenericRecord> |
expand(PBegin input)
Override this method to specify how this
PTransform should be expanded on the given
InputT . |
ParquetIO.Read |
from(java.lang.String filepattern)
Like
from(ValueProvider) . |
ParquetIO.Read |
from(ValueProvider<java.lang.String> filepattern)
Reads from the given filename or filepattern.
|
void |
populateDisplayData(DisplayData.Builder builder)
Register display data for the given transform or component.
|
ParquetIO.Read |
withAvroDataModel(GenericData model)
Define the Avro data model; see
AvroParquetReader.Builder#withDataModel(GenericData) . |
ParquetIO.Read |
withBeamSchemas(boolean inferBeamSchema) |
ParquetIO.Read |
withConfiguration(Configuration configuration)
Specify Hadoop configuration for ParquetReader.
|
ParquetIO.Read |
withConfiguration(java.util.Map<java.lang.String,java.lang.String> configuration)
Specify Hadoop configuration for ParquetReader.
|
ParquetIO.Read |
withoutSplit()
Deprecated.
This method may currently be used to opt-out of the default, splittable,
behavior. However, this will be removed in a future release assuming no issues are
discovered.
|
ParquetIO.Read |
withProjection(Schema projectionSchema,
Schema encoderSchema)
Enable the reading with projection.
|
ParquetIO.Read |
withSplit()
Deprecated.
as of version 2.35.0. Splittable reading is enabled by default.
|
compose, compose, getAdditionalInputs, getDefaultOutputCoder, getDefaultOutputCoder, getDefaultOutputCoder, getKindString, getName, getResourceHints, setResourceHints, toString, validate, validate
public ParquetIO.Read from(ValueProvider<java.lang.String> filepattern)
public ParquetIO.Read from(java.lang.String filepattern)
from(ValueProvider)
.public ParquetIO.Read withProjection(Schema projectionSchema, Schema encoderSchema)
public ParquetIO.Read withConfiguration(java.util.Map<java.lang.String,java.lang.String> configuration)
public ParquetIO.Read withConfiguration(Configuration configuration)
@Experimental(value=SCHEMAS) public ParquetIO.Read withBeamSchemas(boolean inferBeamSchema)
@Deprecated public ParquetIO.Read withSplit()
@Deprecated public ParquetIO.Read withoutSplit()
public ParquetIO.Read withAvroDataModel(GenericData model)
AvroParquetReader.Builder#withDataModel(GenericData)
.public PCollection<GenericRecord> expand(PBegin input)
PTransform
PTransform
should be expanded on the given
InputT
.
NOTE: This method should not be called directly. Instead apply the PTransform
should
be applied to the InputT
using the apply
method.
Composite transforms, which are defined in terms of other transforms, should return the output of one of the composed transforms. Non-composite transforms, which do not apply any transforms internally, should return a new unbound output and register evaluators (via backend-specific registration methods).
expand
in class PTransform<PBegin,PCollection<GenericRecord>>
public void populateDisplayData(DisplayData.Builder builder)
PTransform
populateDisplayData(DisplayData.Builder)
is invoked by Pipeline runners to collect
display data via DisplayData.from(HasDisplayData)
. Implementations may call super.populateDisplayData(builder)
in order to register display data in the current namespace,
but should otherwise use subcomponent.populateDisplayData(builder)
to use the namespace
of the subcomponent.
By default, does not register any display data. Implementors may override this method to provide their own display data.
populateDisplayData
in interface HasDisplayData
populateDisplayData
in class PTransform<PBegin,PCollection<GenericRecord>>
builder
- The builder to populate with display data.HasDisplayData