public abstract static class IcebergIO.WriteRows extends PTransform<PCollection<Row>,IcebergWriteResult>
annotations, displayData, name, resourceHints
Constructor and Description |
---|
WriteRows() |
Modifier and Type | Method and Description |
---|---|
IcebergWriteResult |
expand(PCollection<Row> input)
Override this method to specify how this
PTransform should be expanded on the given
InputT . |
IcebergIO.WriteRows |
to(DynamicDestinations destinations) |
IcebergIO.WriteRows |
to(org.apache.iceberg.catalog.TableIdentifier identifier) |
IcebergIO.WriteRows |
withTriggeringFrequency(Duration triggeringFrequency)
Sets the frequency at which data is written to files and a new
Snapshot is produced. |
addAnnotation, compose, compose, getAdditionalInputs, getAnnotations, getDefaultOutputCoder, getDefaultOutputCoder, getDefaultOutputCoder, getKindString, getName, getResourceHints, populateDisplayData, setDisplayData, setResourceHints, toString, validate, validate
public IcebergIO.WriteRows to(org.apache.iceberg.catalog.TableIdentifier identifier)
public IcebergIO.WriteRows to(DynamicDestinations destinations)
public IcebergIO.WriteRows withTriggeringFrequency(Duration triggeringFrequency)
Snapshot
is produced.
Roughly every triggeringFrequency duration, records are written to data files and appended to the respective table. Each append operation created a new table snapshot.
Generally speaking, increasing this duration will result in fewer, larger data files and fewer snapshots.
This is only applicable when writing an unbounded PCollection
(i.e. a streaming
pipeline).
public IcebergWriteResult expand(PCollection<Row> input)
PTransform
PTransform
should be expanded on the given
InputT
.
NOTE: This method should not be called directly. Instead apply the PTransform
should
be applied to the InputT
using the apply
method.
Composite transforms, which are defined in terms of other transforms, should return the output of one of the composed transforms. Non-composite transforms, which do not apply any transforms internally, should return a new unbound output and register evaluators (via backend-specific registration methods).
expand
in class PTransform<PCollection<Row>,IcebergWriteResult>