public abstract static class IcebergIO.WriteRows extends PTransform<PCollection<Row>,IcebergWriteResult>
annotations, displayData, name, resourceHints| Constructor and Description |
|---|
WriteRows() |
| Modifier and Type | Method and Description |
|---|---|
IcebergWriteResult |
expand(PCollection<Row> input)
Override this method to specify how this
PTransform should be expanded on the given
InputT. |
IcebergIO.WriteRows |
to(DynamicDestinations destinations) |
IcebergIO.WriteRows |
to(org.apache.iceberg.catalog.TableIdentifier identifier) |
IcebergIO.WriteRows |
withTriggeringFrequency(Duration triggeringFrequency)
Sets the frequency at which data is written to files and a new
Snapshot is produced. |
addAnnotation, compose, compose, getAdditionalInputs, getAnnotations, getDefaultOutputCoder, getDefaultOutputCoder, getDefaultOutputCoder, getKindString, getName, getResourceHints, populateDisplayData, setDisplayData, setResourceHints, toString, validate, validatepublic IcebergIO.WriteRows to(org.apache.iceberg.catalog.TableIdentifier identifier)
public IcebergIO.WriteRows to(DynamicDestinations destinations)
public IcebergIO.WriteRows withTriggeringFrequency(Duration triggeringFrequency)
Snapshot is produced.
Roughly every triggeringFrequency duration, records are written to data files and appended to the respective table. Each append operation creates a new table snapshot.
Generally speaking, increasing this duration will result in fewer, larger data files and fewer snapshots.
This is only applicable when writing an unbounded PCollection (i.e. a streaming
pipeline).
public IcebergWriteResult expand(PCollection<Row> input)
PTransformPTransform should be expanded on the given
InputT.
NOTE: This method should not be called directly. Instead apply the PTransform should
be applied to the InputT using the apply method.
Composite transforms, which are defined in terms of other transforms, should return the output of one of the composed transforms. Non-composite transforms, which do not apply any transforms internally, should return a new unbound output and register evaluators (via backend-specific registration methods).
expand in class PTransform<PCollection<Row>,IcebergWriteResult>