Group.ByFields (Apache Beam 2.9.0)

java.lang.Object
- org.apache.beam.sdk.transforms.PTransform<PCollection<InputT>,PCollection<KV<Row,java.lang.Iterable<InputT>>>>
- - org.apache.beam.sdk.schemas.transforms.Group.ByFields<InputT>

All Implemented Interfaces:

java.io.Serializable, HasDisplayData

Enclosing class:

Group
```
public static class Group.ByFields<InputT>
extends PTransform<PCollection<InputT>,PCollection<KV<Row,java.lang.Iterable<InputT>>>>
```
a PTransform that groups schema elements based on the given fields.
The output of this transform is a KV where the key type is a Row containing the extracted fields.

See Also:

Serialized Form

Field Summary
- Fields inherited from class org.apache.beam.sdk.transforms.PTransform
  name

Method Summary

All Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`<OutputT> Group.CombineByFields<InputT,OutputT>`	`aggregate(Combine.CombineFn<InputT,?,OutputT> combineFn)` Aggregate the grouped data using the specified `Combine.CombineFn`.
`<CombineInputT,AccumT,CombineOutputT> Group.CombineFieldsByFields<InputT>`	`aggregateField(int inputFieldId, Combine.CombineFn<CombineInputT,AccumT,CombineOutputT> fn, Schema.Field outputField)`
`<CombineInputT,AccumT,CombineOutputT> Group.CombineFieldsByFields<InputT>`	`aggregateField(int inputFieldId, Combine.CombineFn<CombineInputT,AccumT,CombineOutputT> fn, java.lang.String outputFieldName)`
`<CombineInputT,AccumT,CombineOutputT> Group.CombineFieldsByFields<InputT>`	`aggregateField(java.lang.String inputFieldName, Combine.CombineFn<CombineInputT,AccumT,CombineOutputT> fn, Schema.Field outputField)` Build up an aggregation function over the input elements.
`<CombineInputT,AccumT,CombineOutputT> Group.CombineFieldsByFields<InputT>`	`aggregateField(java.lang.String inputFieldName, Combine.CombineFn<CombineInputT,AccumT,CombineOutputT> fn, java.lang.String outputFieldName)` Build up an aggregation function over the input elements.
`<CombineInputT,AccumT,CombineOutputT> Group.CombineFieldsByFields<InputT>`	`aggregateFields(FieldAccessDescriptor fieldsToAggregate, Combine.CombineFn<CombineInputT,AccumT,CombineOutputT> fn, Schema.Field outputField)` Build up an aggregation function over the input elements.
`<CombineInputT,AccumT,CombineOutputT> Group.CombineFieldsByFields<InputT>`	`aggregateFields(FieldAccessDescriptor fieldsToAggregate, Combine.CombineFn<CombineInputT,AccumT,CombineOutputT> fn, java.lang.String outputFieldName)` Build up an aggregation function over the input elements.
`<CombineInputT,AccumT,CombineOutputT> Group.CombineFieldsByFields<InputT>`	`aggregateFields(java.util.List<java.lang.String> inputFieldNames, Combine.CombineFn<CombineInputT,AccumT,CombineOutputT> fn, Schema.Field outputField)` Build up an aggregation function over the input elements.
`<CombineInputT,AccumT,CombineOutputT> Group.CombineFieldsByFields<InputT>`	`aggregateFields(java.util.List<java.lang.String> inputFieldNames, Combine.CombineFn<CombineInputT,AccumT,CombineOutputT> fn, java.lang.String outputFieldName)` Build up an aggregation function over the input elements.
`<CombineInputT,AccumT,CombineOutputT> Group.CombineFieldsByFields<InputT>`	`aggregateFieldsById(java.util.List<java.lang.Integer> inputFieldIds, Combine.CombineFn<CombineInputT,AccumT,CombineOutputT> fn, Schema.Field outputField)`
`<CombineInputT,AccumT,CombineOutputT> Group.CombineFieldsByFields<InputT>`	`aggregateFieldsById(java.util.List<java.lang.Integer> inputFieldIds, Combine.CombineFn<CombineInputT,AccumT,CombineOutputT> fn, java.lang.String outputFieldName)`
`PCollection<KV<Row,java.lang.Iterable<InputT>>>`	`expand(PCollection<InputT> input)` Override this method to specify how this `PTransform` should be expanded on the given `InputT`.

Methods inherited from class org.apache.beam.sdk.transforms.PTransform
compose, getAdditionalInputs, getDefaultOutputCoder, getDefaultOutputCoder, getDefaultOutputCoder, getKindString, getName, populateDisplayData, toString, validate

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait

Method Detail

aggregate
```
public <OutputT> Group.CombineByFields<InputT,OutputT> aggregate(Combine.CombineFn<InputT,?,OutputT> combineFn)
```
Aggregate the grouped data using the specified Combine.CombineFn. The resulting PCollection will have type PCollection<KV<Row, OutputT>>.

aggregateField

public <CombineInputT,AccumT,CombineOutputT> Group.CombineFieldsByFields<InputT> aggregateField(java.lang.String inputFieldName,
                                                                                                Combine.CombineFn<CombineInputT,AccumT,CombineOutputT> fn,
                                                                                                java.lang.String outputFieldName)

Build up an aggregation function over the input elements.

This method specifies an aggregation over single field of the input. The union of all calls to aggregateField and aggregateFields will determine the output schema.

Field types in the output schema will be inferred from the provided combine function. Sometimes the field type cannot be inferred due to Java's type erasure. In that case, use the overload that allows setting the output field type explicitly.

aggregateField

public <CombineInputT,AccumT,CombineOutputT> Group.CombineFieldsByFields<InputT> aggregateField(int inputFieldId,
                                                                                                Combine.CombineFn<CombineInputT,AccumT,CombineOutputT> fn,
                                                                                                java.lang.String outputFieldName)

aggregateField

public <CombineInputT,AccumT,CombineOutputT> Group.CombineFieldsByFields<InputT> aggregateField(java.lang.String inputFieldName,
                                                                                                Combine.CombineFn<CombineInputT,AccumT,CombineOutputT> fn,
                                                                                                Schema.Field outputField)

Build up an aggregation function over the input elements.

This method specifies an aggregation over single field of the input. The union of all calls to aggregateField and aggregateFields will determine the output schema.

aggregateField

public <CombineInputT,AccumT,CombineOutputT> Group.CombineFieldsByFields<InputT> aggregateField(int inputFieldId,
                                                                                                Combine.CombineFn<CombineInputT,AccumT,CombineOutputT> fn,
                                                                                                Schema.Field outputField)

aggregateFields

public <CombineInputT,AccumT,CombineOutputT> Group.CombineFieldsByFields<InputT> aggregateFields(java.util.List<java.lang.String> inputFieldNames,
                                                                                                 Combine.CombineFn<CombineInputT,AccumT,CombineOutputT> fn,
                                                                                                 java.lang.String outputFieldName)

Build up an aggregation function over the input elements.

This method specifies an aggregation over multiple fields of the input. The union of all calls to aggregateField and aggregateFields will determine the output schema.

aggregateFieldsById

public <CombineInputT,AccumT,CombineOutputT> Group.CombineFieldsByFields<InputT> aggregateFieldsById(java.util.List<java.lang.Integer> inputFieldIds,
                                                                                                     Combine.CombineFn<CombineInputT,AccumT,CombineOutputT> fn,
                                                                                                     java.lang.String outputFieldName)

aggregateFields

public <CombineInputT,AccumT,CombineOutputT> Group.CombineFieldsByFields<InputT> aggregateFields(FieldAccessDescriptor fieldsToAggregate,
                                                                                                 Combine.CombineFn<CombineInputT,AccumT,CombineOutputT> fn,
                                                                                                 java.lang.String outputFieldName)

Build up an aggregation function over the input elements.

This method specifies an aggregation over multiple fields of the input. The union of all calls to aggregateField and aggregateFields will determine the output schema.

aggregateFields

public <CombineInputT,AccumT,CombineOutputT> Group.CombineFieldsByFields<InputT> aggregateFields(java.util.List<java.lang.String> inputFieldNames,
                                                                                                 Combine.CombineFn<CombineInputT,AccumT,CombineOutputT> fn,
                                                                                                 Schema.Field outputField)

Build up an aggregation function over the input elements.

This method specifies an aggregation over multiple fields of the input. The union of all calls to aggregateField and aggregateFields will determine the output schema.

aggregateFieldsById

public <CombineInputT,AccumT,CombineOutputT> Group.CombineFieldsByFields<InputT> aggregateFieldsById(java.util.List<java.lang.Integer> inputFieldIds,
                                                                                                     Combine.CombineFn<CombineInputT,AccumT,CombineOutputT> fn,
                                                                                                     Schema.Field outputField)

aggregateFields

public <CombineInputT,AccumT,CombineOutputT> Group.CombineFieldsByFields<InputT> aggregateFields(FieldAccessDescriptor fieldsToAggregate,
                                                                                                 Combine.CombineFn<CombineInputT,AccumT,CombineOutputT> fn,
                                                                                                 Schema.Field outputField)

Build up an aggregation function over the input elements.

This method specifies an aggregation over multiple fields of the input. The union of all calls to aggregateField and aggregateFields will determine the output schema.

expand
```
public PCollection<KV<Row,java.lang.Iterable<InputT>>> expand(PCollection<InputT> input)
```
Description copied from class: PTransform

Override this method to specify how this PTransform should be expanded on the given InputT.
NOTE: This method should not be called directly. Instead apply the PTransform should be applied to the InputT using the apply method.
Composite transforms, which are defined in terms of other transforms, should return the output of one of the composed transforms. Non-composite transforms, which do not apply any transforms internally, should return a new unbound output and register evaluators (via backend-specific registration methods).

Specified by:

expand in class PTransform<PCollection<InputT>,PCollection<KV<Row,java.lang.Iterable<InputT>>>>

Class Group.ByFields<InputT>

Field Summary

Fields inherited from class org.apache.beam.sdk.transforms.PTransform

Method Summary

Methods inherited from class org.apache.beam.sdk.transforms.PTransform

Methods inherited from class java.lang.Object

Method Detail

aggregate

aggregateField

aggregateField

aggregateField

aggregateField

aggregateFields

aggregateFieldsById

aggregateFields

aggregateFields

aggregateFieldsById

aggregateFields

expand