Package org.apache.beam.sdk.extensions.sorter
package org.apache.beam.sdk.extensions.sorter
Utility for performing local sort of potentially large sets of values. Will sort in memory and
spill to disk for external sorting if necessary.
-
ClassDescription
Sorter
that will use in memory sorting until the values can't fit into memory and will then fall back to external sorting.Contains configuration for the sorter.Does an external sort of the provided values.ExternalSorter.Options
contains configuration of the sorter.Sorter type.SortValues<PrimaryKeyT,SecondaryKeyT, ValueT> SortValues<PrimaryKeyT, SecondaryKeyT, ValueT>
takes aPCollection<KV<PrimaryKeyT, Iterable<KV<SecondaryKeyT, ValueT>>>>
with elements consisting of a primary key and iterables over<secondary key, value>
pairs, and returns aPCollection<KV<PrimaryKeyT, Iterable<KV<SecondaryKeyT, ValueT>>>
of the same elements but with values sorted by a secondary key.