Package org.apache.beam.sdk.extensions.sketching
package org.apache.beam.sdk.extensions.sketching
Utilities for computing statistical indicators using probabilistic sketches.
-
ClassesClassDescription
PTransform
s for computing the approximate number of distinct elements in a stream.Implements theCombine.CombineFn
ofApproximateDistinct
transforms.Implementation ofApproximateDistinct.globally()
.Coder forHyperLogLogPlus
class.Implementation ofApproximateDistinct.perKey()
.PTransform
s to compute the estimate frequency of each element in a stream.Implements theCombine.CombineFn
ofSketchFrequencies
transforms.SketchFrequencies.GlobalSketch<InputT>Implementation ofSketchFrequencies.globally()
.Implementation ofSketchFrequencies.perKey()
.Wrap StreamLib's Count-Min Sketch to support counting all user types by hashing the encoded user type using the supplied deterministic coder.PTransform
s for getting information about quantiles in a stream.Implementation ofTDigestQuantiles.globally()
.Implementation ofTDigestQuantiles.perKey()
.Implements theCombine.CombineFn
ofTDigestQuantiles
transforms.