InputT
- the type of the elements in the input PCollection
public static class SketchFrequencies.CountMinSketchFn<InputT> extends Combine.CombineFn<InputT,SketchFrequencies.Sketch<InputT>,SketchFrequencies.Sketch<InputT>>
Combine.CombineFn
of SketchFrequencies
transforms.Modifier and Type | Method and Description |
---|---|
SketchFrequencies.Sketch<InputT> |
addInput(SketchFrequencies.Sketch<InputT> accumulator,
InputT element)
Adds the given input value to the given accumulator, returning the new accumulator value.
|
static <InputT> SketchFrequencies.CountMinSketchFn<InputT> |
create(Coder<InputT> coder)
Returns a
SketchFrequencies.CountMinSketchFn combiner with the given input coder. |
SketchFrequencies.Sketch<InputT> |
createAccumulator()
Returns a new, mutable accumulator value, representing the accumulation of zero input values.
|
SketchFrequencies.Sketch<InputT> |
extractOutput(SketchFrequencies.Sketch<InputT> accumulator)
Output the whole structure so it can be queried, reused or stored easily.
|
java.lang.reflect.TypeVariable<?> |
getAccumTVariable()
Returns the
TypeVariable of AccumT . |
Coder<SketchFrequencies.Sketch<InputT>> |
getAccumulatorCoder(CoderRegistry registry,
Coder inputCoder)
Returns the
Coder to use for accumulator AccumT values, or null if it is not
able to be inferred. |
Coder<OutputT> |
getDefaultOutputCoder(CoderRegistry registry,
Coder<InputT> inputCoder)
Returns the
Coder to use by default for output OutputT values, or null if it
is not able to be inferred. |
java.lang.String |
getIncompatibleGlobalWindowErrorMessage()
Returns the error message for not supported default values in Combine.globally().
|
java.lang.reflect.TypeVariable<?> |
getInputTVariable()
Returns the
TypeVariable of InputT . |
java.lang.reflect.TypeVariable<?> |
getOutputTVariable()
Returns the
TypeVariable of OutputT . |
SketchFrequencies.Sketch<InputT> |
mergeAccumulators(java.lang.Iterable<SketchFrequencies.Sketch<InputT>> accumulators)
Returns an accumulator representing the accumulation of all the input values accumulated in
the merging accumulators.
|
void |
populateDisplayData(DisplayData.Builder builder)
Register display data for the given transform or component.
|
SketchFrequencies.CountMinSketchFn<InputT> |
withAccuracy(double epsilon,
double confidence)
Returns a new
SketchFrequencies.CountMinSketchFn combiner with new precision accuracy parameters epsilon and confidence . |
apply, compact, defaultValue, getInputType, getOutputType
public static <InputT> SketchFrequencies.CountMinSketchFn<InputT> create(Coder<InputT> coder)
SketchFrequencies.CountMinSketchFn
combiner with the given input coder. coder
- the coder that encodes the elements' typepublic SketchFrequencies.CountMinSketchFn<InputT> withAccuracy(double epsilon, double confidence)
SketchFrequencies.CountMinSketchFn
combiner with new precision accuracy parameters epsilon
and confidence
.
Keep in mind that the lower the epsilon
value, the greater the width, and the
greater the confidence, the greater the depth.
epsilon
- the error relative to the total number of distinct elementsconfidence
- the confidence in the result to not exceed the relative errorpublic SketchFrequencies.Sketch<InputT> createAccumulator()
Combine.CombineFn
createAccumulator
in class Combine.CombineFn<InputT,SketchFrequencies.Sketch<InputT>,SketchFrequencies.Sketch<InputT>>
public SketchFrequencies.Sketch<InputT> addInput(SketchFrequencies.Sketch<InputT> accumulator, InputT element)
Combine.CombineFn
addInput
in class Combine.CombineFn<InputT,SketchFrequencies.Sketch<InputT>,SketchFrequencies.Sketch<InputT>>
accumulator
- may be modified and returned for efficiencyelement
- should not be mutatedpublic SketchFrequencies.Sketch<InputT> mergeAccumulators(java.lang.Iterable<SketchFrequencies.Sketch<InputT>> accumulators)
Combine.CombineFn
mergeAccumulators
in class Combine.CombineFn<InputT,SketchFrequencies.Sketch<InputT>,SketchFrequencies.Sketch<InputT>>
accumulators
- only the first accumulator may be modified and returned for efficiency;
the other accumulators should not be mutated, because they may be shared with other code
and mutating them could lead to incorrect results or data corruption.public SketchFrequencies.Sketch<InputT> extractOutput(SketchFrequencies.Sketch<InputT> accumulator)
extractOutput
in class Combine.CombineFn<InputT,SketchFrequencies.Sketch<InputT>,SketchFrequencies.Sketch<InputT>>
accumulator
- can be modified for efficiencypublic Coder<SketchFrequencies.Sketch<InputT>> getAccumulatorCoder(CoderRegistry registry, Coder inputCoder)
CombineFnBase.GlobalCombineFn
Coder
to use for accumulator AccumT
values, or null if it is not
able to be inferred.
By default, uses the knowledge of the Coder
being used for InputT
values
and the enclosing Pipeline
's CoderRegistry
to try to infer the Coder for
AccumT
values.
This is the Coder used to send data through a communication-intensive shuffle step, so a compact and efficient representation may have significant performance benefits.
getAccumulatorCoder
in interface CombineFnBase.GlobalCombineFn<InputT,SketchFrequencies.Sketch<InputT>,SketchFrequencies.Sketch<InputT>>
public void populateDisplayData(DisplayData.Builder builder)
populateDisplayData(DisplayData.Builder)
is invoked by Pipeline runners to collect
display data via DisplayData.from(HasDisplayData)
. Implementations may call super.populateDisplayData(builder)
in order to register display data in the current namespace,
but should otherwise use subcomponent.populateDisplayData(builder)
to use the namespace
of the subcomponent.
By default, does not register any display data. Implementors may override this method to provide their own display data.
populateDisplayData
in interface HasDisplayData
builder
- The builder to populate with display data.HasDisplayData
public Coder<OutputT> getDefaultOutputCoder(CoderRegistry registry, Coder<InputT> inputCoder) throws CannotProvideCoderException
CombineFnBase.GlobalCombineFn
Coder
to use by default for output OutputT
values, or null if it
is not able to be inferred.
By default, uses the knowledge of the Coder
being used for input InputT
values and the enclosing Pipeline
's CoderRegistry
to try to infer the Coder
for OutputT
values.
getDefaultOutputCoder
in interface CombineFnBase.GlobalCombineFn<InputT,AccumT,OutputT>
CannotProvideCoderException
public java.lang.String getIncompatibleGlobalWindowErrorMessage()
CombineFnBase.GlobalCombineFn
getIncompatibleGlobalWindowErrorMessage
in interface CombineFnBase.GlobalCombineFn<InputT,AccumT,OutputT>
public java.lang.reflect.TypeVariable<?> getInputTVariable()
TypeVariable
of InputT
.public java.lang.reflect.TypeVariable<?> getAccumTVariable()
TypeVariable
of AccumT
.public java.lang.reflect.TypeVariable<?> getOutputTVariable()
TypeVariable
of OutputT
.