Class DetectNewPartitionsTracker
- All Implemented Interfaces:
RestrictionTracker.HasProgress
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.beam.sdk.transforms.splittabledofn.GrowableOffsetRangeTracker
GrowableOffsetRangeTracker.RangeEndEstimatorNested classes/interfaces inherited from class org.apache.beam.sdk.transforms.splittabledofn.RestrictionTracker
RestrictionTracker.HasProgress, RestrictionTracker.IsBounded, RestrictionTracker.Progress, RestrictionTracker.TruncateResult<RestrictionT> -
Field Summary
Fields inherited from class org.apache.beam.sdk.transforms.splittabledofn.OffsetRangeTracker
lastAttemptedOffset, lastClaimedOffset, range -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptiontrySplit(double fractionOfRemainder) Splits current restriction based onfractionOfRemainder.Methods inherited from class org.apache.beam.sdk.transforms.splittabledofn.GrowableOffsetRangeTracker
getProgress, isBoundedMethods inherited from class org.apache.beam.sdk.transforms.splittabledofn.OffsetRangeTracker
checkDone, currentRestriction, toString, tryClaim
-
Constructor Details
-
DetectNewPartitionsTracker
public DetectNewPartitionsTracker(long start)
-
-
Method Details
-
trySplit
Description copied from class:RestrictionTrackerSplits current restriction based onfractionOfRemainder.If splitting the current restriction is possible, the current restriction is split into a primary and residual restriction pair. This invocation updates the
RestrictionTracker.currentRestriction()to be the primary restriction effectively having the currentDoFn.ProcessElementexecution responsible for performing the work that the primary restriction represents. The residual restriction will be executed in a separateDoFn.ProcessElementinvocation (likely in a different process). The work performed by executing the primary and residual restrictions as separateDoFn.ProcessElementinvocations MUST be equivalent to the work performed as if this split never occurred.The
fractionOfRemaindershould be used in a best effort manner to choose a primary and residual restriction based upon the fraction of the remaining work that the currentDoFn.ProcessElementinvocation is responsible for. For example, if aDoFn.ProcessElementwas reading a file with a restriction representing the offset range[100, 200)and has processed up to offset 130 with afractionOfRemainderof0.7, the primary and residual restrictions returned would be[100, 179), [179, 200)(note:currentOffset + fractionOfRemainder * remainingWork = 130 + 0.7 * 70 = 179).fractionOfRemainder = 0means a checkpoint is required.The API is recommended to be implemented for a batch pipeline to improve parallel processing performance.
The API is recommended to be implemented for batch pipeline given that it is very important for pipeline scaling and end to end pipeline execution.
The API is required to be implemented for a streaming pipeline.
- Overrides:
trySplitin classGrowableOffsetRangeTracker- Parameters:
fractionOfRemainder- A hint as to the fraction of work the primary restriction should represent based upon the current known remaining amount of work.- Returns:
- a
SplitResultif a split was possible, otherwise returnsnull. If thefractionOfRemainder == 0, anullresult MUST imply that the restriction tracker is done and there is no more work left to do.
-