Package org.apache.beam.sdk.io.parquet
Class ParquetIO.ReadFiles.BlockTracker
java.lang.Object
org.apache.beam.sdk.transforms.splittabledofn.RestrictionTracker<OffsetRange,Long>
 
org.apache.beam.sdk.transforms.splittabledofn.OffsetRangeTracker
org.apache.beam.sdk.io.parquet.ParquetIO.ReadFiles.BlockTracker
- All Implemented Interfaces:
- RestrictionTracker.HasProgress
- Enclosing class:
- ParquetIO.ReadFiles
- 
Nested Class SummaryNested classes/interfaces inherited from class org.apache.beam.sdk.transforms.splittabledofn.RestrictionTrackerRestrictionTracker.HasProgress, RestrictionTracker.IsBounded, RestrictionTracker.Progress, RestrictionTracker.TruncateResult<RestrictionT>
- 
Field SummaryFields inherited from class org.apache.beam.sdk.transforms.splittabledofn.OffsetRangeTrackerlastAttemptedOffset, lastClaimedOffset, range
- 
Constructor SummaryConstructors
- 
Method SummaryModifier and TypeMethodDescriptionA representation for the amount of known completed and known remaining work.voidMethods inherited from class org.apache.beam.sdk.transforms.splittabledofn.OffsetRangeTrackercheckDone, currentRestriction, isBounded, toString, tryClaim, trySplit
- 
Constructor Details- 
BlockTracker
 
- 
- 
Method Details- 
makeProgress- Throws:
- IOException
 
- 
getProgressDescription copied from interface:RestrictionTracker.HasProgressA representation for the amount of known completed and known remaining work.It is up to each restriction tracker to convert between their natural representation of completed and remaining work and the doublerepresentation. For example:- Block based file source (e.g. Avro): The number of bytes from the beginning of the restriction to the current block and the number of bytes from the current block to the end of the restriction.
- Pull based queue based source (e.g. Pubsub): The local/global size available in number
       of messages or number of message bytesthat have processed and the number of messages or number ofmessage bytesthat are outstanding.
- Key range based source (e.g. BigQuery, Bigtable, ...): Scale the start key to be one and end key to be zero and interpolate the position of the next splittable key as a position. If information about the probability density function or cumulative distribution function is available, work completed and work remaining interpolation can be improved. Alternatively, if the number of encoded bytes for the keys and values is known for the key range, the number of completed and remaining bytes can be used.
 The work completed and work remaining must be of the same scale whether that be number of messages or number of bytes and should never represent two distinct unit types. - Specified by:
- getProgressin interface- RestrictionTracker.HasProgress
- Overrides:
- getProgressin class- OffsetRangeTracker
 
 
-