All Classes and Interfaces (Apache Beam 2.69.0)

Class

Description

AbstractBeamCalcRel

BeamRelNode to replace Project and Filter node.

AbstractFlinkCombineRunner<K,InputT,AccumT,OutputT,W>

Abstract base for runners that execute a Combine.PerKey.

AbstractFlinkCombineRunner.CompleteFlinkCombiner<K,InputT,AccumT,OutputT>

A straight wrapper of CombineFnBase.GlobalCombineFn that takes in InputT and emits OutputT.

AbstractFlinkCombineRunner.FinalFlinkCombiner<K,AccumT,OutputT>

A final combiner that takes in AccumT and produces OutputT.

AbstractFlinkCombineRunner.FlinkCombiner<K,InputT,AccumT,OutputT>

Adapter interface that allows using a CombineFnBase.GlobalCombineFn to either produce the AccumT as output or to combine several accumulators into an OutputT.

AbstractFlinkCombineRunner.PartialFlinkCombiner<K,InputT,AccumT>

A partial combiner that takes in InputT and produces AccumT.

AbstractInOutIterator<K,InputT,OutputT>

Abstract base class for iterators that process Spark input data and produce corresponding output values.

ActionFactory

Factory class for creating instances that will handle different functions of DoFns.

ActionFactory

Factory class for creating instances that will handle each type of record within a change stream query.

AddFields

A transform to add new nullable fields to a PCollection's schema.

AddFields.Inner<T>

Inner PTransform for AddFields.

AddHarnessIdInterceptor

A ClientInterceptor that attaches a provided SDK Harness ID to outgoing messages.

AddShardKeyDoFn

This class adds pseudo-key with a given cardinality.

AddUuidsTransform

A transform to add UUIDs to each message to be written to Pub/Sub Lite.

AdvancingPhaser

A Phaser which never terminates.

AfterAll

A composite Trigger that fires when all of its sub-triggers are ready.

AfterEach

A composite Trigger that executes its sub-triggers in order.

AfterFirst

A composite Trigger that fires once after at least one of its sub-triggers have fired.

AfterPane

A Trigger that fires at some point after a specified number of input elements have arrived.

AfterProcessingTime

A Trigger trigger that fires at a specified point in processing time, relative to when input first arrives.

AfterSynchronizedProcessingTime

FOR INTERNAL USE ONLY.

AfterWatermark

AfterWatermark triggers fire based on progress of the system watermark.

AfterWatermark.AfterWatermarkEarlyAndLate

AfterWatermark.FromEndOfWindow

A watermark trigger targeted relative to the end of the window.

AggregateFn<InputT,AccumT,OutputT>

An aggregate function that can be executed as part of a SQL query.

AggregationCombineFnAdapter<T>

Wrapper Combine.CombineFns for aggregation function calls.

AggregationQuery

Builds a MongoDB AggregateIterable object.

AmqpIO

AmqpIO supports AMQP 1.0 protocol using the Apache QPid Proton-J library.

AmqpIO.Read

A PTransform to read/receive messages using AMQP 1.0 protocol.

AmqpIO.Write

A PTransform to send messages using AMQP 1.0 protocol.

AmqpMessageCoder

A coder for AMQP message.

AmqpMessageCoderProviderRegistrar

A CoderProviderRegistrar for standard types used with AmqpIO.

AnnotateText

A PTransform using the Cloud AI Natural language processing capability.

AnnotateText.Builder

ApiIOError

ApiIOError is a data class for storing details about an error.

ApplicationNameOptions

Options that allow setting the application name.

ApproximateCountDistinct

PTransforms for estimating the number of distinct elements in a PCollection, or the number of distinct values associated with each key in a PCollection of KVs.

ApproximateCountDistinct.Globally<T>

PTransform for estimating the number of distinct elements in a PCollection.

ApproximateCountDistinct.Globally.Builder<T>

ApproximateCountDistinct.PerKey<K,V>

ApproximateCountDistinct.PerKey.Builder<K,V>

ApproximateDistinct

PTransforms for computing the approximate number of distinct elements in a stream.

ApproximateDistinct.ApproximateDistinctFn<InputT>

Implements the Combine.CombineFn of ApproximateDistinct transforms.

ApproximateDistinct.GloballyDistinct<InputT>

Implementation of ApproximateDistinct.globally().

ApproximateDistinct.HyperLogLogPlusCoder

Coder for HyperLogLogPlus class.

ApproximateDistinct.PerKeyDistinct<K,V>

Implementation of ApproximateDistinct.perKey().

ApproximateQuantiles

PTransforms for getting an idea of a PCollection's data distribution using approximate N-tiles (e.g.

ApproximateQuantiles.ApproximateQuantilesCombineFn<T,ComparatorT>

The ApproximateQuantilesCombineFn combiner gives an idea of the distribution of a collection of values using approximate N-tiles.

ApproximateUnique

Deprecated.

Consider using ApproximateCountDistinct in the zetasketch extension module, which makes use of the HllCount implementation.

ApproximateUnique.ApproximateUniqueCombineFn<T>

CombineFn that computes an estimate of the number of distinct values that were combined.

ApproximateUnique.ApproximateUniqueCombineFn.LargestUnique

A heap utility class to efficiently track the largest added elements.

ApproximateUnique.Globally<T>

PTransform for estimating the number of distinct elements in a PCollection.

ApproximateUnique.PerKey<K,V>

PTransform for estimating the number of distinct values associated with each key in a PCollection of KVs.

ArrayAgg

ArrayAgg.ArrayAggArray<T>

ArrowConversion

Utilities to create Iterables of Beam Row instances backed by Arrow record batches.

ArrowConversion.ArrowSchemaTranslator

Converts Arrow schema to Beam row schema.

ArrowConversion.RecordBatchRowIterator

ArtifactRetrievalService

An ArtifactRetrievalService that uses FileSystems as its backing storage.

ArtifactStagingService

ArtifactStagingService.ArtifactDestination

A pairing of a newly created artifact type and an output stream that will be readable at that type.

ArtifactStagingService.ArtifactDestinationProvider

Provides a concrete location to which artifacts can be staged on retrieval.

AsJsons<InputT>

PTransform for serializing objects to JSON Strings.

AssignWindowP<T>

/** * Jet Processor implementation for Beam's Windowing primitive.

AssignWindowsFunction<T>

Assign Windows function.

AssignWindowTranslatorBatch<T>

Assign Window translator.

AsyncBatchWriteHandler<RecT,ResT>

Async handler that automatically retries unprocessed records in case of a partial success.

AsyncBatchWriteHandler.Stats

Statistics on the batch request.

AsyncWatermarkCache

Asynchronously compute the earliest partition watermark and stores it in memory.

AtomicCoder<T>

A Coder that has no component Coders or other configuration.

AttributeValueCoder

A Coder that serializes and deserializes the AttributeValue objects.

AutoScaler

Enables users to specify their own `JMS` backlog reporters enabling JmsIO to report UnboundedSource.UnboundedReader.getTotalBacklogBytes().

AutoValueSchema

A SchemaProvider for AutoValue classes.

AutoValueSchema.AbstractGetterTypeSupplier

FieldValueTypeSupplier that's based on AutoValue getters.

AutoValueUtils

Utilities for managing AutoValue schemas.

AvroCoder<T>

A Coder using Avro binary format.

AvroDatumFactory<T>

Create DatumReader and DatumWriter for given schemas.

AvroDatumFactory.GenericDatumFactory

Specialized AvroDatumFactory for GenericRecord.

AvroDatumFactory.ReflectDatumFactory<T>

Specialized AvroDatumFactory for java classes transforming to avro through reflection.

AvroDatumFactory.SpecificDatumFactory<T>

Specialized AvroDatumFactory for SpecificRecord.

AvroGenericCoder

AvroCoder specialisation for GenericRecord, needed for cross-language transforms.

AvroGenericCoderRegistrar

Coder registrar for AvroGenericCoder.

AvroGenericCoderTranslator

Coder translator for AvroGenericCoder.

AvroGenericRecordToStorageApiProto

Utility methods for converting Avro GenericRecord objects to dynamic protocol message, for use with the Storage write API.

AvroIO

PTransforms for reading and writing Avro files.

AvroIO.Parse<T>

Implementation of

AvroIO.parseGenericRecords(org.apache.beam.sdk.transforms.SerializableFunction<org.apache.avro.generic.GenericRecord, T>)

AvroIO.ParseAll<T>

Deprecated.

See AvroIO.parseAllGenericRecords(SerializableFunction) for details.

AvroIO.ParseFiles<T>

Implementation of

AvroIO.parseFilesGenericRecords(org.apache.beam.sdk.transforms.SerializableFunction<org.apache.avro.generic.GenericRecord, T>)

AvroIO.Read<T>

Implementation of AvroIO.read(java.lang.Class<T>) and AvroIO.readGenericRecords(org.apache.avro.Schema).

AvroIO.ReadAll<T>

Deprecated.

See AvroIO.readAll(Class) for details.

AvroIO.ReadFiles<T>

Implementation of AvroIO.readFiles(java.lang.Class<T>).

AvroIO.RecordFormatter<ElementT>

Deprecated.

Users can achieve the same by providing this transform in a ParDo before using write in AvroIO AvroIO.write(Class).

AvroIO.Sink<ElementT>

Implementation of AvroIO.sink(java.lang.Class<ElementT>) and

AvroIO.sinkViaGenericRecords(org.apache.avro.Schema, org.apache.beam.sdk.extensions.avro.io.AvroIO.RecordFormatter<ElementT>)

AvroIO.TypedWrite<UserT,DestinationT,OutputT>

Implementation of AvroIO.write(java.lang.Class<T>).

AvroIO.Write<T>

This class is used as the default return value of AvroIO.write(java.lang.Class<T>)

AvroJavaTimeConversions

Avro 1.8 ships with joda time conversions only.

AvroJavaTimeConversions.DateConversion

AvroJavaTimeConversions.LocalTimestampMicros

AvroJavaTimeConversions.LocalTimestampMicrosConversion

AvroJavaTimeConversions.LocalTimestampMillis

AvroJavaTimeConversions.LocalTimestampMillisConversion

AvroJavaTimeConversions.TimeMicrosConversion

AvroJavaTimeConversions.TimeMillisConversion

AvroJavaTimeConversions.TimestampMicrosConversion

AvroJavaTimeConversions.TimestampMillisConversion

AvroJodaTimeConversions

Avro 1.8 invalid input: '&' 1.9 ship joda time conversions.

AvroJodaTimeConversions.DateConversion

AvroJodaTimeConversions.LossyTimeMicrosConversion

AvroJodaTimeConversions.LossyTimestampMicrosConversion

AvroJodaTimeConversions.TimeConversion

AvroJodaTimeConversions.TimeMicrosConversion

AvroJodaTimeConversions.TimestampConversion

AvroJodaTimeConversions.TimestampMicrosConversion

AvroPayloadSerializerProvider

AvroReadSchemaTransformFormatProvider

AvroRecordSchema

A SchemaProvider for AVRO generated SpecificRecords and POJOs.

AvroSchemaInformationProvider

AvroSchemaIOProvider

An implementation of SchemaIOProvider for reading and writing Avro files with AvroIO.

AvroSink<UserT,DestinationT,OutputT>

A FileBasedSink for Avro files.

AvroSink.DatumWriterFactory<T>

AvroSource<T>

Do not use in pipelines directly: most users should use AvroIO.Read.

AvroSource.AvroReader<T>

A BlockBasedSource.BlockBasedReader for reading blocks from Avro files.

AvroSource.DatumReaderFactory<T>

AvroTableProvider

TableProvider for AvroIO for consumption by Beam SQL.

AvroUtils

Utils to convert AVRO records to Beam rows.

AvroUtils.AvroConvertType

AvroUtils.AvroConvertValueForGetter

AvroUtils.AvroConvertValueForSetter

AvroUtils.FixedBytesField

Wrapper for fixed byte fields.

AvroUtils.TypeWithNullability

AvroWriteRequest<T>

AvroWriteSchemaTransformFormatProvider

A FileWriteSchemaTransformFormatProvider for avro format.

AwsBuilderFactory<PojoT,BuilderT>

Builder factory for AWS SdkPojo to avoid using reflection to instantiate a builder.

AwsModule

A Jackson Module that registers a JsonSerializer and JsonDeserializer for AwsCredentialsProvider and some subclasses.

AwsOptions

Options used to configure Amazon Web Services specific options such as credentials and region.

AwsOptions.AwsRegionFactory

Attempt to load default region.

AwsOptions.AwsUserCredentialsFactory

Return DefaultCredentialsProvider as default provider.

AwsPipelineOptionsRegistrar

A registrar containing the default AWS options.

AwsSchemaProvider

Schema provider for AWS SdkPojo models using the provided field metadata (@see SdkPojo.sdkFields()) rather than reflection.

AwsSchemaRegistrar

AwsSerializableUtils

Utilities for working with AWS Serializables.

AwsTypes

AzureBlobStoreFileSystemRegistrar

AutoService registrar for the AzureBlobStoreFileSystem.

AzureModule

A Jackson Module that registers a JsonSerializer and JsonDeserializer for Azure credential providers.

AzureOptions

AzureOptions.AzureUserCredentialsFactory

Attempts to load Azure credentials.

AzurePipelineOptionsRegistrar

A registrar containing the default Azure options.

BackOffAdapter

An adapter for converting between Apache Beam and Google API client representations of backoffs.

BadRecord

BadRecord.Builder

BadRecord.Failure

BadRecord.Failure.Builder

BadRecord.Record

BadRecord.Record.Builder

BadRecordRouter

BadRecordRouter.RecordingBadRecordRouter

BadRecordRouter.ThrowingBadRecordRouter

BagState<T>

A ReadableState cell containing a bag of values.

BaseBeamTable

Basic implementation of BeamSqlTable.

BasicAuthJcsmpSessionServiceFactory

A factory for creating JcsmpSessionService instances.

BasicAuthJcsmpSessionServiceFactory.Builder

BasicAuthSempClient

A class that manages REST calls to the Solace Element Management Protocol (SEMP) using basic authentication.

BasicAuthSempClientFactory

A factory for creating BasicAuthSempClient instances.

BasicAuthSempClientFactory.Builder

BatchContextImpl

Class for Batch, Sink and Stream CDAP wrapper classes that use it to provide common details.

BatchSideInputHandlerFactory

StateRequestHandler that uses a BatchSideInputHandlerFactory.SideInputGetter to access side inputs.

BatchSideInputHandlerFactory.SideInputGetter

Returns the value for the side input with the given PCollection id from the runner.

BatchSinkContextImpl

Class for creating context object of different CDAP classes with batch sink type.

BatchSourceContextImpl

Class for creating context object of different CDAP classes with batch source type.

BatchStatefulParDoOverrides

PTransformOverrideFactories that expands to correctly implement stateful ParDo using window-unaware BatchViewOverrides.GroupByKeyAndSortValuesOnly to linearize processing per key.

BatchStatefulParDoOverrides.BatchStatefulDoFn<K,V,OutputT>

A key-preserving DoFn that explodes an iterable that has been grouped by key and window.

BatchTransformTranslator<TransformT>

Batch TransformTranslator interface.

BeamAggregateProjectMergeRule

This rule is essentially a wrapper around Calcite's AggregateProjectMergeRule.

BeamAggregationRel

BeamRelNode to replace a Aggregate node.

BeamAggregationRule

Rule to detect the window/trigger settings.

BeamBasicAggregationRule

Aggregation rule that doesn't include projection.

BeamBatchTSetEnvironment

This is a shell tset environment which is used on as a central driver model to fit what beam expects.

BeamBatchWorker

The Twister2 worker that will execute the job logic once the job is submitted from the run method.

BeamBigQuerySqlDialect

BeamBuiltinAggregations

Built-in aggregations functions for COUNT/MAX/MIN/SUM/AVG/VAR_POP/VAR_SAMP.

BeamBuiltinAggregations.BitXOr<T>

BeamBuiltinAnalyticFunctions

Built-in Analytic Functions for the aggregation analytics functionality.

BeamBuiltinAnalyticFunctions.PositionAwareCombineFn<InputT,AccumT,OutputT>

BeamBuiltinFunctionProvider

BeamBuiltinFunctionClass interface.

BeamCalciteSchema

A Calcite Schema that corresponds to a TableProvider or MetaStore.

BeamCalciteTable

Adapter from BeamSqlTable to a calcite Table.

BeamCalcMergeRule

Planner rule to merge a BeamCalcRel with a BeamCalcRel.

BeamCalcRel

BeamRelNode to replace Project and Filter node.

BeamCalcRel.WrappedList<T>

WrappedList translates List on access.

BeamCalcRel.WrappedMap<V>

WrappedMap translates Map on access.

BeamCalcRel.WrappedRow

WrappedRow translates Row on access.

BeamCalcRule

A ConverterRule to replace Calc with BeamCalcRel.

BeamCalcSplittingRule

A RelOptRule that converts a LogicalCalc into a chain of AbstractBeamCalcRel nodes via CalcRelSplitter.

BeamCoGBKJoinRel

A BeamJoinRel which does CoGBK Join

BeamCoGBKJoinRule

Rule to convert LogicalJoin node to BeamCoGBKJoinRel node.

BeamCostModel

VolcanoCost represents the cost of a plan node.

BeamCostModel.Factory

Implementation of RelOptCostFactory that creates BeamCostModels.

BeamEnumerableConverter

BeamRelNode to replace a Enumerable node.

BeamEnumerableConverterRule

A ConverterRule to Convert BeamRelNode to EnumerableConvention.

BeamFlinkDataSetAdapter

An adapter class that allows one to apply Apache Beam PTransforms directly to Flink DataSets.

BeamFlinkDataStreamAdapter

An adapter class that allows one to apply Apache Beam PTransforms directly to Flink DataStreams.

BeamFnDataGrpcMultiplexer

A gRPC multiplexer for a specific Endpoints.ApiServiceDescriptor.

BeamFnDataInboundObserver

Decodes BeamFnApi.Elements partitioning them using the provided DataEndpoints and TimerEndpoints.

BeamFnDataInboundObserver.CloseException

BeamFnDataOutboundAggregator

An outbound data buffering aggregator with size-based buffer and time-based buffer if corresponding options are set.

BeamImpulseSource

A Beam BoundedSource for Impulse Source.

BeamIntersectRel

BeamRelNode to replace a Intersect node.

BeamIntersectRule

ConverterRule to replace Intersect with BeamIntersectRel.

BeamIOPushDownRule

BeamIOSinkRel

BeamRelNode to replace a TableModify node.

BeamIOSinkRule

A ConverterRule to replace TableModify with BeamIOSinkRel.

BeamIOSourceRel

BeamRelNode to replace a TableScan node.

BeamJavaTypeFactory

customized data type in Beam.

BeamJoinAssociateRule

This is very similar to JoinAssociateRule.

BeamJoinPushThroughJoinRule

This is exactly similar to JoinPushThroughJoinRule.

BeamJoinRel

An abstract BeamRelNode to implement Join Rels.

BeamJoinTransforms

Collections of PTransform and DoFn used to perform JOIN operation.

BeamJoinTransforms.JoinAsLookup

Transform to execute Join as Lookup.

BeamKafkaCSVTable

A Kafka topic that saves records as CSV format.

BeamKafkaTable

BeamKafkaTable represent a Kafka topic, as source or target.

BeamLogicalConvention

Convention for Beam SQL.

BeamMatchRel

BeamRelNode to replace a Match node.

BeamMatchRule

ConverterRule to replace Match with BeamMatchRel.

BeamMinusRel

BeamRelNode to replace a Minus node.

BeamMinusRule

ConverterRule to replace Minus with BeamMinusRel.

BeamPCollectionTable<InputT>

BeamPCollectionTable converts a PCollection<Row> as a virtual table, then a downstream query can query directly.

BeamPushDownIOSourceRel

BeamRelDataTypeSystem

customized data type in Beam.

BeamRelMetadataQuery

BeamRelNode

A RelNode that can also give a PTransform that implements the expression.

BeamRowToBigtableMutation

Bigtable reference: .

BeamRowToBigtableMutation.ToBigtableRowFn

BeamRowToStorageApiProto

Utility methods for converting Beam Row objects to dynamic protocol message, for use with the Storage write API.

BeamRuleSets

RuleSet used in BeamQueryPlanner.

BeamSetOperatorRelBase

Delegate for Set operators: BeamUnionRel, BeamIntersectRel and


 BeamMinusRel

BeamSetOperatorRelBase.OpType

Set operator type.

BeamSetOperatorsTransforms

Collections of PTransform and DoFn used to perform Set operations.

BeamSetOperatorsTransforms.BeamSqlRow2KvFn

Transform a BeamSqlRow to a KV<BeamSqlRow, BeamSqlRow>.

BeamSetOperatorsTransforms.SetOperatorFilteringDoFn

Filter function used for Set operators.

BeamSideInputJoinRel

A BeamJoinRel which does sideinput Join

BeamSideInputJoinRule

Rule to convert LogicalJoin node to BeamSideInputJoinRel node.

BeamSideInputLookupJoinRel

A BeamJoinRel which does Lookup Join

BeamSideInputLookupJoinRule

Rule to convert LogicalJoin node to BeamSideInputLookupJoinRel node.

BeamSortRel

BeamRelNode to replace a Sort node.

BeamSortRel.BeamSqlRowComparator

BeamSortRule

ConverterRule to replace Sort with BeamSortRel.

BeamSqlCli

BeamSqlCli provides methods to execute Beam SQL with an interactive client.

BeamSqlDataCatalogExample

Example pipeline that uses Google Cloud Data Catalog to retrieve the table metadata.

BeamSqlDataCatalogExample.DCExamplePipelineOptions

Pipeline options to specify the query and the output for the example.

BeamSqlEnv

Contains the metadata of tables/UDF functions, and exposes APIs to query/validate/optimize/translate SQL statements.

BeamSqlEnv.BeamSqlEnvBuilder

BeamSqlEnv's Builder.

BeamSqlOutputToConsoleFn

A test PTransform to display output in console.

BeamSqlParser

BeamSqlPipelineOptions

Options used to configure BeamSQL.

BeamSqlPipelineOptionsRegistrar

AutoService registrar for BeamSqlPipelineOptions.

BeamSqlRelUtils

Utilities for BeamRelNode.

BeamSqlSeekableTable

A seekable table converts a JOIN operator to an inline lookup.

BeamSqlTable

This interface defines a Beam Sql Table.

BeamSqlTableFilter

This interface defines Beam SQL Table Filter.

BeamSqlUdf

Interface to create a UDF in Beam SQL.

BeamSqlUnparseContext

BeamStoppableFunction

Custom StoppableFunction for backward compatibility.

BeamTableFunctionScanRel

BeamRelNode to replace TableFunctionScan.

BeamTableFunctionScanRule

This is the conveter rule that converts a Calcite TableFunctionScan to Beam


 TableFunctionScanRel

BeamTableStatistics

This class stores row count statistics.

BeamTableUtils

Utility methods for working with BeamTable.

BeamUncollectRel

BeamRelNode to implement an uncorrelated Uncollect, aka UNNEST.

BeamUncollectRule

A ConverterRule to replace Uncollect with BeamUncollectRule.

BeamUnionRel

BeamRelNode to replace a Union.

BeamUnionRule

A ConverterRule to replace Union with BeamUnionRule.

BeamUnnestRel

BeamRelNode to implement UNNEST, supporting specifically only Correlate with Uncollect.

BeamUnnestRule

A ConverterRule to replace Correlate Uncollect with BeamUnnestRule.

BeamValuesRel

BeamRelNode to replace a Values node.

BeamValuesRule

ConverterRule to replace Values with BeamValuesRel.

BeamWindowRel

BeamRelNode to replace a Window node.

BeamWindowRule

A ConverterRule to replace Window with BeamWindowRel.

BeamWorkerStatusGrpcService

A Fn Status service which can collect run-time status information from SDK harnesses for debugging purpose.

BigDecimalCoder

A BigDecimalCoder encodes a BigDecimal as an integer scale encoded with VarIntCoder and a BigInteger encoded using BigIntegerCoder.

BigDecimalConverter

Provides converters from BigDecimal to other numeric types based on the input Schema.TypeName.

BigEndianIntegerCoder

A BigEndianIntegerCoder encodes Integers in 4 bytes, big-endian.

BigEndianLongCoder

A BigEndianLongCoder encodes Longs in 8 bytes, big-endian.

BigEndianShortCoder

A BigEndianShortCoder encodes Shorts in 2 bytes, big-endian.

BigIntegerCoder

A BigIntegerCoder encodes a BigInteger as a byte array containing the big endian two's-complement representation, encoded via ByteArrayCoder.

BigqueryClient

A wrapper class to call Bigquery API calls.

BigQueryCoderProviderRegistrar

A CoderProviderRegistrar for standard types used with BigQueryIO.

BigQueryDirectReadSchemaTransformProvider

An implementation of TypedSchemaTransformProvider for BigQuery Storage Read API jobs configured via BigQueryDirectReadSchemaTransformProvider.BigQueryDirectReadSchemaTransformConfiguration.

BigQueryDirectReadSchemaTransformProvider.BigQueryDirectReadSchemaTransform

A SchemaTransform for BigQuery Storage Read API, configured with BigQueryDirectReadSchemaTransformProvider.BigQueryDirectReadSchemaTransformConfiguration and instantiated by BigQueryDirectReadSchemaTransformProvider.

BigQueryDirectReadSchemaTransformProvider.BigQueryDirectReadSchemaTransformConfiguration

Configuration for reading from BigQuery with Storage Read API.

BigQueryDirectReadSchemaTransformProvider.BigQueryDirectReadSchemaTransformConfiguration.Builder

BigQueryDlqProvider

BigQueryExportReadSchemaTransformConfiguration

Configuration for reading from BigQuery.

BigQueryExportReadSchemaTransformConfiguration.Builder

BigQueryExportReadSchemaTransformProvider

An implementation of TypedSchemaTransformProvider for BigQuery read jobs configured using BigQueryExportReadSchemaTransformConfiguration.

BigQueryExportReadSchemaTransformProvider.BigQueryExportSchemaTransform

An implementation of SchemaTransform for BigQuery read jobs configured using BigQueryExportReadSchemaTransformConfiguration.

BigQueryFileLoadsSchemaTransformProvider

An implementation of TypedSchemaTransformProvider for BigQuery write jobs configured using BigQueryWriteConfiguration.

BigQueryFileLoadsSchemaTransformProvider.BigQueryFileLoadsSchemaTransform

BigQueryFilter

BigQueryHelpers

A set of helper functions and classes used by BigQueryIO.

BigQueryInsertError

Model definition for BigQueryInsertError.

BigQueryInsertErrorCoder

A Coder that encodes BigQuery BigQueryInsertError objects.

BigQueryIO

PTransforms for reading and writing BigQuery tables.

BigQueryIO.Read

Implementation of BigQueryIO.read().

BigQueryIO.TypedRead<T>

Implementation of BigQueryIO.read(SerializableFunction).

BigQueryIO.TypedRead.Method

Determines the method used to read data from BigQuery.

BigQueryIO.TypedRead.QueryPriority

An enumeration type for the priority of a query.

BigQueryIO.Write<T>

Implementation of BigQueryIO.write().

BigQueryIO.Write.CreateDisposition

An enumeration type for the BigQuery create disposition strings.

BigQueryIO.Write.Method

Determines the method used to insert data in BigQuery.

BigQueryIO.Write.SchemaUpdateOption

An enumeration type for the BigQuery schema update options strings.

BigQueryIO.Write.WriteDisposition

An enumeration type for the BigQuery write disposition strings.

BigQueryIOTranslation

BigQueryIOTranslation.ReadRegistrar

BigQueryIOTranslation.WriteRegistrar

BigqueryMatcher

A matcher to verify data in BigQuery by processing given query and comparing with content's checksum.

BigqueryMatcher.TableAndQuery

BigQueryOptions

Properties needed when using Google BigQuery with the Apache Beam SDK.

BigQuerySchemaIOProvider

An implementation of SchemaIOProvider for reading and writing to BigQuery with BigQueryIO.

BigQuerySchemaRetrievalException

Exception to signal that BigQuery schema retrieval failed.

BigQuerySchemaTransformTranslation

BigQuerySchemaTransformTranslation.BigQueryStorageReadSchemaTransformTranslator

BigQuerySchemaTransformTranslation.BigQueryWriteSchemaTransformTranslator

BigQuerySchemaTransformTranslation.ReadWriteRegistrar

BigQueryServices

An interface for real, mock, or fake implementations of Cloud BigQuery services.

BigQueryServices.BigQueryServerStream<T>

Container for reading data from streaming endpoints.

BigQueryServices.DatasetService

An interface to get, create and delete Cloud BigQuery datasets and tables.

BigQueryServices.DatasetService.TableMetadataView

BigQueryServices.JobService

An interface for the Cloud BigQuery load service.

BigQueryServices.StorageClient

An interface representing a client object for making calls to the BigQuery Storage API.

BigQueryServices.StreamAppendClient

An interface for appending records to a Storage API write stream.

BigQueryServices.WriteStreamService

An interface to get, create and flush Cloud BigQuery STORAGE API write streams.

BigQueryServicesImpl

An implementation of BigQueryServices that actually communicates with the cloud BigQuery service.

BigQueryServicesImpl.DatasetServiceImpl

BigQueryServicesImpl.WriteStreamServiceImpl

BigQuerySinkMetrics

Helper class to create perworker metrics for BigQuery Sink stages.

BigQuerySinkMetrics.RpcMethod

BigQueryStorageApiInsertError

BigQueryStorageApiInsertErrorCoder

BigQueryStorageTableSource<T>

A Source representing reading from a table.

BigQueryStorageWriteApiSchemaTransformProvider

An implementation of TypedSchemaTransformProvider for BigQuery Storage Write API jobs configured via BigQueryWriteConfiguration.

BigQueryStorageWriteApiSchemaTransformProvider.BigQueryStorageWriteApiSchemaTransform

A SchemaTransform for BigQuery Storage Write API, configured with BigQueryWriteConfiguration and instantiated by BigQueryStorageWriteApiSchemaTransformProvider.

BigQueryTableProvider

BigQuery table provider.

BigQueryUtils

Utility methods for BigQuery related operations.

BigQueryUtils.ConversionOptions

Options for how to convert BigQuery data to Beam data.

BigQueryUtils.ConversionOptions.Builder

Builder for BigQueryUtils.ConversionOptions.

BigQueryUtils.ConversionOptions.TruncateTimestamps

Controls whether to truncate timestamps to millisecond precision lossily, or to crash when truncation would result.

BigQueryUtils.SchemaConversionOptions

Options for how to convert BigQuery schemas to Beam schemas.

BigQueryUtils.SchemaConversionOptions.Builder

Builder for BigQueryUtils.SchemaConversionOptions.

BigQueryWriteConfiguration

Configuration for writing to BigQuery with SchemaTransforms.

BigQueryWriteConfiguration.Builder

Builder for BigQueryWriteConfiguration.

BigQueryWriteConfiguration.ErrorHandling

BigQueryWriteConfiguration.ErrorHandling.Builder

BigQueryWriteSchemaTransformProvider

A BigQuery Write SchemaTransformProvider that routes to either BigQueryFileLoadsSchemaTransformProvider or BigQueryStorageWriteApiSchemaTransformProvider.

BigQueryWriteSchemaTransformProvider.BigQueryWriteSchemaTransform

BigtableChangeStreamAccessor

This is probably a temporary solution to what is a bigger migration from cloud-bigtable-client-core to java-bigtable.

BigtableChangeStreamTestOptions

BigtableClientOverride

Override the configuration of Cloud Bigtable data and admin client.

BigtableConfig

Configuration for a Cloud Bigtable client.

BigtableIO

Transforms for reading from and writing to Google Cloud Bigtable.

BigtableIO.ExistingPipelineOptions

Overwrite options to determine what to do if change stream name is being reused and there exists metadata of the same change stream name.

BigtableIO.Read

A PTransform that reads from Google Cloud Bigtable.

BigtableIO.ReadChangeStream

BigtableIO.Write

A PTransform that writes to Google Cloud Bigtable.

BigtableIO.WriteWithResults

A PTransform that writes to Google Cloud Bigtable and emits a BigtableWriteResult for each batch written.

BigtableReadSchemaTransformProvider

An implementation of TypedSchemaTransformProvider for Bigtable Read jobs configured via BigtableReadSchemaTransformProvider.BigtableReadSchemaTransformConfiguration.

BigtableReadSchemaTransformProvider.BigtableReadSchemaTransformConfiguration

Configuration for reading from Bigtable.

BigtableReadSchemaTransformProvider.BigtableReadSchemaTransformConfiguration.Builder

Builder for the BigtableReadSchemaTransformProvider.BigtableReadSchemaTransformConfiguration.

BigtableRowToBeamRow

Bigtable reference: .

BigtableRowToBeamRowFlat

Bigtable reference: .

BigtableTable

BigtableTableProvider

TableProvider for BigtableTable.

BigtableUtils

BigtableWriteResult

The result of writing a batch of rows to Bigtable.

BigtableWriteResultCoder

A coder for BigtableWriteResult.

BigtableWriteSchemaTransformProvider

An implementation of TypedSchemaTransformProvider for Bigtable Write jobs configured via BigtableWriteSchemaTransformProvider.BigtableWriteSchemaTransformConfiguration.

BigtableWriteSchemaTransformProvider.BigtableWriteSchemaTransformConfiguration

Configuration for writing to Bigtable.

BigtableWriteSchemaTransformProvider.BigtableWriteSchemaTransformConfiguration.Builder

Builder for the BigtableWriteSchemaTransformProvider.BigtableWriteSchemaTransformConfiguration.

BigtableWriteSchemaTransformProvider.GetMutationsFromBeamRow

BitSetCoder

Coder for BitSet.

BlobstoreClientBuilderFactory

Construct BlobServiceClientBuilder from Azure pipeline options.

BlobstoreOptions

Options used to configure Microsoft Azure Blob Storage.

BlockBasedSource<T>

A BlockBasedSource is a FileBasedSource where a file consists of blocks of records.

BlockBasedSource.Block<T>

A Block represents a block of records that can be read.

BlockBasedSource.BlockBasedReader<T>

A Reader that reads records from a BlockBasedSource.

BlockingCommitterImpl

BooleanCoder

A Coder for Boolean.

BoundedDataset<T>

Holds an RDD or values for deferred conversion to an RDD if needed.

BoundedDatasetFactory

BoundedReadFromUnboundedSource<T>

PTransform that reads a bounded amount of data from an UnboundedSource, specified as one or both of a maximum number of elements or a maximum period of time to read.

BoundedSource<T>

A Source that reads a finite amount of input and, because of that, supports some additional operations.

BoundedSource.BoundedReader<T>

A Reader that reads a bounded amount of input and supports some additional operations, such as progress estimation and dynamic work rebalancing.

BoundedSourceP<T>

Jet Processor implementation for reading from a bounded Beam source.

BoundedTrie

Internal: For internal use only and not for public consumption.

BoundedTrieImpl

Implementation of BoundedTrie.

BoundedTrieResult

Internal: For internal use only and not for public consumption.

BoundedWindow

A BoundedWindow represents window information assigned to data elements.

BrokerResponse

BufferedElement

An interface for elements buffered during a checkpoint when using @RequiresStableInput.

BufferedExternalSorter

Sorter that will use in memory sorting until the values can't fit into memory and will then fall back to external sorting.

BufferedExternalSorter.Options

Contains configuration for the sorter.

BufferingDoFnRunner<InputT,OutputT>

A DoFnRunner which buffers data for supporting DoFn.RequiresStableInput.

BufferingStreamObserver<T>

A thread safe StreamObserver which uses a bounded queue to pass elements to a processing thread responsible for interacting with the underlying CallStreamObserver.

BuiltinHashFunctions

Hash Functions.

BuiltinStringFunctions

BuiltinStringFunctions.

BuiltinTrigonometricFunctions

TrigonometricFunctions.

Bundle<T,CollectionT>

An immutable collection of elements which are part of a PCollection.

BundleCheckpointHandler

A handler which is invoked when the SDK returns BeamFnApi.DelayedBundleApplications as part of the bundle completion.

BundleCheckpointHandlers

Utility methods for creating BundleCheckpointHandlers.

BundleCheckpointHandlers.StateAndTimerBundleCheckpointHandler<T>

A BundleCheckpointHandler which uses TimerInternals.TimerData and ValueState to reschedule BeamFnApi.DelayedBundleApplication.

BundleFinalizationHandler

A handler for the runner when a finalization request has been received.

BundleFinalizationHandlers

Utility methods for creating BundleFinalizationHandlers.

BundleFinalizationHandlers.InMemoryFinalizer

See BundleFinalizationHandlers.inMemoryFinalizer(InstructionRequestHandler) for details.

BundleProgressHandler

A handler for bundle progress messages, both during bundle execution and on its completion.

BundleSplitHandler

A handler which is invoked whenever an active bundle is split.

ByteArray

Serializable byte array.

ByteArrayCoder

A Coder for byte[].

ByteBuddyUtils

ByteBuddyUtils.ConvertType

Give a Java type, returns the Java type expected for use with Row.

ByteBuddyUtils.ConvertValueForGetter

Takes a StackManipulation that returns a value.

ByteBuddyUtils.ConvertValueForSetter

Row is going to call the setter with its internal Java type, however the user object being set might have a different type internally.

ByteBuddyUtils.DefaultTypeConversionsFactory

ByteBuddyUtils.InjectPackageStrategy

A naming strategy for ByteBuddy classes.

ByteBuddyUtils.TransformingMap<K1,V1,K2,V2>

ByteBuddyUtils.TypeConversion<T>

ByteBuddyUtils.TypeConversionsFactory

ByteCoder

A ByteCoder encodes Byte values in 1 byte using Java serialization.

ByteKey

A class representing a key consisting of an array of bytes.

ByteKeyRange

A class representing a range of ByteKeys.

ByteKeyRangeTracker

A RangeTracker for ByteKeys in ByteKeyRanges.

ByteKeyRangeTracker

A RestrictionTracker for claiming ByteKeys in a ByteKeyRange in a monotonically increasing fashion.

BytesThroughputEstimator<T>

An estimator to provide an estimate on the byte throughput of the outputted elements.

BytesThroughputEstimator<T>

An estimator to provide an estimate on the throughput of the outputted elements.

ByteStringCoder

A duplicate of ByteStringCoder that uses the Apache Beam vendored protobuf.

ByteStringCoder

A Coder for ByteString objects based on their encoded Protocol Buffer form.

ByteStringOutputStreamBenchmark

Benchmarks for ByteStringOutputStream.

ByteStringOutputStreamBenchmark.NewVsCopy

These benchmarks below provide good details as to the cost of creating a new buffer vs copying a subset of the existing one and re-using the larger one.

ByteStringOutputStreamBenchmark.NewVsCopy.ArrayCopyState

ByteStringOutputStreamBenchmark.NewVsCopy.ArrayNewState

ByteStringOutputStreamBenchmark.ProtobufByteStringOutputStream

ByteStringOutputStreamBenchmark.SdkCoreByteStringOutputStream

ByteStringRangeHelper

Helper functions to evaluate the completeness of collection of ByteStringRanges.

ByteToElemFunction<V>

ByteToWindow function.

ByteToWindowFunction<K,V>

ByteToWindow function.

ByteToWindowFunctionPrimitive<K,V>

ByteToWindow function.

Cache

Transforms for reading and writing request/response associations to a cache.

Cache.Pair<RequestT,ResponseT>

A simple POJO that holds both cache read and write PTransforms.

CachedSideInputReader

SideInputReader that caches results for costly Materializations.

CachedSideInputReader

SideInputReader that caches materialized views.

CacheFactory

CachingFactory<CreatedT>

A wrapper around a Factory that assumes the schema parameter never changes.

CalciteConnectionWrapper

Abstract wrapper for CalciteConnection to simplify extension.

CalciteFactoryWrapper

Wrapper for CalciteFactory.

CalciteQueryPlanner

The core component to handle through a SQL statement, from explain execution plan, to generate a Beam pipeline.

CalciteQueryPlanner.NonCumulativeCostImpl

CalciteUtils

Utility methods for Calcite related operations.

CalciteUtils.TimeWithLocalTzType

A LogicalType corresponding to TIME_WITH_LOCAL_TIME_ZONE.

CalcRelSplitter

CalcRelSplitter operates on a Calc with multiple RexCall sub-expressions that cannot all be implemented by a single concrete RelNode.

CalcRelSplitter.RelType

Type of relational expression.

CalendarWindows

A collection of WindowFns that windows values into calendar-based windows such as spans of days, months, or years.

CalendarWindows.DaysWindows

A WindowFn that windows elements into periods measured by days.

CalendarWindows.MonthsWindows

A WindowFn that windows elements into periods measured by months.

CalendarWindows.YearsWindows

A WindowFn that windows elements into periods measured by years.

Caller<RequestT,ResponseT>

Caller interfaces user custom code intended for API calls.

CallShouldBackoff<ResponseT>

Informs whether a call to an API should backoff.

CancellableQueue<T>

A simplified ThreadSafe blocking queue that can be cancelled freeing any blocked Threads and preventing future Threads from blocking.

CannotProvideCoderException

The exception thrown when a CoderRegistry or CoderProvider cannot provide a Coder that has been requested.

CannotProvideCoderException.ReasonCode

Indicates the reason that Coder inference failed.

CassandraIO

An IO to read and write from/to Apache Cassandra

CassandraIO.MutationType

Specify the mutation type: either write or delete.

CassandraIO.Read<T>

A PTransform to read data from Apache Cassandra.

CassandraIO.ReadAll<T>

A PTransform to read data from Apache Cassandra.

CassandraIO.Write<T>

A PTransform to mutate into Apache Cassandra.

Cast<T>

Set of utilities for casting rows between schemas.

Cast.CompatibilityError

Describes compatibility errors during casting.

Cast.Narrowing

Narrowing changes type without guarantee to preserve data.

Cast.Validator

Interface for statically validating casts.

Cast.Widening

Widening changes to type that can represent any possible value of the original type.

Catalog

Represents a named and configurable container for managing tables.

CatalogManager

Top-level authority that manages Catalogs.

CatalogManagerSchema

A Calcite Schema that corresponds to a CatalogManager.

CatalogRegistrar

Over-arching registrar to capture available Catalogs.

CatalogSchema

A Calcite Schema that corresponds to a Catalog.

CdapIO

A CdapIO is a Transform for reading data from source or writing data to sink of a Cdap Plugin.

CdapIO.Read<K,V>

A PTransform to read from CDAP source.

CdapIO.Write<K,V>

A PTransform to write to CDAP sink.

CEPCall

A CEPCall instance represents an operation (node) that contains an operator and a list of operands.

CEPFieldRef

A CEPFieldRef instance represents a node that points to a specified field in a

Row

CEPKind

CEPKind corresponds to Calcite's SqlKind.

CEPLiteral

CEPLiteral represents a literal node.

CEPMeasure

The CEPMeasure class represents the Measures clause and contains information about output columns.

CEPOperation

CEPOperation is the base class for the evaluation operations defined in the


 DEFINE

syntax of MATCH_RECOGNIZE.

CEPOperator

The CEPOperator records the operators (i.e.

CEPPattern

Core pattern class that stores the definition of a single pattern.

CEPUtils

Some utility methods for transforming Calcite's constructs into our own Beam constructs (for serialization purpose).

ChangeStreamAction

This class is responsible for processing individual ChangeStreamRecord.

ChangeStreamContinuationTokenHelper

ChangeStreamDao

Data access object to list and read stream partitions of a table.

ChangeStreamDao

Responsible for making change stream queries for a given partition.

ChangeStreamMetrics

Class to aggregate metrics related functionality.

ChangeStreamMetrics

Class to aggregate metrics related functionality.

ChangeStreamRecord

Represents a Spanner Change Stream Record.

ChangeStreamRecordMapper

This class is responsible for transforming a Struct to a List of ChangeStreamRecord models.

ChangeStreamRecordMetadata

Holds internal execution metrics / metadata for the processed ChangeStreamRecord.

ChangeStreamRecordMetadata.Builder

ChangeStreamResultSet

Decorator class over a ResultSet that provides telemetry for the streamed records.

ChangeStreamResultSetMetadata

Represents telemetry metadata gathered during the consumption of a change stream query.

ChangeStreamsConstants

Single place for defining the constants used in the Spanner.readChangeStreams() connector.

Checkpoint

Checkpoint data to make it available in future pipeline runs.

Checkpoint.CheckpointDir

Checkpoint dir tree.

CheckpointMarkImpl

CheckpointStats

Helpers for reporting checkpoint durations.

CheckStopReadingFn

CheckStopReadingFnWrapper

ChildPartition

A child partition represents a new partition that should be queried.

ChildPartitionsRecord

Represents a ChildPartitionsRecord.

ChildPartitionsRecordAction

This class is part of the process for ReadChangeStreamPartitionDoFn SDF.

CivilTimeEncoder

Encoder for TIME and DATETIME values, according to civil_time encoding.

ClassLoaderFileSystem

A read-only FileSystem implementation looking up resources using a ClassLoader.

ClassLoaderFileSystem.ClassLoaderFileSystemRegistrar

AutoService registrar for the ClassLoaderFileSystem.

ClassLoaderFileSystem.ClassLoaderResourceId

CleanUpReadChangeStreamDoFn

ClickHouseIO

An IO to write to ClickHouse.

ClickHouseIO.Write<T>

A PTransform to write to ClickHouse.

ClickHouseWriter

Writes Rows and field values using ClickHousePipedOutputStream.

ClientBuilderFactory

Factory to build and configure any AwsClientBuilder using a specific ClientConfiguration or the globally provided settings in AwsOptions as fallback.

ClientBuilderFactory.DefaultClientBuilder

Default implementation of ClientBuilderFactory.

ClientBuilderFactory.SkipCertificateVerificationTrustManagerProvider

Trust provider to skip certificate verification.

ClientConfiguration

AWS client configuration.

ClientConfiguration.Builder

Clock

Access to the current time.

CloseableFnDataReceiver<T>

A receiver of streamed data that can be closed.

CloseableResource<T>

An AutoCloseable that wraps a resource that needs to be cleaned up but does not implement AutoCloseable itself.

CloseableResource.CloseException

An exception that wraps errors thrown while a resource is being closed.

CloseableResource.Closer<T>

A function that knows how to clean up after a resource.

CloseableThrowingConsumer<ExceptionT,T>

A ThrowingConsumer that can be closed.

CloudObject

A representation of an arbitrary Java object to be instantiated by Dataflow workers.

CloudObjects

Utilities for converting an object to a CloudObject.

CloudObjectTranslator<T>

A translator that takes an object and creates a CloudObject which can be converted back to the original object.

CloudPubsubTransforms

A class providing transforms between Cloud Pub/Sub and Pub/Sub Lite message types.

CloudResourceManagerOptions

Properties needed when using Google CloudResourceManager with the Apache Beam SDK.

CloudVision

Factory class for implementations of AnnotateImages.

CloudVision.AnnotateImagesFromBytes

Accepts ByteString (encoded image contents) with optional DoFn.SideInput with a Map of ImageContext to the image.

CloudVision.AnnotateImagesFromBytesWithContext

Accepts KVs of ByteString (encoded image contents) and ImageContext.

CloudVision.AnnotateImagesFromGcsUri

Accepts String (image URI on GCS) with optional DoFn.SideInput with a Map of ImageContext to the image.

CloudVision.AnnotateImagesFromGcsUriWithContext

Accepts KVs of String (GCS URI to the image) and ImageContext.

CodahaleCsvSink

A Sink for Spark's metric system reporting metrics (including Beam step metrics) to a CSV file.

CodahaleGraphiteSink

A Sink for Spark's metric system reporting metrics (including Beam step metrics) to Graphite.

Coder<T>

A Coder<T> defines how to encode and decode values of type T into byte streams.

Coder.Context

Deprecated.

To implement a coder, do not use any Coder.Context.

Coder.NonDeterministicException

Exception thrown by Coder.verifyDeterministic() if the encoding is not deterministic, including details of why the encoding is not deterministic.

CoderCloudObjectTranslatorRegistrar

Coder authors have the ability to automatically have their Coder registered with the Dataflow Runner by creating a ServiceLoader entry and a concrete implementation of this interface.

CoderException

An Exception thrown if there is a problem encoding or decoding a value.

CoderHelpers

Serialization utility class.

CoderHelpers

Serialization utility class.

CoderHelpers.FromByteFunction<K,V>

A function for converting a byte array pair to a key-value pair.

CoderProperties

Properties for use in Coder tests.

CoderProperties.TestElementByteSizeObserver

An ElementByteSizeObserver that records the observed element sizes for testing purposes.

CoderProvider

A CoderProvider provides Coders.

CoderProviderRegistrar

Coder creators have the ability to automatically have their coders registered with this SDK by creating a ServiceLoader entry and a concrete implementation of this interface.

CoderProviders

Static utility methods for creating and working with CoderProviders.

CoderRegistry

A CoderRegistry allows creating a Coder for a given Java class or type descriptor.

CoderSizeEstimator<T>

This class is used to estimate the size in bytes of a given element.

CoderTypeInformation<T>

Flink TypeInformation for Beam Coders.

CoderTypeSerializer<T>

Flink TypeSerializer for Beam Coders.

CoGbkResult

A row result of a CoGroupByKey.

CoGbkResult.CoGbkResultCoder

A Coder for CoGbkResults.

CoGbkResultSchema

A schema for the results of a CoGroupByKey.

CoGroup

A transform that performs equijoins across multiple schema PCollections.

CoGroup.By

Defines the set of fields to extract for the join key, as well as other per-input join options.

CoGroup.ExpandCrossProduct

A PTransform that calculates the cross-product join.

CoGroup.Impl

The implementing PTransform.

CoGroup.Result

CoGroupByKey<K>

A PTransform that performs a CoGroupByKey on a tuple of tables.

CollectionCoder<T>

A CollectionCoder encodes Collections in the format of IterableLikeCoder.

ColumnType

Defines a column type from a Cloud Spanner table with the following information: column name, column type, flag indicating if column is primary key and column position in the table.

Combine

PTransforms for combining PCollection elements globally and per-key.

Combine.AccumulatingCombineFn<InputT,AccumT,OutputT>

A CombineFn that uses a subclass of Combine.AccumulatingCombineFn.Accumulator as its accumulator type.

Combine.AccumulatingCombineFn.Accumulator<InputT,AccumT,OutputT>

The type of mutable accumulator values used by this AccumulatingCombineFn.

Combine.BinaryCombineDoubleFn

An abstract subclass of Combine.CombineFn for implementing combiners that are more easily and efficiently expressed as binary operations on doubles.

Combine.BinaryCombineFn<V>

An abstract subclass of Combine.CombineFn for implementing combiners that are more easily expressed as binary operations.

Combine.BinaryCombineIntegerFn

An abstract subclass of Combine.CombineFn for implementing combiners that are more easily and efficiently expressed as binary operations on ints

Combine.BinaryCombineLongFn

An abstract subclass of Combine.CombineFn for implementing combiners that are more easily and efficiently expressed as binary operations on longs.

Combine.CombineFn<InputT,AccumT,OutputT>

A CombineFn<InputT, AccumT, OutputT> specifies how to combine a collection of input values of type InputT into a single output value of type OutputT.

Combine.Globally<InputT,OutputT>

Combine.Globally<InputT, OutputT> takes a PCollection<InputT> and returns a PCollection<OutputT> whose elements are the result of combining all the elements in each window of the input PCollection, using a specified CombineFn<InputT, AccumT, OutputT>.

Combine.GloballyAsSingletonView<InputT,OutputT>

Combine.GloballyAsSingletonView<InputT, OutputT> takes a PCollection<InputT> and returns a PCollectionView<OutputT> whose elements are the result of combining all the elements in each window of the input PCollection, using a specified CombineFn<InputT, AccumT, OutputT>.

Combine.GroupedValues<K,InputT,OutputT>

GroupedValues<K, InputT, OutputT> takes a PCollection<KV<K, Iterable<InputT>>>, such as the result of GroupByKey, applies a specified CombineFn<InputT, AccumT, OutputT> to each of the input KV<K, Iterable<InputT>> elements to produce a combined output KV<K, OutputT> element, and returns a


 PCollection<KV<K, OutputT>>

containing all the combined output elements.

Combine.Holder<V>

Holds a single value value of type V which may or may not be present.

Combine.IterableCombineFn<V>

Converts a SerializableFunction from Iterable<V>s to Vs into a simple Combine.CombineFn over Vs.

Combine.PerKey<K,InputT,OutputT>

PerKey<K, InputT, OutputT> takes a PCollection<KV<K, InputT>>, groups it by key, applies a combining function to the InputT values associated with each key to produce a combined OutputT value, and returns a PCollection<KV<K, OutputT>> representing a map from each distinct key of the input PCollection to the corresponding combined value.

Combine.PerKeyWithHotKeyFanout<K,InputT,OutputT>

Like Combine.PerKey, but sharding the combining of hot keys.

Combine.SimpleCombineFn<V>

Deprecated.

CombineAsIterable<T>

CombineFnBase

For internal use only; no backwards-compatibility guarantees.

CombineFnBase.GlobalCombineFn<InputT,AccumT,OutputT>

For internal use only; no backwards-compatibility guarantees.

CombineFns

Static utility methods that create combine function instances.

CombineFns.CoCombineResult

A tuple of outputs produced by a composed combine functions.

CombineFns.ComposeCombineFnBuilder

A builder class to construct a composed CombineFnBase.GlobalCombineFn.

CombineFns.ComposedCombineFn<DataT>

A composed Combine.CombineFn that applies multiple CombineFns.

CombineFns.ComposedCombineFnWithContext<DataT>

A composed CombineWithContext.CombineFnWithContext that applies multiple CombineFnWithContexts.

CombineFnTester

Utilities for testing CombineFns.

CombineWithContext

This class contains combine functions that have access to PipelineOptions and side inputs through CombineWithContext.Context.

CombineWithContext.CombineFnWithContext<InputT,AccumT,OutputT>

A combine function that has access to PipelineOptions and side inputs through


 CombineWithContext.Context

CombineWithContext.Context

Information accessible to all methods in CombineFnWithContext and


 KeyedCombineFnWithContext

CombineWithContext.RequiresContextInternal

An internal interface for signaling that a GloballyCombineFn or a


 PerKeyCombineFn

needs to access CombineWithContext.Context.

CombiningState<InputT,AccumT,OutputT>

A ReadableState cell defined by a Combine.CombineFn, accepting multiple input values, combining them as specified into accumulators, and producing a single output value.

CompressedSource<T>

A Source that reads from compressed files.

CompressedSource.CompressedReader<T>

Reader for a CompressedSource.

CompressedSource.CompressionMode

Deprecated.

Use Compression instead

CompressedSource.DecompressingChannelFactory

Factory interface for creating channels that decompress the content of an underlying channel.

Compression

Various compression types for reading/writing files.

ConfigWrapper<T>

Class for building PluginConfig object of the specific class .

ConfluentSchemaRegistryDeserializerProvider<T>

A DeserializerProvider that uses Confluent Schema Registry to resolve a Deserializers and Coder given a subject.

ConnectionManager

Connectors

Enumeration of debezium connectors.

ConsoleIO

Print to console.

ConsoleIO.Write

Write to console.

ConsoleIO.Write.Unbound<T>

PTransform writing PCollection to the console.

Contextful<ClosureT>

Pair of a bit of user code (a "closure") and the Requirements needed to run it.

Contextful.Fn<InputT,OutputT>

A function from an input to an output that may additionally access Contextful.Fn.Context when computing the result.

Contextful.Fn.Context

An accessor for additional capabilities available in Contextful.Fn.apply(InputT, org.apache.beam.sdk.transforms.Contextful.Fn.Context).

ContextualTextIO

PTransforms that read text files and collect contextual information of the elements in the input.

ContextualTextIO.Read

Implementation of ContextualTextIO.read().

ContextualTextIO.ReadFiles

Implementation of ContextualTextIO.readFiles().

ContiguousSequenceRange

A range of contiguous event sequences and the latest timestamp of the events in the range.

ControlClientPool

A pool of control clients that brokers incoming SDK harness connections (in the form of InstructionRequestHandlers.

ControlClientPool.Sink

A sink for InstructionRequestHandlers keyed by worker id.

ControlClientPool.Source

A source of InstructionRequestHandlers.

Convert

A set of utilities for converting between different objects supporting schemas.

ConvertHelpers

Helper functions for converting between equivalent schema types.

ConvertHelpers.ConvertedSchemaInformation<T>

Return value after converting a schema.

CosmosIO

CosmosIO.BoundedCosmosBDSource<T>

A BoundedSource reading from Comos.

CosmosIO.Read<T>

CosmosOptions

CosmosOptions.CosmosClientBuilderFactory

Create a cosmos client from the pipeline options.

Count

PTransforms to count the elements in a PCollection.

Counter

A metric that reports a single long value and can be incremented or decremented.

CounterImpl

Implementation of Counter.

CountIf

Returns the count of TRUE values for expression.

CountIf.CountIfFn

CountingPipelineVisitor

Pipeline visitors that fills a lookup table of PValue to number of consumers.

CountingReadableByteChannel

CountingSeekableByteChannel

CountingSource

Most users should use GenerateSequence instead.

CountingSource.CounterMark

The checkpoint for an unbounded CountingSource is simply the last value produced.

CountingSource.CounterMarkCoder

A custom coder for CounterMark.

CountingWritableByteChannel

CovarianceFn<T>

Combine.CombineFn for Covariance on Number types.

CrashingRunner

A PipelineRunner that applies no overrides and throws an exception on calls to Pipeline.run().

Create<T>

Create<T> takes a collection of elements of type T known when the pipeline is constructed and returns a PCollection<T> containing the elements.

Create.OfValueProvider<T>

Implementation of Create.ofProvider(org.apache.beam.sdk.options.ValueProvider<T>, org.apache.beam.sdk.coders.Coder<T>).

Create.TimestampedValues<T>

A PTransform that creates a PCollection whose elements have associated timestamps.

Create.Values<T>

A PTransform that creates a PCollection from a set of in-memory objects.

Create.WindowedValues<T>

A PTransform that creates a PCollection whose elements have associated windowing metadata.

CreateDataflowView<ElemT,ViewT>

A DataflowRunner marker class for creating a PCollectionView.

CreateDisposition

Enum containing all supported dispositions for table.

CreateOptions

An abstract class that contains common configuration options for creating resources.

CreateOptions.Builder<BuilderT>

An abstract builder for CreateOptions.

CreateOptions.StandardCreateOptions

A standard configuration options with builder.

CreateOptions.StandardCreateOptions.Builder

Builder for CreateOptions.StandardCreateOptions.

CreateStream<T>

Create an input stream from Queue.

CreateStreamingSparkView<ElemT,ViewT>

Spark streaming overrides for various view (side input) transforms.

CreateStreamingSparkView.CreateSparkPCollectionView<ElemT,ViewT>

Creates a primitive PCollectionView.

CreateStreamingSparkView.Factory<ElemT,ViewT>

CreateTableHelpers

CreateTables<DestinationT,ElementT>

Creates any tables needed before performing streaming writes to the tables.

CredentialFactory

Construct an oauth credential to be used by the SDK and the SDK workers.

CrossLanguageConfiguration

Parameters abstract class to expose the transforms to an external SDK.

CsvIO

PTransforms for reading and writing CSV files.

CsvIO.Write<T>

PTransform for writing CSV files.

CsvIOParse<T>

PTransform for Parsing CSV Record Strings into Schema-mapped target types.

CsvIOParseError

CsvIOParseError is a data class to store errors from CSV record processing.

CsvIOParseResult<T>

The CsvIOParseResult and CsvIOParseError PCollection results of parsing CSV records.

CsvSink

A Sink for Spark's metric system reporting metrics (including Beam step metrics) to a CSV file.

CsvWriteSchemaTransformFormatProvider

A FileWriteSchemaTransformFormatProvider for CSV format.

CsvWriteTransformProvider

An implementation of TypedSchemaTransformProvider for CsvIO.write(java.lang.String, org.apache.commons.csv.CSVFormat).

CsvWriteTransformProvider.CsvWriteConfiguration

Configuration for writing to BigQuery with Storage Write API.

CsvWriteTransformProvider.CsvWriteConfiguration.Builder

Builder for CsvWriteTransformProvider.CsvWriteConfiguration.

CsvWriteTransformProvider.CsvWriteTransform

A SchemaTransform for CsvIO.write(java.lang.String, org.apache.commons.csv.CSVFormat).

CustomCoder<T>

An abstract base class that implements all methods of Coder except Coder.encode(T, java.io.OutputStream) and Coder.decode(java.io.InputStream).

Customer

Describes a customer.

CustomHttpErrors

An optional component to use with the RetryHttpRequestInitializer in order to provide custom errors for failing http calls.

CustomHttpErrors.Builder

A Builder which allows building immutable CustomHttpErrors object.

CustomHttpErrors.MatcherAndError

A simple Tuple class for creating a list of HttpResponseMatcher and HttpResponseCustomError to print for the responses.

CustomSources

A helper class for supporting sources defined as Source.

CustomTableResolver

Interface that table providers can implement if they require custom table name resolution.

CustomTimestampPolicyWithLimitedDelay<K,V>

A policy for custom record timestamps where timestamps within a partition are expected to be roughly monotonically increasing with a cap on out of order event delays (say 1 minute).

CustomX509TrustManager

A Custom X509TrustManager that trusts a user provided CA and default CA's.

DAGBuilder

Utility class for wiring up Jet DAGs based on Beam pipelines.

DAGBuilder.WiringListener

Listener that can be registered with a DAGBuilder in order to be notified when edges are being registered.

DaoFactory

Factory class to create data access objects to perform change stream queries and access the metadata tables.

DataCatalogPipelineOptions

Pipeline options for Data Catalog table provider.

DataCatalogPipelineOptionsRegistrar

DataCatalogTableProvider

Uses DataCatalog to get the source type and schema for a table.

DataChangeRecord

A data change record encodes modifications to Cloud Spanner rows.

DataChangeRecordAction

This class is part of the process for ReadChangeStreamPartitionDoFn SDF.

DataEndpoint<T>

DataflowClient

Wrapper around the generated Dataflow client to provide common functionality.

DataflowGroupByKey<K,V>

Specialized implementation of GroupByKey for translating Redistribute transform into Dataflow service protos.

DataflowGroupByKey.Registrar

Registers DataflowGroupByKey.DataflowGroupByKeyTranslator.

DataflowJobAlreadyExistsException

An exception that is thrown if the unique job name constraint of the Dataflow service is broken because an existing job with the same job name is currently active.

DataflowJobAlreadyUpdatedException

An exception that is thrown if the existing job has already been updated within the Dataflow service and is no longer able to be updated.

DataflowJobException

A RuntimeException that contains information about a DataflowPipelineJob.

DataflowPipelineDebugOptions

Internal.

DataflowPipelineDebugOptions.DataflowClientFactory

Returns the default Dataflow client built from the passed in PipelineOptions.

DataflowPipelineDebugOptions.StagerFactory

Creates a Stager object using the class specified in DataflowPipelineDebugOptions.getStagerClass().

DataflowPipelineDebugOptions.UnboundedReaderMaxReadTimeFactory

Sets Integer value based on old, deprecated field (DataflowPipelineDebugOptions.getUnboundedReaderMaxReadTimeSec()).

DataflowPipelineJob

A DataflowPipelineJob represents a job submitted to Dataflow using DataflowRunner.

DataflowPipelineOptions

Options that can be used to configure the DataflowRunner.

DataflowPipelineOptions.FlexResourceSchedulingGoal

Set of available Flexible Resource Scheduling goals.

DataflowPipelineOptions.StagingLocationFactory

Returns a default staging location under GcpOptions.getGcpTempLocation().

DataflowPipelineRegistrar

Contains the PipelineOptionsRegistrar and PipelineRunnerRegistrar for the DataflowRunner.

DataflowPipelineRegistrar.Options

DataflowPipelineRegistrar.Runner

DataflowPipelineTranslator

DataflowPipelineTranslator knows how to translate Pipeline objects into Cloud Dataflow Service API Jobs.

DataflowPipelineTranslator.JobSpecification

The result of a job translation.

DataflowPipelineWorkerPoolOptions

Options that are used to configure the Dataflow pipeline worker pool.

DataflowPipelineWorkerPoolOptions.AutoscalingAlgorithmType

Type of autoscaling algorithm to use.

DataflowProfilingOptions

Options for controlling profiling of pipeline execution.

DataflowProfilingOptions.DataflowProfilingAgentConfiguration

Configuration the for profiling agent.

DataflowRunner

A PipelineRunner that executes the operations in the pipeline by first translating them to the Dataflow representation using the DataflowPipelineTranslator and then submitting them to a Dataflow service for execution.

DataflowRunner.DataflowTransformTranslator

DataflowRunner.StreamingPCollectionViewWriterFn<T>

A marker DoFn for writing the contents of a PCollection to a streaming PCollectionView backend implementation.

DataflowRunnerHooks

An instance of this class can be passed to the DataflowRunner to add user defined hooks to be invoked at various times during pipeline execution.

DataflowRunnerInfo

Populates versioning and other information for DataflowRunner.

DataflowServiceException

Signals there was an error retrieving information about a job from the Cloud Dataflow Service.

DataflowStreamingPipelineOptions

[Internal] Options for configuring StreamingDataflowWorker.

DataflowStreamingPipelineOptions.EnableWindmillServiceDirectPathFactory

EnableStreamingEngine defaults to false unless one of the experiment is set.

DataflowStreamingPipelineOptions.GlobalConfigRefreshPeriodFactory

Read global get config request period from system property 'windmill.global_config_refresh_period'.

DataflowStreamingPipelineOptions.HarnessUpdateReportingPeriodFactory

Read counter reporting period from system property 'windmill.harness_update_reporting_period'.

DataflowStreamingPipelineOptions.LocalWindmillHostportFactory

Factory for creating local Windmill address.

DataflowStreamingPipelineOptions.MaxStackTraceDepthToReportFactory

Read 'MaxStackTraceToReport' from system property 'windmill.max_stack_trace_to_report' or Integer.MAX_VALUE if unspecified.

DataflowStreamingPipelineOptions.PeriodicStatusPageDirectoryFactory

Read 'PeriodicStatusPageOutputDirector' from system property 'windmill.periodic_status_page_directory' or null if unspecified.

DataflowStreamingPipelineOptions.WindmillServiceStreamingRpcBatchLimitFactory

Factory for setting value of WindmillServiceStreamingRpcBatchLimit based on environment.

DataflowTemplateJob

A DataflowPipelineJob that is returned when --templateRunner is set.

DataflowTransport

Helpers for cloud communication.

DataflowWorkerHarnessOptions

Options that are used exclusively within the Dataflow worker harness.

DataflowWorkerLoggingOptions

Deprecated.

This interface will no longer be the source of truth for worker logging configuration once jobs are executed using a dedicated SDK harness instead of user code being co-located alongside Dataflow worker code.

DataflowWorkerLoggingOptions.Level

The set of log levels that can be used on the Dataflow worker.

DataflowWorkerLoggingOptions.WorkerLogLevelOverrides

Defines a log level override for a specific class, package, or name.

DataframeTransform

Wrapper for invoking external Python DataframeTransform.

DataGeneratorPTransform

The main PTransform that encapsulates the data generation logic.

DataGeneratorRowFn

A stateful DoFn that converts a sequence of Longs into structured Rows.

DataGeneratorTable

Represents a 'datagen' table within a Beam SQL pipeline.

DataGeneratorTableProvider

The service entry point for the 'datagen' table type.

DataInputViewWrapper

Wrapper for DataInputView.

DataOutputViewWrapper

Wrapper for DataOutputView.

Dataset

Holder for Spark RDD/DStream.

DatastoreIO

DatastoreIO provides an API for reading from and writing to Google Cloud Datastore over different versions of the Cloud Datastore Client libraries.

DatastoreV1

DatastoreV1 provides an API to Read, Write and Delete PCollections of Google Cloud Datastore version v1 Entity objects.

DatastoreV1.DeleteEntity

A PTransform that deletes Entities from Cloud Datastore.

DatastoreV1.DeleteEntityWithSummary

A PTransform that deletes Entities from Cloud Datastore and returns DatastoreV1.WriteSuccessSummary for each successful write.

DatastoreV1.DeleteKey

A PTransform that deletes Entities associated with the given Keys from Cloud Datastore.

DatastoreV1.DeleteKeyWithSummary

A PTransform that deletes Entities associated with the given Keys from Cloud Datastore and returns DatastoreV1.WriteSuccessSummary for each successful delete.

DatastoreV1.Read

A PTransform that reads the result rows of a Cloud Datastore query as Entity objects.

DatastoreV1.Write

A PTransform that writes Entity objects to Cloud Datastore.

DatastoreV1.WriteSuccessSummary

Summary object produced when a number of writes are successfully written to Datastore in a single Mutation.

DatastoreV1.WriteWithSummary

A PTransform that writes Entity objects to Cloud Datastore and returns DatastoreV1.WriteSuccessSummary for each successful write.

DataStoreV1SchemaIOProvider

An implementation of SchemaIOProvider for reading and writing payloads with DatastoreIO.

DataStoreV1SchemaIOProvider.DataStoreV1SchemaIO

An abstraction to create schema aware IOs.

DataStoreV1TableProvider

TableProvider for DatastoreIO for consumption by Beam SQL.

DataStreams

DataStreams.DataStreamDecoder treats multiple ByteStrings as a single input stream decoding values with the supplied iterator.

DataStreams.DataStreamDecoder<T>

An adapter which converts an InputStream to a PrefetchableIterator of T values using the specified Coder.

DataStreams.ElementDelimitedOutputStream

An adapter which wraps an DataStreams.OutputChunkConsumer as an OutputStream.

DataStreams.OutputChunkConsumer<T>

A callback which is invoked whenever the

DataStreams.outbound(org.apache.beam.sdk.fn.stream.DataStreams.OutputChunkConsumer<org.apache.beam.vendor.grpc.v1p69p0.com.google.protobuf.ByteString>)

OutputStream becomes full.

Date

A date without a time-zone.

DateTime

A datetime without a time-zone.

DeadLetteredTransform<InputT,OutputT>

DebeziumIO

Utility class which exposes an implementation DebeziumIO.read() and a Debezium configuration.

DebeziumIO.ConnectorConfiguration

A POJO describing a Debezium configuration.

DebeziumIO.Read<T>

Implementation of DebeziumIO.read().

DebeziumReadSchemaTransformProvider

A schema-aware transform provider for DebeziumIO.

DebeziumReadSchemaTransformProvider.DebeziumReadSchemaTransformConfiguration

DebeziumReadSchemaTransformProvider.DebeziumReadSchemaTransformConfiguration.Builder

DebeziumTransformRegistrar

Exposes DebeziumIO.Read as an external transform for cross-language usage.

DebeziumTransformRegistrar.ReadBuilder

DebeziumTransformRegistrar.ReadBuilder.Configuration

DecodingFnDataReceiver<T>

A receiver of encoded data, decoding it and passing it onto a downstream consumer.

DedupingOperator<T>

Remove values with duplicate ids.

Deduplicate

A set of PTransforms which deduplicate input records over a time domain and threshold.

Deduplicate.KeyedValues<K,V>

Deduplicates keyed values using the key over a specified time domain and threshold.

Deduplicate.Values<T>

Deduplicates values over a specified time domain and threshold.

Deduplicate.WithRepresentativeValues<T,IdT>

A PTransform that uses a SerializableFunction to obtain a representative value for each input element used for deduplication.

Default

Default represents a set of annotations that can be used to annotate getter properties on PipelineOptions with information representing the default value to be returned if no value is specified.

Default.Boolean

This represents that the default of the option is the specified boolean primitive value.

Default.Byte

This represents that the default of the option is the specified byte primitive value.

Default.Character

This represents that the default of the option is the specified char primitive value.

Default.Class

This represents that the default of the option is the specified Class value.

Default.Double

This represents that the default of the option is the specified double primitive value.

Default.Enum

This represents that the default of the option is the specified enum.

Default.Float

This represents that the default of the option is the specified float primitive value.

Default.InstanceFactory

Value must be of type DefaultValueFactory and have a default constructor.

Default.Integer

This represents that the default of the option is the specified int primitive value.

Default.Long

This represents that the default of the option is the specified long primitive value.

Default.Short

This represents that the default of the option is the specified short primitive value.

Default.String

This represents that the default of the option is the specified String value.

DefaultAutoscaler

Default implementation of AutoScaler.

DefaultBlobstoreClientBuilderFactory

Construct BlobServiceClientBuilder with given values of Azure client properties.

DefaultCoder

The DefaultCoder annotation specifies a Coder class to handle encoding and decoding instances of the annotated class.

DefaultCoder.DefaultCoderProviderRegistrar

A CoderProviderRegistrar that registers a CoderProvider which can use the @DefaultCoder annotation to provide coder providers that creates Coders.

DefaultCoder.DefaultCoderProviderRegistrar.DefaultCoderProvider

A CoderProvider that uses the @DefaultCoder annotation to provide coder providers that create Coders.

DefaultCoderCloudObjectTranslatorRegistrar

The CoderCloudObjectTranslatorRegistrar containing the default collection of Coder Cloud Object Translators.

DefaultExecutableStageContext

Implementation of a ExecutableStageContext.

DefaultFilenamePolicy

A default FileBasedSink.FilenamePolicy for windowed and unwindowed files.

DefaultFilenamePolicy.Params

Encapsulates constructor parameters to DefaultFilenamePolicy.

DefaultFilenamePolicy.ParamsCoder

A Coder for DefaultFilenamePolicy.Params.

DefaultGcpRegionFactory

Factory for a default value for Google Cloud region according to https://cloud.google.com/compute/docs/gcloud-compute/#default-properties.

DefaultGoogleAdsClientFactory

The default way to construct a GoogleAdsClient.

DefaultJobBundleFactory

A JobBundleFactory for which the implementation can specify a custom EnvironmentFactory for environment management.

DefaultJobBundleFactory.ServerInfo

A container for EnvironmentFactory and its corresponding Grpc servers.

DefaultJobBundleFactory.WrappedSdkHarnessClient

Holder for an SdkHarnessClient along with its associated state and data servers.

DefaultPipelineOptionsRegistrar

A PipelineOptionsRegistrar containing the PipelineOptions subclasses available by default.

DefaultS3ClientBuilderFactory

Construct S3ClientBuilder with default values of S3 client properties like path style access, accelerated mode, etc.

DefaultS3FileSystemSchemeRegistrar

Registers the "s3" uri schema to be handled by S3FileSystem.

DefaultSchema

The DefaultSchema annotation specifies a SchemaProvider class to handle obtaining a schema and row for the specified class.

DefaultSchema.DefaultSchemaProvider

SchemaProvider for default schemas.

DefaultSchema.DefaultSchemaProviderRegistrar

Registrar for default schemas.

DefaultSequenceCombiner<EventKeyT,EventT,StateT>

Default global sequence combiner.

DefaultTableFilter

This default implementation of BeamSqlTableFilter interface.

DefaultTrigger

A trigger that is equivalent to Repeatedly.forever(AfterWatermark.pastEndOfWindow()).

DefaultValueFactory<T>

An interface used with the Default.InstanceFactory annotation to specify the class that will be an instance factory to produce default values for a given getter on PipelineOptions.

DelegateCoder<T,IntermediateT>

A DelegateCoder<T, IntermediateT> wraps a Coder for IntermediateT and encodes/decodes values of type T by converting to/from IntermediateT and then encoding/decoding using the underlying Coder<IntermediateT>.

DelegateCoder.CodingFunction<InputT,OutputT>

A CodingFunction<InputT, OutputT> is a serializable function from InputT to OutputT that may throw any Exception.

DelegatingCounter

Implementation of Counter that delegates to the instance for the current context.

DelegatingDistribution

Implementation of Distribution that delegates to the instance for the current context.

DelegatingGauge

Implementation of Gauge that delegates to the instance for the current context.

DelegatingHistogram

Implementation of Histogram that delegates to the instance for the current context.

Dependency

DequeCoder<T>

A Coder for Deque, using the format of IterableLikeCoder.

Description

Descriptions are used to generate human readable output when the --help command is specified.

DeserializerProvider<T>

Provides a configured Deserializer instance and its associated Coder.

DetectNewPartitionsAction

This class processes DetectNewPartitionsDoFn.

DetectNewPartitionsAction

This class is responsible for scheduling partitions.

DetectNewPartitionsDoFn

A SplittableDoFn (SDF) that is responsible for scheduling partitions to be queried.

DetectNewPartitionsRangeTracker

This restriction tracker delegates most of its behavior to an internal TimestampRangeTracker.

DetectNewPartitionsState

Metadata of the progress of DetectNewPartitionsDoFn from the metadata table.

DetectNewPartitionsTracker

DicomIO

The DicomIO connectors allows Beam pipelines to make calls to the Dicom API of the Google Cloud Healthcare API (https://cloud.google.com/healthcare/docs/how-tos#dicom-guide).

DicomIO.ReadStudyMetadata

This class makes a call to the retrieve metadata endpoint (https://cloud.google.com/healthcare/docs/how-tos/dicomweb#retrieving_metadata).

DicomIO.ReadStudyMetadata.Result

DirectOptions

Options that can be used to configure the DirectRunner.

DirectOptions.AvailableParallelismFactory

A DefaultValueFactory that returns the result of Runtime.availableProcessors() from the DirectOptions.AvailableParallelismFactory.create(PipelineOptions) method.

DirectRegistrar

Contains the PipelineRunnerRegistrar and PipelineOptionsRegistrar for the DirectRunner.

DirectRegistrar.Options

Registers the DirectOptions.

DirectRegistrar.Runner

Registers the DirectRunner.

DirectRunner

A PipelineRunner that executes a Pipeline within the process that constructed the Pipeline.

DirectRunner.DirectPipelineResult

The result of running a Pipeline with the DirectRunner.

DirectStreamObserver<T>

A StreamObserver which uses synchronization on the underlying CallStreamObserver to provide thread safety.

DirectTestOptions

Internal-only options for tweaking the behavior of the PipelineOptions.DirectRunner in ways that users should never do.

DisplayData

Static display data associated with a pipeline component.

DisplayData.Builder

Utility to build up display data from a component and its included subcomponents.

DisplayData.Identifier

Unique identifier for a display data item within a component.

DisplayData.Item

Items are the unit of display data.

DisplayData.ItemSpec<T>

Specifies an DisplayData.Item to register as display data.

DisplayData.Path

Structured path of registered display data within a component hierarchy.

DisplayData.Type

Display data type.

Distinct<T>

Distinct<T> takes a PCollection<T> and returns a PCollection<T> that has all distinct elements of the input.

Distinct.WithRepresentativeValues<T,IdT>

A Distinct PTransform that uses a SerializableFunction to obtain a representative value for each input element.

Distribution

A metric that reports information about the distribution of reported values.

DistributionImpl

Implementation of Distribution.

DistributionResult

The result of a Distribution metric.

DLPDeidentifyText

A PTransform connecting to Cloud DLP (https://cloud.google.com/dlp/docs/libraries) and deidentifying text according to provided settings.

DLPDeidentifyText.Builder

DLPInspectText

A PTransform connecting to Cloud DLP (https://cloud.google.com/dlp/docs/libraries) and inspecting text for identifying data according to provided settings.

DLPInspectText.Builder

DLPReidentifyText

A PTransform connecting to Cloud DLP (https://cloud.google.com/dlp/docs/libraries) and inspecting text for identifying data according to provided settings.

DLPReidentifyText.Builder

DlqProvider

DockerEnvironmentFactory

An EnvironmentFactory that creates docker containers by shelling out to docker.

DockerEnvironmentFactory.Provider

Provider for DockerEnvironmentFactory.

DoFn<InputT,OutputT>

The argument to ParDo providing the code to use to process elements of the input PCollection.

DoFn.AlwaysFetched

Annotation for declaring that a state parameter is always fetched.

DoFn.BoundedPerElement

Annotation on a splittable DoFn specifying that the DoFn performs a bounded amount of work per input element, so applying it to a bounded PCollection will produce also a bounded PCollection.

DoFn.BundleFinalizer

A parameter that is accessible during @StartBundle, @ProcessElement and @FinishBundle that allows the caller to register a callback that will be invoked after the bundle has been successfully completed and the runner has commit the output.

DoFn.BundleFinalizer.Callback

An instance of a function that will be invoked after bundle finalization.

DoFn.Element

Parameter annotation for the input element for DoFn.ProcessElement, DoFn.GetInitialRestriction, DoFn.GetSize, DoFn.SplitRestriction, DoFn.GetInitialWatermarkEstimatorState, DoFn.NewWatermarkEstimator, and DoFn.NewTracker methods.

DoFn.FieldAccess

Annotation for specifying specific fields that are accessed in a Schema PCollection.

DoFn.FinishBundle

Annotation for the method to use to finish processing a batch of elements.

DoFn.GetInitialRestriction

Annotation for the method that maps an element to an initial restriction for a splittable DoFn.

DoFn.GetInitialWatermarkEstimatorState

Annotation for the method that maps an element and restriction to initial watermark estimator state for a splittable DoFn.

DoFn.GetRestrictionCoder

Annotation for the method that returns the coder to use for the restriction of a splittable DoFn.

DoFn.GetSize

Annotation for the method that returns the corresponding size for an element and restriction pair.

DoFn.GetWatermarkEstimatorStateCoder

Annotation for the method that returns the coder to use for the watermark estimator state of a splittable DoFn.

DoFn.Key

Parameter annotation for dereferencing input element key in KV pair.

DoFn.MultiOutputReceiver

Receives tagged output for a multi-output function.

DoFn.NewTracker

Annotation for the method that creates a new RestrictionTracker for the restriction of a splittable DoFn.

DoFn.NewWatermarkEstimator

Annotation for the method that creates a new WatermarkEstimator for the watermark state of a splittable DoFn.

DoFn.OnTimer

Annotation for registering a callback for a timer.

DoFn.OnTimerFamily

Annotation for registering a callback for a timerFamily.

DoFn.OnWindowExpiration

Annotation for the method to use for performing actions on window expiration.

DoFn.OutputReceiver<T>

Receives values of the given type.

DoFn.ProcessContinuation

When used as a return value of DoFn.ProcessElement, indicates whether there is more work to be done for the current element.

DoFn.ProcessElement

Annotation for the method to use for processing elements.

DoFn.RequiresStableInput

Annotation that may be added to a DoFn.ProcessElement, DoFn.OnTimer, or DoFn.OnWindowExpiration method to indicate that the runner must ensure that the observable contents of the input PCollection or mutable state must be stable upon retries.

DoFn.RequiresTimeSortedInput

Annotation that may be added to a DoFn.ProcessElement method to indicate that the runner must ensure that the observable contents of the input PCollection is sorted by time, in ascending order.

DoFn.Restriction

Parameter annotation for the restriction for DoFn.GetSize, DoFn.SplitRestriction, DoFn.GetInitialWatermarkEstimatorState, DoFn.NewWatermarkEstimator, and DoFn.NewTracker methods.

DoFn.Setup

Annotation for the method to use to prepare an instance for processing bundles of elements.

DoFn.SideInput

Parameter annotation for the SideInput for a DoFn.ProcessElement method.

DoFn.SplitRestriction

Annotation for the method that splits restriction of a splittable DoFn into multiple parts to be processed in parallel.

DoFn.StartBundle

Annotation for the method to use to prepare an instance for processing a batch of elements.

DoFn.StateId

Annotation for declaring and dereferencing state cells.

DoFn.Teardown

Annotation for the method to use to clean up this instance before it is discarded.

DoFn.TimerFamily

Parameter annotation for the TimerMap for a DoFn.ProcessElement method.

DoFn.TimerId

Annotation for declaring and dereferencing timers.

DoFn.Timestamp

Parameter annotation for the input element timestamp for DoFn.ProcessElement, DoFn.GetInitialRestriction, DoFn.GetSize, DoFn.SplitRestriction, DoFn.GetInitialWatermarkEstimatorState, DoFn.NewWatermarkEstimator, and DoFn.NewTracker methods.

DoFn.TruncateRestriction

Annotation for the method that truncates the restriction of a splittable DoFn into a bounded one.

DoFn.UnboundedPerElement

Annotation on a splittable DoFn specifying that the DoFn performs an unbounded amount of work per input element, so applying it to a bounded PCollection will produce an unbounded PCollection.

DoFn.WatermarkEstimatorState

Parameter annotation for the watermark estimator state for the DoFn.NewWatermarkEstimator method.

DoFnFunction<OutputT,InputT>

DoFn function.

DoFnOperator<PreInputT,InputT,OutputT>

Flink operator for executing DoFns.

DoFnOperator.BufferedOutputManager<OutputT>

A WindowedValueReceiver that can buffer its outputs.

DoFnOperator.MultiOutputOutputManagerFactory<OutputT>

Implementation of DoFnOperator.OutputManagerFactory that creates an DoFnOperator.BufferedOutputManager that can write to multiple logical outputs by Flink side output.

DoFnOutputReceivers

Common DoFn.OutputReceiver and DoFn.MultiOutputReceiver classes.

DoFnRunnerWithMetrics<InputT,OutputT>

DoFnRunner decorator which registers MetricsContainerImpl.

DoFnRunnerWithMetricsUpdate<InputT,OutputT>

DoFnRunner decorator which registers MetricsContainerImpl.

DoFnSchemaInformation

Represents information about how a DoFn extracts schemas.

DoFnSchemaInformation.Builder

The builder object.

DoFnTester<InputT,OutputT>

Deprecated.

Use TestPipeline with the DirectRunner.

DoFnTester.CloningBehavior

Deprecated.

Use TestPipeline with the DirectRunner.

DoubleCoder

A DoubleCoder encodes Double values in 8 bytes using Java serialization.

DropFields

A transform to drop fields from a schema.

DropFields.Inner<T>

Implementation class for DropFields.

DurationCoder

A Coder that encodes a joda Duration as a Long using the format of VarLongCoder.

DynamicAvroDestinations<UserT,DestinationT,OutputT>

A specialization of FileBasedSink.DynamicDestinations for AvroIO.

DynamicDestinations<T,DestinationT>

This class provides the most general way of specifying dynamic BigQuery table destinations.

DynamicDestinations

DynamicFileDestinations

Some helper classes that derive from FileBasedSink.DynamicDestinations.

DynamicProtoCoder

A Coder using Google Protocol Buffers binary format.

DynamoDBIO

IO to read from and write to DynamoDB tables.

DynamoDBIO.Read<T>

Read data from DynamoDB using DynamoDBIO.Read.getScanRequestFn() and emit an element of type DynamoDBIO.Read for each ScanResponse using the mapping function DynamoDBIO.Read.getScanResponseMapperFn().

DynamoDBIO.Write<T>

Write a PCollection data into DynamoDB.

ElasticsearchIO

Transforms for reading and writing data from/to Elasticsearch.

ElasticsearchIO.BoundedElasticsearchSource

A BoundedSource reading from Elasticsearch.

ElasticsearchIO.BulkIO

A PTransform writing Bulk API entities created by ElasticsearchIO.DocToBulk to an Elasticsearch cluster.

ElasticsearchIO.ConnectionConfiguration

A POJO describing a connection configuration to Elasticsearch.

ElasticsearchIO.DocToBulk

A PTransform converting docs to their Bulk API counterparts.

ElasticsearchIO.Document

ElasticsearchIO.DocumentCoder

ElasticsearchIO.Read

A PTransform reading data from Elasticsearch.

ElasticsearchIO.RetryConfiguration

A POJO encapsulating a configuration for retry behavior when issuing requests to ES.

ElasticsearchIO.Write

A PTransform writing data to Elasticsearch.

ElasticsearchIO.Write.BooleanFieldValueExtractFn

ElasticsearchIO.Write.FieldValueExtractFn

ElasticsearchIOITCommon

Manipulates test data used by the ElasticsearchIO integration tests.

ElasticsearchIOITCommon.ElasticsearchPipelineOptions

Pipeline options for elasticsearch tests.

ElemToBytesFunction<V>

Map to tuple function.

EmbeddedEnvironmentFactory

An EnvironmentFactory that communicates to a FnHarness which is executing in the same process.

EmbeddedEnvironmentFactory.Provider

Provider of EmbeddedEnvironmentFactory.

EmptyCatalogManager

EmptyCheckpointMark

Passing null values to Spark's Java API may cause problems because of Guava preconditions.

EmptyMatchTreatment

Options for allowing or disallowing filepatterns that match no resources in FileSystems.match(java.util.List<java.lang.String>).

EncodableThrowable

A wrapper around a Throwable for use with coders.

EncodedBoundedWindow

An encoded BoundedWindow used within Runners to track window information without needing to decode the window.

EncodedBoundedWindow.Coder

An EncodedBoundedWindow.Coder for EncodedBoundedWindows.

EncodedValueComparator

Flink TypeComparator for Beam values that have been encoded to byte data by a Coder.

EncodedValueSerializer

TypeSerializer for values that were encoded using a Coder.

EncodedValueTypeInformation

Flink TypeInformation for Beam values that have been encoded to byte data by a Coder.

EncoderFactory

EncoderHelpers

Encoders utility class.

EncoderHelpers.Utils

Encoder / expression utils that are called from generated code.

EncoderProvider

EncoderProvider.Factory<T>

EncodingException

Represents an error during encoding (serializing) a class.

EncodingException

Represents an error during encoding (serializing) a class.

EntityToRow

A PTransform to perform a conversion of Entity to Row.

EnumerationType

This Schema.LogicalType represent an enumeration over a fixed set of values.

EnumerationType.Value

This class represents a single enum value.

EnvironmentFactory

Creates environments which communicate to an SdkHarnessClient.

EnvironmentFactory.Provider

Provider for a EnvironmentFactory and ServerFactory for the environment.

ErrorContainer<T>

ErrorContainer interface.

ErrorHandler<ErrorT,OutputT>

An Error Handler is a utility object used for plumbing error PCollections to a configured sink Error Handlers must be closed before a pipeline is run to properly pipe error collections to the sink, and the pipeline will be rejected if any handlers aren't closed.

ErrorHandler.BadRecordErrorHandler<OutputT>

ErrorHandler.DefaultErrorHandler<ErrorT,OutputT>

A default, placeholder error handler that exists to allow usage of .addErrorCollection() without effects.

ErrorHandler.PTransformErrorHandler<ErrorT,OutputT>

ErrorHandler.PTransformErrorHandler.WriteErrorMetrics<ErrorT>

ErrorHandler.PTransformErrorHandler.WriteErrorMetrics.CountErrors<ErrorT>

ErrorHandling

ErrorHandling.Builder

EvaluationContext

The EvaluationContext is the result of a pipeline translation and can be used to evaluate / run the pipeline.

EvaluationContext

The EvaluationContext allows us to define pipeline instructions and translate between


 PObject<T>

s or PCollection<T>s and Ts or DStreams/RDDs of Ts.

EventExaminer<EventT,StateT>

Classes extending this interface will be called by OrderedEventProcessor to examine every incoming event.

ExecutableGraph<ExecutableT,CollectionT>

The interface that enables querying of a graph of independently executable stages and the inputs and outputs of those stages.

ExecutableStageContext

The context required in order to execute stages.

ExecutableStageContext.Factory

Creates ExecutableStageContext instances.

ExecutableStageDoFnOperator<InputT,OutputT>

This operator is the streaming equivalent of the FlinkExecutableStageFunction.

ExecutionDriver

Drives the execution of a Pipeline by scheduling work.

ExecutionDriver.DriverState

The state of the driver.

ExecutorOptions

Options for configuring the ScheduledExecutorService used throughout the Java runtime.

ExecutorOptions.ScheduledExecutorServiceFactory

Returns the default ScheduledExecutorService to use within the Apache Beam SDK.

ExpansionServer

A gRPC Server for an ExpansionService.

ExpansionService

A service that allows pipeline expand transforms from a remote SDK.

ExpansionService.ExpansionServiceRegistrar

A registrar that creates TransformProvider instances from RunnerApi.FunctionSpecs.

ExpansionService.ExternalTransformRegistrarLoader

Exposes Java transforms via ExternalTransformRegistrar.

ExpansionServiceConfig

ExpansionServiceOptions

Options used to configure the ExpansionService.

ExpansionServiceOptions.ExpansionServiceConfigFactory

Loads the ExpansionService config.

ExpansionServiceOptions.JavaClassLookupAllowListFactory

Loads the allow list from ExpansionServiceOptions.getJavaClassLookupAllowlistFile(), defaulting to an empty JavaClassLookupTransformProvider.AllowList.

ExpansionServiceSchemaTransformProvider

ExperimentalOptions

Apache Beam provides a number of experimental features that can be enabled with this flag.

ExternalEnvironmentFactory

An EnvironmentFactory which requests workers via the given URL in the Environment.

ExternalEnvironmentFactory.Provider

Provider of ExternalEnvironmentFactory.

ExternalRead

Exposes PubsubIO.Read as an external transform for cross-language usage.

ExternalRead.Configuration

Parameters class to expose the transform to an external SDK.

ExternalRead.ReadBuilder

ExternalSchemaIOTransformRegistrar

ExternalSchemaIOTransformRegistrar.Configuration

ExternalSorter

Does an external sort of the provided values.

ExternalSorter.Options

ExternalSorter.Options contains configuration of the sorter.

ExternalSorter.Options.SorterType

Sorter type.

ExternalSqlTransformRegistrar

ExternalSqlTransformRegistrar.Configuration

ExternalSynchronization

Provides mechanism for acquiring locks related to the job.

ExternalTransformBuilder<ConfigT,InputT,OutputT>

An interface for building a transform from an externally provided configuration.

ExternalTransformRegistrar

A registrar which contains a mapping from URNs to available ExternalTransformBuilders.

ExternalTransformRegistrarImpl

ExternalWrite

Exposes PubsubIO.Write as an external transform for cross-language usage.

ExternalWrite.Configuration

Parameters class to expose the transform to an external SDK.

ExternalWrite.ParsePubsubMessageProtoAsPayloadFromWindowedValue

ExternalWrite.WriteBuilder

Factory<T>

A Factory interface for schema-related objects for a specific Java type.

FailedRunningPipelineResults

Alternative implementation of PipelineResult used to avoid throwing Exceptions in certain situations.

FailsafeValueInSingleWindow<T,ErrorT>

An immutable tuple of value, timestamp, window, and pane.

FailsafeValueInSingleWindow.Coder<T,ErrorT>

A coder for FailsafeValueInSingleWindow.

Failure

A generic failure of an SQL transform.

Failure.Builder

FailureCollectorWrapper

Class FailureCollectorWrapper is a class for collecting ValidationFailure.

FakeBigQueryServices

A fake implementation of BigQuery's query service..

FakeBigQueryServices.FakeBigQueryServerStream<T>

An implementation of BigQueryServices.BigQueryServerStream which takes a List as the Iterable to simulate a server stream.

FakeDatasetService

A fake dataset service that can be serialized, for use in testReadFromTable.

FakeJobService

A fake implementation of BigQuery's job service.

FhirBundleParameter

FhirBundleParameter represents a FHIR bundle in JSON format to be executed on a FHIR store.

FhirBundleResponse

FhirIO

FhirIO provides an API for reading and writing resources to Google Cloud Healthcare Fhir API.

FhirIO.Deidentify

Deidentify FHIR resources from a FHIR store to a destination FHIR store.

FhirIO.Deidentify.DeidentifyFn

A function that schedules a deidentify operation and monitors the status.

FhirIO.ExecuteBundles

The type Execute bundles.

FhirIO.ExecuteBundlesResult

ExecuteBundlesResult contains both successfully executed bundles and information help debugging failed executions (eg metadata invalid input: '&' error msgs).

FhirIO.Export

Export FHIR resources from a FHIR store to new line delimited json files on GCS or BigQuery.

FhirIO.Export.ExportResourcesFn

A function that schedules an export operation and monitors the status.

FhirIO.Import

Writes each bundle of elements to a new-line delimited JSON file on GCS and issues a fhirStores.import Request for that file.

FhirIO.Import.ContentStructure

The enum Content structure.

The type Read.

The type Result.

The type Search.

The type Write.

FhirIO.Write.AbstractResult

FhirIO.Write.Result

The type Result.

FhirIO.Write.WriteMethod

The enum Write method.

FhirIOPatientEverything

The type FhirIOPatientEverything for querying a FHIR Patient resource's compartment.

FhirIOPatientEverything.PatientEverythingParameter

PatientEverythingParameter defines required attributes for a FHIR GetPatientEverything request in FhirIOPatientEverything.

FhirIOPatientEverything.Result

The Result for a FhirIOPatientEverything request.

FhirSearchParameter<T>

FhirSearchParameter represents the query parameters for a FHIR search request, used as a parameter for FhirIO.Search.

FhirSearchParameterCoder<T>

FhirSearchParameterCoder is the coder for FhirSearchParameter, which takes a coder for type T.

FieldAccessDescriptor

Used inside of a DoFn to describe which fields in a schema type need to be accessed for processing.

FieldAccessDescriptor.FieldDescriptor

Description of a single field.

FieldAccessDescriptor.FieldDescriptor.Builder

Builder class.

FieldAccessDescriptor.FieldDescriptor.ListQualifier

Qualifier for a list selector.

FieldAccessDescriptor.FieldDescriptor.MapQualifier

Qualifier for a map selector.

FieldAccessDescriptor.FieldDescriptor.Qualifier

OneOf union for a collection selector.

FieldAccessDescriptor.FieldDescriptor.Qualifier.Kind

The kind of qualifier.

FieldAccessDescriptorParser

Parser for textual field-access selector.

FieldSpecifierNotationBaseListener

This class provides an empty implementation of FieldSpecifierNotationListener, which can be extended to create a listener which only needs to handle a subset of the available methods.

FieldSpecifierNotationBaseVisitor<T>

This class provides an empty implementation of FieldSpecifierNotationVisitor, which can be extended to create a visitor which only needs to handle a subset of the available methods.

FieldSpecifierNotationLexer

FieldSpecifierNotationListener

This interface defines a complete listener for a parse tree produced by FieldSpecifierNotationParser.

FieldSpecifierNotationParser

FieldSpecifierNotationParser.ArrayQualifierContext

FieldSpecifierNotationParser.ArrayQualifierListContext

FieldSpecifierNotationParser.DotExpressionComponentContext

FieldSpecifierNotationParser.DotExpressionContext

FieldSpecifierNotationParser.FieldSpecifierContext

FieldSpecifierNotationParser.MapQualifierContext

FieldSpecifierNotationParser.MapQualifierListContext

FieldSpecifierNotationParser.QualifiedComponentContext

FieldSpecifierNotationParser.QualifierListContext

FieldSpecifierNotationParser.QualifyComponentContext

FieldSpecifierNotationParser.SimpleIdentifierContext

FieldSpecifierNotationParser.WildcardContext

FieldSpecifierNotationVisitor<T>

This interface defines a complete generic visitor for a parse tree produced by FieldSpecifierNotationParser.

FieldTypeDescriptors

Utilities for converting between Schema field types and TypeDescriptors that define Java objects which can represent these field types.

FieldValueGetter<ObjectT,ValueT>

For internal use only; no backwards-compatibility guarantees.

FieldValueHaver<ObjectT>

For internal use only; no backwards-compatibility guarantees.

FieldValueSetter<ObjectT,ValueT>

For internal use only; no backwards-compatibility guarantees.

FieldValueTypeInformation

Represents type information for a Java type that will be used to infer a Schema type.

FieldValueTypeInformation.Builder

FieldValueTypeSupplier

A naming policy for schema fields.

FileBasedSink<UserT,DestinationT,OutputT>

Abstract class for file-based output.

FileBasedSink.CompressionType

Deprecated.

use Compression.

FileBasedSink.DynamicDestinations<UserT,DestinationT,OutputT>

A class that allows value-dependent writes in FileBasedSink.

FileBasedSink.FilenamePolicy

A naming policy for output files.

FileBasedSink.FileResult<DestinationT>

Result of a single bundle write.

FileBasedSink.FileResultCoder<DestinationT>

A coder for FileBasedSink.FileResult objects.

FileBasedSink.OutputFileHints

Provides hints about how to generate output files, such as a suggested filename suffix (e.g.

FileBasedSink.WritableByteChannelFactory

Implementations create instances of WritableByteChannel used by FileBasedSink and related classes to allow decorating, or otherwise transforming, the raw data that would normally be written directly to the WritableByteChannel passed into FileBasedSink.WritableByteChannelFactory.create(WritableByteChannel).

FileBasedSink.WriteOperation<DestinationT,OutputT>

Abstract operation that manages the process of writing to FileBasedSink.

FileBasedSink.Writer<DestinationT,OutputT>

Abstract writer that writes a bundle to a FileBasedSink.

FileBasedSource<T>

A common base class for all file-based Sources.

FileBasedSource.FileBasedReader<T>

A reader that implements code common to readers of


 FileBasedSource

FileBasedSource.Mode

A given FileBasedSource represents a file resource of one of these types.

FileChecksumMatcher

Matcher to verify checksum of the contents of an ShardedFile in E2E test.

FileIO

General-purpose transforms for working with files: listing files (matching), reading and writing.

FileIO.Match

Implementation of FileIO.match().

FileIO.MatchAll

Implementation of FileIO.matchAll().

FileIO.MatchConfiguration

Describes configuration for matching filepatterns, such as EmptyMatchTreatment and continuous watching for matching files.

FileIO.ReadableFile

A utility class for accessing a potentially compressed file.

FileIO.ReadMatches

Implementation of FileIO.readMatches().

FileIO.ReadMatches.DirectoryTreatment

Enum to control how directories are handled.

FileIO.Sink<ElementT>

Specifies how to write elements to individual files in FileIO.write() and FileIO.writeDynamic().

FileIO.Write<DestinationT,UserT>

Implementation of FileIO.write() and FileIO.writeDynamic().

FileIO.Write.FileNaming

A policy for generating names for shard files.

FileReadSchemaTransformConfiguration

FileReadSchemaTransformConfiguration.Builder

FileReadSchemaTransformFormatProvider

Interface that provides a PTransform that reads in a PCollection of FileIO.ReadableFiles and outputs the data represented as a PCollection of Rows.

FileReadSchemaTransformProvider

FileReporter

Flink metrics reporter for writing metrics to a file specified via the "metrics.reporter.file.path" config key (assuming an alias of "file" for this reporter in the "metrics.reporters" setting).

FileStagingOptions

File staging related options.

FileSystem<ResourceIdT>

File system interface in Beam.

FileSystem.LineageLevel

FileSystemRegistrar

A registrar that creates FileSystem instances from PipelineOptions.

FileSystems

Clients facing FileSystem utility.

FileSystemUtils

FileWriteSchemaTransformConfiguration

The configuration for building file writing transforms using SchemaTransform and SchemaTransformProvider.

FileWriteSchemaTransformConfiguration.Builder

FileWriteSchemaTransformConfiguration.CsvConfiguration

Configures extra details related to writing CSV formatted files.

FileWriteSchemaTransformConfiguration.CsvConfiguration.Builder

FileWriteSchemaTransformConfiguration.ParquetConfiguration

Configures extra details related to writing Parquet formatted files.

FileWriteSchemaTransformConfiguration.ParquetConfiguration.Builder

FileWriteSchemaTransformConfiguration.XmlConfiguration

Configures extra details related to writing XML formatted files.

FileWriteSchemaTransformConfiguration.XmlConfiguration.Builder

FileWriteSchemaTransformFormatProvider

Provides a PTransform that writes a PCollection of Rows and outputs a PCollection of the file names according to a registered AutoService FileWriteSchemaTransformFormatProvider implementation.

FileWriteSchemaTransformFormatProviders

FileWriteSchemaTransformFormatProviders contains FileWriteSchemaTransformFormatProvider implementations.

FileWriteSchemaTransformProvider

A TypedSchemaTransformProvider implementation for writing a Row PCollection to file systems, driven by a FileWriteSchemaTransformConfiguration.

FillGaps<ValueT>

Fill gaps in timeseries.

FillGaps.FillGapsDoFn<ValueT>

FillGaps.InterpolateData<ValueT>

Argument to withInterpolateFunction function.

Filter

A PTransform for filtering a collection of schema types.

Filter<T>

PTransforms for filtering from a PCollection the elements satisfying a predicate, or satisfying an inequality with a given value based on the elements' natural ordering.

Filter.Inner<T>

Implementation of the filter.

FilterForMutationDoFn

FilterUtils

Utilities that convert between a SQL filter expression and an Iceberg Expression.

FindQuery

Builds a MongoDB FindQuery object.

FirestoreIO

FirestoreIO provides an API for reading from and writing to Google Cloud Firestore.

FirestoreOptions

FirestoreV1

FirestoreV1 provides an API which provides lifecycle managed PTransforms for Cloud Firestore v1 API.

FirestoreV1.BatchGetDocuments

Concrete class representing a PTransform<PCollection<BatchGetDocumentsRequest>, PTransform<BatchGetDocumentsResponse>> which will read from Firestore.

FirestoreV1.BatchGetDocuments.Builder

A type safe builder for FirestoreV1.BatchGetDocuments allowing configuration and instantiation.

FirestoreV1.BatchWriteWithDeadLetterQueue

Concrete class representing a PTransform<PCollection<Write>, PCollection<FirestoreV1.WriteFailure

which will write to Firestore.

FirestoreV1.BatchWriteWithDeadLetterQueue.Builder

A type safe builder for FirestoreV1.BatchWriteWithDeadLetterQueue allowing configuration and instantiation.

FirestoreV1.BatchWriteWithSummary

Concrete class representing a PTransform<PCollection<Write>, PDone> which will write to Firestore.

FirestoreV1.BatchWriteWithSummary.Builder

A type safe builder for FirestoreV1.BatchWriteWithSummary allowing configuration and instantiation.

FirestoreV1.FailedWritesException

Exception that is thrown if one or more Writes is unsuccessful with a non-retryable status code.

FirestoreV1.ListCollectionIds

Concrete class representing a PTransform<PCollection<ListCollectionIdsRequest>, PTransform<ListCollectionIdsResponse>> which will read from Firestore.

FirestoreV1.ListCollectionIds.Builder

A type safe builder for FirestoreV1.ListCollectionIds allowing configuration and instantiation.

FirestoreV1.ListDocuments

Concrete class representing a PTransform<PCollection<ListDocumentsRequest>, PTransform<ListDocumentsResponse

>>

which will read from Firestore.

FirestoreV1.ListDocuments.Builder

A type safe builder for FirestoreV1.ListDocuments allowing configuration and instantiation.

FirestoreV1.PartitionQuery

Concrete class representing a PTransform<PCollection<PartitionQueryRequest>, PTransform<RunQueryRequest>> which will read from Firestore.

FirestoreV1.PartitionQuery.Builder

A type safe builder for FirestoreV1.PartitionQuery allowing configuration and instantiation.

FirestoreV1.Read

Type safe builder factory for read operations.

FirestoreV1.RunQuery

Concrete class representing a PTransform<PCollection<RunQueryRequest>, PTransform<RunQueryResponse>> which will read from Firestore.

FirestoreV1.RunQuery.Builder

A type safe builder for FirestoreV1.RunQuery allowing configuration and instantiation.

FirestoreV1.Write

Type safe builder factory for write operations.

FirestoreV1.WriteFailure

Failure details for an attempted Write.

FirestoreV1.WriteSuccessSummary

Summary object produced when a number of writes are successfully written to Firestore in a single BatchWrite.

FixedBytes

A LogicalType representing a fixed-length byte array.

FixedPrecisionNumeric

Fixed precision numeric types used to represent jdbc NUMERIC and DECIMAL types.

FixedString

A LogicalType representing a fixed-length string.

FixedWindows

A WindowFn that windows values into fixed-size timestamp-based windows.

FlatMapElements<InputT,OutputT>

PTransforms for mapping a simple function that returns iterables over the elements of a PCollection and merging the results.

FlatMapElements.FlatMapWithFailures<InputT,OutputT,FailureT>

A PTransform that adds exception handling to FlatMapElements.

Flatten

Flatten<T> takes multiple PCollection<T>s bundled into a


 PCollectionList<T>

and returns a single PCollection<T> containing all the elements in all the input PCollections.

Flatten.Iterables<T>

FlattenIterables<T> takes a PCollection<Iterable<T>> and returns a


 PCollection<T>

that contains all the elements from each iterable.

Flatten.PCollections<T>

A PTransform that flattens a PCollectionList into a PCollection containing all the elements of all the PCollections in its input.

FlattenP

Jet Processor implementation for Beam's Flatten primitive.

FlattenP.Supplier

Jet Processor supplier that will provide instances of FlattenP.

FlattenTransformProvider

An implementation of TypedSchemaTransformProvider for Flatten.

FlattenTransformProvider.Configuration

FlattenTransformProvider.Configuration.Builder

FlattenTranslatorBatch<T>

Flatten translator.

FlattenWithHeterogeneousCoders

Category tag for tests that use a Flatten where the input PCollectionList contains PCollections heterogeneous coders.

FlinkAssignWindows<T,W>

Flink FlatMapFunction for implementing Window.Assign.

FlinkBatchPortablePipelineTranslator

A translator that translates bounded portable pipelines into executable Flink pipelines.

FlinkBatchPortablePipelineTranslator.BatchTranslationContext

Batch translation context.

FlinkBatchPortablePipelineTranslator.IsFlinkNativeTransform

Predicate to determine whether a URN is a Flink native transform.

FlinkBatchPortablePipelineTranslator.PTransformTranslator

Transform translation interface.

FlinkBoundedSource<T>

A Flink Source implementation that wraps a Beam BoundedSource.

FlinkBoundedSourceReader<T>

A Flink SourceReader implementation that reads from the assigned FlinkSourceSplits by using Beam BoundedReaders.

FlinkBroadcastStateInternals<K>

StateInternals that uses a Flink OperatorStateBackend to manage the broadcast state.

FlinkDetachedRunnerResult

Result of a detached execution of a Pipeline with Flink.

FlinkDoFnFunction<InputT,OutputT>

Encapsulates a DoFn inside a Flink RichMapPartitionFunction.

FlinkExecutableStageContextFactory

Singleton class that contains one ExecutableStageContext.Factory per job.

FlinkExecutableStageFunction<InputT>

Flink operator that passes its input DataSet through an SDK-executed ExecutableStage.

FlinkExecutableStagePruningFunction

A Flink function that demultiplexes output from a FlinkExecutableStageFunction.

FlinkExecutionEnvironments

Utilities for Flink execution environments.

FlinkExplodeWindowsFunction<T>

Explode WindowedValue that belongs to multiple windows into multiple "single window" values, so we can safely group elements by (K, W) tuples.

FlinkIdentityFunction<T>

A map function that outputs the input element without any change.

FlinkJobInvoker

Job Invoker for the FlinkRunner.

FlinkJobServerDriver

Driver program that starts a job server for the Flink runner.

FlinkJobServerDriver.FlinkServerConfiguration

Flink runner-specific Configuration for the jobServer.

FlinkKey

FlinkKeyUtils

Utility functions for dealing with key encoding.

FlinkMergingNonShuffleReduceFunction<K,InputT,AccumT,OutputT,W>

Special version of FlinkReduceFunction that supports merging windows.

FlinkMetricContainer

Helper class for holding a MetricsContainerImpl and forwarding Beam metrics to Flink accumulators and metrics.

FlinkMetricContainerWithoutAccumulator

The base helper class for holding a MetricsContainerImpl and forwarding Beam metrics to Flink accumulators and metrics.

FlinkMiniClusterEntryPoint

Entry point for starting an embedded Flink cluster.

FlinkMultiOutputPruningFunction<T>

A FlatMapFunction function that filters out those elements that don't belong in this output.

FlinkNonMergingReduceFunction<K,InputT>

Reduce function for non-merging GBK implementation.

FlinkNoOpStepContext

A StepContext for Flink Batch Runner execution.

FlinkPartialReduceFunction<K,InputT,AccumT,W>

This is the first step for executing a Combine.PerKey on Flink.

FlinkPipelineOptions

Options which can be used to configure the Flink Runner.

FlinkPipelineOptions.MaxBundleSizeFactory

Maximum bundle size factory.

FlinkPipelineOptions.MaxBundleTimeFactory

Maximum bundle time factory.

FlinkPipelineRunner

Runs a Pipeline on Flink via FlinkRunner.

FlinkPortableClientEntryPoint

Flink job entry point to launch a Beam pipeline by executing an external SDK driver program.

FlinkPortablePipelineTranslator<T>

Interface for portable Flink translators.

FlinkPortablePipelineTranslator.Executor

A handle used to execute a translated pipeline.

FlinkPortablePipelineTranslator.TranslationContext

The context used for pipeline translation.

FlinkPortableRunnerResult

Result of executing a portable Pipeline with Flink.

FlinkPortableRunnerUtils

Various utilies related to portability.

FlinkReduceFunction<K,AccumT,OutputT,W>

This is the second part for executing a Combine.PerKey on Flink, the second part is FlinkReduceFunction.

FlinkRunner

A PipelineRunner that executes the operations in the pipeline by first translating them to a Flink Plan and then executing them either locally or on a Flink cluster, depending on the configuration.

FlinkRunnerRegistrar

AutoService registrar - will register FlinkRunner and FlinkOptions as possible pipeline runner services.

FlinkRunnerRegistrar.Options

Pipeline options registrar.

FlinkRunnerRegistrar.Runner

Pipeline runner registrar.

FlinkRunnerResult

Result of executing a Pipeline with Flink.

FlinkSideInputReader

A SideInputReader for the Flink Batch Runner.

FlinkSource<T,OutputT>

The base class for FlinkBoundedSource and FlinkUnboundedSource.

FlinkSource.TimestampExtractor<T>

FlinkSourceReaderBase<T,OutputT>

An abstract implementation of SourceReader which encapsulates Beam Sources for data reading.

FlinkSourceSplit<T>

A Flink SourceSplit implementation that encapsulates a Beam Source.

FlinkSourceSplitEnumerator<T>

A Flink SplitEnumerator implementation that holds a Beam Source and does the following: Split the Beam Source to desired number of splits.

FlinkStateBackendFactory

Constructs a StateBackend to use from flink pipeline options.

FlinkStatefulDoFnFunction<K,V,OutputT>

A RichGroupReduceFunction for stateful ParDo in Flink Batch Runner.

FlinkStateInternals<K>

StateInternals that uses a Flink KeyedStateBackend to manage state.

FlinkStateInternals.EarlyBinder

Eagerly create user state to work around https://jira.apache.org/jira/browse/FLINK-12653.

FlinkStateInternals.FlinkStateNamespaceKeySerializer

FlinkStateInternals.FlinkStateNamespaceKeySerializer.FlinkStateNameSpaceSerializerSnapshot

Serializer configuration snapshot for compatibility and format evolution.

FlinkStreamingAggregationsTranslators

FlinkStreamingAggregationsTranslators.ConcatenateAsIterable<T>

FlinkStreamingPortablePipelineTranslator

Translate an unbounded portable pipeline representation into a Flink pipeline representation.

FlinkStreamingPortablePipelineTranslator.IsFlinkNativeTransform

Predicate to determine whether a URN is a Flink native transform.

FlinkStreamingPortablePipelineTranslator.PTransformTranslator<T>

FlinkStreamingPortablePipelineTranslator.StreamingTranslationContext

Streaming translation context.

FlinkUnboundedSource<T>

A Flink Source implementation that wraps a Beam UnboundedSource.

FlinkUnboundedSourceReader<T>

A Flink SourceReader implementation that reads from the assigned FlinkSourceSplits by using Beam UnboundedReaders.

FloatCoder

A FloatCoder encodes Float values in 4 bytes using Java serialization.

FnApiControlClient

A client for the control plane of an SDK harness, which can issue requests to it over the Fn API.

FnApiControlClientPoolService

A Fn API control service which adds incoming SDK harness connections to a sink.

FnDataReceiver<T>

A receiver of streamed data.

FnDataService

The FnDataService is able to forward inbound elements to a consumer and is also a consumer of outbound elements.

FnService

An interface sharing common behavior with services used during execution of user Fns.

ForwardingClientResponseObserver<ReqT,RespT>

A ClientResponseObserver which delegates all StreamObserver calls.

FullNameTableProvider

Base class for table providers that look up table metadata using full table names, instead of querying it by parts of the name separately.

Gauge

A metric that reports the latest value out of reported values.

GaugeImpl

Implementation of Gauge.

GaugeResult

The result of a Gauge metric.

GaugeResult.EmptyGaugeResult

Empty GaugeResult, representing no values reported.

GceMetadataUtil

GcpCredentialFactory

Construct an oauth credential to be used by the SDK and the SDK workers.

GcpIoPipelineOptionsRegistrar

A registrar containing the default GCP options.

GcpOptions

Options used to configure Google Cloud Platform specific options such as the project and credentials.

GcpOptions.DefaultProjectFactory

Attempts to infer the default project based upon the environment this application is executing within.

GcpOptions.EnableStreamingEngineFactory

EnableStreamingEngine defaults to false unless one of the two experiments is set.

GcpOptions.GcpOAuthScopesFactory

Returns the default set of OAuth scopes.

GcpOptions.GcpTempLocationFactory

Returns PipelineOptions.getTempLocation() as the default GCP temp location.

GcpOptions.GcpUserCredentialsFactory

Attempts to load the GCP credentials.

GcpPipelineOptionsRegistrar

A registrar containing the default GCP options.

GCPSecretSessionServiceFactory

This class implements a SessionServiceFactory that retrieve the basic authentication credentials from a Google Cloud Secret Manager secret.

GCPSecretSessionServiceFactory.Builder

GcsCreateOptions

An abstract class that contains common configuration options for creating resources.

GcsCreateOptions.Builder

A builder for GcsCreateOptions.

GcsFileSystemRegistrar

AutoService registrar for the GcsFileSystem.

GcsOptions

Options used to configure Google Cloud Storage.

GcsOptions.ExecutorServiceFactory

Returns the default ExecutorService to use within the Apache Beam SDK.

GcsOptions.GcsCustomAuditEntries

Creates a GcsOptions.GcsCustomAuditEntries that key-value pairs to be stored as custom information in GCS audit logs.

GcsOptions.PathValidatorFactory

Creates a PathValidator object using the class specified in GcsOptions.getPathValidatorClass().

GcsPath

Implements the Java NIO Path API for Google Cloud Storage paths.

GcsPathValidator

GCP implementation of PathValidator.

GcsResourceId

ResourceId implementation for Google Cloud Storage.

GcsStager

Utility class for staging files to GCS.

GcsUtil

Provides operations on GCS.

GcsUtil.CreateOptions

GcsUtil.CreateOptions.Builder

GcsUtil.GcsCountersOptions

GcsUtil.GcsReadOptionsFactory

GcsUtil.GcsUtilFactory

This is a DefaultValueFactory able to create a GcsUtil using any transport flags specified on the PipelineOptions.

GcsUtil.StorageObjectOrIOException

A class that holds either a StorageObject or an IOException.

GenerateInitialPartitionsAction

Class to generate first set of outputs for DetectNewPartitionsDoFn.

GenerateSequence

A PTransform that produces longs starting from the given value, and either up to the given limit or until Long.MAX_VALUE / until the given time elapses.

GenerateSequence.External

Exposes GenerateSequence as an external transform for cross-language usage.

GenerateSequence.External.ExternalConfiguration

Parameters class to expose the transform to an external SDK.

GenerateSequenceSchemaTransformProvider

GenerateSequenceSchemaTransformProvider.GenerateSequenceConfiguration

GenerateSequenceSchemaTransformProvider.GenerateSequenceConfiguration.Builder

GenerateSequenceSchemaTransformProvider.GenerateSequenceConfiguration.Rate

GenerateSequenceSchemaTransformProvider.GenerateSequenceConfiguration.Rate.Builder

GenerateSequenceSchemaTransformProvider.GenerateSequenceSchemaTransform

GenerateSequenceTableProvider

Sequence generator table provider.

GenericDlq

Helper to generate a DLQ transform to write PCollection to an external system.

GenericDlqProvider

A Provider for generic DLQ transforms that handle deserialization failures.

GetterBasedSchemaProvider

Deprecated.

new implementations should extend the GetterBasedSchemaProviderV2 class' methods which receive TypeDescriptors instead of ordinary Classes as arguments, which permits to support generic type signatures during schema inference

GetterBasedSchemaProviderBenchmark

Benchmarks for GetterBasedSchemaProvider on reading / writing fields based on toRowFunction / fromRowFunction.

GetterBasedSchemaProviderV2

A newer version of GetterBasedSchemaProvider, which works with TypeDescriptors, and which by default delegates the old, Class based methods, to the new ones.

GlobalWatermarkHolder

A store to hold the global watermarks for a micro-batch.

GlobalWatermarkHolder.SparkWatermarks

A GlobalWatermarkHolder.SparkWatermarks holds the watermarks and batch time relevant to a micro-batch input from a specific source.

GlobalWatermarkHolder.WatermarkAdvancingStreamingListener

Advance the WMs onBatchCompleted event.

GlobalWindow

The default window into which all data is placed (via GlobalWindows).

GlobalWindow.Coder

GlobalWindow.Coder for encoding and decoding GlobalWindows.

GlobalWindows

A WindowFn that assigns all data to the same window.

GoogleADCIdTokenProvider

A OIDC web identity token provider implementation that uses the application default credentials set by the runtime (container, GCE instance, local environment, etc.).

GoogleAdsClientFactory

Defines how to construct a GoogleAdsClient.

GoogleAdsIO<GoogleAdsRowT,SearchGoogleAdsStreamRequestT>

GoogleAdsIO provides an API for reading from the Google Ads API over supported versions of the Google Ads client libraries.

GoogleAdsIO.RateLimitPolicy<GoogleAdsErrorT>

This interface can be used to implement custom client-side rate limiting policies.

GoogleAdsIO.RateLimitPolicyFactory<GoogleAdsErrorT>

Implement this interface to create a GoogleAdsIO.RateLimitPolicy.

GoogleAdsOptions

Options used to configure Google Ads API specific options.

GoogleAdsOptions.GoogleAdsCredentialsFactory

Attempts to load the Google Ads credentials.

GoogleAdsUserCredentialFactory

Constructs and returns Credentials to be used by Google Ads API calls.

GoogleAdsV19

GoogleAdsV19 provides an API to read Google Ads API v19 reports.

GoogleAdsV19.Read

A PTransform that reads the results of a Google Ads query as GoogleAdsRow objects.

GoogleAdsV19.ReadAll

A PTransform that reads the results of many SearchGoogleAdsStreamRequest objects as GoogleAdsRow objects.

GoogleAdsV19.SimpleRateLimitPolicy

This rate limit policy wraps a RateLimiter and can be used in low volume and development use cases as a client-side rate limiting policy.

GoogleApiDebugOptions

These options configure debug settings for Google API clients created within the Apache Beam SDK.

GoogleApiDebugOptions.GoogleApiTracer

A GoogleClientRequestInitializer that adds the trace destination to Google API calls.

GraphiteSink

A Sink for Spark's metric system reporting metrics (including Beam step metrics) to Graphite.

Group

A generic grouping transform for schema PCollections.

Group.AggregateCombiner<InputT>

a PTransform that does a combine using an aggregation built up by calls to aggregateField and aggregateFields.

Group.ByFields<InputT>

a PTransform that groups schema elements based on the given fields.

Group.CombineFieldsByFields<InputT>

a PTransform that does a per-key combine using an aggregation built up by calls to aggregateField and aggregateFields.

Group.CombineFieldsByFields.Fanout

Group.CombineFieldsByFields.Fanout.Kind

Group.CombineFieldsGlobally<InputT>

a PTransform that does a global combine using an aggregation built up by calls to aggregateField and aggregateFields.

Group.CombineGlobally<InputT,OutputT>

a PTransform that does a global combine using a provider Combine.CombineFn.

Group.Global<InputT>

A PTransform for doing global aggregations on schema PCollections.

GroupAlsoByWindowViaOutputBufferFn<K,InputT,W>

A FlatMap function that groups by windows in batch mode using ReduceFnRunner.

GroupByEncryptedKey<K,V>

A PTransform that provides a secure alternative to GroupByKey.

GroupByKey<K,V>

GroupByKey<K, V> takes a PCollection<KV<K, V>>, groups the values by key and windows, and returns a PCollection<KV<K, Iterable<V>>> representing a map from each distinct key and window of the input PCollection to an Iterable over all the values associated with that key in the input per window.

GroupByKeyTranslatorBatch<K,V>

GroupByKey translator.

GroupByKeyVisitor

Traverses the pipeline to populate the candidates for group by key.

GroupByWindowFunction<K,V,W>

GroupBy window function.

GroupCombineFunctions

A set of group/combine functions to apply to Spark RDDs.

GroupingState<InputT,OutputT>

A ReadableState cell that combines multiple input values and outputs a single value of a different type.

GroupIntoBatches<K,InputT>

A PTransform that batches inputs to a desired batch size.

GroupIntoBatches.BatchingParams<InputT>

Wrapper class for batching parameters supplied by users.

GroupIntoBatchesOverride

GroupNonMergingWindowsFunctions

Functions for GroupByKey with Non-Merging windows translations to Spark.

GrowableOffsetRangeTracker

An OffsetRangeTracker for tracking a growable offset range.

GrowableOffsetRangeTracker.RangeEndEstimator

Provides the estimated end offset of the range.

GrpcContextHeaderAccessorProvider

A HeaderAccessorProvider which intercept the header in a GRPC request and expose the relevant fields.

GrpcDataService

A FnDataService implemented via gRPC.

GrpcFnServer<ServiceT>

A gRPC Server which manages a single FnService.

GrpcLoggingService

An implementation of the Beam Fn Logging Service over gRPC.

GrpcStateService

An implementation of the Beam Fn State service.

HadoopFileSystemModule

A Jackson Module that registers a JsonSerializer and JsonDeserializer for a Hadoop Configuration.

HadoopFileSystemOptions

PipelineOptions which encapsulate Hadoop Configuration for the HadoopFileSystem.

HadoopFileSystemOptions.ConfigurationLocator

A DefaultValueFactory which locates a Hadoop Configuration.

HadoopFileSystemOptionsRegistrar

AutoService registrar for HadoopFileSystemOptions.

HadoopFileSystemRegistrar

AutoService registrar for the HadoopFileSystem.

HadoopFormatIO

A HadoopFormatIO is a Transform for reading data from any source or writing data to any sink which implements Hadoop InputFormat or OutputFormat.

HadoopFormatIO.HadoopInputFormatBoundedSource<K,V>

Bounded source implementation for HadoopFormatIO.

HadoopFormatIO.Read<K,V>

A PTransform that reads from any data source which implements Hadoop InputFormat.

HadoopFormatIO.SerializableSplit

A wrapper to allow Hadoop InputSplit to be serialized using Java's standard serialization mechanisms.

HadoopFormatIO.Write<KeyT,ValueT>

A PTransform that writes to any data sink which implements Hadoop OutputFormat.

HadoopFormatIO.Write.ExternalSynchronizationBuilder<KeyT,ValueT>

Builder for External Synchronization defining.

HadoopFormatIO.Write.PartitionedWriterBuilder<KeyT,ValueT>

Builder for partitioning determining.

HadoopFormatIO.Write.WriteBuilder<KeyT,ValueT>

Main builder of Write transformation.

HasDefaultTracker<RestrictionT,TrackerT>

Interface for restrictions for which a default implementation of DoFn.NewTracker is available, depending only on the restriction itself.

HasDefaultWatermarkEstimator<WatermarkEstimatorStateT,WatermarkEstimatorT>

Interface for watermark estimator state for which a default implementation of DoFn.NewWatermarkEstimator is available, depending only on the watermark estimator state itself.

HasDisplayData

Marker interface for PTransforms and components to specify display data used within UIs and diagnostic tools.

HashingFlinkCombineRunner<K,InputT,AccumT,OutputT,W>

A Flink combine runner that builds a map of merged windows and produces output after seeing all input.

HasOffset

Interface for any Spark Receiver that supports reading from and to some offset.

HBaseCoderProviderRegistrar

A CoderProviderRegistrar for standard types used with HBaseIO.

HBaseIO

A bounded source and sink for HBase.

HBaseIO.Read

A PTransform that reads from HBase.

HBaseIO.ReadAll

Implementation of HBaseIO.readAll().

HBaseIO.Write

A PTransform that writes to HBase.

HBaseIO.WriteRowMutations

Transformation that writes RowMutation objects to a Hbase table.

HCatalogBeamSchema

Adapter from HCatalog table schema to Beam Schema.

HCatalogIO

IO to read and write data using HCatalog.

HCatalogIO.Read

A PTransform to read data using HCatalog.

HCatalogIO.Write

A PTransform to write to a HCatalog managed source.

HCatalogTable

Beam SQL table that wraps HCatalogIO.

HCatalogUtils

Utility classes to enable meta store conf/client creation.

HCatToRow

Utilities to convert HCatRecords to Rows.

HDFSSynchronization

Implementation of ExternalSynchronization which registers locks in the HDFS.

HeaderAccessor

Interface to access headers in the client request.

HealthcareApiClient

Defines a client to communicate with the GCP HCLS API (version v1).

HealthcareIOError<T>

Class for capturing errors on IO operations on Google Cloud Healthcare APIs resources.

HealthcareIOErrorCoder<T>

HealthcareIOErrorToTableRow<T>

Convenience transform to write dead-letter HealthcareIOErrors to BigQuery TableRows.

HeartbeatRecord

A heartbeat record serves as a notification that the change stream query has returned all changes for the partition less or equal to the record timestamp.

HeartbeatRecordAction

This class is part of the process for ReadChangeStreamPartitionDoFn SDF.

Hidden

Methods and/or interfaces annotated with @Hidden will be suppressed from being output when --help is specified on the command-line.

Histogram

A metric that reports information about the histogram of reported values.

HL7v2IO

HL7v2IO provides an API for reading from and writing to Google Cloud Healthcare HL7v2 API.

HL7v2IO.HL7v2Read

The type Read that reads HL7v2 message contents given a PCollection of HL7v2ReadParameter.

HL7v2IO.HL7v2Read.FetchHL7v2Message

PTransform to fetch a message from an Google Cloud Healthcare HL7v2 store based on msgID.

HL7v2IO.HL7v2Read.FetchHL7v2Message.HL7v2MessageGetFn

DoFn for fetching messages from the HL7v2 store with error handling.

HL7v2IO.HL7v2Read.Result

The type Result includes PCollection of HL7v2ReadResponse objects for successfully read results and PCollection of HealthcareIOError objects for failed reads.

HL7v2IO.ListHL7v2Messages

List HL7v2 messages in HL7v2 Stores with optional filter.

HL7v2IO.Read

The type Read that reads HL7v2 message contents given a PCollection of message IDs strings.

HL7v2IO.Read.FetchHL7v2Message

PTransform to fetch a message from an Google Cloud Healthcare HL7v2 store based on msgID.

HL7v2IO.Read.FetchHL7v2Message.HL7v2MessageGetFn

DoFn for fetching messages from the HL7v2 store with error handling.

HL7v2IO.Read.Result

The type Result includes PCollection of HL7v2Message objects for successfully read results and PCollection of HealthcareIOError objects for failed reads.

HL7v2IO.Write

The type Write that writes the given PCollection of HL7v2 messages.

HL7v2IO.Write.Result

HL7v2IO.Write.WriteMethod

The enum Write method.

HL7v2Message

The type HL7v2 message to wrap the Message model.

HL7v2MessageCoder

HL7v2ReadParameter

HL7v2ReadParameter represents the read parameters for a HL7v2 read request, used as the input type for HL7v2IO.HL7v2Read.

HL7v2ReadResponse

HL7v2ReadResponse represents the response format for a HL7v2 read request, used as the output type of HL7v2IO.HL7v2Read.

HL7v2ReadResponseCoder

Coder for HL7v2ReadResponse.

HllCount

PTransforms to compute HyperLogLogPlusPlus (HLL++) sketches on data streams based on the ZetaSketch implementation.

HllCount.Extract

Provides PTransforms to extract the estimated count of distinct elements (as


 Long

s) from each HLL++ sketch.

HllCount.Init

Provides PTransforms to aggregate inputs into HLL++ sketches.

HllCount.Init.Builder<InputT>

Builder for the HllCount.Init combining PTransform.

HllCount.MergePartial

Provides PTransforms to merge HLL++ sketches into a new sketch.

HttpClientConfiguration

HTTP client configuration for both, sync and async AWS clients.

HttpClientConfiguration.Builder

HttpHealthcareApiClient

A client that talks to the Cloud Healthcare API through HTTP requests.

HttpHealthcareApiClient.AuthenticatedRetryInitializer

HttpHealthcareApiClient.FhirResourcePagesIterator

The type FhirResourcePagesIterator for methods which return paged output.

HttpHealthcareApiClient.FhirResourcePagesIterator.FhirMethod

HttpHealthcareApiClient.HealthcareHttpException

Wraps HttpResponse in an exception with a statusCode field for use with HealthcareIOError.

HttpHealthcareApiClient.HL7v2MessagePages

HttpHealthcareApiClient.HL7v2MessagePages.HL7v2MessagePagesIterator

The type Hl7v2 message id pages iterator.

IcebergCatalog

IcebergCatalogConfig

IcebergCatalogConfig.Builder

IcebergCatalogConfig.IcebergTableInfo

IcebergCatalogRegistrar

IcebergCdcReadSchemaTransformProvider

SchemaTransform implementation for IcebergIO.readRows(org.apache.beam.sdk.io.iceberg.IcebergCatalogConfig).

IcebergCdcReadSchemaTransformProvider.Configuration

IcebergDestination

IcebergDestination.Builder

IcebergFilter

IcebergIO

A connector that reads and writes to Apache Iceberg tables.

IcebergIO.ReadRows

IcebergIO.ReadRows.StartingStrategy

IcebergIO.WriteRows

IcebergMetastore

IcebergReadSchemaTransformProvider

SchemaTransform implementation for IcebergIO.readRows(org.apache.beam.sdk.io.iceberg.IcebergCatalogConfig).

IcebergReadSchemaTransformProvider.Configuration

IcebergScanConfig

IcebergScanConfig.Builder

IcebergScanConfig.ScanType

IcebergSchemaTransformTranslation

IcebergSchemaTransformTranslation.ReadRegistrar

IcebergSchemaTransformTranslation.WriteRegistrar

IcebergTableCreateConfig

IcebergTableCreateConfig.Builder

IcebergUtils

Utilities for converting between Beam and Iceberg types, made public for user's convenience.

IcebergWriteResult

IcebergWriteSchemaTransformProvider

SchemaTransform implementation for IcebergIO.writeRows(org.apache.beam.sdk.io.iceberg.IcebergCatalogConfig).

IcebergWriteSchemaTransformProvider.Configuration

IcebergWriteSchemaTransformProvider.Configuration.Builder

IdGenerator

A generator of unique IDs.

IdGenerators

Common IdGenerator implementations.

Impulse

For internal use only; no backwards-compatibility guarantees.

ImpulseInputFormat

Flink input format that implements impulses.

ImpulseP

/** * Jet Processor implementation for Beam's Impulse primitive.

ImpulseSource

A SourceFunc which executes the impulse transform contract.

ImpulseSourceFunction

Source function which sends a single global impulse to a downstream operator.

ImpulseTranslatorBatch

Impulse translator.

IncompatibleWindowException

Exception thrown by WindowFn.verifyCompatibility(WindowFn) if two compared WindowFns are not compatible, including the explanation of incompatibility.

InferableFunction<InputT,OutputT>

A ProcessFunction which is not a functional interface.

InfluxDbIO

IO to read and write from InfluxDB.

InfluxDbIO.DataSourceConfiguration

A POJO describing a DataSourceConfiguration such as URL, userName and password.

InfluxDbIO.Read

A PTransform to read from InfluxDB metric or data related to query.

InfluxDbIO.Write

A PTransform to write to a InfluxDB datasource.

InitializeDoFn

A DoFn responsible to initialize the metadata table and prepare it for managing the state of the pipeline.

InitializeDoFn

A DoFn responsible for initializing the change stream Connector.

InitialPartition

Utility class to determine initial partition constants and methods.

InitialPipelineState

States to initialize a pipeline outputted by InitializeDoFn.

InMemoryBagUserStateFactory<K,V,W>

Holds user state in memory.

InMemoryCatalog

InMemoryCatalogManager

InMemoryCatalogRegistrar

InMemoryJobService

A InMemoryJobService that prepares and runs jobs on behalf of a client using a JobInvoker.

InMemoryMetaStore

A MetaStore which stores the meta info in memory.

InMemoryMetaTableProvider

A InMemoryMetaTableProvider is an abstract TableProvider for in-memory types.

InProcessServerFactory

A ServerFactory which creates servers with the InProcessServerBuilder.

InsertRetryPolicy

A retry policy for streaming BigQuery inserts.

InsertRetryPolicy.Context

Contains information about a failed insert.

InstantCoder

A Coder for joda Instant that encodes it as a big endian Long shifted such that lexicographic ordering of the bytes corresponds to chronological order.

InstantDeserializer

Kafka Deserializer for Instant.

InstantSerializer

Kafka Serializer for Instant.

InstructionRequestHandler

Interface for any function that can handle a Fn API BeamFnApi.InstructionRequest.

Internal

Signifies that a publicly accessible API (public class, method or field) is intended for internal use only and not for public consumption.

IntervalWindow

An implementation of BoundedWindow that represents an interval from IntervalWindow.start (inclusive) to IntervalWindow.end (exclusive).

IntervalWindow.IntervalWindowCoder

Encodes an IntervalWindow as a pair of its upper bound and duration.

InvalidConfigurationException

Exception thrown when the configuration for a SchemaIO is invalid.

InvalidLocationException

Exception thrown when the configuration for a SchemaIO is invalid.

InvalidSchemaException

Exception thrown when the schema for a SchemaIO is invalid.

InvalidTableException

Exception thrown when the request for a table is invalid, such as invalid metadata.

IsInf

IS_INF(X)

IsmFormat

An Ism file is a prefix encoded composite key value file broken into shards.

IsmFormat.Footer

The footer stores the relevant information required to locate the index and bloom filter.

IsmFormat.FooterCoder

A Coder for IsmFormat.Footer.

IsmFormat.IsmRecord<V>

A record containing a composite key and either a value or metadata.

IsmFormat.IsmRecordCoder<V>

A Coder for IsmFormat.IsmRecords.

IsmFormat.IsmShard

A shard descriptor containing shard id, the data block offset, and the index offset for the given shard.

IsmFormat.IsmShardCoder

A coder for IsmFormat.IsmShards.

IsmFormat.KeyPrefix

The prefix used before each key which contains the number of shared and unshared bytes from the previous key that was read.

IsmFormat.KeyPrefixCoder

A Coder for IsmFormat.KeyPrefix.

IsmFormat.MetadataKeyCoder<K>

A coder for metadata key component.

IsNan

IS_NAN(X)

IterableCoder<T>

An IterableCoder encodes any Iterable in the format of IterableLikeCoder.

IterableLikeCoder<T,IterableT>

An abstract base class with functionality for assembling a Coder for a class that implements Iterable.

JavaBeanSchema

A SchemaProvider for Java Bean objects.

JavaBeanSchema.GetterTypeSupplier

FieldValueTypeSupplier that's based on getter methods.

JavaBeanSchema.SetterTypeSupplier

FieldValueTypeSupplier that's based on setter methods.

JavaBeanUtils

A set of utilities to generate getter and setter classes for JavaBean objects.

JavaExplodeTransformProvider

An implementation of TypedSchemaTransformProvider for Explode.

JavaExplodeTransformProvider.Configuration

JavaExplodeTransformProvider.Configuration.Builder

JavaExplodeTransformProvider.ExplodeTransform

A SchemaTransform for Explode.

JavaFieldSchema

A SchemaProvider for Java POJO objects.

JavaFieldSchema.JavaFieldTypeSupplier

FieldValueTypeSupplier that's based on public fields.

JavaFilterTransformProvider

An implementation of TypedSchemaTransformProvider for Filter for the java language.

JavaFilterTransformProvider.Configuration

JavaFilterTransformProvider.Configuration.Builder

JavaFilterTransformProvider.JavaFilterTransform

A SchemaTransform for Filter-java.

JavaMapToFieldsTransformProvider

An implementation of TypedSchemaTransformProvider for MapToFields for the java language.

JavaMapToFieldsTransformProvider.Configuration

JavaMapToFieldsTransformProvider.Configuration.Builder

JavaMapToFieldsTransformProvider.JavaMapToFieldsTransform

A SchemaTransform for MapToFields-java.

JavaRowUdf

JavaRowUdf.Configuration

JavaRowUdf.Configuration.Builder

JavaUdfLoader

Loads UdfProvider implementations from user-provided jars.

JAXBCoder<T>

A coder for JAXB annotated objects.

JcsmpSessionService

A class that manages a connection to a Solace broker using basic authentication.

JdbcConnection

Beam JDBC Connection.

JdbcDriver

Calcite JDBC driver with Beam defaults.

JdbcIO

IO to read and write data on JDBC.

JdbcIO.DataSourceConfiguration

A POJO describing a DataSource, either providing directly a DataSource or all properties allowing to create a DataSource.

JdbcIO.DataSourceProviderFromDataSourceConfiguration

Wraps a JdbcIO.DataSourceConfiguration to provide a DataSource.

JdbcIO.DefaultRetryStrategy

This is the default Predicate we use to detect DeadLock.

JdbcIO.PoolableDataSourceProvider

Wraps a JdbcIO.DataSourceConfiguration to provide a PoolingDataSource.

JdbcIO.PreparedStatementSetter<T>

An interface used by the JdbcIO JdbcIO.ReadAll and JdbcIO.Write to set the parameters of the PreparedStatement used to setParameters into the database.

JdbcIO.Read<T>

Implementation of JdbcIO.read().

JdbcIO.ReadAll<ParameterT,OutputT>

Implementation of JdbcIO.readAll().

JdbcIO.ReadRows

Implementation of JdbcIO.readRows().

JdbcIO.ReadWithPartitions<T,PartitionColumnT>

Implementation of JdbcIO.readWithPartitions(org.apache.beam.sdk.values.TypeDescriptor<PartitionColumnT>).

JdbcIO.RetryConfiguration

Builder used to help with retry configuration for JdbcIO.

JdbcIO.RetryStrategy

An interface used to control if we retry the statements when a SQLException occurs.

JdbcIO.RowMapper<T>

An interface used by JdbcIO.Read for converting each row of the ResultSet into an element of the resulting PCollection.

JdbcIO.StatementPreparator

An interface used by the JdbcIO Write to set the parameters of the PreparedStatement used to setParameters into the database.

JdbcIO.Write<T>

This class is used as the default return value of JdbcIO.write().

JdbcIO.WriteVoid<T>

A PTransform to write to a JDBC datasource.

JdbcIO.WriteWithResults<T,V>

A PTransform to write to a JDBC datasource.

JdbcReadSchemaTransformProvider

An implementation of SchemaTransformProvider for reading from JDBC connections using JdbcIO.

JdbcReadSchemaTransformProvider.JdbcReadSchemaTransform

JdbcReadSchemaTransformProvider.JdbcReadSchemaTransformConfiguration

JdbcReadSchemaTransformProvider.JdbcReadSchemaTransformConfiguration.Builder

JdbcReadWithPartitionsHelper<PartitionT>

A helper for JdbcIO.ReadWithPartitions that handles range calculations.

JdbcSchemaIOProvider

An implementation of SchemaIOProvider for reading and writing JSON payloads with JdbcIO.

JdbcUtil

Provides utility functions for working with JdbcIO.

JdbcWriteResult

The result of writing a row to JDBC datasource.

JdbcWriteSchemaTransformProvider

An implementation of SchemaTransformProvider for writing to a JDBC connections using JdbcIO.

JdbcWriteSchemaTransformProvider.JdbcWriteSchemaTransform

JdbcWriteSchemaTransformProvider.JdbcWriteSchemaTransformConfiguration

JdbcWriteSchemaTransformProvider.JdbcWriteSchemaTransformConfiguration.Builder

JetMetricResults

Jet specific MetricResults.

JetMetricsContainer

Jet specific implementation of MetricsContainer.

JetPipelineOptions

Pipeline options specific to the Jet runner.

JetPipelineResult

Jet specific implementation of PipelineResult.

JetRunner

Jet specific implementation of Beam's PipelineRunner.

JetRunnerRegistrar

Contains the PipelineRunnerRegistrar and PipelineOptionsRegistrar for the JetRunner.

JetRunnerRegistrar.Options

Registers the JetPipelineOptions.

JetRunnerRegistrar.Runner

Registers the JetRunner.

JmsIO

An unbounded source for JMS destinations (queues or topics).

JmsIO.ConnectionFactoryContainer<T>

JmsIO.MessageMapper<T>

An interface used by JmsIO.Read for converting each jms Message into an element of the resulting PCollection.

JmsIO.Read<T>

A PTransform to read from a JMS destination.

JmsIO.Write<EventT>

A PTransform to write to a JMS queue.

JmsIOException

JmsRecord

JmsRecord contains message payload of the record as well as metadata (JMS headers and properties).

JobBundleFactory

A factory that has all job-scoped information, and can be combined with stage-scoped information to create a StageBundleFactory.

JobInfo

A subset of ProvisionApi.ProvisionInfo that specifies a unique job, while omitting fields that are not known to the runner operator.

JobInvocation

Internal representation of a Job which has been invoked (prepared and run) by a client.

JobInvoker

Factory to create JobInvocation instances.

JobPreparation

A job that has been prepared, but not invoked.

JobServerDriver

Shared code for starting and serving an InMemoryJobService.

JobServerDriver.JobInvokerFactory

JobServerDriver.ServerConfiguration

Configuration for the jobServer.

Join

Utility class with different versions of joins.

Join

A transform that performs equijoins across two schema PCollections.

Join.FieldsEqual

Predicate object to specify fields to compare when doing an equi-join.

Join.FieldsEqual.Impl

Implementation class for FieldsEqual.

Join.FullOuterJoin<K,V1,V2>

PTransform representing a full outer join of two collections of KV elements.

Join.Impl<LhsT,RhsT>

Implementation class .

Join.InnerJoin<K,V1,V2>

PTransform representing an inner join of two collections of KV elements.

Join.LeftOuterJoin<K,V1,V2>

PTransform representing a left outer join of two collections of KV elements.

Join.RightOuterJoin<K,V1,V2>

PTransform representing a right outer join of two collections of KV elements.

JoinRelOptRuleCall

This is a class to catch the built join and check if it is a legal join before passing it to the actual RelOptRuleCall.

JoinRelOptRuleCall.JoinChecker

This is a function gets the output relation and checks if it is a legal relational node.

JsonArrayCoder

JsonIO

PTransforms for reading and writing JSON files.

JsonIO.Write<T>

PTransform for writing JSON files.

JsonMatcher<T>

Matcher to compare a string or byte[] representing a JSON Object, independent of field order.

JsonPayloadSerializerProvider

JsonReadSchemaTransformFormatProvider

A FileReadSchemaTransformFormatProvider that reads newline-delimited JSONs.

JsonToRow

Creates a PTransform to convert input JSON objects to Rows with given Schema.

JsonToRow.JsonToRowWithErrFn

JsonToRow.JsonToRowWithErrFn.Builder

JsonToRow.JsonToRowWithErrFn.ParseWithError

JsonToRow.JsonToRowWithErrFn.ParseWithError.Builder

JsonToRow.ParseResult

The result of a JsonToRow.withExceptionReporting(Schema) transform.

JsonToRow.ParseResult.Builder

JsonUtils

Utils to convert JSON records to Beam Row.

JsonWriteSchemaTransformFormatProvider

A FileWriteSchemaTransformFormatProvider for JSON format.

JsonWriteTransformProvider

An implementation of TypedSchemaTransformProvider for JsonIO.write(java.lang.String).

JsonWriteTransformProvider.JsonWriteConfiguration

Configuration for writing to BigQuery with Storage Write API.

JsonWriteTransformProvider.JsonWriteConfiguration.Builder

Builder for JsonWriteTransformProvider.JsonWriteConfiguration.

JsonWriteTransformProvider.JsonWriteTransform

A SchemaTransform for JsonIO.write(java.lang.String).

JvmInitializer

A service interface for defining one-time initialization of the JVM during pipeline execution.

JvmInitializers

Helpers for executing JvmInitializer implementations.

KafkaCheckpointMark

Checkpoint for a KafkaUnboundedReader.

KafkaCheckpointMark.PartitionMark

A tuple to hold topic, partition, and offset that comprise the checkpoint for a single partition.

KafkaCommitOffset<K,V>

A PTransform that commits offsets of KafkaRecord.

KafkaConnectUtils

KafkaIO

An unbounded source and a sink for Kafka topics.

KafkaIO.Read<K,V>

A PTransform to read from Kafka topics.

KafkaIO.Read.External

Exposes KafkaIO.TypedWithoutMetadata as an external transform for cross-language usage.

KafkaIO.Read.External.Configuration

Parameters class to expose the Read transform to an external SDK.

KafkaIO.Read.FakeFlinkPipelineOptions

KafkaIO.ReadSourceDescriptors<K,V>

A PTransform to read from KafkaSourceDescriptor.

KafkaIO.TypedWithoutMetadata<K,V>

A PTransform to read from Kafka topics.

KafkaIO.Write<K,V>

A PTransform to write to a Kafka topic with KVs .

KafkaIO.Write.External

Exposes KafkaIO.Write as an external transform for cross-language usage.

KafkaIO.Write.External.Configuration

Parameters class to expose the Write transform to an external SDK.

KafkaIO.WriteRecords<K,V>

A PTransform to write to a Kafka topic with ProducerRecord's.

KafkaIOInitializer

Initialize KafkaIO feature flags on worker.

KafkaIOTranslation

Utility methods for translating KafkaIO transforms to and from RunnerApi representations.

KafkaIOTranslation.ReadRegistrar

KafkaIOTranslation.WriteRegistrar

KafkaIOUtils

Common utility functions and default configurations for KafkaIO.Read and KafkaIO.ReadSourceDescriptors.

KafkaIOUtils.MovingAvg

KafkaIOUtilsBenchmark

KafkaIOUtilsBenchmark.AtomicAccumulatorState

KafkaIOUtilsBenchmark.PlainAccumulatorState

KafkaIOUtilsBenchmark.ProducerState

KafkaIOUtilsBenchmark.VolatileAccumulatorState

KafkaMetrics

Stores and exports metrics for a batch of Kafka Client RPCs.

KafkaMetrics.KafkaMetricsImpl

Metrics of a batch of RPCs.

KafkaMetrics.NoOpKafkaMetrics

No-op implementation of KafkaResults.

KafkaPublishTimestampFunction<T>

An interface for providing custom timestamp for elements written to Kafka.

KafkaReadRedistribute<K,V>

KafkaReadSchemaTransformConfiguration

Configuration for reading from a Kafka topic.

KafkaReadSchemaTransformConfiguration.Builder

Builder for the KafkaReadSchemaTransformConfiguration.

KafkaReadSchemaTransformProvider

KafkaReadSchemaTransformProvider.ErrorFn

KafkaRecord<K,V>

KafkaRecord contains key and value of the record as well as metadata for the record (topic name, partition id, and offset).

KafkaRecordCoder<K,V>

Coder for KafkaRecord.

KafkaSchemaTransformTranslation

KafkaSchemaTransformTranslation.ReadRegistrar

KafkaSchemaTransformTranslation.WriteRegistrar

KafkaSinkMetrics

Helper class to create per worker metrics for Kafka Sink stages.

KafkaSourceConsumerFn<T>

Quick Overview

KafkaSourceConsumerFn.DebeziumSDFDatabaseHistory

KafkaSourceDescriptor

Represents a Kafka source description.

KafkaTableProvider

Kafka table provider.

KafkaTimestampType

This is a copy of Kafka's TimestampType.

KafkaWriteSchemaTransformProvider

KafkaWriteSchemaTransformProvider.KafkaWriteSchemaTransformConfiguration

KafkaWriteSchemaTransformProvider.KafkaWriteSchemaTransformConfiguration.Builder

KeyedBufferingElementsHandler

A keyed implementation of a BufferingElementsHandler.

KeyedPCollectionTuple<K>

An immutable tuple of keyed PCollections with key type K.

KeyedPCollectionTuple.TaggedKeyedPCollection<K,V>

A utility class to help ensure coherence of tag and input PCollection types.

KeyPairUtils

Keys<K>

Keys<K> takes a PCollection of KV<K, V>s and returns a


 PCollection<K>

of the keys.

KinesisClientThrottledException

Thrown when the Kinesis client was throttled due to rate limits.

KinesisIO

IO to read from Kinesis streams.

KinesisIO.Read

Implementation of KinesisIO.read().

KinesisIO.RecordAggregation

Configuration of Kinesis record aggregation.

KinesisIO.RecordAggregation.Builder

KinesisIO.Write<T>

Implementation of KinesisIO.write().

KinesisIO.Write.Result

Result of KinesisIO.write().

KinesisIOOptions

PipelineOptions for KinesisIO.

KinesisIOOptions.KinesisIOOptionsRegistrar

A registrar containing the default KinesisIOOptions.

KinesisIOOptions.MapFactory

KinesisPartitioner<T>

Kinesis interface for custom partitioner.

KinesisPartitioner.ExplicitPartitioner<T>

An explicit partitioner that always returns a Nonnull explicit hash key.

KinesisRecord

KinesisClientRecord enhanced with utility methods.

KinesisTransformRegistrar

Exposes KinesisIO.Write and KinesisIO.Read as an external transform for cross-language usage.

KinesisTransformRegistrar.KinesisReadToBytes

KinesisTransformRegistrar.ReadDataBuilder

KinesisTransformRegistrar.ReadDataBuilder.Configuration

KinesisTransformRegistrar.WriteBuilder

KinesisTransformRegistrar.WriteBuilder.Configuration

KuduIO

A bounded source and sink for Kudu.

KuduIO.FormatFunction<T>

An interface used by the KuduIO Write to convert an input record into an Operation to apply as a mutation in Kudu.

KuduIO.Read<T>

Implementation of KuduIO.read().

KuduIO.Write<T>

A PTransform that writes to Kudu.

KV<K,V>

An immutable key/value pair.

KV.OrderByKey<K,V>

A Comparator that orders KVs by the natural ordering of their keys.

KV.OrderByValue<K,V>

A Comparator that orders KVs by the natural ordering of their values.

KvCoder<K,V>

A KvCoder encodes KVs.

KvKeySelector<InputT,K>

KeySelector that extracts the key from a KV and returns it in encoded form as a byte array.

KvSwap<K,V>

KvSwap<K, V> takes a PCollection<KV<K, V>> and returns a

PCollection<KV<V,
 K>>

, where all the keys and values have been swapped.

KvToFlinkKeyKeySelector<K,V>

KeySelector that retrieves a key from a KV.

LabeledMetricNameUtils

Util class for building/parsing labeled MetricName.

LabeledMetricNameUtils.MetricNameBuilder

Builder class for a labeled MetricName.

LabeledMetricNameUtils.ParsedMetricName

LargeKeys

Category tags for tests which validate that a Beam runner can handle keys up to a given size.

LargeKeys.Above100KB

Tests if a runner supports 100KB keys.

LargeKeys.Above100MB

Tests if a runner supports 100MB keys.

LargeKeys.Above10KB

Tests if a runner supports 10KB keys.

LargeKeys.Above10MB

Tests if a runner supports 10MB keys.

LargeKeys.Above1MB

Tests if a runner supports 1MB keys.

LatencyRecordingHttpRequestInitializer

HttpRequestInitializer for recording request to response latency of Http-based API calls.

Latest

PTransform and Combine.CombineFn for computing the latest element in a PCollection.

LazyAggregateCombineFn<InputT,AccumT,OutputT>

Combine.CombineFn that wraps an AggregateFn.

LazyFlinkSourceSplitEnumerator<T>

A Flink SplitEnumerator implementation that holds a Beam Source and does the following: Split the Beam Source to desired number of splits.

LengthPrefixCoder<T>

A Coder which is able to take any existing coder and wrap it such that it is only invoked in the outer context.

LengthPrefixUnknownCoders

Utilities for replacing or wrapping unknown coders with LengthPrefixCoder.

Lineage

Standard collection of metrics used to record source and sinks information for lineage tracking.

Lineage.Type

Lineage metrics resource types.

LineReadSchemaTransformFormatProvider

A FileReadSchemaTransformFormatProvider that reads lines as Strings.

ListCoder<T>

A Coder for List, using the format of IterableLikeCoder.

LocalFileSystemRegistrar

AutoService registrar for the LocalFileSystem.

LocalResources

Helper functions for producing a ResourceId that references a local file or directory.

Locker

LoggingTransformProvider

An implementation of TypedSchemaTransformProvider for Logging.

LoggingTransformProvider.Configuration

LoggingTransformProvider.Configuration.Builder

LoggingTransformProvider.LoggingTransform

A SchemaTransform for logging.

LogicalEndpoint

A logical endpoint is a pair of an instruction ID corresponding to the BeamFnApi.ProcessBundleRequest and the transform within the processing graph.

LogWriter

A consumer of Beam Log Entries.

LookupPipelineVisitor

Pipeline visitor that fills lookup table of PTransform to AppliedPTransform for usage in FlinkBatchPortablePipelineTranslator.BatchTranslationContext.

Managed

Top-level PTransforms that build and instantiate turnkey transforms.

Managed.ManagedTransform

ManagedChannelFactory

A Factory which creates ManagedChannel instances.

ManagedFactory<T>

A ManagedFactory produces instances and tears down any produced instances when it is itself closed.

ManagedFactoryImpl<T>

ManagedSchemaTransformProvider

ManagedSchemaTransformTranslation

ManagedSchemaTransformTranslation.ManagedTransformRegistrar

ManualDockerEnvironmentOptions

Pipeline options to tune DockerEnvironment.

ManualDockerEnvironmentOptions.Options

ManualWatermarkEstimator<WatermarkEstimatorStateT>

A WatermarkEstimator which is controlled manually from within a DoFn.

MapCoder<K,V>

A Coder for Maps that encodes them according to provided coders for keys and values.

MapControlClientPool

A ControlClientPool backed by a client map.

MapElements<InputT,OutputT>

PTransforms for mapping a simple function over the elements of a PCollection.

MapElements.MapWithFailures<InputT,OutputT,FailureT>

A PTransform that adds exception handling to MapElements.

MapKeys<K1,K2,V>

MapKeys maps a SerializableFunction<K1,K2> over keys of a


 PCollection<KV<K1,V>>

and returns a PCollection<KV<K2, V>>.

Mapper<T>

This interface allows you to implement a custom mapper to read and persist elements from/to Cassandra.

MapperFactory

Factory class for creating instances that will map a struct to a connector model.

MappingUtils

Util class for mapping plugins.

MapState<K,V>

A ReadableState cell mapping keys to values.

MapToTupleFunction<K,V>

Map to tuple function.

MapValues<K,V1,V2>

MapValues maps a SerializableFunction<V1,V2> over values of a


 PCollection<KV<K,V1>>

and returns a PCollection<KV<K, V2>>.

MatchResult

The result of FileSystem.match(java.util.List<java.lang.String>).

MatchResult.Metadata

MatchResult.Metadata of a matched file.

MatchResult.Metadata.Builder

Builder class for MatchResult.Metadata.

MatchResult.Status

Status of a MatchResult.

Materialization<T>

For internal use only; no backwards-compatibility guarantees.

Materializations

For internal use only; no backwards-compatibility guarantees.

Materializations.IterableView<V>

Represents the PrimitiveViewT supplied to the ViewFn when it declares to use the iterable materialization.

Materializations.MultimapView<K,V>

Represents the PrimitiveViewT supplied to the ViewFn when it declares to use the multimap materialization.

Max

PTransforms for computing the maximum of the elements in a PCollection, or the maximum of the values associated with each key in a PCollection of KVs.

Mean

PTransforms for computing the arithmetic mean (a.k.a.

MemoryMonitorOptions

Options that are used to control the Memory Monitor.

MergeOverlappingIntervalWindows

For internal use only; no backwards compatibility guarantees.

MessageProducer

Base class for publishing messages to a Solace broker.

MessageProducerUtils

MessageReceiver

Interface for receiving messages from a Solace broker.

MetadataCoder

A Coder for MatchResult.Metadata.

MetadataCoderV2

A Coder for MatchResult.Metadata that includes MatchResult.Metadata.lastModifiedMillis().

MetadataSpannerConfigFactory

This class generates a SpannerConfig for the change stream metadata database by copying only the necessary fields from the SpannerConfig of the primary database.

MetadataTableAdminDao

Data access object for creating and dropping the metadata table.

MetadataTableDao

Data access object for managing the state of the metadata Bigtable table.

MetadataTableEncoder

Helper methods that simplifies some conversion and extraction of metadata table content.

MetaStore

The interface to handle CRUD of BeamSql table metadata.

Metric

Marker interface for all user-facing metrics.

MetricFiltering

Implements matching for metrics filters.

MetricKey

Metrics are keyed by the step name they are associated with and the name of the metric.

MetricName

The name of a metric consists of a MetricName.getNamespace() and a MetricName.getName().

MetricNameFilter

The name of a metric.

MetricQueryResults

The results of a query for metrics.

MetricResult<T>

The results of a single current metric.

MetricResults

Methods for interacting with the metrics of a pipeline that has been executed.

Metrics

Helper for pretty-printing Flink metrics.

Metrics

The Metrics is a utility class for producing various kinds of metrics for reporting properties of an executing pipeline.

MetricsAccumulator

Accumulator of MetricsContainerStepMap.

MetricsAccumulator

For resilience, Accumulators are required to be wrapped in a Singleton.

MetricsAccumulator

AccumulatorV2 for Beam metrics captured in MetricsContainerStepMap.

MetricsAccumulator.AccumulatorCheckpointingSparkListener

Spark Listener which checkpoints MetricsContainerStepMap values for fault-tolerance.

MetricsContainer

Holds the metrics for a single step.

MetricsContainerStepMapAccumulator

AccumulatorV2 implementation for MetricsContainerStepMap.

MetricsEnvironment

Manages and provides the metrics container associated with each thread.

MetricsEnvironment.MetricsContainerHolder

MetricsEnvironment.MetricsEnvironmentState

Set the MetricsContainer for the associated MetricsEnvironment.

MetricsFilter

Simple POJO representing a filter for querying metrics.

MetricsFilter.Builder

Builder for creating a MetricsFilter.

MetricsOptions

Extension of PipelineOptions that defines MetricsSink specific options.

MetricsOptions.NoOpMetricsSink

A DefaultValueFactory that obtains the class of the NoOpMetricsSink if it exists on the classpath, and throws an exception otherwise.

MetricsSink

Interface for all metric sinks.

MicrobatchSource<T,CheckpointMarkT>

A Source that accommodates Spark's micro-batch oriented nature and wraps an UnboundedSource.

MicrosInstant

A timestamp represented as microseconds since the epoch.

Min

PTransforms for computing the minimum of the elements in a PCollection, or the minimum of the values associated with each key in a PCollection of KVs.

Mod

Represents a modification in a table emitted within a DataChangeRecord.

ModType

Represents the type of modification applied in the DataChangeRecord.

MongoDbGridFSIO

IO to read and write data on MongoDB GridFS.

MongoDbGridFSIO.ConnectionConfiguration

Encapsulate the MongoDB GridFS connection logic.

MongoDbGridFSIO.Parser<T>

Interface for the parser that is used to parse the GridFSFile into the appropriate types.

MongoDbGridFSIO.ParserCallback<T>

Callback for the parser to use to submit data.

MongoDbGridFSIO.Read<T>

A PTransform to read data from MongoDB GridFS.

MongoDbGridFSIO.Read.BoundedGridFSSource

A BoundedSource for MongoDB GridFS.

MongoDbGridFSIO.Write<T>

A PTransform to write data to MongoDB GridFS.

MongoDbGridFSIO.WriteFn<T>

Function that is called to write the data to the give GridFS OutputStream.

MongoDbIO

IO to read and write data on MongoDB.

MongoDbIO.Read

A PTransform to read data from MongoDB.

MongoDbIO.Write

A PTransform to write to a MongoDB database.

MongoDbTable

MongoDbTable.DocumentToRow

MongoDbTable.RowToDocument

MongoDbTableProvider

TableProvider for MongoDbTable.

Monitoring

Configures Metrics throughout various features of RequestResponseIO.

Monitoring.Builder

MonitoringUtil

A helper class for monitoring jobs submitted to the service.

MonitoringUtil.JobMessagesHandler

An interface that can be used for defining callbacks to receive a list of JobMessages containing monitoring information.

MonitoringUtil.LoggingHandler

A handler that logs monitoring messages.

MonitoringUtil.TimeStampComparator

Comparator for sorting rows in increasing order based on timestamp.

MoveOptions

An object that configures

FileSystems.copy(java.util.List<org.apache.beam.sdk.io.fs.ResourceId>, java.util.List<org.apache.beam.sdk.io.fs.ResourceId>, org.apache.beam.sdk.io.fs.MoveOptions...)

FileSystems.rename(java.util.List<org.apache.beam.sdk.io.fs.ResourceId>, java.util.List<org.apache.beam.sdk.io.fs.ResourceId>, org.apache.beam.sdk.io.fs.MoveOptions...)

, and FileSystems.delete(java.util.Collection<org.apache.beam.sdk.io.fs.ResourceId>, org.apache.beam.sdk.io.fs.MoveOptions...).

MoveOptions.StandardMoveOptions

Defines the standard MoveOptions.

MqttIO

An unbounded source for MQTT broker.

MqttIO.ConnectionConfiguration

A POJO describing a MQTT connection.

MqttIO.Read<T>

A PTransform to read from a MQTT broker.

MqttIO.Write<InputT>

A PTransform to write and send a message to a MQTT server.

MqttRecord

A container class for MQTT message metadata, including the topic name and payload.

MultiDoFnFunction<InputT,OutputT>

DoFunctions ignore outputs that are not the main output.

MultiLanguageBuilderMethod

MultiLanguageConstructorMethod

MultimapState<K,V>

A ReadableState cell mapping keys to bags of values.

MutableState<EventT,ResultT>

Mutable state mutates when events apply to it.

MutationGroup

A bundle of mutations that must be submitted atomically.

MySqlSchemaTransformTranslation

MySqlSchemaTransformTranslation.ReadRegistrar

MySqlSchemaTransformTranslation.WriteRegistrar

NaiveReadFromPulsarDoFn<T>

DoFn for reading from Apache Pulsar based on Pulsar Reader from the start message id.

NanosDuration

A duration represented in nanoseconds.

NanosInstant

A timestamp represented as nanoseconds since the epoch.

NeedsDocker

Category for integration tests that require Docker.

NeedsRunner

Category tag for validation tests which utilize TestPipeline for execution and expect to be executed by a PipelineRunner.

Neo4jIO

This is a Beam IO to read from, and write data to, Neo4j.

Neo4jIO.DriverConfiguration

This describes all the information needed to create a Neo4j Session.

Neo4jIO.DriverProviderFromDriverConfiguration

Wraps a Neo4jIO.DriverConfiguration to provide a Driver.

Neo4jIO.ReadAll<ParameterT,OutputT>

This is the class which handles the work behind the Neo4jIO.readAll() method.

Neo4jIO.RowMapper<T>

An interface used by Neo4jIO.ReadAll for converting each row of a Neo4j Result record Record into an element of the resulting PCollection.

Neo4jIO.WriteUnwind<ParameterT>

This is the class which handles the work behind the Neo4jIO.writeUnwind() method.

Never

A Trigger which never fires.

Never.NeverTrigger

The actual trigger class for Never triggers.

NewPartition

Represent new partition as a result of splits and merges.

NFA

NFA is an implementation of non-deterministic finite automata.

NodeStats

This is a utility class to represent rowCount, rate and window.

NodeStatsMetadata

This is a metadata used for row count and rate estimation.

NodeStatsMetadata.Handler

Handler API.

NonKeyedBufferingElementsHandler<T>

A non-keyed implementation of a BufferingElementsHandler.

NonMergingWindowFn<T,W>

Abstract base class for WindowFns that do not merge windows.

NoOpCounter

A no-op implementation of Counter.

NoopCredentialFactory

Construct an oauth credential to be used by the SDK and the SDK workers.

NoOpHistogram

A no-op implementation of Histogram.

NoopPathValidator

For internal use only; no backwards compatibility guarantees.

NoOpStepContext

A StepContext for Spark Batch Runner execution.

NoOpStepContext

doc.

NoOpWatermarkCache

Synchronously compute the earliest partition watermark, by delegating the call to

invalid reference

PartitionMetadataDao#getUnfinishedMinWatermark()

NormalizedRange

NoSuchSchemaException

Indicates that we are missing a schema for a type.

NullableCoder<T>

A NullableCoder encodes nullable values of type T using a nested Coder<T> that does not tolerate null values.

NullCredentialInitializer

A HttpRequestInitializer for requests that don't have credentials.

NullSizeEstimator<T>

NoOp implementation of a size estimator.

NullThroughputEstimator<T>

NoOp implementation of a throughput estimator.

ObjectPool<KeyT,ObjectT>

Reference counting object pool to easily share invalid input: '&' destroy objects.

ObjectPool.ClientPool<ClientT>

Client pool to easily share AWS clients per configuration.

OffsetBasedSource<T>

A BoundedSource that uses offsets to define starting and ending positions.

OffsetBasedSource.OffsetBasedReader<T>

A Source.Reader that implements code common to readers of all OffsetBasedSources.

OffsetByteRangeCoder

OffsetRange

A restriction represented by a range of integers [from, to).

OffsetRange.Coder

A coder for OffsetRanges.

OffsetRangeTracker

A RangeTracker for non-negative positions of type long.

OffsetRangeTracker

A RestrictionTracker for claiming offsets in an OffsetRange in a monotonically increasing fashion.

OneOfType

A logical type representing a union of fields.

OneOfType.Value

Represents a single OneOf value.

OptionalCoder<T>

A OptionalCoder encodes optional values of type T using a nested


 Coder<T>

Order

Describes an order.

OrderedEventProcessor<EventT,EventKeyT,ResultT,StateT>

Transform for processing ordered events.

OrderedEventProcessorResult<KeyT,ResultT,EventT>

The result of the ordered processing.

OrderedListState<T>

A ReadableState cell containing a list of values sorted by timestamp.

OrderedProcessingHandler<EventT,KeyT,StateT,ResultT>

Parent class for Ordered Processing configuration handlers.

OrderedProcessingHandler.OrderedProcessingGlobalSequenceHandler<EventT,KeyT,StateT,ResultT>

Parent class for Ordered Processing configuration handlers to handle processing of the events where global sequence is used.

OrderedProcessingStatus

Indicates the status of ordered processing for a particular key.

OrderedProcessingStatus.Builder

OrderKey

The OrderKey class stores the information to sort a column.

OrFinallyTrigger

A Trigger that executes according to its main trigger until its "finally" trigger fires.

OrphanedMetadataCleaner

OutboundObserverFactory

Creates factories which determine an underlying StreamObserver implementation to use in to interact with fn execution APIs.

OutboundObserverFactory.BasicFactory<ReqT,RespT>

Creates an outbound observer for the given inbound observer.

OutputBuilder<T>

A builder for an output, to set all the fields and extended metadata of a Beam value.

OutputReceiverFactory

A factory that can create output receivers during an executable stage.

OutputReference

A representation used by Steps to reference the output of other Steps.

OutputTagFilter<OutputT,InputT>

Output tag filter.

PackageUtil

Helper routines for packages.

PackageUtil.StagedFile

PaneInfo

Provides information about the pane an element belongs to.

PaneInfo.PaneInfoCoder

A Coder for encoding PaneInfo instances.

PaneInfo.Timing

Enumerates the possibilities for the timing of this pane firing related to the input and output watermarks for its computation.

ParDo

ParDo is the core element-wise transform in Apache Beam, invoking a user-specified function on each of the elements of the input PCollection to produce zero or more output elements, all of which are collected into the output PCollection.

ParDo.MultiOutput<InputT,OutputT>

A PTransform that, when applied to a PCollection<InputT>, invokes a user-specified DoFn<InputT, OutputT> on all its elements, which can emit elements to any of the PTransform's output PCollections, which are bundled into a result PCollectionTuple.

ParDo.SingleOutput<InputT,OutputT>

A PTransform that, when applied to a PCollection<InputT>, invokes a user-specified DoFn<InputT, OutputT> on all its elements, with all its outputs collected into an output PCollection<OutputT>.

ParDoMultiOutputTranslatorBatch<InputT,OutputT>

ParDo translator.

ParDoMultiOverrideFactory<InputT,OutputT>

A PTransformOverrideFactory that provides overrides for applications of a ParDo in the direct runner.

ParDoP<InputT,OutputT>

Jet Processor implementation for Beam's ParDo primitive (when no user-state is being used).

ParDoP.Supplier<InputT,OutputT>

Jet Processor supplier that will provide instances of ParDoP.

ParDoStateUpdateFn<KeyT,ValueT,InputT,OutputT>

A function to handle stateful processing in Apache Beam's SparkRunner.

ParDoStateUpdateFn.SparkTimerInternalsIterator

An iterator implementation that processes timers from SparkTimerInternals.

ParquetIO

IO to read and write Parquet files.

ParquetIO.Parse<T>

Implementation of ParquetIO.parseGenericRecords(SerializableFunction).

ParquetIO.ParseFiles<T>

Implementation of ParquetIO.parseFilesGenericRecords(SerializableFunction).

ParquetIO.Read

Implementation of ParquetIO.read(Schema).

ParquetIO.ReadFiles

Implementation of ParquetIO.readFiles(Schema).

ParquetIO.ReadFiles.BlockTracker

ParquetIO.Sink

Implementation of ParquetIO.sink(org.apache.avro.Schema).

ParquetReadSchemaTransformFormatProvider

ParquetTableProvider

TableProvider for ParquetIO for consumption by Beam SQL.

ParquetWriteSchemaTransformFormatProvider

A FileWriteSchemaTransformFormatProvider for Parquet format.

ParseException

Exception thrown when Beam SQL is unable to parse the statement.

ParseJsons<OutputT>

PTransform for parsing JSON Strings.

ParseResult

The result of parsing a single file with Tika: contains the file's location, metadata, extracted text, and optionally an error.

PartialReduceBundleOperator<K,InputT,OutputT,AccumT>

Partition<T>

Partition takes a PCollection<T> and a PartitionFn, uses the


 PartitionFn

to split the elements of the input PCollection into N partitions, and returns a PCollectionList<T> that bundles N PCollection<T>s containing the split elements.

Partition.PartitionFn<T>

A function object that chooses an output partition for an element.

Partition.PartitionWithSideInputsFn<T>

A function object that chooses an output partition for an element.

PartitionEndRecord

A partition end record serves as a notification that the client should stop reading the partition.

PartitionEndRecordAction

This class is part of the process for ReadChangeStreamPartitionDoFn SDF.

PartitionEventRecord

A partition event record describes key range changes for a change stream partition.

PartitionEventRecordAction

This class is part of the process for ReadChangeStreamPartitionDoFn SDF.

PartitioningWindowFn<T,W>

A WindowFn that places each value into exactly one window based on its timestamp and never merges windows.

PartitionMetadata

Model for the partition metadata database table used in the Connector.

PartitionMetadata.Builder

Partition metadata builder for better user experience.

PartitionMetadata.State

The state at which a partition can be in the system: CREATED: the partition has been created, but no query has been done against it yet.

PartitionMetadataAdminDao

Data access object for creating and dropping the partition metadata table.

PartitionMetadataDao

Data access object for the Connector metadata tables.

PartitionMetadataDao.InTransactionContext

Represents the execution of a read / write transaction in Cloud Spanner.

PartitionMetadataDao.TransactionResult<T>

Represents a result from executing a Cloud Spanner read / write transaction.

PartitionMetadataMapper

This class is responsible for transforming a Struct to a PartitionMetadata.

PartitionMetadataTableNames

Configuration for a partition metadata table.

PartitionReconciler

There can be a race when many splits and merges happen to a single partition in quick succession.

PartitionRecord

Output result of DetectNewPartitionsDoFn containing information required to stream a partition.

PartitionStartRecord

A partition start record serves as a notification that the client should schedule the partitions to be queried.

PartitionStartRecordAction

This class is part of the process for invalid input: '{@link org.apache.beam.sdk.io.gcp.spanner.changestreams.dofn..ReadChangeStreamPartitionDoFn'} SDF.

PAssert

An assertion on the contents of a PCollection incorporated into the pipeline.

PAssert.DefaultConcludeTransform

Default transform to check that a PAssert was successful.

PAssert.GroupThenAssert<T>

A transform that applies an assertion-checking function over iterables of ActualT to the entirety of the contents of its input.

PAssert.GroupThenAssertForSingleton<T>

A transform that applies an assertion-checking function to the sole element of a PCollection.

PAssert.IterableAssert<T>

Builder interface for assertions applicable to iterables and PCollection contents.

PAssert.MatcherCheckerFn<T>

Check that the passed-in matchers match the existing data.

PAssert.OneSideInputAssert<ActualT>

An assertion checker that takes a single PCollectionView<ActualT> and an assertion over ActualT, and checks it within a Beam pipeline.

PAssert.PAssertionSite

Track the place where an assertion is defined.

PAssert.PCollectionContentsAssert<T>

An PAssert.IterableAssert about the contents of a PCollection.

PAssert.PCollectionListContentsAssert<T>

An assert about the contents of each PCollection in the given PCollectionList.

PAssert.SingletonAssert<T>

Builder interface for assertions applicable to a single value.

PassThroughLogicalType<T>

A base class for LogicalTypes that use the same Java type as the underlying base type.

PathValidator

For internal use only; no backwards compatibility guarantees.

PatternCondition

PatternCondition stores the function to decide whether a row is a match of a single pattern.

PayloadSerializer

PayloadSerializerKafkaTable

PayloadSerializerProvider

PayloadSerializers

PBegin

PBegin is the "input" to a root PTransform, such as Read or Create.

PCollection<T>

A PCollection<T> is an immutable collection of values of type

PCollection.IsBounded

The enumeration of cases for whether a PCollection is bounded.

PCollectionList<T>

A PCollectionList<T> is an immutable list of homogeneously typed PCollection<T>s.

PCollectionRowTuple

A PCollectionRowTuple is an immutable tuple of PCollections, "keyed" by a string tag.

PCollectionTuple

A PCollectionTuple is an immutable tuple of heterogeneously-typed PCollections, "keyed" by TupleTags.

PCollectionView<T>

A PCollectionView<T> is an immutable view of a PCollection as a value of type T that can be accessed as a side input to a ParDo transform.

PCollectionViews

For internal use only; no backwards compatibility guarantees.

PCollectionViews.HasDefaultValue<T>

PCollectionViews.InMemoryListFromMultimapViewFn<T>

Implementation which is able to adapt a multimap materialization to an in-memory


 List<T>

PCollectionViews.InMemoryListViewFn<T>

Implementation which is able to adapt an iterable materialization to an in-memory


 List<T>

PCollectionViews.InMemoryMapFromVoidKeyViewFn<K,V>

Implementation which is able to adapt a multimap materialization to an in-memory

Map<K,
 V>

PCollectionViews.InMemoryMapViewFn<K,V>

Implementation which is able to adapt an iterable materialization to an in-memory

Map<K,
 V>

PCollectionViews.InMemoryMultimapFromVoidKeyViewFn<K,V>

Implementation which is able to adapt a multimap materialization to an in-memory

Map<K,
 Iterable<V>>

PCollectionViews.InMemoryMultimapViewFn<K,V>

Implementation which is able to adapt an iterable materialization to an in-memory

Map<K,
 Iterable<V>>

PCollectionViews.IsSingletonView<T>

PCollectionViews.IterableBackedListViewFn<T>

Implementation which is able to adapt an iterable materialization to a List<T>.

PCollectionViews.IterableViewFn<T>

Deprecated.

See PCollectionViews.IterableViewFn2.

PCollectionViews.IterableViewFn2<T>

Implementation which is able to adapt an iterable materialization to a Iterable<T>.

PCollectionViews.ListViewFn<T>

Deprecated.

See PCollectionViews.ListViewFn2.

PCollectionViews.ListViewFn2<T>

Implementation which is able to adapt a multimap materialization to a List<T>.

PCollectionViews.MapViewFn<K,V>

Deprecated.

See PCollectionViews.MapViewFn2.

PCollectionViews.MapViewFn2<K,V>

Implementation which is able to adapt a multimap materialization to a Map<K, V>.

PCollectionViews.MultimapViewFn<K,V>

Deprecated.

See PCollectionViews.MultimapViewFn2.

PCollectionViews.MultimapViewFn2<K,V>

Implementation which is able to adapt a multimap materialization to a

Map<K,
 Iterable<V>>

PCollectionViews.SimplePCollectionView<ElemT,PrimitiveViewT,ViewT,W>

A class for PCollectionView implementations, with additional type parameters that are not visible at pipeline assembly time when the view is used as a side input.

PCollectionViews.SingletonViewFn<T>

Deprecated.

See PCollectionViews.SingletonViewFn2.

PCollectionViews.SingletonViewFn2<T>

Implementation which is able to adapt an iterable materialization to a T.

PCollectionViews.TypeDescriptorSupplier<T>

PCollectionViews.ValueOrMetadata<T,MetaT>

Stores values or metadata about values.

PCollectionViews.ValueOrMetadataCoder<T,MetaT>

A coder for PCollectionViews.ValueOrMetadata.

PCollectionViewTranslatorBatch<ElemT,ViewT>

PCollectionView translator.

PDone

PDone is the output of a PTransform that has a trivial result, such as a WriteFiles.

PeriodicImpulse

A PTransform which produces a sequence of elements at fixed runtime intervals.

PeriodicSequence

A PTransform which generates a sequence of timestamped elements at given runtime intervals.

PeriodicSequence.OutputRangeTracker

PeriodicSequence.SequenceDefinition

PInput

The interface for things that might be input to a PTransform.

Pipeline

A Pipeline manages a directed acyclic graph of PTransforms, and the PCollections that the PTransforms consume and produce.

Pipeline.PipelineExecutionException

Thrown during execution of a Pipeline, whenever user code within that Pipeline throws an exception.

Pipeline.PipelineVisitor

For internal use only; no backwards-compatibility guarantees.

Pipeline.PipelineVisitor.CompositeBehavior

Control enum for indicating whether or not a traversal should process the contents of a composite transform or not.

Pipeline.PipelineVisitor.Defaults

Default no-op Pipeline.PipelineVisitor that enters all composite transforms.

PipelineMessageReceiver

Handles failures in the form of exceptions.

PipelineOptions

PipelineOptions are used to configure Pipelines.

PipelineOptions.AtomicLongFactory

DefaultValueFactory which supplies an ID that is guaranteed to be unique within the given process.

PipelineOptions.CheckEnabled

Enumeration of the possible states for a given check.

PipelineOptions.DirectRunner

A DefaultValueFactory that obtains the class of the DirectRunner if it exists on the classpath, and throws an exception otherwise.

PipelineOptions.JobNameFactory

Returns a normalized job name constructed from ApplicationNameOptions.getAppName(), the local system user name (if available), the current time, and a random integer.

PipelineOptions.UserAgentFactory

Returns a user agent string constructed from ReleaseInfo.getName() and ReleaseInfo.getVersion(), in the format [name]/[version].

PipelineOptionsFactory

Constructs a PipelineOptions or any derived interface that is composable to any other derived interface of PipelineOptions via the PipelineOptions.as(java.lang.Class<T>) method.

PipelineOptionsFactory.Builder

A fluent PipelineOptions builder.

PipelineOptionsRegistrar

PipelineOptions creators have the ability to automatically have their PipelineOptions registered with this SDK by creating a ServiceLoader entry and a concrete implementation of this interface.

PipelineOptionsValidator

Validates that the PipelineOptions conforms to all the Validation criteria.

PipelineResult

Result of Pipeline.run().

PipelineResult.State

Possible job states, for both completed and ongoing jobs.

PipelineRunner<ResultT>

A PipelineRunner runs a Pipeline.

PipelineTranslator

The pipeline translator translates a Beam Pipeline into a Spark correspondence, that can then be evaluated.

PipelineTranslator.TranslationState

Shared, mutable state during the translation of a pipeline and omitted afterwards.

PipelineTranslator.UnresolvedTranslation<InT,T>

Unresolved translation, allowing to optimize the generated Spark DAG.

PipelineTranslatorBatch

PipelineTranslator for executing a Pipeline in Spark in batch mode.

PipelineTranslatorUtils

Utilities for pipeline translation.

Plugin<K,V>

Class wrapper for a CDAP plugin.

Plugin.Builder<K,V>

Builder class for a Plugin.

PluginConfigInstantiationUtils

Class for getting any filled PluginConfig configuration object.

PluginConstants

Class for CDAP plugin constants.

PluginConstants.Format

Format types.

PluginConstants.FormatProvider

Format provider types.

PluginConstants.Hadoop

Hadoop types.

PluginConstants.PluginType

Plugin types.

POJOUtils

A set of utilities to generate getter and setter classes for POJOs.

PortableBigQueryDestinations

PortableMetrics

PortablePipelineJarCreator

PortablePipelineRunner that bundles the input pipeline along with all dependencies, artifacts, etc.

PortablePipelineJarUtils

Contains common code for writing and reading portable pipeline jars.

PortablePipelineOptions

Pipeline options common to all portable runners.

PortablePipelineResult

Result of a portable PortablePipelineRunner.run(RunnerApi.Pipeline, JobInfo).

PortablePipelineRunner

Runs a portable Beam pipeline on some execution engine.

PortableRunner

A PipelineRunner a Pipeline against a JobService.

PortableRunnerRegistrar

Registrar for the portable runner.

PostgresSchemaTransformTranslation

PostgresSchemaTransformTranslation.ReadRegistrar

PostgresSchemaTransformTranslation.WriteRegistrar

PostProcessingMetricsDoFn

A DoFn class to gather metrics about the emitted DataChangeRecords.

POutput

The interface for things that might be output from a PTransform.

PrefetchableIterable<T>

An Iterable that returns PrefetchableIterators.

PrefetchableIterables

This class contains static utility functions that operate on or return objects of type PrefetchableIterable.

PrefetchableIterables.Default<T>

A default implementation that caches an iterator to be returned when PrefetchableIterables.Default.prefetch() is invoked.

PrefetchableIterator<T>

Iterator that supports prefetching the next set of records.

PrefetchableIterators

PreparePubsubWriteDoFn<InputT>

PrepareWrite<InputT,DestinationT,OutputT>

Prepare an input PCollection for writing to BigQuery.

PrimitiveParDoSingleFactory<InputT,OutputT>

A PTransformOverrideFactory that produces PrimitiveParDoSingleFactory.ParDoSingle instances from ParDo.SingleOutput instances.

PrimitiveParDoSingleFactory.ParDoSingle<InputT,OutputT>

A single-output primitive ParDo.

PrimitiveParDoSingleFactory.PayloadTranslator

A translator for PrimitiveParDoSingleFactory.ParDoSingle.

PrimitiveParDoSingleFactory.Registrar

Registers PrimitiveParDoSingleFactory.PayloadTranslator.

PrismPipelineOptions

PipelineOptions for running a Pipeline on the PrismRunner.

PrismRegistrar

Contains the PipelineRunnerRegistrar and PipelineOptionsRegistrar for the PrismRunner.

PrismRegistrar.Options

Registers the PrismPipelineOptions and TestPrismPipelineOptions.

PrismRegistrar.Runner

Registers PrismRunner and TestPrismRunner with PipelineRunnerRegistrar.

PrismRunner

A PipelineRunner executed on Prism.

ProcessBundleDescriptors

Utility methods for creating BeamFnApi.ProcessBundleDescriptor instances.

ProcessBundleDescriptors.BagUserStateSpec<K,V,W>

A container type storing references to the key, value, and window Coder used when handling bag user state requests.

ProcessBundleDescriptors.ExecutableProcessBundleDescriptor

ProcessBundleDescriptors.SideInputSpec<T,W>

A container type storing references to the value, and window Coder used when handling side input state requests.

ProcessBundleDescriptors.TimerSpec<K,V,W>

A container type storing references to the key, timer and payload coders and the remote input destination used when handling timer requests.

ProcessEnvironment

Environment for process-based execution.

ProcessEnvironmentFactory

An EnvironmentFactory which forks processes based on the parameters in the Environment.

ProcessEnvironmentFactory.Provider

Provider of ProcessEnvironmentFactory.

ProcessFunction<InputT,OutputT>

A function that computes an output value of type OutputT from an input value of type InputT and is Serializable.

ProcessManager

A simple process manager which forks processes and kills them if necessary.

ProcessManager.RunningProcess

ProcessNewPartitionsAction

ProducerRecordCoder<K,V>

Coder for ProducerRecord.

ProjectionConsumer

A ProjectionConsumer is a Schema-aware operation (such as a DoFn or PTransform) that has a FieldAccessDescriptor describing which fields the operation accesses.

ProjectionProducer<T>

A factory for operations that execute a projection on a Schema-aware PCollection.

ProjectSupport

PropertyNames

Constant property names used by the SDK in CloudWorkflow specifications.

ProtoBeamConverter

Provides converts between Protobuf Message and Beam Row.

ProtobufCoderProviderRegistrar

A CoderProviderRegistrar for standard types used with Google Protobuf.

ProtoByteUtils

Utility class for working with Protocol Buffer (Proto) data.

ProtoCoder<T>

A Coder using Google Protocol Buffers binary format.

ProtoDomain

ProtoDomain is a container class for Protobuf descriptors.

ProtoDynamicMessageSchema<T>

Deprecated.

Use ProtoBeamConverter

ProtoFromBytes<T>

ProtoMessageSchema

ProtoPayloadSerializerProvider

ProtoSchemaLogicalTypes

A set of Schema.LogicalType classes to represent protocol buffer types.

ProtoSchemaLogicalTypes.DurationConvert

ProtoSchemaLogicalTypes.Fixed32

A Fixed32 type.

ProtoSchemaLogicalTypes.Fixed64

A Fixed64 type.

ProtoSchemaLogicalTypes.SFixed32

A SFixed32 type.

ProtoSchemaLogicalTypes.SFixed64

An SFixed64 type.

ProtoSchemaLogicalTypes.SInt32

A SInt32 type.

ProtoSchemaLogicalTypes.SInt64

A SIn64 type.

ProtoSchemaLogicalTypes.TimestampConvert

ProtoSchemaLogicalTypes.UInt32

A UInt32 type.

ProtoSchemaLogicalTypes.UInt64

A UIn64 type.

ProtoToBytes<T>

Providers

Helpers for implementing the "Provider" pattern.

Providers.Identifyable

PTransform<InputT,OutputT>

A PTransform<InputT, OutputT> is an operation that takes an InputT (some subtype of PInput) and produces an OutputT (some subtype of POutput).

PublisherOptions

Options needed for a Pub/Sub Lite Publisher.

PublisherOptions.Builder

PublishResultHandler

This class is required to handle callbacks from Solace, to find out if messages were actually published or there were any kind of error.

PubsubClient

An (abstract) helper class for talking to Pubsub via an underlying transport.

PubsubClient.IncomingMessage

A message received from Pubsub.

PubsubClient.OutgoingMessage

A message to be sent to Pubsub.

PubsubClient.ProjectPath

Path representing a cloud project id.

PubsubClient.PubsubClientFactory

Factory for creating clients.

PubsubClient.SchemaPath

Path representing a Pubsub schema.

PubsubClient.SubscriptionPath

Path representing a Pubsub subscription.

PubsubClient.TopicPath

Path representing a Pubsub topic.

PubsubCoderProviderRegistrar

A CoderProviderRegistrar for standard types used with PubsubIO.

PubsubDlqProvider

PubsubGrpcClient

A helper class for talking to Pubsub via grpc.

PubsubIO

Read and Write PTransforms for Cloud Pub/Sub streams.

PubsubIO.PubsubSubscription

Class representing a Cloud Pub/Sub Subscription.

PubsubIO.PubsubTopic

Class representing a Cloud Pub/Sub Topic.

PubsubIO.Read<T>

Implementation of read methods.

PubsubIO.Write<T>

Implementation of write methods.

PubsubJsonClient

A Pubsub client using JSON transport.

PubsubLiteIO

I/O transforms for reading from Google Pub/Sub Lite.

PubsubLiteReadSchemaTransformProvider

PubsubLiteReadSchemaTransformProvider.ErrorFn

PubsubLiteReadSchemaTransformProvider.PubsubLiteReadSchemaTransformConfiguration

PubsubLiteReadSchemaTransformProvider.PubsubLiteReadSchemaTransformConfiguration.Builder

PubsubLiteSink

A sink which publishes messages to Pub/Sub Lite.

PubsubLiteTableProvider

Pub/Sub Lite table provider.

PubsubLiteWriteSchemaTransformProvider

PubsubLiteWriteSchemaTransformProvider.ErrorCounterFn

PubsubLiteWriteSchemaTransformProvider.PubsubLiteWriteSchemaTransformConfiguration

PubsubLiteWriteSchemaTransformProvider.PubsubLiteWriteSchemaTransformConfiguration.Builder

PubsubLiteWriteSchemaTransformProvider.SetUuidFromPubSubMessage

PubsubLiteWriteSchemaTransformProvider.SetUuidFromPubSubMessage.SetUuidFn

PubsubMessage

Class representing a Pub/Sub message.

PubsubMessagePayloadOnlyCoder

A coder for PubsubMessage treating the raw bytes being decoded as the message's payload.

PubsubMessages

Common util functions for converting between PubsubMessage proto and PubsubMessage.

PubsubMessages.DeserializeBytesIntoPubsubMessagePayloadOnly

PubsubMessages.ParsePayloadAsPubsubMessageProto

PubsubMessages.ParsePubsubMessageProtoAsPayload

PubsubMessageSchemaCoder

Provides a SchemaCoder for PubsubMessage, including the topic and all fields of a PubSub message from server.

PubsubMessageWithAttributesAndMessageIdAndOrderingKeyCoder

A coder for PubsubMessage including all fields of a PubSub message from server.

PubsubMessageWithAttributesAndMessageIdCoder

A coder for PubsubMessage including attributes and the message id from the PubSub server.

PubsubMessageWithAttributesCoder

A coder for PubsubMessage including attributes.

PubsubMessageWithMessageIdCoder

A coder for PubsubMessage treating the raw bytes being decoded as the message's payload, with the message id from the PubSub server.

PubsubMessageWithTopicCoder

A coder for PubsubMessage including the topic from the PubSub server.

PubsubOptions

Properties that can be set when using Google Cloud Pub/Sub with the Apache Beam SDK.

PubSubPayloadTranslation

PubSubPayloadTranslation.ReadRegistrar

PubSubPayloadTranslation.WriteRegistrar

PubsubReadSchemaTransformConfiguration

Configuration for reading from Pub/Sub.

PubsubReadSchemaTransformConfiguration.Builder

PubsubReadSchemaTransformConfiguration.ErrorHandling

PubsubReadSchemaTransformConfiguration.ErrorHandling.Builder

PubsubReadSchemaTransformProvider

An implementation of TypedSchemaTransformProvider for Pub/Sub reads configured using PubsubReadSchemaTransformConfiguration.

PubsubSchemaIOProvider

An implementation of SchemaIOProvider for reading and writing JSON/AVRO payloads with PubsubIO.

PubsubTableProvider

TableProvider for PubsubIO for consumption by Beam SQL.

PubsubTestClient

A (partial) implementation of PubsubClient for use by unit tests.

PubsubTestClient.PubsubTestClientFactory

Closing the factory will validate all expected messages were processed.

PubsubUnboundedSink

A PTransform which streams messages to Pubsub.

PubsubUnboundedSource

Users should use

invalid reference

PubsubIO#read

instead.

PubsubWriteSchemaTransformConfiguration

Configuration for writing to Pub/Sub.

PubsubWriteSchemaTransformConfiguration.Builder

PubsubWriteSchemaTransformConfiguration.ErrorHandling

PubsubWriteSchemaTransformConfiguration.ErrorHandling.Builder

PubsubWriteSchemaTransformProvider

An implementation of TypedSchemaTransformProvider for Pub/Sub reads configured using PubsubWriteSchemaTransformConfiguration.

PubsubWriteSchemaTransformProvider.ErrorFn

PulsarIO

IO connector for reading and writing from Apache Pulsar.

PulsarIO.Read<T>

PulsarIO.Write

PulsarMessage

Class representing a Pulsar Message record.

PulsarSourceDescriptor

PValue

For internal use.

PValueBase

For internal use.

PValues

For internal use.

PythonCallable

A logical type for PythonCallableSource objects.

PythonExternalTransform<InputT,OutputT>

Wrapper for invoking external Python transforms.

PythonExternalTransformOptions

Pipeline options for PythonExternalTransform.

PythonExternalTransformOptionsRegistrar

A registrar for PythonExternalTransformOptions.

PythonMap<InputT,OutputT>

Wrapper for invoking external Python Map transforms..

PythonService

Utility to bootstrap and start a Beam Python service.

Quantifier

The Quantifier class is intended for storing the information of the quantifier for a pattern variable.

QueryChangeStreamAction

Main action class for querying a partition change stream.

QueryPlanner

An interface that planners should implement to convert sql statement to BeamRelNode or SqlNode.

QueryPlanner.Factory

QueryPlanner.QueryParameters

QueryPlanner.QueryParameters.Kind

RabbitMqIO

A IO to publish or consume messages with a RabbitMQ broker.

RabbitMqIO.Read

A PTransform to consume messages from RabbitMQ server.

RabbitMqIO.Write

A PTransform to publish messages to a RabbitMQ server.

RabbitMqMessage

It contains the message payload, and additional metadata like routing key or attributes.

RampupThrottlingFn<T>

An implementation of a client-side throttler that enforces a gradual ramp-up, broadly in line with Datastore best practices.

RandomAccessData

An elastic-sized byte array which allows you to manipulate it as a stream, or access it directly.

RandomAccessData.RandomAccessDataCoder

A Coder which encodes the valid parts of this stream.

RandomAccessData.UnsignedLexicographicalComparator

A Comparator that compares two byte arrays lexicographically.

RangeTracker<PositionT>

A RangeTracker is a thread-safe helper object for implementing dynamic work rebalancing in position-based BoundedSource.BoundedReader subclasses.

RateLimitPolicy

RateLimitPolicyFactory

Implement this interface to create a RateLimitPolicy.

RateLimitPolicyFactory.DefaultRateLimiter

Default rate limiter that throttles reading from a shard using an exponential backoff if the response is empty or if the consumer is throttled by AWS.

RateLimitPolicyFactory.DelayIntervalRateLimiter

RawUnionValue

This corresponds to an integer union tag and value.

Read

A PTransform for reading from a Source.

Read.Bounded<T>

PTransform that reads from a BoundedSource.

Read.Builder

Helper class for building Read transforms.

Read.Unbounded<T>

PTransform that reads from a UnboundedSource.

ReadableFileCoder

A Coder for FileIO.ReadableFile.

ReadableState<T>

A State that can be read via ReadableState.read().

ReadableStates

For internal use only; no backwards-compatibility guarantees.

ReadAllViaFileBasedSource<T>

Reads each file in the input PCollection of FileIO.ReadableFile using given parameters for splitting files into offset ranges and for creating a FileBasedSource for a file.

ReadAllViaFileBasedSource.ReadFileRangesFnExceptionHandler

A class to handle errors which occur during file reads.

ReadAllViaFileBasedSourceTransform<InT,T>

ReadAllViaFileBasedSourceTransform.AbstractReadFileRangesFn<InT,T>

ReadAllViaFileBasedSourceTransform.SplitIntoRangesFn

ReadAllViaFileBasedSourceWithFilename<T>

Reads each file of the input PCollection and outputs each element as the value of a KV, where the key is the filename from which that value came.

ReadBuilder

ReadBuilder.Configuration

Parameters class to expose the transform to an external SDK.

ReadChangeStreamPartitionAction

This class is part of ReadChangeStreamPartitionDoFn SDF.

ReadChangeStreamPartitionDoFn

A SDF (Splittable DoFn) class which is responsible for performing a change stream query for a given partition.

ReadChangeStreamPartitionProgressTracker

RestrictionTracker used by ReadChangeStreamPartitionDoFn to keep track of the progress of the stream and to split the restriction for runner initiated checkpoints.

ReadChangeStreamPartitionRangeTracker

This restriction tracker delegates most of its behavior to an internal TimestampRangeTracker.

ReaderInvocationUtil<OutputT,ReaderT>

Util for invoking Source.Reader methods that might require a MetricsContainerImpl to be active.

ReadFromMySqlSchemaTransformProvider

ReadFromMySqlSchemaTransformProvider.MySqlReadSchemaTransform

ReadFromOracleSchemaTransformProvider

ReadFromPostgresSchemaTransformProvider

ReadFromPostgresSchemaTransformProvider.PostgresReadSchemaTransform

ReadFromSqlServerSchemaTransformProvider

ReadFromSqlServerSchemaTransformProvider.SqlServerReadSchemaTransform

ReadOnlyTableProvider

A ReadOnlyTableProvider provides in-memory read only set of

BeamSqlTable
 BeamSqlTables

ReadOperation

Encapsulates a spanner read operation.

ReadSourceTranslatorBatch<T>

Source translator.

ReadSourceTranslatorStream<T>

doc.

ReadSpannerSchema

This DoFn reads Cloud Spanner 'information_schema.*' tables to build the SpannerSchema.

ReadUtils

Helper class for source operations.

ReceiverBuilder<X,T>

Class for building an instance for Receiver that uses Apache Beam mechanisms instead of Spark environment.

RecommendationAICreateCatalogItem

A PTransform using the Recommendations AI API (https://cloud.google.com/recommendations).

RecommendationAIImportCatalogItems

A PTransform connecting to the Recommendations AI API (https://cloud.google.com/recommendations) and creating CatalogItems.

RecommendationAIImportUserEvents

A PTransform connecting to the Recommendations AI API (https://cloud.google.com/recommendations) and creating UserEvents.

RecommendationAIIO

The RecommendationAIIO class acts as a wrapper around the

invalid reference

PTransform

s that interact with the Recommendation AI API (https://cloud.google.com/recommendations).

RecommendationAIPredict

A PTransform using the Recommendations AI API (https://cloud.google.com/recommendations).

RecommendationAIWriteUserEvent

A PTransform using the Recommendations AI API (https://cloud.google.com/recommendations).

RecordToPublishResultDoFn

This class just transforms to PublishResult to be able to capture the windowing with the right strategy.

RecordWithMetadata

Helper Class based on Row, it provides Metadata associated with each Record when reading from file(s) using ContextualTextIO.

RedisConnectionConfiguration

RedisConnectionConfiguration describes and wraps a connectionConfiguration to Redis server or cluster.

RedisCursor

RedisIO

An IO to manipulate Redis key/value database.

RedisIO.Read

Implementation of RedisIO.read().

RedisIO.ReadKeyPatterns

Implementation of RedisIO.readKeyPatterns().

RedisIO.Write

A PTransform to write to a Redis server.

RedisIO.Write.Method

Determines the method used to insert data in Redis.

RedisIO.WriteStreams

A PTransform to write stream key pairs (https://redis.io/topics/streams-intro) to a Redis server.

Redistribute

A family of PTransforms that returns a PCollection equivalent to its input but functions as an operational hint to a runner that redistributing the data in some way is likely useful.

Redistribute.RedistributeArbitrarily<T>

Noop transform that hints to the runner to try to redistribute the work evenly, or via whatever clever strategy the runner comes up with.

Redistribute.RedistributeByKey<K,V>

Redistribute.Registrar

Registers translators for the Redistribute family of transforms.

ReferenceCountingExecutableStageContextFactory

ExecutableStageContext.Factory which counts ExecutableStageContext reference for book keeping.

ReferenceCountingExecutableStageContextFactory.Creator

Interface for creator which extends Serializable.

ReflectUtils

A set of reflection helper methods.

ReflectUtils.ClassWithSchema

Represents a class and a schema.

ReflectUtils.TypeDescriptorWithSchema<T>

Represents a type descriptor and a schema.

Regex

PTransforms to use Regular Expressions to process elements in a PCollection.

Regex.AllMatches

Regex.MatchesName<String> takes a PCollection<String> and returns a


 PCollection<List<String>>

representing the value extracted from all the Regex groups of the input PCollection to the number of times that element occurs in the input.

Regex.Find

Regex.Find<String> takes a PCollection<String> and returns a


 PCollection<String>

representing the value extracted from the Regex groups of the input


 PCollection

to the number of times that element occurs in the input.

Regex.FindAll

Regex.Find<String> takes a PCollection<String> and returns a


 PCollection<List<String>>

representing the value extracted from the Regex groups of the input PCollection to the number of times that element occurs in the input.

Regex.FindKV

Regex.MatchesKV<KV<String, String>> takes a PCollection<String> and returns a PCollection<KV<String, String>> representing the key and value extracted from the Regex groups of the input PCollection to the number of times that element occurs in the input.

Regex.FindName

Regex.Find<String> takes a PCollection<String> and returns a


 PCollection<String>

representing the value extracted from the Regex groups of the input


 PCollection

to the number of times that element occurs in the input.

Regex.FindNameKV

Regex.Matches

Regex.Matches<String> takes a PCollection<String> and returns a


 PCollection<String>

representing the value extracted from the Regex groups of the input


 PCollection

to the number of times that element occurs in the input.

Regex.MatchesKV

Regex.MatchesName

Regex.MatchesName<String> takes a PCollection<String> and returns a


 PCollection<String>

representing the value extracted from the Regex groups of the input


 PCollection

to the number of times that element occurs in the input.

Regex.MatchesNameKV

Regex.MatchesNameKV<KV<String, String>> takes a PCollection<String> and returns a PCollection<KV<String, String>> representing the key and value extracted from the Regex groups of the input PCollection to the number of times that element occurs in the input.

Regex.ReplaceAll

Regex.ReplaceAll<String> takes a PCollection<String> and returns a


 PCollection<String>

with all Strings that matched the Regex being replaced with the replacement string.

Regex.ReplaceFirst

Regex.ReplaceFirst<String> takes a PCollection<String> and returns a


 PCollection<String>

with the first Strings that matched the Regex being replaced with the replacement string.

Regex.Split

Regex.Split<String> takes a PCollection<String> and returns a


 PCollection<String>

with the input string split into individual items in a list.

RegexMatcher

Hamcrest matcher to assert a string matches a pattern.

Reify

PTransforms for converting between explicit and implicit form of various Beam values.

ReifyAsIterable<T>

This transforms turns a side input into a singleton PCollection that can be used as the main input for another transform.

ReifyTimestampsAndWindowsFunction<K,V>

Simple Function to bring the windowing information into the value from the implicit background representation of the PCollection.

RelMdNodeStats

This is the implementation of NodeStatsMetadata.

RemoteBundle

A bundle capable of handling input data elements for a bundle descriptor by forwarding them to a remote environment for processing.

RemoteEnvironment

A handle to an available remote RunnerApi.Environment.

RemoteEnvironment.SimpleRemoteEnvironment

A RemoteEnvironment which uses the default RemoteEnvironment.close() behavior.

RemoteEnvironmentOptions

Options that are used to control configuration of the remote environment.

RemoteEnvironmentOptions.Options

RemoteGrpcPortRead

An execution-time only RunnerApi.PTransform which represents an SDK harness reading from a BeamFnApi.RemoteGrpcPort.

RemoteGrpcPortWrite

An execution-time only RunnerApi.PTransform which represents a write from within an SDK harness to a BeamFnApi.RemoteGrpcPort.

RemoteInputDestination<T>

A pair of Coder and

invalid reference

BeamFnApi.Target

which specifies the arguments to a FnDataService to send data to a remote harness.

RemoteOutputReceiver<T>

A pair of Coder and FnDataReceiver which can be registered to receive elements for a LogicalEndpoint.

RenameFields

A transform for renaming fields inside an existing schema.

RenameFields.Inner<T>

The class implementing the actual PTransform.

Repeatedly

A Trigger that fires according to its subtrigger forever.

RequestResponseIO<RequestT,ResponseT>

PTransform for reading from and writing to Web APIs.

Requirements

Describes the run-time requirements of a Contextful, such as access to side inputs.

Reshuffle<K,V>

For internal use only; no backwards compatibility guarantees.

Reshuffle.AssignShardFn<T>

Reshuffle.ViaRandomKey<T>

Implementation of Reshuffle.viaRandomKey().

ReshuffleTrigger<W>

For internal use only; no backwards compatibility guarantees.

ResolveOptions

An object that configures ResourceId.resolve(java.lang.String, org.apache.beam.sdk.io.fs.ResolveOptions).

ResolveOptions.StandardResolveOptions

Defines the standard resolve options.

ResourceHint

Provides a definition of a resource hint known to the SDK.

ResourceHints

Pipeline authors can use resource hints to provide additional information to runners about the desired aspects of the execution environment.

ResourceHintsOptions

Options that are used to control configuration of the remote environment.

ResourceHintsOptions.EmptyListDefault

ResourceHintsOptions.Options

ResourceId

An identifier which represents a file-like resource.

ResourceIdCoder

A Coder for ResourceId.

ResourceIdTester

A utility to test ResourceId implementations.

RestrictionInterrupter<T>

An interrupter for restriction tracker of type T.

RestrictionTracker<RestrictionT,PositionT>

Manages access to the restriction and keeps track of its claimed part for a splittable DoFn.

RestrictionTracker.HasProgress

All RestrictionTrackers SHOULD implement this interface to improve auto-scaling and splitting performance.

RestrictionTracker.IsBounded

RestrictionTracker.Progress

A representation for the amount of known completed and remaining work.

RestrictionTracker.TruncateResult<RestrictionT>

A representation of the truncate result.

RestrictionTrackers

Support utilities for interacting with RestrictionTrackers.

RestrictionTrackers.ClaimObserver<PositionT>

Interface allowing a runner to observe the calls to RestrictionTracker.tryClaim(PositionT).

Result<ResponseT>

The Result of processing request PCollection into response PCollection.

ResumeFromPreviousPipelineAction

RetryCallableManager

A class that manages retrying of callables based on the exceptions they throw.

RetryCallableManager.Builder

RetryConfiguration

Configuration of the retry behavior for AWS SDK clients.

RetryConfiguration

RetryConfiguration.Builder

RetryHttpRequestInitializer

Implements a request initializer that adds retry handlers to all HttpRequests.

RingRange

Models a Cassandra token range.

Row

Row is an immutable tuple-like schema to represent one element in a PCollection.

Row.Builder

Builder for Row.

Row.Equals

Row.FieldValueBuilder

Builder for Row that bases a row on another row.

RowBundle<T>

Bundle of rows according to the configured Factory as input for benchmarks.

RowBundle.Action

RowBundles

RowBundles.ArrayOfNestedStringBundle

RowBundles.ArrayOfNestedStringBundle.Field

RowBundles.ArrayOfStringBundle

RowBundles.ArrayOfStringBundle.Field

RowBundles.ByteBufferBundle

RowBundles.ByteBufferBundle.Field

RowBundles.BytesBundle

RowBundles.BytesBundle.Field

RowBundles.DateTimeBundle

RowBundles.DateTimeBundle.Field

RowBundles.IntBundle

RowBundles.IntBundle.Field

RowBundles.MapOfIntBundle

RowBundles.MapOfIntBundle.Field

RowBundles.MapOfNestedIntBundle

RowBundles.MapOfNestedIntBundle.Field

RowBundles.NestedBytesBundle

RowBundles.NestedBytesBundle.Field

RowBundles.NestedIntBundle

RowBundles.NestedIntBundle.Field

RowBundles.StringBuilderBundle

RowBundles.StringBuilderBundle.Field

RowBundles.StringBundle

RowBundles.StringBundle.Field

RowCoder

A sub-class of SchemaCoder that can only encode Row instances.

RowCoderCloudObjectTranslator

Translator for row coders.

RowCoderGenerator

A utility for automatically generating a Coder for Row objects corresponding to a specific schema.

RowMessages

RowMutation

A convenience class for applying row updates to BigQuery using BigQueryIO.applyRowMutations().

RowMutation.RowMutationCoder

RowMutationInformation

This class indicates how to apply a row update to BigQuery.

RowMutationInformation.MutationType

RowSchemaInformationProvider

RowSelector

A selector interface for extracting fields from a row.

RowToEntity

A PTransform to perform a conversion of Row to Entity.

RowUtils

RowWithGetters<T>

A Concrete subclass of Row that delegates to a set of provided FieldValueGetters.

RowWithStorage

Concrete subclass of Row that explicitly stores all fields of the row.

RpcQosOptions

Quality of Service manager options for Firestore RPCs.

RpcQosOptions.Builder

Mutable Builder class for creating instances of RpcQosOptions.

RunInference<OutputT>

Wrapper for invoking external Python RunInference.

S3ClientBuilderFactory

Construct S3ClientBuilder from S3 pipeline options.

S3FileSystemConfiguration

Object used to configure S3FileSystem.

S3FileSystemConfiguration.Builder

S3FileSystemRegistrar

AutoService registrar for the S3FileSystem.

S3FileSystemSchemeRegistrar

A registrar that creates S3FileSystemConfiguration instances from PipelineOptions.

S3Options

Options used to configure Amazon Web Services S3.

S3Options.S3UploadBufferSizeBytesFactory

Provide the default s3 upload buffer size in bytes: 64MB if more than 512MB in RAM are available and 5MB otherwise.

S3Options.SSECustomerKeyFactory

Sample

PTransforms for taking samples of the elements in a PCollection, or samples of the values associated with each key in a PCollection of KVs.

Sample.FixedSizedSampleFn<T>

CombineFn that computes a fixed-size sample of a collection of values.

SbeLogicalTypes

Classes that represent various SBE semantic types.

SbeLogicalTypes.LocalMktDate

Representation of SBE's LocalMktDate.

SbeLogicalTypes.TZTimeOnly

Represents SBE's TimeOnly composite type.

SbeLogicalTypes.TZTimestamp

Represents SBE's TZTimestamp composite type.

SbeLogicalTypes.Uint16

Represents SBE's uint16 type.

SbeLogicalTypes.Uint32

Represents SBE's uint32 type.

SbeLogicalTypes.Uint64

Represents SBE's uint64 type.

SbeLogicalTypes.Uint8

Represents SBE's uint8 type.

SbeLogicalTypes.UTCDateOnly

Representation of SBE's UTCDateOnly.

SbeLogicalTypes.UTCTimeOnly

Represents SBE's UTCTimeOnly composite type.

SbeLogicalTypes.UTCTimestamp

Represents SBE's UTCTimestamp composite type.

SbeSchema

Represents an SBE schema.

SbeSchema.IrOptions

Options for configuring schema generation from an Ir.

SbeSchema.IrOptions.Builder

Builder for SbeSchema.IrOptions.

ScalaInterop

Utilities for easier interoperability with the Spark Scala API.

ScalaInterop.Fun1<T,V>

ScalaInterop.Fun2<T1,T2,V>

ScalarFn

A scalar function that can be executed as part of a SQL query.

ScalarFn.ApplyMethod

Annotates the single method in a ScalarFn implementation that is to be applied to SQL function arguments.

ScalarFnReflector

Reflection-based implementation logic for ScalarFn.

ScalarFunctionImpl

Beam-customized version from ScalarFunctionImpl , to address BEAM-5921.

Schema

Schema describes the fields in Row.

Schema.Builder

Builder class for building Schema objects.

Schema.EquivalenceNullablePolicy

Control whether nullable is included in equivalence check.

Schema.Field

Field of a row.

Schema.Field.Builder

Builder for Schema.Field.

Schema.FieldType

A descriptor of a single field type.

Schema.LogicalType<InputT,BaseT>

A LogicalType allows users to define a custom schema type.

Schema.Options

Schema.Options.Builder

Schema.TypeName

An enumerated list of type constructors.

SchemaAndRecord

A wrapper for a GenericRecord and the TableSchema representing the schema of the table (or query) it was generated from.

SchemaBaseBeamTable

Each IO in Beam has one table schema, by extending SchemaBaseBeamTable.

SchemaCaseFormat

When used on a POJO, Java Bean, or AutoValue class the specified case format will be used for all the generated Schema fields.

SchemaCoder<T>

SchemaCoder is used as the coder for types that have schemas registered.

SchemaCoderCloudObjectTranslator

Translator for Schema coders.

SchemaCreate

Can be put on a constructor or a static method, in which case that constructor or method will be used to created instance of the class by Beam's schema code.

SchemaFieldDescription

When used on a POJO field, a Java Bean getter, or an AutoValue getter, the specified description is used for the generated schema field.

SchemaFieldName

When used on a POJO field, a Java Bean getter, or an AutoValue getter, the specified name is used for the generated schema field.

SchemaFieldNumber

When used on a POJO field, a Java Bean getter, or an AutoValue getter, the generated field will have the specified index.

SchemaIgnore

When used on a POJO field or a JavaBean getter, that field or getter is ignored from the inferred schema.

SchemaInformationProvider

Provides an instance of ConvertHelpers.ConvertedSchemaInformation.

SchemaIO

An abstraction to create schema capable and aware IOs.

SchemaIOProvider

Provider to create SchemaIO instances for use in Beam SQL and other SDKs.

SchemaIOTableProviderWrapper

A general TableProvider for IOs for consumption by Beam SQL.

SchemaLogicalType

A schema represented as a serialized proto bytes.

SchemaProvider

Concrete implementations of this class allow creation of schema service objects that vend a Schema for a specific type.

SchemaProviderRegistrar

SchemaProvider creators have the ability to automatically have their schemaProvider registered with this SDK by creating a ServiceLoader entry and a concrete implementation of this interface.

SchemaRegistry

A SchemaRegistry allows registering Schemas for a given Java Class or a TypeDescriptor.

SchemaTransform

An abstraction representing schema capable and aware transforms.

SchemaTransformProvider

Provider to create SchemaTransform instances for use in Beam SQL and other SDKs.

SchemaTransformTranslation

A PTransformTranslation.TransformPayloadTranslator implementation that translates between a Java SchemaTransform and a protobuf payload for that transform.

SchemaTransformTranslation.SchemaTransformPayloadTranslator<T>

SchemaTranslation

Utility methods for translating schemas.

SchemaUserTypeCreator

A creator interface for user types that have schemas.

SchemaUtil

Provides utility functions for working with Beam Schema types.

SchemaUtil.BeamRowMapper

A JdbcIO.RowMapper implementation that converts JDBC results into Beam Row objects.

SchemaUtils

A set of utility functions for schemas.

SchemaVerification

SchemaZipFold<T>

Visitor that zips schemas, and accepts pairs of fields and their types.

SchemaZipFold.Context

Context referring to a current position in a schema.

SdfFlinkKeyKeySelector<K,V>

KeySelector that retrieves a key from a

KV<KV<element, KV<restriction,
 watermarkState>>, size>

SdkHarnessClient

A high-level client for an SDK harness.

SdkHarnessOptions

Options that are used to control configuration of the SDK harness.

SdkHarnessOptions.BundleProcessorCacheTimeoutFactory

SdkHarnessOptions.DefaultMaxCacheMemoryUsageMb

The default implementation which detects how much memory to use for a process wide cache.

SdkHarnessOptions.DefaultMaxCacheMemoryUsageMbFactory

A DefaultValueFactory which constructs an instance of the class specified by maxCacheMemoryUsageMbClass to compute the maximum amount of memory to allocate to the process wide cache within an SDK harness instance.

SdkHarnessOptions.LogLevel

The set of log levels that can be used in the SDK harness.

SdkHarnessOptions.MaxCacheMemoryUsageMb

Specifies the maximum amount of memory to use within the current SDK harness instance.

SdkHarnessOptions.SdkHarnessLogLevelOverrides

Defines a log level override for a specific class, package, or name.

Select

A PTransform for selecting a subset of fields from a schema type.

Select.Fields<T>

Select.Flattened<T>

A PTransform representing a flattened schema.

SelectHelpers

Helper methods to select subrows out of rows.

SelectHelpers.RowSelectorContainer

Semp

Semp.Queue

Semp.QueueData

SempBasicAuthClientExecutor

A class to execute requests to SEMP v2 with Basic Auth authentication.

SempClient

This interface defines methods for interacting with a Solace message broker using the Solace Element Management Protocol (SEMP).

SempClientFactory

This interface serves as a blueprint for creating SempClient objects, which are used to interact with a Solace message broker using the Solace Element Management Protocol (SEMP).

SequenceRangeAccumulator

Default accumulator used to combine sequence ranges.

SequenceRangeAccumulator.SequenceRangeAccumulatorCoder

SerdeUtils

Util methods to help with serialization / deserialization.

SerializableBiConsumer<FirstInputT,SecondInputT>

A union of the BiConsumer and Serializable interfaces.

SerializableBiFunction<FirstInputT,SecondInputT,OutputT>

A union of the BiFunction and Serializable interfaces.

SerializableCoder<T>

A Coder for Java classes that implement Serializable.

SerializableCoder.SerializableCoderProviderRegistrar

A CoderProviderRegistrar which registers a CoderProvider which can handle serializable types.

SerializableComparator<T>

A Comparator that is also Serializable.

SerializableConfiguration

A wrapper to allow Hadoop Configurations to be serialized using Java's standard serialization mechanisms.

SerializableFunction<InputT,OutputT>

A function that computes an output value of type OutputT from an input value of type InputT, is Serializable, and does not allow checked exceptions to be declared.

SerializableFunctions

Useful SerializableFunction overrides.

SerializableIr

A wrapper around Ir that fulfils Java's Serializable contract.

SerializableMatcher<T>

A Matcher that is also Serializable.

SerializableMatchers

Static class for building and using SerializableMatcher instances.

SerializableRexFieldAccess

SerializableRexFieldAccess.

SerializableRexInputRef

SerializableRexInputRef.

SerializableRexNode

SerializableRexNode.

SerializableRexNode.Builder

SerializableRexNode.Builder.

ServerFactory

A gRPC server factory.

ServerFactory.InetSocketAddressServerFactory

Creates a gRPC Server using the default server factory.

ServerFactory.UrlFactory

Factory that constructs client-accessible URLs from a local server address and port.

Sessions

A WindowFn that windows values into sessions separated by periods with no input for at least the duration specified by Sessions.getGapDuration().

SessionService

The SessionService interface provides a set of methods for managing a session with the Solace messaging system.

SessionServiceFactory

This abstract class serves as a blueprint for creating `SessionServiceFactory` objects.

SetCoder<T>

A SetCoder encodes any Set using the format of IterableLikeCoder.

Sets

The PTransforms that allow to compute different set functions across PCollections.

SetState<T>

A ReadableState cell containing a set of elements.

SetupTeardown

Provided by user and called within DoFn.Setup and @{link org.apache.beam.sdk.transforms.DoFn.Teardown} lifecycle methods of Call's DoFn.

ShardedKey<K>

Deprecated.

Use ShardedKey instead.

ShardedKeyCoder<KeyT>

A Coder for ShardedKey, using a wrapped key Coder.

ShardingFunction<UserT,DestinationT>

Function for assigning ShardedKeys to input elements for sharded WriteFiles.

ShardNameTemplate

Standard shard naming templates.

SideInputBroadcast<T>

Broadcast helper for side inputs.

SideInputInitializer<ViewT>

BroadcastVariableInitializer that initializes the broadcast input as a Map from window to side input.

SideInputMetadata

Metadata class for side inputs in Spark runner.

SideInputReaderFactory

Utility class for creating and managing side input readers in the Spark runner.

SideInputValues<T>

SideInputValues serves as a Kryo serializable container that contains a materialized view of side inputs.

SideInputValues.BaseSideInputValues<BinaryT,ValuesT,T>

SideInputValues.ByWindow<T>

General SideInputValues for BoundedWindows in two possible states.

SideInputValues.Global<T>

Specialized SideInputValues for use with the GlobalWindow in two possible states.

SideInputValues.Loader<T>

Factory function for load SideInputValues from a Dataset.

SimpleFunction<InputT,OutputT>

A SerializableFunction which is not a functional interface.

SingleEmitInputDStream<T>

A specialized ConstantInputDStream that emits its RDD exactly once.

SingleEnvironmentInstanceJobBundleFactory

Deprecated.

replace with a DefaultJobBundleFactory when appropriate if the EnvironmentFactory is a DockerEnvironmentFactory, or create an InProcessJobBundleFactory and inline the creation of the environment if appropriate.

SingleStoreIO

IO to read and write data on SingleStoreDB.

SingleStoreIO.DataSourceConfiguration

A POJO describing a SingleStoreDB DataSource by providing all properties needed to create it.

SingleStoreIO.Read<T>

A PTransform for reading data from SingleStoreDB.

SingleStoreIO.Read.SingleStoreRowMapperInitializationException

SingleStoreIO.ReadWithPartitions<T>

A PTransform for reading data from SingleStoreDB.

SingleStoreIO.RowMapper<T>

An interface used by SingleStoreIO.Read and SingleStoreIO.ReadWithPartitions for converting each row of the ResultSet into an element of the resulting PCollection.

SingleStoreIO.RowMapperWithCoder<T>

A RowMapper that provides a Coder for resulting PCollection.

SingleStoreIO.RowMapperWithInit<T>

A RowMapper that requires initialization.

SingleStoreIO.StatementPreparator

An interface used by the SingleStoreIO SingleStoreIO.Read to set the parameters of the PreparedStatement.

SingleStoreIO.UserDataMapper<T>

An interface used by the SingleStoreIO SingleStoreIO.Write to map a data from each element of PCollection to a List of Strings.

SingleStoreIO.Write<T>

A PTransform for writing data to SingleStoreDB.

SingleStoreSchemaTransformReadConfiguration

Configuration for reading from SingleStoreDB.

SingleStoreSchemaTransformReadConfiguration.Builder

SingleStoreSchemaTransformReadProvider

An implementation of TypedSchemaTransformProvider for SingleStoreDB read jobs configured using SingleStoreSchemaTransformReadConfiguration.

SingleStoreSchemaTransformWriteConfiguration

Configuration for writing to SingleStoreDB.

SingleStoreSchemaTransformWriteConfiguration.Builder

SingleStoreSchemaTransformWriteProvider

An implementation of TypedSchemaTransformProvider for SingleStoreDB write jobs configured using SingleStoreSchemaTransformWriteConfiguration.

SingletonKeyedWorkItem<K,ElemT>

Singleton keyed word item.

SingletonKeyedWorkItemCoder<K,ElemT>

Singleton keyed work item coder.

SingleWindowFlinkCombineRunner<K,InputT,AccumT,OutputT,W>

A Flink combine runner takes elements pre-grouped by window and produces output after seeing all input.

SinkMetrics

Standard Sink Metrics.

SizeEstimator<T>

This class is used to estimate the size in bytes of a given element.

SketchFrequencies

PTransforms to compute the estimate frequency of each element in a stream.

SketchFrequencies.CountMinSketchFn<InputT>

Implements the Combine.CombineFn of SketchFrequencies transforms.

SketchFrequencies.GlobalSketch<InputT>

Implementation of SketchFrequencies.globally().

SketchFrequencies.PerKeySketch<K,V>

Implementation of SketchFrequencies.perKey().

SketchFrequencies.Sketch<T>

Wrap StreamLib's Count-Min Sketch to support counting all user types by hashing the encoded user type using the supplied deterministic coder.

Slf4jLogWriter

A LogWriter which uses an SLF4J Logger as the underlying log backend.

SlidingWindows

A WindowFn that windows values into possibly overlapping fixed-size timestamp-based windows.

SnappyCoder<T>

Wraps an existing coder with Snappy compression.

SnapshotInfo

This is an AutoValue representation of an Iceberg Snapshot.

SnapshotInfo.Builder

SnowflakeArray

SnowflakeBatchServiceConfig

Class for preparing configuration for batch write and read.

SnowflakeBatchServiceImpl

Implemenation of SnowflakeServices.BatchService used in production.

POJO describing single Column within Snowflake Table.

SnowflakeDataType

Interface for data types to provide SQLs for themselves.

IO to read and write data on Snowflake.

SnowflakeIO.Concatenate

Combines list of String to provide one String with paths where files were staged for write.

SnowflakeIO.CsvMapper<T>

Interface for user-defined function mapping parts of CSV line into T.

SnowflakeIO.DataSourceConfiguration

A POJO describing a DataSource, providing all properties allowing to create a DataSource.

SnowflakeIO.DataSourceProviderFromDataSourceConfiguration

Wraps SnowflakeIO.DataSourceConfiguration to provide DataSource.

SnowflakeIO.Read<T>

Implementation of SnowflakeIO.read().

SnowflakeIO.Read.CleanTmpFilesFromGcsFn

Removes temporary staged files after reading.

SnowflakeIO.Read.MapCsvToStringArrayFn

Parses String from incoming data in PCollection to have proper format for CSV files.

SnowflakeIO.UserDataMapper<T>

Interface for user-defined function mapping T into array of Objects.

SnowflakeIO.Write<T>

Implementation of SnowflakeIO.write().

SnowflakeNumber

SnowflakeNumeric

SnowflakeObject

SnowflakePipelineOptions

SnowflakeReal

SnowflakeServices

Interface which defines common methods for interacting with Snowflake.

SnowflakeServices.BatchService

SnowflakeServices.StreamingService

SnowflakeServicesImpl

SnowflakeStreamingServiceConfig

Class for preparing configuration for streaming write.

SnowflakeStreamingServiceImpl

Implementation of SnowflakeServices.StreamingService used in production.

SnowflakeString

SnowflakeTableSchema

POJO representing schema of Table in Snowflake.

SnowflakeText

SnowflakeTime

SnowflakeTimestamp

SnowflakeTimestampLTZ

SnowflakeTimestampNTZ

SnowflakeTimestampTZ

SnowflakeTransformRegistrar

Exposes SnowflakeIO.Read and SnowflakeIO.Write as an external transform for cross-language usage.

IO to send notifications via SNS.

SnsIO.Write<T>

Implementation of SnsIO.write().

SocketAddressFactory

Creates a SocketAddress based upon a supplied string.

Solace

Provides core data models and utilities for working with Solace messages in the context of Apache Beam pipelines.

Solace.CorrelationKey

The correlation key is an object that is passed back to the client during the event broker ack or nack.

Solace.CorrelationKey.Builder

Solace.Destination

Represents a Solace message destination (either a Topic or a Queue).

Solace.Destination.Builder

Solace.DestinationType

Represents a Solace destination type.

Solace.PublishResult

The result of writing a message to Solace.

Solace.PublishResult.Builder

Solace.Queue

Represents a Solace queue.

Solace.Record

Represents a Solace message record with its associated metadata.

Solace.Record.Builder

Solace.SolaceRecordMapper

A utility class for mapping BytesXMLMessage instances to Solace.Record objects.

Solace.Topic

Represents a Solace topic.

SolaceCheckpointMark

Checkpoint for an unbounded Solace source.

SolaceIO

A PTransform to read and write from/to Solace event broker.

SolaceIO.Read<T>

SolaceIO.SubmissionMode

SolaceIO.Write<T>

SolaceIO.WriterType

SolaceMessageProducer

SolaceMessageReceiver

SolaceOutput

The SolaceIO.Write transform's output return this type, containing the successful publishes (SolaceOutput.getSuccessfulPublish()).

SolrIO

Transforms for reading and writing data from/to Solr.

SolrIO.ConnectionConfiguration

A POJO describing a connection configuration to Solr.

SolrIO.Read

A PTransform reading data from Solr.

SolrIO.ReadAll

SolrIO.ReplicaInfo

A POJO describing a replica of Solr.

SolrIO.RetryConfiguration

A POJO encapsulating a configuration for retry behavior when issuing requests to Solr.

SolrIO.Write

A PTransform writing data to Solr.

SortedMapCoder<K,V>

A Coder for Maps that encodes them according to provided coders for keys and values.

SortingFlinkCombineRunner<K,InputT,AccumT,OutputT,W>

A Flink combine runner that first sorts the elements by window and then does one pass that merges windows and outputs results.

SortValues<PrimaryKeyT,SecondaryKeyT,ValueT>

SortValues<PrimaryKeyT, SecondaryKeyT, ValueT> takes a

PCollection<KV<PrimaryKeyT,
 Iterable<KV<SecondaryKeyT, ValueT>>>>

with elements consisting of a primary key and iterables over <secondary key, value> pairs, and returns a

PCollection<KV<PrimaryKeyT,
 Iterable<KV<SecondaryKeyT, ValueT>>>

of the same elements but with values sorted by a secondary key.

Source<T>

Base class for defining input formats and creating a Source for reading the input.

Source.Reader<T>

The interface that readers of custom input sources must implement.

SourceInputFormat<T>

Wrapper for executing a Source as a Flink InputFormat.

SourceInputSplit<T>

InputSplit for SourceInputFormat.

SourceMetrics

Standard Source Metrics.

SourceRDD

Classes implementing Beam Source RDDs.

SourceRDD.Bounded<T>

A SourceRDD.Bounded reads input from a BoundedSource and creates a Spark RDD.

SourceRDD.Unbounded<T,CheckpointMarkT>

A SourceRDD.Unbounded is the implementation of a micro-batch in a SourceDStream.

SourceRecordJson

This class can be used as a mapper for each SourceRecord retrieved.

SourceRecordJson.SourceRecordJsonMapper

SourceRecordJson implementation.

SourceRecordMapper<T>

Interface used to map a Kafka source record.

SourceTestUtils

Helper functions and test harnesses for checking correctness of Source implementations.

SourceTestUtils.ExpectedSplitOutcome

Expected outcome of BoundedSource.BoundedReader.splitAtFraction(double).

SpannerAccessor

Manages lifecycle of DatabaseClient and Spanner instances.

SpannerChangestreamsReadSchemaTransformProvider

SpannerChangestreamsReadSchemaTransformProvider.DataChangeRecordToRow

SpannerChangestreamsReadSchemaTransformProvider.SpannerChangestreamsReadConfiguration

SpannerChangestreamsReadSchemaTransformProvider.SpannerChangestreamsReadConfiguration.Builder

SpannerConfig

Configuration for a Cloud Spanner client.

SpannerConfig.Builder

Builder for SpannerConfig.

SpannerIO

Reading from Cloud Spanner

SpannerIO.CreateTransaction

A PTransform that create a transaction.

SpannerIO.CreateTransaction.Builder

A builder for SpannerIO.CreateTransaction.

SpannerIO.FailureMode

A failure handling strategy.

SpannerIO.Read

Implementation of SpannerIO.read().

SpannerIO.ReadAll

Implementation of SpannerIO.readAll().

SpannerIO.ReadChangeStream

SpannerIO.SpannerChangeStreamOptions

Interface to display the name of the metadata table on Dataflow UI.

SpannerIO.Write

A PTransform that writes Mutation objects to Google Cloud Spanner.

SpannerIO.WriteGrouped

Same as SpannerIO.Write but supports grouped mutations.

SpannerReadSchemaTransformProvider

A provider for reading from Cloud Spanner using a Schema Transform Provider.

SpannerReadSchemaTransformProvider.ErrorFn

SpannerReadSchemaTransformProvider.SpannerReadSchemaTransformConfiguration

SpannerReadSchemaTransformProvider.SpannerReadSchemaTransformConfiguration.Builder

SpannerSchema

Encapsulates Cloud Spanner Schema.

SpannerSchema.Column

SpannerSchema.KeyPart

SpannerSchemaRetrievalException

Exception to signal that Spanner schema retrieval failed.

SpannerTransformRegistrar

Exposes SpannerIO.WriteRows, SpannerIO.ReadRows and SpannerIO.ChangeStreamRead as an external transform for cross-language usage.

SpannerTransformRegistrar.ChangeStreamReaderBuilder

SpannerTransformRegistrar.ChangeStreamReaderBuilder.Configuration

SpannerTransformRegistrar.CrossLanguageConfiguration

SpannerTransformRegistrar.DeleteBuilder

SpannerTransformRegistrar.InsertBuilder

SpannerTransformRegistrar.InsertOrUpdateBuilder

SpannerTransformRegistrar.ReadBuilder

SpannerTransformRegistrar.ReadBuilder.Configuration

SpannerTransformRegistrar.ReplaceBuilder

SpannerTransformRegistrar.UpdateBuilder

SpannerWriteResult

The results of a SpannerIO.write() transform.

SpannerWriteSchemaTransformProvider

SpannerWriteSchemaTransformProvider.SpannerWriteSchemaTransformConfiguration

SpannerWriteSchemaTransformProvider.SpannerWriteSchemaTransformConfiguration.Builder

SparkAssignWindowFn<T,W>

An implementation of Window.Assign for the Spark runner.

SparkBatchPortablePipelineTranslator

Translates a bounded portable pipeline into a Spark job.

SparkBatchPortablePipelineTranslator.IsSparkNativeTransform

Predicate to determine whether a URN is a Spark native transform.

SparkBeamMetricSource

A Spark Source that is tailored to expose a SparkBeamMetric, wrapping an underlying MetricResults instance.

SparkBeamMetricSource

A Spark Source that is tailored to expose a SparkBeamMetric, wrapping an underlying MetricResults instance.

SparkCombineFn<InputT,ValueT,AccumT,OutputT>

A CombineFnBase.GlobalCombineFn with a CombineWithContext.Context for the SparkRunner.

SparkCombineFn.WindowedAccumulator<InputT,ValueT,AccumT,ImplT>

Accumulator of WindowedValues holding values for different windows.

SparkCombineFn.WindowedAccumulator.Type

Type of the accumulator.

SparkCommonPipelineOptions

Spark runner PipelineOptions handles Spark execution-related configurations, such as the master address, and other user-related knobs.

SparkCommonPipelineOptions.StorageLevelFactory

Returns Spark's default storage level for the Dataset or RDD API based on the respective runner.

SparkCommonPipelineOptions.TmpCheckpointDirFactory

Returns the default checkpoint directory of /tmp/${job.name}.

SparkContextFactory

SparkContextOptions

A custom PipelineOptions to work with properties related to JavaSparkContext.

SparkContextOptions.EmptyListenersList

Returns an empty list, to avoid handling null.

SparkExecutableStageContextFactory

Singleton class that contains one ExecutableStageContext.Factory per job.

SparkGroupAlsoByWindowViaWindowSet

An implementation of GroupByKeyViaGroupByKeyOnly.GroupAlsoByWindow logic for grouping by windows and controlling trigger firings and pane accumulation.

SparkInputDataProcessor<FnInputT,FnOutputT,OutputT>

Processes Spark's input data iterators using Beam's DoFnRunner.

SparkJobInvoker

Creates a job invocation to manage the Spark runner's execution of a portable pipeline.

SparkJobServerDriver

Driver program that starts a job server for the Spark runner.

SparkJobServerDriver.SparkServerConfiguration

Spark runner-specific Configuration for the jobServer.

SparkNativePipelineVisitor

Pipeline visitor for translating a Beam pipeline into equivalent Spark operations.

SparkPCollectionView

SparkPCollectionView is used to pass serialized views to lambdas.

SparkPCollectionView.Type

Type of side input.

SparkPipelineOptions

Spark runner PipelineOptions handles Spark execution-related configurations, such as the master address, batch-interval, and other user-related knobs.

SparkPipelineResult

Represents a Spark pipeline execution result.

SparkPipelineRunner

Runs a portable pipeline on Apache Spark.

SparkPipelineTranslator

Translator to support translation between Beam transformations and Spark transformations.

SparkPortablePipelineTranslator<T>

Interface for portable Spark translators.

SparkPortableStreamingPipelineOptions

Pipeline options specific to the Spark portable runner running a streaming job.

SparkProcessContext<K,InputT,OutputT>

Holds current processing context for SparkInputDataProcessor.

SparkReceiverIO

Streaming sources for Spark Receiver.

SparkReceiverIO.Read<V>

A PTransform to read from Spark Receiver.

SparkRunner

The SparkRunner translate operations defined on a pipeline to a representation executable by Spark, and then submitting the job to Spark to be executed.

SparkRunner.Evaluator

Evaluator on the pipeline.

SparkRunnerDebugger

Pipeline runner which translates a Beam pipeline into equivalent Spark operations, without running them.

SparkRunnerDebugger.DebugSparkPipelineResult

PipelineResult of running a Pipeline using SparkRunnerDebugger Use SparkRunnerDebugger.DebugSparkPipelineResult.getDebugString() to get a String representation of the Pipeline translated into Spark native operations.

SparkRunnerKryoRegistrator

Custom KryoRegistrators for Beam's Spark runner needs and registering used class in spark translation for better serialization performance.

SparkRunnerRegistrar

Contains the PipelineRunnerRegistrar and PipelineOptionsRegistrar for the SparkRunner.

SparkRunnerRegistrar.Options

Registers the SparkPipelineOptions.

SparkRunnerRegistrar.Runner

Registers the SparkRunner.

SparkRunnerStreamingContextFactory

A JavaStreamingContext factory for resilience.

SparkSessionFactory

SparkSessionFactory.SparkKryoRegistrator

KryoRegistrator for Spark to serialize broadcast variables used for side-inputs.

SparkSideInputReader

SideInputReader using broadcasted SideInputValues.

SparkSideInputReader

A SideInputReader for the SparkRunner.

SparkStateInternals<K>

An implementation of StateInternals for the SparkRunner.

SparkStreamingPortablePipelineTranslator

Translates an unbounded portable pipeline into a Spark job.

SparkStreamingTranslationContext

Translation context used to lazily store Spark datasets during streaming portable pipeline translation and compute them after translation.

SparkStructuredStreamingPipelineOptions

Spark runner PipelineOptions handles Spark execution-related configurations, such as the master address, and other user-related knobs.

SparkStructuredStreamingPipelineResult

SparkStructuredStreamingRunner

A Spark runner build on top of Spark's SQL Engine (Structured Streaming framework).

SparkStructuredStreamingRunnerRegistrar

Contains the PipelineRunnerRegistrar and PipelineOptionsRegistrar for the SparkStructuredStreamingRunner.

SparkStructuredStreamingRunnerRegistrar.Options

Registers the SparkStructuredStreamingPipelineOptions.

SparkStructuredStreamingRunnerRegistrar.Runner

Registers the SparkStructuredStreamingRunner.

SparkTimerInternals

An implementation of TimerInternals for the SparkRunner.

SparkTransformOverrides

PTransform overrides for Spark runner.

SparkTranslationContext

Translation context used to lazily store Spark data sets during portable pipeline translation and compute them after translation.

SparkUnboundedSource

A "composite" InputDStream implementation for UnboundedSources.

SparkUnboundedSource.Metadata

A metadata holder for an input stream partition.

SplitResult<RestrictionT>

A representation of a split result.

SplittableDoFnOperator<InputT,OutputT,RestrictionT>

Flink operator for executing splittable DoFns.

SplunkEvent

A SplunkEvent describes a single payload sent to Splunk's Http Event Collector (HEC) endpoint.

SplunkEvent.Builder

A builder class for creating a SplunkEvent.

SplunkEventCoder

A Coder for SplunkEvent objects.

SplunkIO

An unbounded sink for Splunk's Http Event Collector (HEC).

SplunkIO.Write

Class SplunkIO.Write provides a PTransform that allows writing SplunkEvent records into a Splunk HTTP Event Collector end-point using HTTP POST requests.

SplunkWriteError

A class for capturing errors that occur while writing SplunkEvent to Splunk's Http Event Collector (HEC) end point.

SplunkWriteError.Builder

A builder class for creating a SplunkWriteError.

SqlCheckConstraint

Parse tree for UNIQUE, PRIMARY KEY constraints.

SqlColumnDeclaration

Parse tree for column.

SqlConversionException

Exception thrown when BeamSQL cannot convert sql to BeamRelNode.

SqlCreateCatalog

SqlCreateDatabase

SqlCreateExternalTable

Parse tree for CREATE EXTERNAL TABLE statement.

SqlCreateFunction

Parse tree for CREATE FUNCTION statement.

SqlDdlNodes

Utilities concerning SqlNode for DDL.

SqlDropCatalog

SqlDropDatabase

SqlDropTable

Parse tree for DROP TABLE statement.

SqlServerSchemaTransformTranslation

SqlServerSchemaTransformTranslation.ReadRegistrar

SqlServerSchemaTransformTranslation.WriteRegistrar

SqlSetOptionBeam

SQL parse tree node to represent SET and RESET statements.

SqlTransform

SqlTransform is the DSL interface of Beam SQL.

SqlTransformSchemaTransformProvider

SqlTypes

Beam Schema.LogicalTypes corresponding to SQL data types.

SqlUseCatalog

SqlUseDatabase

SqsIO

IO to read (unbounded) from and write to SQS queues.

SqsIO.Read

A PTransform to read/receive messages from SQS.

SqsIO.Write

Deprecated.

superseded by SqsIO.WriteBatches

SqsIO.WriteBatches<T>

A PTransform to send messages to SQS.

SqsIO.WriteBatches.DynamicDestination<T>

SqsIO.WriteBatches.EntryMapperFn<T>

Mapper to create a SendMessageBatchRequestEntry from a unique batch entry id and the input T.

SqsIO.WriteBatches.EntryMapperFn.Builder<T>

A more convenient SqsIO.WriteBatches.EntryMapperFn variant that already sets the entry id.

SqsIO.WriteBatches.Result

Result of SqsIO.writeBatches().

SqsMessage

SqsReadConfiguration

Configuration class for reading data from an AWS SQS queue.

SqsReadConfiguration.Builder

SqsReadSchemaTransformProvider

An implementation of TypedSchemaTransformProvider for jobs reading data from AWS SQS queues and configured via SqsReadConfiguration.

SqsReadSchemaTransformProvider.SqsMessageToBeamRow

SSECustomerKey

Customer provided key for use with Amazon S3 server-side encryption.

SSECustomerKey.Builder

StageBundleFactory

A bundle factory scoped to a particular ExecutableStage, which has all of the resources it needs to provide new RemoteBundles.

Stager

Interface for staging files needed for running a Dataflow pipeline.

State

A state cell, supporting a State.clear() operation.

StateAndTimers

State and Timers wrapper.

StateBinder

For internal use only; no backwards-compatibility guarantees.

StateContext<W>

For internal use only; no backwards-compatibility guarantees.

StateContexts

For internal use only; no backwards-compatibility guarantees.

StateDelegator

The StateDelegator is able to delegate BeamFnApi.StateRequests to a set of registered handlers.

StateDelegator.Registration

Allows callers to deregister from receiving further state requests.

StatefulParDoP<OutputT>

Jet Processor implementation for Beam's stateful ParDo primitive.

StatefulParDoP.Supplier<OutputT>

Jet Processor supplier that will provide instances of StatefulParDoP.

StatefulStreamingParDoEvaluator<KeyT,ValueT,OutputT>

A specialized evaluator for ParDo operations in Spark Streaming context that is invoked when stateful streaming is detected in the DoFn.

StateKeySpec

StateRequestHandler

Handler for StateRequests.

StateRequestHandlers

A set of utility methods which construct StateRequestHandlers.

StateRequestHandlers.BagUserStateHandler<K,V,W>

A handler for bag user state.

StateRequestHandlers.BagUserStateHandlerFactory<K,V,W>

A factory which constructs StateRequestHandlers.BagUserStateHandlers.

StateRequestHandlers.IterableSideInputHandler<V,W>

A handler for iterable side inputs.

StateRequestHandlers.MultimapSideInputHandler<K,V,W>

A handler for multimap side inputs.

StateRequestHandlers.SideInputHandler

Marker interface that denotes some type of side input handler.

StateRequestHandlers.SideInputHandlerFactory

A factory which constructs StateRequestHandlers.MultimapSideInputHandlers.

StateSpec<StateT>

A specification of a persistent state cell.

StateSpec.Cases<ResultT>

Cases for doing a "switch" on the type of StateSpec.

StateSpec.Cases.WithDefault<ResultT>

A base class for a visitor with a default method for cases it is not interested in.

StateSpecFunctions

A class containing StateSpec mappingFunctions.

StateSpecs

Static methods for working with StateSpecs.

StaticGrpcProvisionService

A provision service that returns a static response to all calls.

StaticRemoteEnvironment

A RemoteEnvironment that connects to Dataflow runner harness.

StaticRemoteEnvironmentFactory

An EnvironmentFactory that creates StaticRemoteEnvironment used by a runner harness that would like to use an existing InstructionRequestHandler.

StaticRemoteEnvironmentFactory.Provider

Provider for StaticRemoteEnvironmentFactory.

StaticSchemaInference

A set of utilities for inferring a Beam Schema from static Java types.

StorageApiCDC

Constants and variables for CDC support.

StorageApiConvertMessages<DestinationT,ElementT>

A transform that converts messages to protocol buffers in preparation for writing to BigQuery.

StorageApiConvertMessages.ConvertMessagesDoFn<DestinationT,ElementT>

StorageApiDynamicDestinationsTableRow<T,DestinationT>

StorageApiFlushAndFinalizeDoFn

This DoFn flushes and optionally (if requested) finalizes Storage API streams.

StorageApiLoads<DestinationT,ElementT>

This PTransform manages loads into BigQuery using the Storage API.

StorageApiWritePayload

Class used to wrap elements being sent to the Storage API sinks.

StorageApiWritePayload.Builder

StorageApiWriteRecordsInconsistent<DestinationT,ElementT>

A transform to write sharded records to BigQuery using the Storage API.

StorageApiWritesShardedRecords<DestinationT,ElementT>

A transform to write sharded records to BigQuery using the Storage API (Streaming).

StorageApiWriteUnshardedRecords<DestinationT,ElementT>

Write records to the Storage API using a standard batch approach.

StreamingImpulseSource

Deprecated.

Legacy non-portable source which can be replaced by a DoFn with timers.

StreamingInserts<DestinationT,ElementT>

PTransform that performs streaming BigQuery write.

StreamingInsertsMetrics

Stores and exports metrics for a batch of Streaming Inserts RPCs.

StreamingInsertsMetrics.NoOpStreamingInsertsMetrics

No-op implementation of StreamingInsertsResults.

StreamingInsertsMetrics.StreamingInsertsMetricsImpl

Metrics of a batch of InsertAll RPCs.

StreamingIT

Deprecated.

tests which use unbounded PCollections should be in the category UsesUnboundedPCollections.

StreamingLogLevel

StreamingOptions

Options used to configure streaming.

StreamingSideInputHandlerFactory

StateRequestHandler that uses SideInputHandler to access the broadcast state that represents side inputs.

StreamingSourceContextImpl

Class for creating context object of different CDAP classes with stream source type.

StreamingTransformTranslator

Supports translation between a Beam transform, and Spark's operations on DStreams.

StreamingTransformTranslator.SparkTransformsRegistrar

Registers classes specialized by the Spark runner.

StreamingTransformTranslator.Translator

Translator matches Beam transformation with the appropriate evaluator.

StreamingWriteTables<ElementT>

This transform takes in key-value pairs of TableRow entries and the TableDestination it should be written to.

StreamPartitionWithWatermark

StreamProgress

Position for ReadChangeStreamPartitionProgressTracker.

StreamTransformTranslator<TransformT>

Stream TransformTranslator interface.

StringAgg

Combine.CombineFns for aggregating strings or bytes with an optional delimiter (default comma).

StringAgg.StringAggByte

A Combine.CombineFn that aggregates bytes with a byte array as delimiter.

StringAgg.StringAggString

A Combine.CombineFn that aggregates strings with a string as delimiter.

StringCompiler

StringCompiler.CompileException

StringDelegateCoder<T>

A Coder that wraps a Coder<String> and encodes/decodes values via string representations.

StringSet

A metric that reports set of unique string values.

StringSetImpl

Implementation of StringSet.

StringSetResult

The result of a StringSet metric.

StringSetResult.EmptyStringSetResult

Empty StringSetResult, representing no values reported and is immutable.

StringUtf8Coder

A Coder that encodes Strings in UTF-8 encoding.

Structs

A collection of static methods for manipulating datastructure representations transferred via the Dataflow API.

StructuralByteArray

A wrapper around a byte[] that uses structural, value-based equality rather than byte[]'s normal object identity.

StructuralKey<K>

A (Key, Coder) pair that uses the structural value of the key (as provided by Coder.structuralValue(Object)) to perform equality and hashing.

StructuredCoder<T>

An abstract base class to implement a Coder that defines equality, hashing, and printing via the class name and recursively using StructuredCoder.getComponents().

StsAssumeRoleForFederatedCredentialsProvider

An implementation of AwsCredentialsProvider that periodically sends an AssumeRoleWithWebIdentityRequest to the AWS Security Token Service to maintain short-lived sessions to use for authentication.

StsAssumeRoleForFederatedCredentialsProvider.Builder

Builder class for StsAssumeRoleForFederatedCredentialsProvider.

SubscriberOptions

SubscriberOptions.Builder

SubscribeTransform

SubscriptionPartition

SubscriptionPartitionCoder

SuccessOrFailure

Output of PAssert.

Sum

PTransforms for computing the sum of the elements in a PCollection, or the sum of the values associated with each key in a PCollection of KVs.

SynchronizedStreamObserver<V>

A StreamObserver which provides synchronous access access to an underlying StreamObserver.

SystemReduceFnBuffering<K,T,W>

Table

Represents the metadata of a BeamSqlTable.

Table.Builder

Builder class for Table.

TableAlreadyExistsException

TableAndRecord<T>

A wrapper for a KuduTable and the TableAndRecord representing a typed record.

TableDestination

Encapsulates a BigQuery table destination.

TableDestinationCoder

A coder for TableDestination objects.

TableDestinationCoderV2

A Coder for TableDestination that includes time partitioning information.

TableDestinationCoderV3

A Coder for TableDestination that includes time partitioning and clustering information.

TableName

Represents a parsed table name that is specified in a FROM clause (and other places).

TableNameExtractionUtils

Helper class to extract table identifiers from the query.

TableProvider

A TableProvider handles the metadata CRUD of a specified kind of tables.

TableRowJsonCoder

A Coder that encodes BigQuery TableRow objects in their native JSON format.

TableRowToStorageApiProto

Utility methods for converting JSON TableRow objects to dynamic protocol message, for use with the Storage write API.

TableRowToStorageApiProto.SchemaDoesntMatchException

TableRowToStorageApiProto.SchemaTooNarrowException

TableRowToStorageApiProto.SingleValueConversionException

TableSchema

A descriptor for ClickHouse table schema.

TableSchema.Column

A column in ClickHouse table.

TableSchema.ColumnType

A descriptor for a column type.

TableSchema.DefaultType

An enumeration of possible kinds of default values in ClickHouse.

TableSchema.TypeName

An enumeration of possible types in ClickHouse.

TableSchemaCache

An updatable cache for table schemas.

TableSchemaUpdateUtils

Helper utilities for handling schema-update responses.

TableUtils

TaggedPValue

For internal use only; no backwards-compatibility guarantees.

TDigestQuantiles

PTransforms for getting information about quantiles in a stream.

TDigestQuantiles.GlobalDigest

Implementation of TDigestQuantiles.globally().

TDigestQuantiles.PerKeyDigest<K>

Implementation of TDigestQuantiles.perKey().

TDigestQuantiles.TDigestQuantilesFn

Implements the Combine.CombineFn of TDigestQuantiles transforms.

Tee<T>

A PTransform that returns its input, but also applies its input to an auxiliary PTransform, akin to the shell tee command, which is named after the T-splitter used in plumbing.

TestBigQuery

Test rule which creates a new table with specified schema, with randomized name and exposes few APIs to work with it.

TestBigQuery.PollingAssertion

Interface to implement a polling assertion.

TestBigQueryOptions

TestPipelineOptions for TestBigQuery.

TestBoundedTable

Mocked table for bounded data sources.

TestDataflowPipelineOptions

A set of options used to configure the TestPipeline.

TestDataflowRunner

TestDataflowRunner is a pipeline runner that wraps a DataflowRunner when running tests against the TestPipeline.

TestDStream<T>

TestExecutors

A TestRule that validates that all submitted tasks finished and were completed.

TestExecutors.TestExecutorService

A union of the ExecutorService and TestRule interfaces.

TestFlinkRunner

Test Flink runner.

TestJobService

A JobService for tests.

TestOutputReceiver<T>

An implementation of DoFn.OutputReceiver that naively collects all output values.

TestPipeline

A creator of test pipelines that can be used inside of tests that can be configured to run locally or against a remote pipeline runner.

TestPipeline.AbandonedNodeException

An exception thrown in case an abandoned PTransform is detected, that is, a PTransform that has not been run.

TestPipeline.PipelineRunMissingException

An exception thrown in case a test finishes without invoking Pipeline.run().

TestPipeline.TestValueProviderOptions

Implementation detail of TestPipeline.newProvider(T), do not use.

TestPipelineExtension

JUnit 5 extension for TestPipeline that provides the same functionality as the JUnit 4 TestRule implementation.

TestPipelineOptions

TestPipelineOptions is a set of options for test pipelines.

TestPipelineOptions.AlwaysPassMatcher

Matcher which will always pass.

TestPipelineOptions.AlwaysPassMatcherFactory

Factory for PipelineResult matchers which always pass.

TestPortablePipelineOptions

Options for TestPortableRunner.

TestPortablePipelineOptions.DefaultJobServerConfigFactory

Factory for default config.

TestPortablePipelineOptions.TestPortablePipelineOptionsRegistrar

TestPortableRunner

TestPortableRunner is a pipeline runner that wraps a PortableRunner when running tests against the TestPipeline.

TestPrismPipelineOptions

PipelineOptions for use with the TestPrismRunner.

TestPrismRunner

TestPrismRunner is the recommended PipelineRunner to use for tests that rely on sdks/go/cmd/prism.

TestPubsub

Test rule which creates a new topic and subscription with randomized names and exposes the APIs to work with them.

TestPubsub.PollingAssertion

TestPubsubOptions

PipelineOptions for TestPubsub.

TestPubsubSignal

Test rule which observes elements of the PCollection and checks whether they match the success criteria.

TestSchemaTransformProvider

TestSchemaTransformProvider.Config

TestSchemaTransformProvider.Config.Builder

TestSparkPipelineOptions

A SparkPipelineOptions for tests.

TestSparkPipelineOptions.DefaultStopPipelineWatermarkFactory

A factory to provide the default watermark to stop a pipeline that reads from an unbounded source.

TestSparkRunner

The SparkRunner translate operations defined on a pipeline to a representation executable by Spark, and then submitting the job to Spark to be executed.

TestStream<T>

A testing input that generates an unbounded PCollection of elements, advancing the watermark and processing time as elements are emitted.

TestStream.Builder<T>

An incomplete TestStream.

TestStream.ElementEvent<T>

A TestStream.Event that produces elements.

TestStream.Event<T>

An event in a TestStream.

TestStream.EventType

The types of TestStream.Event that are supported by TestStream.

TestStream.ProcessingTimeEvent<T>

A TestStream.Event that advances the processing time clock.

TestStream.TestStreamCoder<T>

Coder for TestStream.

TestStream.WatermarkEvent<T>

A TestStream.Event that advances the watermark.

TestStreams

Utility methods which enable testing of StreamObservers.

TestStreams.Builder<T>

A builder for a test CallStreamObserver that performs various callbacks.

TestStreamSource<T>

Flink source for executing TestStream.

TestTable

Base class for mocked table.

TestTableFilter

TestTableProvider

Test in-memory table provider for use in tests.

TestTableProvider.PushDownOptions

TestTableProvider.TableWithRows

TableWitRows.

TestTableUtils

Utility functions for mock classes.

TestUnboundedTable

A mocked unbounded table.

TestUniversalRunner

A PipelineRunner a Pipeline against a JobService.

TestUniversalRunner.Options

TestUniversalRunner.OptionsRegistrar

TestUniversalRunner.RunnerRegistrar

Registrar for the portable runner.

TextIO

PTransforms for reading and writing text files.

TextIO.CompressionType

Deprecated.

Use Compression.

TextIO.Read

Implementation of TextIO.read().

TextIO.ReadAll

Deprecated.

See TextIO.readAll() for details.

TextIO.ReadFiles

Implementation of TextIO.readFiles().

TextIO.Sink

Implementation of TextIO.sink().

TextIO.TypedWrite<UserT,DestinationT>

Implementation of TextIO.write().

TextIO.Write

This class is used as the default return value of TextIO.write().

TextJsonTable

TextJsonTable is a BeamSqlTable that reads text files and converts them according to the JSON format.

TextMessageMapper

The TextMessageMapper takes a String value, a Session and returns a TextMessage.

TextRowCountEstimator

This returns a row count estimation for files associated with a file pattern.

TextRowCountEstimator.Builder

Builder for TextRowCountEstimator.

TextRowCountEstimator.LimitNumberOfFiles

This strategy stops sampling if we sample enough number of bytes.

TextRowCountEstimator.LimitNumberOfTotalBytes

This strategy stops sampling when total number of sampled bytes are more than some threshold.

TextRowCountEstimator.NoEstimationException

An exception that will be thrown if the estimator cannot get an estimation of the number of lines.

TextRowCountEstimator.SampleAllFiles

This strategy samples all the files.

TextRowCountEstimator.SamplingStrategy

Sampling Strategy shows us when should we stop reading further files.

TextSource

Implementation detail of TextIO.Read.

TextSourceBenchmark

TextSourceBenchmark.Data

TextTable

TextTable is a BeamSqlTable that reads text files and converts them according to the specified format.

TextTableProvider

Text table provider.

TextTableProvider.CsvToRow

Read-side converter for TextTable with format 'csv'.

TextTableProvider.LinesReadConverter

Read-side converter for TextTable with format 'lines'.

TextTableProvider.LinesWriteConverter

Write-side converter for for TextTable with format 'lines'.

TextualIntegerCoder

A Coder that encodes Integer Integers as the ASCII bytes of their textual, decimal, representation.

TFRecordIO

PTransforms for reading and writing TensorFlow TFRecord files.

TFRecordIO.CompressionType

Deprecated.

Use Compression.

TFRecordIO.Read

Implementation of TFRecordIO.read().

TFRecordIO.ReadFiles

Implementation of TFRecordIO.readFiles().

TFRecordIO.Sink

A FileIO.Sink for use with FileIO.write() and FileIO.writeDynamic().

TFRecordIO.Write

Implementation of TFRecordIO.write().

TFRecordReadSchemaTransformConfiguration

Configuration for reading from TFRecord.

TFRecordReadSchemaTransformConfiguration.Builder

Builder for TFRecordReadSchemaTransformConfiguration.

TFRecordReadSchemaTransformProvider

TFRecordReadSchemaTransformProvider.ErrorFn

TFRecordSchemaTransformTranslation

TFRecordSchemaTransformTranslation.ReadWriteRegistrar

TFRecordSchemaTransformTranslation.TFRecordReadSchemaTransformTranslator

TFRecordSchemaTransformTranslation.TFRecordWriteSchemaTransformTranslator

TFRecordWriteSchemaTransformConfiguration

Configuration for reading from TFRecord.

TFRecordWriteSchemaTransformConfiguration.Builder

TFRecordWriteSchemaTransformProvider

TFRecordWriteSchemaTransformProvider.ErrorFn

ThriftCoder<T>

A Coder using a Thrift TProtocol to serialize/deserialize elements.

ThriftIO

PTransforms for reading and writing files containing Thrift encoded data.

ThriftIO.ReadFiles<T>

Implementation of ThriftIO.readFiles(java.lang.Class<T>).

ThriftIO.Sink<T>

Implementation of ThriftIO.sink(org.apache.thrift.protocol.TProtocolFactory).

ThriftIO.ThriftWriter<T>

Writer to write Thrift object to OutputStream.

ThriftPayloadSerializerProvider

ThriftSchema

Schema provider for generated thrift types.

ThriftSchema.Customizer

ThroughputEstimator<T>

An estimator to calculate the throughput of the outputted elements from a DoFn.

ThrowingBiConsumer<T1,T2>

A BiConsumer which can throw Exceptions.

ThrowingBiFunction<T1,T2,T3>

A BiFunction which can throw Exceptions.

ThrowingConsumer<ExceptionT,T>

A Consumer which can throw Exceptions.

ThrowingFunction<T1,T2>

A Function which can throw Exceptions.

ThrowingRunnable

A Runnable which can throw Exceptions.

TikaIO

Transforms for parsing arbitrary files using Apache Tika.

TikaIO.Parse

Implementation of TikaIO.parse().

TikaIO.ParseFiles

Implementation of TikaIO.parseFiles().

Time

A time without a time-zone.

TimeDomain

TimeDomain specifies whether an operation is based on timestamps of elements or current "real-world" time as reported while processing.

Timer

A timer for a specified time domain that can be set to register the desire for further processing at particular time in its specified time domain.

TimerEndpoint<T>

TimerMap

TimerReceiverFactory

A factory that passes timers to TimerReceiverFactory.timerDataConsumer.

Timers

Interface for interacting with time.

TimerSpec

A specification for a Timer.

TimerSpecs

Static methods for working with TimerSpecs.

TimerUtils

Utility class for handling timers in the Spark runner.

TimerUtils.TimerMarker

A marker class used to identify timer keys and values in Spark transformations.

TimestampCombiner

Policies for combining timestamps that occur within a window.

TimestampConverter

Convert between different Timestamp and Instant classes.

TimestampedValue<V>

An immutable pair of a value and a timestamp.

TimestampedValue.TimestampedValueCoder<T>

A Coder for TimestampedValue.

TimestampEncoding

This encoder/decoder writes a com.google.cloud.Timestamp object as a pair of long and int to avro and reads a Timestamp object from the same pair.

TimestampObservingWatermarkEstimator<WatermarkEstimatorStateT>

A WatermarkEstimator that observes the timestamps of all records output from a DoFn.

TimestampPolicy<K,V>

A timestamp policy to assign event time for messages in a Kafka partition and watermark for it.

TimestampPolicy.PartitionContext

The context contains state maintained in the reader for the partition.

TimestampPolicyFactory<KeyT,ValueT>

An extendable factory to create a TimestampPolicy for each partition at runtime by KafkaIO reader.

TimestampPolicyFactory.LogAppendTimePolicy<K,V>

Assigns Kafka's log append time (server side ingestion time) to each record.

TimestampPolicyFactory.ProcessingTimePolicy<K,V>

A simple policy that uses current time for event time and watermark.

TimestampPolicyFactory.TimestampFnPolicy<K,V>

Internal policy to support deprecated withTimestampFn API.

TimestampPrefixingWindowCoder<T>

A TimestampPrefixingWindowCoder wraps arbitrary user custom window coder.

TimestampRange

A restriction represented by a range of timestamps [from, to).

TimestampRangeTracker

A RestrictionTracker for claiming positions in a TimestampRange in a monotonically increasing fashion.

TimestampTransform

For internal use only; no backwards-compatibility guarantees.

TimestampTransform.AlignTo

For internal use only; no backwards-compatibility guarantees.

TimestampTransform.Delay

For internal use only; no backwards-compatibility guarantees.

TimestampUtils

Provides methods in order to convert timestamp to nanoseconds representation and back.

TimeUtil

A helper class for converting between Dataflow API and SDK time representations.

TimeUtil

Time conversion utilities.

ToJson<T>

Creates a PTransform that serializes UTF-8 JSON objects from a Schema-aware PCollection (i.e.

Top

PTransforms for finding the largest (or smallest) set of elements in a


 PCollection

, or the largest (or smallest) set of values associated with each key in a


 PCollection

of KVs.

Top.Largest<T>

Deprecated.

use Top.Natural instead

Top.Natural<T>

A Serializable Comparator that that uses the compared elements' natural ordering.

Top.Reversed<T>

Serializable Comparator that that uses the reverse of the compared elements' natural ordering.

Top.Smallest<T>

Deprecated.

use Top.Reversed instead

Top.TopCombineFn<T,ComparatorT>

CombineFn for Top transforms that combines a bunch of Ts into a single count-long List<T>, using compareFn to choose the largest Ts.

TopicPartitionCoder

The Coder for encoding and decoding TopicPartition in Beam.

ToString

PTransforms for converting a PCollection<?>, PCollection<KV<?,?>>, or PCollection<Iterable<?>> to a PCollection<String>.

TrackerWithProgress

Transaction

A transaction object.

TransformEvaluator<TransformT>

Describe a PTransform evaluator.

TransformExecutor

A Runnable that will execute a PTransform on some bundle of input.

TransformProvider<InputT,OutputT>

Provides a mapping of RunnerApi.FunctionSpec to a PTransform, together with mappings of its inputs and outputs to maps of PCollections.

TransformServiceLauncher

A utility that can be used to manage a Beam Transform Service.

TransformTranslator<TransformT>

A TransformTranslator knows how to translate a particular subclass of PTransform for the Cloud Dataflow service.

TransformTranslator<InT,OutT,TransformT>

A TransformTranslator provides the capability to translate a specific primitive or composite PTransform into its Spark correspondence.

TransformTranslator

Supports translation between a Beam transform, and Spark's operations on RDDs.

TransformTranslator.StepTranslationContext

The interface for a TransformTranslator to build a Dataflow step.

TransformTranslator.TranslationContext

The interface provided to registered callbacks for interacting with the DataflowRunner, including reading and writing the values of PCollections and side inputs.

TransformTranslator.Translator

Translator matches Beam transformation with the appropriate evaluator.

TranslationUtils

A set of utilities to help translating Beam transformations into Spark transformations.

TranslationUtils

doc.

TranslationUtils.CombineGroupedValues<K,InputT,OutputT>

A SparkCombineFn function applied to grouped KVs.

TranslationUtils.TupleTagFilter<V>

A utility class to filter TupleTags.

Transport

Helpers for cloud communication.

Trigger

Triggers control when the elements for a specific key and window are output.

Trigger.OnceTrigger

For internal use only; no backwards-compatibility guarantees.

TupleTag<V>

A TupleTag is a typed tag to use as the key of a heterogeneously typed tuple, like PCollectionTuple.

TupleTagList

A TupleTagList is an immutable list of heterogeneously typed TupleTags.

TVFSlidingWindowFn

TVFSlidingWindowFn assigns window based on input row's "window_start" and "window_end" timestamps.

TVFStreamingUtils

Provides static constants or utils for TVF streaming.

Twister2AssignContext<T,W>

doc.

Twister2BatchPipelineTranslator

Twister pipeline translator for batch pipelines.

Twister2BatchTranslationContext

Twister2BatchTranslationContext.

Twister2BoundedSource<T>

Twister2 wrapper for Bounded Source.

Twister2EmptySource<T>

Empty Source wrapper.

Twister2PipelineExecutionEnvironment

Twister2PipelineExecutionEnvironment.

Twister2PipelineOptions

Twister2PipelineOptions.

Twister2PipelineResult

Represents a Twister2 pipeline execution result.

Twister2PipelineTranslator

Twister2PipelineTranslator, both batch and streaming translators need to extend from this.

Twister2Runner

Deprecated.

The support for twister2 is scheduled for removal in Beam 3.0.

Twister2RunnerRegistrar

AutoService registrar - will register Twister2Runner and Twister2Options as possible pipeline runner services.

Twister2RunnerRegistrar.Options

Pipeline options registrar.

Twister2RunnerRegistrar.Runner

Pipeline runner registrar.

Twister2SideInputReader

Twister2SinkFunction<T>

Sink Function that collects results.

Twister2StreamPipelineTranslator

Twister pipeline translator for stream pipelines.

Twister2StreamTranslationContext

Twister2StreamingTranslationContext.

Twister2TestRunner

A PipelineRunner that executes the operations in the pipeline by first translating them to a Twister2 Plan and then executing them either locally or on a Twister2 cluster, depending on the configuration.

Twister2TranslationContext

Twister2TranslationContext.

TypeCode

Represents a type of a column within Cloud Spanner.

TypedCombineFnDelegate<InputT,AccumT,OutputT>

A Combine.CombineFn delegating all relevant calls to given delegate.

TypeDescriptor<T>

A description of a Java type, including actual generic parameters where possible.

TypeDescriptors

A utility class for creating TypeDescriptor objects for different types, such as Java primitive types, containers and KVs of other TypeDescriptor objects, and extracting type variables of parameterized types (e.g.

TypeDescriptors.TypeVariableExtractor<InputT,OutputT>

A helper interface for use with TypeDescriptors.extractFromTypeParameters(Object, Class, TypeVariableExtractor).

TypedSchemaTransformProvider<ConfigT>

Like SchemaTransformProvider except uses a configuration object instead of Schema and Row.

TypeParameter<T>

Captures a free type variable that can be used in TypeDescriptor.where(org.apache.beam.sdk.values.TypeParameter<X>, org.apache.beam.sdk.values.TypeDescriptor<X>).

UdafImpl<InputT,AccumT,OutputT>

Implement AggregateFunction to take a Combine.CombineFn as UDAF.

UdfImplReflectiveFunctionBase

Beam-customized version from ReflectiveFunctionBase, to address BEAM-5921.

UdfImplReflectiveFunctionBase.ParameterListBuilder

Helps build lists of FunctionParameter.

UdfProvider

Provider for user-defined functions written in Java.

UdfTestProvider

Defines Java UDFs for use in tests.

UdfTestProvider.DateIncrementAllFn

UdfTestProvider.HelloWorldFn

UdfTestProvider.IncrementFn

UdfTestProvider.IsNullFn

UdfTestProvider.MatchFn

UdfTestProvider.Sum

UdfTestProvider.UnusedFn

UdfUdafProvider

Provider for UDF and UDAF.

UnboundedBatchedSolaceWriter

This DoFn is the responsible for writing to Solace in batch mode (holding up any messages), and emit the corresponding output (success or fail; only for persistent messages), so the SolaceIO.Write connector can be composed with other subsequent transforms in the pipeline.

UnboundedDataset<T>

DStream holder Can also crate a DStream from a supplied queue of values, but mainly for testing.

UnboundedReaderImpl

UnboundedSolaceSource<T>

UnboundedSolaceWriter

This DoFn encapsulates common code used both for the UnboundedBatchedSolaceWriter and UnboundedStreamingSolaceWriter.

UnboundedSource<OutputT,CheckpointMarkT>

A Source that reads an unbounded amount of input and, because of that, supports some additional operations such as checkpointing, watermarks, and record ids.

UnboundedSource.CheckpointMark

A marker representing the progress and state of an UnboundedSource.UnboundedReader.

UnboundedSource.CheckpointMark.NoopCheckpointMark

A checkpoint mark that does nothing when finalized.

UnboundedSource.UnboundedReader<OutputT>

A Reader that reads an unbounded amount of input.

UnboundedSourceImpl

UnboundedSourceP<T,CmT>

Jet Processor implementation for reading from an unbounded Beam source.

UnboundedSourceWrapper<OutputT,CheckpointMarkT>

Wrapper for executing UnboundedSources as a Flink Source.

UnboundedStreamingSolaceWriter

This DoFn is the responsible for writing to Solace in streaming mode (one message at a time, not holding up any message), and emit the corresponding output (success or fail; only for persistent messages), so the SolaceIO.Write connector can be composed with other subsequent transforms in the pipeline.

UnionCoder

A UnionCoder encodes RawUnionValues.

UniqueIdGenerator

Generate unique IDs that can be used to differentiate different jobs and partitions.

UnknownLogicalType<T>

A base class for logical types that are not understood by the Java SDK.

UnprocessedEvent<EventT>

Combines the source event which failed to process with the failure reason.

UnprocessedEvent.Reason

UnsignedOptions

Options for controlling what to do with unsigned types, specifically whether to use a higher bit count or, in the case of uint64, a string.

UnsignedOptions.Behavior

Defines the exact behavior for unsigned types.

UnsignedOptions.Builder

Builder for UnsignedOptions.

UnversionedTypeSerializerSnapshot<T>

A legacy snapshot which does not care about schema compatibility.

UpdateConfiguration

Builds a MongoDB UpdateConfiguration object.

UpdateField

UpdateSchemaDestination<DestinationT>

Update destination schema based on data that is about to be copied into it.

UploadIdResponseInterceptor

Implements a response intercepter that logs the upload id if the upload id header exists and it is the first request (does not have upload_id parameter in the request).

UserCodeExecutionException

Base Exception for signaling errors in user custom code.

UserCodeQuotaException

Extends UserCodeQuotaException to allow the user custom code to specifically signal a Quota or API overuse related error.

UserCodeRemoteSystemException

A UserCodeExecutionException that signals an error with a remote system.

UserCodeTimeoutException

An extension of UserCodeQuotaException to specifically signal a user code timeout.

UsesAttemptedMetrics

Category tag for validation tests which utilize Metrics.

UsesBoundedSplittableParDo

Category tag for validation tests which utilize splittable ParDo with a DoFn.BoundedPerElement DoFn.

UsesBoundedTrieMetrics

Category tag for validation tests which utilize BoundedTrie.

UsesBundleFinalizer

Category tag for validation tests which use DoFn.BundleFinalizer.

UsesCommittedMetrics

Category tag for validation tests which utilize Metrics.

UsesCounterMetrics

Category tag for validation tests which utilize Counter.

UsesCustomWindowMerging

Category tag for validation tests which utilize custom window merging.

UsesDistributionMetrics

Category tag for validation tests which utilize Distribution.

UsesExternalService

Category tag for tests which relies on a pre-defined port, such as expansion service or transform service.

UsesFailureMessage

Category tag for tests which validate that currect failure message is provided by failed pipeline.

UsesGaugeMetrics

Category tag for validation tests which utilize Gauge.

UsesImpulse

Category for tests that use Impulse transformations.

UsesJavaExpansionService

Category tag for tests which use the expansion service in Java.

UsesKeyInParDo

Category tag for validation tests which use key.

UsesKms

Category tag for validation tests which utilize --tempRoot from TestPipelineOptions and and expect a default KMS key enable for the bucket specified.

UsesLoopingTimer

Category tag for validation tests which utilize looping timers in ParDo.

UsesMapState

Category tag for validation tests which utilize MapState.

UsesMetricsPusher

Category tag for validation tests which utilize the metrics pusher feature.

UsesMultimapState

Category tag for validation tests which utilize MultimapState.

UsesOnWindowExpiration

Category tag for validation tests which utilize DoFn.OnWindowExpiration.

UsesOrderedListState

Category tag for validation tests which utilize OrderedListState.

UsesParDoLifecycle

Category tag for the ParDoLifecycleTest for exclusion (BEAM-3241).

UsesPerKeyOrderedDelivery

Category tag for validation tests which rely on a runner providing per-key ordering.

UsesPerKeyOrderInBundle

Category tag for validation tests which rely on a runner providing per-key ordering in between transforms in the same ProcessBundleRequest.

UsesProcessingTimeTimers

Category tag for validation tests which utilize timers in ParDo.

UsesPythonExpansionService

Category tag for tests which use the expansion service in Python.

UsesRequiresTimeSortedInput

Category tag for validation tests which utilizeDoFn.RequiresTimeSortedInput in stateful ParDo.

UsesSchema

Category tag for validation tests which utilize schemas.

UsesSdkHarnessEnvironment

Category tag for tests which validate that the SDK harness executes in a well formed environment.

UsesSetState

Category tag for validation tests which utilize SetState.

UsesSideInputs

Category tag for validation tests which use sideinputs.

UsesSideInputsWithDifferentCoders

Category tag for validation tests which use multiple side inputs with different coders.

UsesStatefulParDo

Category tag for validation tests which utilize stateful ParDo.

UsesStrictTimerOrdering

Category for tests that enforce strict event-time ordering of fired timers, even in situations where multiple tests mutually set one another and watermark hops arbitrarily far to the future.

UsesStringSetMetrics

Category tag for validation tests which utilize StringSet.

UsesSystemMetrics

Category tag for tests that use System metrics.

UsesTestStream

Category tag for tests that use TestStream, which is not a part of the Beam model but a special feature currently only implemented by the direct runner and the Flink Runner (streaming).

UsesTestStreamWithMultipleStages

Subcategory for UsesTestStream tests which use TestStream # across multiple stages.

UsesTestStreamWithOutputTimestamp

Category tag for validation tests which use outputTimestamp.

UsesTestStreamWithProcessingTime

Subcategory for UsesTestStream tests which use the processing time feature of TestStream.

UsesTimerMap

Category tag for validation tests which use timerMap.

UsesTimersInParDo

Category tag for validation tests which utilize timers in ParDo.

UsesTriggeredSideInputs

Category tag for validation tests which use triggered sideinputs.

UsesUnboundedPCollections

Category tag for validation tests which utilize at least one unbounded PCollection.

UsesUnboundedSplittableParDo

Category tag for validation tests which utilize splittable ParDo with a DoFn.UnboundedPerElement DoFn.

Utils

Various common methods used by the Jet based runner.

Utils.ByteArrayKey

A wrapper of byte[] that can be used as a hash-map key.

Uuid

A Uuid storable in a Pub/Sub Lite attribute.

UuidCoder

A coder for a Uuid.

UuidDeduplicationOptions

Options for deduplicating Pub/Sub Lite messages based on the UUID they were published with.

UuidDeduplicationOptions.Builder

UuidDeduplicationTransform

A transform for deduplicating Pub/Sub Lite messages based on the UUID they were published with.

UuidLogicalType

Base class for types representing UUID as two long values.

ValidatesRunner

Category tag for tests which validate that a Beam runner is correctly implemented.

Validation

Validation represents a set of annotations that can be used to annotate getter properties on PipelineOptions with information representing the validation criteria to be used when validating with the PipelineOptionsValidator.

Validation.Required

This criteria specifies that the value must be not null.

ValueAndCoderKryoSerializer<T>

Kryo serializer for ValueAndCoderLazySerializable.

ValueAndCoderLazySerializable<T>

A holder object that lets you serialize an element with a Coder with minimal wasted space.

ValueCaptureType

Represents the capture type of a change stream.

ValueInSingleWindow<T>

An immutable tuple of value, timestamp, window, and pane.

ValueInSingleWindow.Coder<T>

A coder for ValueInSingleWindow.

ValueProvider<T>

A ValueProvider abstracts the notion of fetching a value that may or may not be currently available.

ValueProvider.Deserializer

For internal use only; no backwards compatibility guarantees.

ValueProvider.NestedValueProvider<T,X>

ValueProvider.NestedValueProvider is an implementation of ValueProvider that allows for wrapping another ValueProvider object.

ValueProvider.RuntimeValueProvider<T>

ValueProvider.RuntimeValueProvider is an implementation of ValueProvider that allows for a value to be provided at execution time rather than at graph construction time.

ValueProvider.Serializer

For internal use only; no backwards compatibility guarantees.

ValueProvider.StaticValueProvider<T>

ValueProvider.StaticValueProvider is an implementation of ValueProvider that allows for a static value to be provided.

ValueProviders

Utilities for working with the ValueProvider interface.

Values<V>

Values<V> takes a PCollection of KV<K, V>s and returns a


 PCollection<V>

of the values.

ValueState<T>

A ReadableState cell containing a single value.

ValueWithRecordId<ValueT>

For internal use only; no backwards compatibility guarantees.

ValueWithRecordId.StripIdsDoFn<T>

DoFn to turn a ValueWithRecordId<T> back to the value T.

ValueWithRecordId.ValueWithRecordIdCoder<ValueT>

A Coder for ValueWithRecordId, using a wrapped value Coder.

VariableBytes

A LogicalType representing a variable-length byte array with specified maximum length.

VariableString

A LogicalType representing a variable-length string with specified maximum length.

VarianceFn<T>

Combine.CombineFn for Variance on Number types.

VarIntBenchmark

Benchmarks for VarInt and variants.

VarIntBenchmark.BlackholeOutput

Output to Blackhole.

VarIntBenchmark.Bytes

Input from randomly generated bytes.

VarIntBenchmark.ByteStringOutput

Output to ByteStringOutputStream.

VarIntBenchmark.Longs

Input from randomly generated longs.

VarIntCoder

A Coder that encodes Integers using between 1 and 5 bytes.

VarLongCoder

A Coder that encodes Longs using between 1 and 10 bytes.

VersionDependentFlinkPipelineOptions

VideoIntelligence

Factory class for PTransforms integrating with Google Cloud AI - VideoIntelligence service.

VideoIntelligence.AnnotateVideoFromBytes

A PTransform taking a PCollection of ByteString and an optional side input with a context map and emitting lists of VideoAnnotationResults for each element.

VideoIntelligence.AnnotateVideoFromBytesWithContext

A PTransform taking a PCollection of KV of ByteString and VideoContext and emitting lists of VideoAnnotationResults for each element.

VideoIntelligence.AnnotateVideoFromUri

A PTransform taking a PCollection of String and an optional side input with a context map and emitting lists of VideoAnnotationResults for each element.

VideoIntelligence.AnnotateVideoFromURIWithContext

A PTransform taking a PCollection of KV of String and VideoContext and emitting lists of VideoAnnotationResults for each element.

View

Transforms for creating PCollectionViews from PCollections (to read them as side inputs).

View.AsIterable<T>

For internal use only; no backwards-compatibility guarantees.

View.AsList<T>

For internal use only; no backwards-compatibility guarantees.

View.AsMap<K,V>

For internal use only; no backwards-compatibility guarantees.

View.AsMultimap<K,V>

For internal use only; no backwards-compatibility guarantees.

View.AsSingleton<T>

For internal use only; no backwards-compatibility guarantees.

View.CreatePCollectionView<ElemT,ViewT>

For internal use only; no backwards-compatibility guarantees.

View.ToListViewDoFn<T>

Provides an index to value mapping using a random starting index and also provides an offset range for each window seen.

ViewFn<PrimitiveViewT,ViewT>

For internal use only; no backwards-compatibility guarantees.

ViewP

Jet Processor implementation for Beam's side input producing primitives.

VoidCoder

A Coder for Void.

Wait

Delays processing of each window in a PCollection until signaled.

Wait.OnSignal<T>

Implementation of Wait.on(org.apache.beam.sdk.values.PCollection<?>...).

Watch

Given a "poll function" that produces a potentially growing set of outputs for an input, this transform simultaneously continuously watches the growth of output sets of all inputs, until a per-input termination condition is reached.

Watch.Growth<InputT,OutputT,KeyT>

Implementation of

Watch.growthOf(org.apache.beam.sdk.transforms.Watch.Growth.PollFn<InputT, OutputT>, org.apache.beam.sdk.transforms.Requirements)

Watch.Growth.PollFn<InputT,OutputT>

A function that computes the current set of outputs for the given input, in the form of a Watch.Growth.PollResult.

Watch.Growth.PollResult<OutputT>

The result of a single invocation of a Watch.Growth.PollFn.

Watch.Growth.TerminationCondition<InputT,StateT>

A strategy for determining whether it is time to stop polling the current input regardless of whether its output is complete or not.

Watch.WatchGrowthFn<InputT,OutputT,KeyT,TerminationStateT>

WatermarkCache

WatermarkEstimator<WatermarkEstimatorStateT>

A WatermarkEstimator which is used for estimating output watermarks of a splittable DoFn.

WatermarkEstimators

Support utilties for interacting with WatermarkEstimators.

WatermarkEstimators

A set of WatermarkEstimators that users can use to advance the output watermark for their associated splittable DoFns.

WatermarkEstimators.Manual

Concrete implementation of a ManualWatermarkEstimator.

WatermarkEstimators.MonotonicallyIncreasing

A watermark estimator that observes timestamps of records output from a DoFn reporting the timestamp of the last element seen as the current watermark.

WatermarkEstimators.WallTime

A watermark estimator that tracks wall time.

WatermarkEstimators.WatermarkAndStateObserver<WatermarkEstimatorStateT>

Interface which allows for accessing the current watermark and watermark estimator state.

WatermarkHoldState

For internal use only; no backwards-compatibility guarantees.

WatermarkParameters

WatermarkParameters contains the parameters used for watermark computation.

WatermarkPolicy

Implement this interface to define a custom watermark calculation heuristic.

WatermarkPolicyFactory

Implement this interface to create a WatermarkPolicy.

WatermarkPolicyFactory.ArrivalTimeWatermarkPolicy

ArrivalTimeWatermarkPolicy uses WatermarkPolicyFactory.CustomWatermarkPolicy for watermark computation.

WatermarkPolicyFactory.CustomWatermarkPolicy

CustomWatermarkPolicy uses parameters defined in WatermarkParameters to compute watermarks.

WatermarkPolicyFactory.ProcessingTimeWatermarkPolicy

Watermark policy where the processing time is used as the event time.

WebIdTokenProvider

Defines the behavior for a OIDC web identity token provider.

WebPathParser

WebPathParser.DicomWebPath

WeightedList<T>

Facade for a List<T> that keeps track of weight, for cache limit reasons.

Window<T>

Window logically divides up or groups the elements of a PCollection into finite windows according to a WindowFn.

Window.Assign<T>

A Primitive PTransform that assigns windows to elements based on a WindowFn.

Window.ClosingBehavior

Specifies the conditions under which a final pane will be created when a window is permanently closed.

Window.OnTimeBehavior

Specifies the conditions under which an on-time pane will be created when a window is closed.

WindowDoFnOperator<K,InputT,OutputT>

Flink operator for executing window DoFns.

WindowedKvKeySelector<InputT,K>

KeySelector that extracts the key from a KV and returns it in encoded form as a byte array.

WindowedValue<T>

A value along with Beam's windowing information and all other metadata.

WindowedValues

Implementations of WindowedValue and static utility methods.

WindowedValues.Builder<T>

WindowedValues.FullWindowedValueCoder<T>

Coder for WindowedValue.

WindowedValues.ParamWindowedValueCoder<T>

A parameterized coder for WindowedValue.

WindowedValues.SingleWindowedValue

A WindowedValues which holds exactly single window per value.

WindowedValues.ValueOnlyWindowedValueCoder<T>

Deprecated.

Use ParamWindowedValueCoder instead, it is a general purpose implementation of the same concept but makes timestamp, windows and pane info configurable.

WindowedValues.WindowedValueCoder<T>

Abstract class for WindowedValue coder.

WindowFn<T,W>

The argument to the Window transform used to assign elements into windows and to determine how windows are merged.

WindowFnTestUtils

A utility class for testing WindowFns.

WindowGroupP<K,V>

Jet Processor implementation for Beam's GroupByKeyOnly + GroupAlsoByWindow primitives.

WindowingStrategy<T,W>

A WindowingStrategy describes the windowing behavior for a specific collection of values.

WindowingStrategy.AccumulationMode

The accumulation modes that can be used with windowing.

WindowIntoTransformProvider

An implementation of TypedSchemaTransformProvider for WindowInto.

WindowIntoTransformProvider.Configuration

WindowIntoTransformProvider.Configuration.Builder

WindowMappingFn<TargetWindowT>

A function that takes the windows of elements in a main input and maps them to the appropriate window in a PCollectionView consumed as a side input.

WireCoders

Helpers to construct coders for gRPC port reads and writes.

WithFailures

A collection of utilities for writing transforms that can handle exceptions raised during processing of elements.

WithFailures.ExceptionAsMapHandler<T>

A simple handler that extracts information from an exception to a Map<String, String> and returns a KV where the key is the input element that failed processing, and the value is the map of exception attributes.

WithFailures.ExceptionElement<T>

The value type passed as input to exception handlers.

WithFailures.Result<OutputT,FailureElementT>

An intermediate output type for PTransforms that allows an output collection to live alongside a collection of elements that failed the transform.

WithFailures.ThrowableHandler<T>

A handler that holds onto the Throwable that led to the exception, returning it along with the original value as a KV.

WithKeys<T>

WithKeys<K,V>

WithKeys<K, V> takes a PCollection<V>, and either a constant key of type

or a function from V to K, and returns a PCollection<KV<K, V>>, where each of the values in the input PCollection has been paired with either the constant key or a key computed from the value.

WithMetricsSupport

A MetricRegistry decorator-like that supports

invalid reference