Interface BigQueryOptions
- All Superinterfaces:
ApplicationNameOptions,GcpOptions,GoogleApiDebugOptions,HasDisplayData,PipelineOptions,StreamingOptions
- All Known Subinterfaces:
DataflowPipelineOptions,DataflowWorkerHarnessOptions,TestBigQueryOptions,TestDataflowPipelineOptions
public interface BigQueryOptions
extends ApplicationNameOptions, GcpOptions, PipelineOptions, StreamingOptions
Properties needed when using Google BigQuery with the Apache Beam SDK.
-
Nested Class Summary
Nested classes/interfaces inherited from interface org.apache.beam.sdk.extensions.gcp.options.GcpOptions
GcpOptions.DefaultProjectFactory, GcpOptions.EnableStreamingEngineFactory, GcpOptions.GcpOAuthScopesFactory, GcpOptions.GcpTempLocationFactory, GcpOptions.GcpUserCredentialsFactoryNested classes/interfaces inherited from interface org.apache.beam.sdk.extensions.gcp.options.GoogleApiDebugOptions
GoogleApiDebugOptions.GoogleApiTracerNested classes/interfaces inherited from interface org.apache.beam.sdk.options.PipelineOptions
PipelineOptions.AtomicLongFactory, PipelineOptions.CheckEnabled, PipelineOptions.DirectRunner, PipelineOptions.JobNameFactory, PipelineOptions.UserAgentFactory -
Field Summary
Fields inherited from interface org.apache.beam.sdk.extensions.gcp.options.GcpOptions
STREAMING_ENGINE_EXPERIMENT, WINDMILL_SERVICE_EXPERIMENT -
Method Summary
Modifier and TypeMethodDescriptionBQ endpoint to use.Choose to use a GBK when gathering a list of files in batch FILE_LOAD.voidsetBigQueryEndpoint(String value) voidsetBigQueryProject(String value) voidvoidsetEnableStorageReadApiV2(Boolean value) voidsetGroupFilesFileLoad(Boolean value) voidsetHTTPReadTimeout(Integer timeout) voidsetHTTPWriteTimeout(Integer timeout) voidsetInsertBundleParallelism(Integer parallelism) voidsetJobLabelsMap(Map<String, String> value) voidvoidvoidsetMaxStreamingBatchSize(Long value) voidsetMaxStreamingRowsToBatch(Long value) voidvoidvoidvoidsetNumStreamingKeys(Integer value) voidvoidvoidvoidvoidvoidvoidvoidsetTempDatasetId(String value) voidvoidsetUseStorageWriteApi(Boolean value) voidMethods inherited from interface org.apache.beam.sdk.options.ApplicationNameOptions
getAppName, setAppNameMethods inherited from interface org.apache.beam.sdk.extensions.gcp.options.GcpOptions
getCredentialFactoryClass, getDataflowKmsKey, getGcpCredential, getGcpOauthScopes, getGcpTempLocation, getImpersonateServiceAccount, getProject, getWorkerRegion, getWorkerZone, getZone, isEnableStreamingEngine, setCredentialFactoryClass, setDataflowKmsKey, setEnableStreamingEngine, setGcpCredential, setGcpOauthScopes, setGcpTempLocation, setImpersonateServiceAccount, setProject, setWorkerRegion, setWorkerZone, setZoneMethods inherited from interface org.apache.beam.sdk.extensions.gcp.options.GoogleApiDebugOptions
getGoogleApiTrace, setGoogleApiTraceMethods inherited from interface org.apache.beam.sdk.transforms.display.HasDisplayData
populateDisplayDataMethods inherited from interface org.apache.beam.sdk.options.PipelineOptions
as, getGbek, getJobName, getOptionsId, getRunner, getStableUniqueNames, getTempLocation, getUserAgent, outputRuntimeOptions, revision, setGbek, setJobName, setOptionsId, setRunner, setStableUniqueNames, setTempLocation, setUserAgentMethods inherited from interface org.apache.beam.sdk.options.StreamingOptions
getUpdateCompatibilityVersion, isStreaming, setStreaming, setUpdateCompatibilityVersion
-
Method Details
-
getTempDatasetId
-
setTempDatasetId
-
getHTTPReadTimeout
-
setHTTPReadTimeout
-
getHTTPWriteTimeout
-
setHTTPWriteTimeout
-
getInsertBundleParallelism
-
setInsertBundleParallelism
-
getNumStreamingKeys
-
setNumStreamingKeys
-
getMaxStreamingRowsToBatch
-
setMaxStreamingRowsToBatch
-
getMaxStreamingBatchSize
-
setMaxStreamingBatchSize
-
getBqStreamingApiLoggingFrequencySec
-
setBqStreamingApiLoggingFrequencySec
-
getUseStorageWriteApi
-
setUseStorageWriteApi
-
getUseStorageWriteApiAtLeastOnce
-
setUseStorageWriteApiAtLeastOnce
-
getNumStorageWriteApiStreams
-
setNumStorageWriteApiStreams
-
getNumStorageWriteApiStreamAppendClients
-
setNumStorageWriteApiStreamAppendClients
-
getMinConnectionPoolConnections
-
setMinConnectionPoolConnections
-
getMaxConnectionPoolConnections
-
setMaxConnectionPoolConnections
-
getStorageWriteMaxInflightRequests
-
setStorageWriteMaxInflightRequests
-
getStorageWriteMaxInflightBytes
-
setStorageWriteMaxInflightBytes
-
getUseStorageApiConnectionPool
-
setUseStorageApiConnectionPool
-
getStorageWriteApiTriggeringFrequencySec
Integer getStorageWriteApiTriggeringFrequencySec() -
setStorageWriteApiTriggeringFrequencySec
-
getStorageWriteApiMaxRetries
-
setStorageWriteApiMaxRetries
-
getMaxBufferingDurationMilliSec
-
setMaxBufferingDurationMilliSec
-
getBigQueryProject
String getBigQueryProject() -
setBigQueryProject
-
getStorageApiAppendThresholdBytes
-
setStorageApiAppendThresholdBytes
-
getStorageApiAppendThresholdRecordCount
-
setStorageApiAppendThresholdRecordCount
-
getStorageWriteApiMaxRequestSize
-
setStorageWriteApiMaxRequestSize
-
getEnableStorageReadApiV2
-
setEnableStorageReadApiV2
-
getJobLabelsMap
-
setJobLabelsMap
-
getBigQueryEndpoint
BQ endpoint to use. If unspecified, uses the default endpoint. -
setBigQueryEndpoint
-
getGroupFilesFileLoad
Choose to use a GBK when gathering a list of files in batch FILE_LOAD.The purpose of this option is to accommodate the runner compatibility, for example, some runners having known issues on large side input, turning on this option avoids trigger side input related issues.
This is an experimental pipeline option, no backward compatibility guaranteed.
-
setGroupFilesFileLoad
-