Interface BigQueryOptions
- All Superinterfaces:
ApplicationNameOptions
,GcpOptions
,GoogleApiDebugOptions
,HasDisplayData
,PipelineOptions
,StreamingOptions
- All Known Subinterfaces:
DataflowPipelineOptions
,DataflowWorkerHarnessOptions
,TestBigQueryOptions
,TestDataflowPipelineOptions
public interface BigQueryOptions
extends ApplicationNameOptions, GcpOptions, PipelineOptions, StreamingOptions
Properties needed when using Google BigQuery with the Apache Beam SDK.
-
Nested Class Summary
Nested classes/interfaces inherited from interface org.apache.beam.sdk.extensions.gcp.options.GcpOptions
GcpOptions.DefaultProjectFactory, GcpOptions.EnableStreamingEngineFactory, GcpOptions.GcpOAuthScopesFactory, GcpOptions.GcpTempLocationFactory, GcpOptions.GcpUserCredentialsFactory
Nested classes/interfaces inherited from interface org.apache.beam.sdk.extensions.gcp.options.GoogleApiDebugOptions
GoogleApiDebugOptions.GoogleApiTracer
Nested classes/interfaces inherited from interface org.apache.beam.sdk.options.PipelineOptions
PipelineOptions.AtomicLongFactory, PipelineOptions.CheckEnabled, PipelineOptions.DirectRunner, PipelineOptions.JobNameFactory, PipelineOptions.UserAgentFactory
-
Field Summary
Fields inherited from interface org.apache.beam.sdk.extensions.gcp.options.GcpOptions
STREAMING_ENGINE_EXPERIMENT, WINDMILL_SERVICE_EXPERIMENT
-
Method Summary
Modifier and TypeMethodDescriptionBQ endpoint to use.Choose to use a GBK when gathering a list of files in batch FILE_LOAD.void
setBigQueryEndpoint
(String value) void
setBigQueryProject
(String value) void
void
setEnableStorageReadApiV2
(Boolean value) void
setGroupFilesFileLoad
(Boolean value) void
setHTTPReadTimeout
(Integer timeout) void
setHTTPWriteTimeout
(Integer timeout) void
setInsertBundleParallelism
(Integer parallelism) void
setJobLabelsMap
(Map<String, String> value) void
void
void
setMaxStreamingBatchSize
(Long value) void
setMaxStreamingRowsToBatch
(Long value) void
void
void
void
setNumStreamingKeys
(Integer value) void
void
void
void
void
void
void
void
setTempDatasetId
(String value) void
void
setUseStorageWriteApi
(Boolean value) void
Methods inherited from interface org.apache.beam.sdk.options.ApplicationNameOptions
getAppName, setAppName
Methods inherited from interface org.apache.beam.sdk.extensions.gcp.options.GcpOptions
getCredentialFactoryClass, getDataflowKmsKey, getGcpCredential, getGcpOauthScopes, getGcpTempLocation, getImpersonateServiceAccount, getProject, getWorkerRegion, getWorkerZone, getZone, isEnableStreamingEngine, setCredentialFactoryClass, setDataflowKmsKey, setEnableStreamingEngine, setGcpCredential, setGcpOauthScopes, setGcpTempLocation, setImpersonateServiceAccount, setProject, setWorkerRegion, setWorkerZone, setZone
Methods inherited from interface org.apache.beam.sdk.extensions.gcp.options.GoogleApiDebugOptions
getGoogleApiTrace, setGoogleApiTrace
Methods inherited from interface org.apache.beam.sdk.transforms.display.HasDisplayData
populateDisplayData
Methods inherited from interface org.apache.beam.sdk.options.PipelineOptions
as, getJobName, getOptionsId, getRunner, getStableUniqueNames, getTempLocation, getUserAgent, outputRuntimeOptions, revision, setJobName, setOptionsId, setRunner, setStableUniqueNames, setTempLocation, setUserAgent
Methods inherited from interface org.apache.beam.sdk.options.StreamingOptions
getUpdateCompatibilityVersion, isStreaming, setStreaming, setUpdateCompatibilityVersion
-
Method Details
-
getTempDatasetId
-
setTempDatasetId
-
getHTTPReadTimeout
-
setHTTPReadTimeout
-
getHTTPWriteTimeout
-
setHTTPWriteTimeout
-
getInsertBundleParallelism
-
setInsertBundleParallelism
-
getNumStreamingKeys
-
setNumStreamingKeys
-
getMaxStreamingRowsToBatch
-
setMaxStreamingRowsToBatch
-
getMaxStreamingBatchSize
-
setMaxStreamingBatchSize
-
getBqStreamingApiLoggingFrequencySec
-
setBqStreamingApiLoggingFrequencySec
-
getUseStorageWriteApi
-
setUseStorageWriteApi
-
getUseStorageWriteApiAtLeastOnce
-
setUseStorageWriteApiAtLeastOnce
-
getNumStorageWriteApiStreams
-
setNumStorageWriteApiStreams
-
getNumStorageWriteApiStreamAppendClients
-
setNumStorageWriteApiStreamAppendClients
-
getMinConnectionPoolConnections
-
setMinConnectionPoolConnections
-
getMaxConnectionPoolConnections
-
setMaxConnectionPoolConnections
-
getStorageWriteMaxInflightRequests
-
setStorageWriteMaxInflightRequests
-
getStorageWriteMaxInflightBytes
-
setStorageWriteMaxInflightBytes
-
getUseStorageApiConnectionPool
-
setUseStorageApiConnectionPool
-
getStorageWriteApiTriggeringFrequencySec
Integer getStorageWriteApiTriggeringFrequencySec() -
setStorageWriteApiTriggeringFrequencySec
-
getStorageWriteApiMaxRetries
-
setStorageWriteApiMaxRetries
-
getMaxBufferingDurationMilliSec
-
setMaxBufferingDurationMilliSec
-
getBigQueryProject
String getBigQueryProject() -
setBigQueryProject
-
getStorageApiAppendThresholdBytes
-
setStorageApiAppendThresholdBytes
-
getStorageApiAppendThresholdRecordCount
-
setStorageApiAppendThresholdRecordCount
-
getStorageWriteApiMaxRequestSize
-
setStorageWriteApiMaxRequestSize
-
getEnableStorageReadApiV2
-
setEnableStorageReadApiV2
-
getJobLabelsMap
-
setJobLabelsMap
-
getBigQueryEndpoint
BQ endpoint to use. If unspecified, uses the default endpoint. -
setBigQueryEndpoint
-
getGroupFilesFileLoad
Choose to use a GBK when gathering a list of files in batch FILE_LOAD.The purpose of this option is to accommodate the runner compatibility, for example, some runners having known issues on large side input, turning on this option avoids trigger side input related issues.
This is an experimental pipeline option, no backward compatibility guaranteed.
-
setGroupFilesFileLoad
-