public interface DataflowPipelineOptions extends PipelineOptions, GcpOptions, ApplicationNameOptions, DataflowPipelineDebugOptions, DataflowPipelineWorkerPoolOptions, BigQueryOptions, GcsOptions, StreamingOptions, CloudDebuggerOptions, DataflowWorkerLoggingOptions, DataflowProfilingOptions, PubsubOptions
DataflowRunner
.Modifier and Type | Interface and Description |
---|---|
static class |
DataflowPipelineOptions.StagingLocationFactory
Returns a default staging location under
GcpOptions.getGcpTempLocation() . |
DataflowPipelineDebugOptions.DataflowClientFactory, DataflowPipelineDebugOptions.StagerFactory
DataflowPipelineWorkerPoolOptions.AutoscalingAlgorithmType, DataflowPipelineWorkerPoolOptions.WorkerHarnessContainerImageFactory
GcsOptions.ExecutorServiceFactory, GcsOptions.PathValidatorFactory
DataflowWorkerLoggingOptions.Level, DataflowWorkerLoggingOptions.WorkerLogLevelOverrides
DataflowProfilingOptions.DataflowProfilingAgentConfiguration
GcpOptions.DefaultProjectFactory, GcpOptions.GcpTempLocationFactory, GcpOptions.GcpUserCredentialsFactory
GoogleApiDebugOptions.GoogleApiTracer
Modifier and Type | Method and Description |
---|---|
java.lang.String |
getProject()
Project id to use when launching jobs.
|
java.lang.String |
getRegion()
The Google Compute Engine
region
for creating Dataflow jobs.
|
java.lang.String |
getServiceAccount()
Run the job as a specific service account, instead of the default GCE robot.
|
java.lang.String |
getStagingLocation()
GCS path for staging local files, e.g.
|
java.lang.String |
getTemplateLocation()
Where the runner should generate a template file.
|
boolean |
isUpdate()
Whether to update the currently running pipeline with the same name as this one.
|
void |
setProject(java.lang.String value) |
void |
setRegion(java.lang.String region) |
void |
setServiceAccount(java.lang.String value) |
void |
setStagingLocation(java.lang.String value) |
void |
setTemplateLocation(java.lang.String value) |
void |
setUpdate(boolean value) |
getApiRootUrl, getDataflowClient, getDataflowEndpoint, getDataflowJobFile, getDumpHeapOnOOM, getExperiments, getNumberOfWorkerHarnessThreads, getOverrideWindmillBinary, getStager, getStagerClass, getTransformNameMapping, getWindmillServiceEndpoint, getWindmillServicePort, setApiRootUrl, setDataflowClient, setDataflowEndpoint, setDataflowJobFile, setDumpHeapOnOOM, setExperiments, setNumberOfWorkerHarnessThreads, setOverrideWindmillBinary, setStager, setStagerClass, setTransformNameMapping, setWindmillServiceEndpoint, setWindmillServicePort
getAutoscalingAlgorithm, getDiskSizeGb, getFilesToStage, getMaxNumWorkers, getNetwork, getNumWorkers, getSubnetwork, getUsePublicIps, getWorkerDiskType, getWorkerHarnessContainerImage, getWorkerMachineType, getZone, setAutoscalingAlgorithm, setDiskSizeGb, setFilesToStage, setMaxNumWorkers, setNetwork, setNumWorkers, setSubnetwork, setUsePublicIps, setWorkerDiskType, setWorkerHarnessContainerImage, setWorkerMachineType, setZone
getTempDatasetId, setTempDatasetId
getExecutorService, getGcsEndpoint, getGcsUploadBufferSizeBytes, getGcsUtil, getPathValidator, getPathValidatorClass, setExecutorService, setGcsEndpoint, setGcsUploadBufferSizeBytes, setGcsUtil, setPathValidator, setPathValidatorClass
getDebuggee, getEnableCloudDebugger, getMaxConditionCost, setDebuggee, setEnableCloudDebugger, setMaxConditionCost
getDefaultWorkerLogLevel, getWorkerLogLevelOverrides, getWorkerSystemErrMessageLevel, getWorkerSystemOutMessageLevel, setDefaultWorkerLogLevel, setWorkerLogLevelOverrides, setWorkerSystemErrMessageLevel, setWorkerSystemOutMessageLevel
getProfilingAgentConfiguration, getSaveProfilesToGcs, setProfilingAgentConfiguration, setSaveProfilesToGcs
getPubsubRootUrl, setPubsubRootUrl
getCredentialFactoryClass, getGcpCredential, getGcpTempLocation, getZone, setCredentialFactoryClass, setGcpCredential, setGcpTempLocation, setZone
getGoogleApiTrace, setGoogleApiTrace
@Validation.Required @Default.InstanceFactory(value=GcpOptions.DefaultProjectFactory.class) java.lang.String getProject()
GcpOptions
getProject
in interface GcpOptions
void setProject(java.lang.String value)
setProject
in interface GcpOptions
@Default.InstanceFactory(value=DataflowPipelineOptions.StagingLocationFactory.class) java.lang.String getStagingLocation()
Must be a valid Cloud Storage URL, beginning with the prefix "gs://"
If getStagingLocation()
is not set, it will default to
GcpOptions.getGcpTempLocation()
. GcpOptions.getGcpTempLocation()
must be a valid GCS path.
void setStagingLocation(java.lang.String value)
boolean isUpdate()
void setUpdate(boolean value)
java.lang.String getTemplateLocation()
void setTemplateLocation(java.lang.String value)
@Hidden @Experimental java.lang.String getServiceAccount()
void setServiceAccount(java.lang.String value)
@Hidden @Experimental @Default.String(value="us-central1") java.lang.String getRegion()
NOTE: The Cloud Dataflow service does not yet honor this setting. However, once service support is added then users of this SDK will be able to control the region.
void setRegion(java.lang.String region)