public interface DataflowPipelineWorkerPoolOptions extends PipelineOptions
Modifier and Type | Interface and Description |
---|---|
static class |
DataflowPipelineWorkerPoolOptions.AutoscalingAlgorithmType
Type of autoscaling algorithm to use.
|
static class |
DataflowPipelineWorkerPoolOptions.WorkerHarnessContainerImageFactory
Returns the default Docker container image that executes Dataflow worker harness, residing in
Google Container Registry.
|
PipelineOptions.AtomicLongFactory, PipelineOptions.CheckEnabled, PipelineOptions.DirectRunner, PipelineOptions.JobNameFactory, PipelineOptions.NoOpMetricsSink, PipelineOptions.UserAgentFactory
Modifier and Type | Method and Description |
---|---|
DataflowPipelineWorkerPoolOptions.AutoscalingAlgorithmType |
getAutoscalingAlgorithm()
[Experimental] The autoscaling algorithm to use for the workerpool.
|
int |
getDiskSizeGb()
Remote worker disk size, in gigabytes, or 0 to use the default size.
|
java.util.List<java.lang.String> |
getFilesToStage()
List of local files to make available to workers.
|
int |
getMaxNumWorkers()
The maximum number of workers to use for the workerpool.
|
java.lang.String |
getMinCpuPlatform()
Specifies a Minimum CPU platform for VM instances.
|
java.lang.String |
getNetwork()
GCE network for launching
workers.
|
int |
getNumWorkers()
Number of workers to use when executing the Dataflow job.
|
java.lang.String |
getSubnetwork()
GCE subnetwork for launching
workers.
|
java.lang.Boolean |
getUsePublicIps()
Specifies whether worker pools should be started with public IP addresses.
|
java.lang.String |
getWorkerDiskType()
Specifies what type of persistent disk is used.
|
java.lang.String |
getWorkerHarnessContainerImage()
Docker container image that executes Dataflow worker harness, residing in Google Container
Registry.
|
java.lang.String |
getWorkerMachineType()
Machine type to create Dataflow worker VMs as.
|
java.lang.String |
getZone()
GCE availability zone for
launching workers.
|
void |
setAutoscalingAlgorithm(DataflowPipelineWorkerPoolOptions.AutoscalingAlgorithmType value) |
void |
setDiskSizeGb(int value) |
void |
setFilesToStage(java.util.List<java.lang.String> value) |
void |
setMaxNumWorkers(int value) |
void |
setMinCpuPlatform(java.lang.String minCpuPlatform) |
void |
setNetwork(java.lang.String value) |
void |
setNumWorkers(int value) |
void |
setSubnetwork(java.lang.String value) |
void |
setUsePublicIps(java.lang.Boolean value) |
void |
setWorkerDiskType(java.lang.String value) |
void |
setWorkerHarnessContainerImage(java.lang.String value) |
void |
setWorkerMachineType(java.lang.String value) |
void |
setZone(java.lang.String value) |
as, getJobName, getMetricsGraphiteHost, getMetricsGraphitePort, getMetricsHttpSinkUrl, getMetricsPushPeriod, getMetricsSink, getOptionsId, getRunner, getStableUniqueNames, getTempLocation, getUserAgent, outputRuntimeOptions, setJobName, setMetricsGraphiteHost, setMetricsGraphitePort, setMetricsHttpSinkUrl, setMetricsPushPeriod, setMetricsSink, setOptionsId, setRunner, setStableUniqueNames, setTempLocation, setUserAgent
populateDisplayData
int getNumWorkers()
NONE
will affect the size of the worker pool. If left unspecified,
the Dataflow service will determine the number of workers.void setNumWorkers(int value)
@Experimental(value=AUTOSCALING) DataflowPipelineWorkerPoolOptions.AutoscalingAlgorithmType getAutoscalingAlgorithm()
void setAutoscalingAlgorithm(DataflowPipelineWorkerPoolOptions.AutoscalingAlgorithmType value)
int getMaxNumWorkers()
void setMaxNumWorkers(int value)
int getDiskSizeGb()
void setDiskSizeGb(int value)
@Default.InstanceFactory(value=DataflowPipelineWorkerPoolOptions.WorkerHarnessContainerImageFactory.class) @Hidden java.lang.String getWorkerHarnessContainerImage()
void setWorkerHarnessContainerImage(java.lang.String value)
java.lang.String getNetwork()
Default is up to the Dataflow service.
void setNetwork(java.lang.String value)
java.lang.String getSubnetwork()
Default is up to the Dataflow service. Expected format is regions/REGION/subnetworks/SUBNETWORK or the fully qualified subnetwork name, beginning with https://..., e.g. https://www.googleapis.com/compute/alpha/projects/PROJECT/ regions/REGION/subnetworks/SUBNETWORK
void setSubnetwork(java.lang.String value)
java.lang.String getZone()
Default is up to the Dataflow service.
void setZone(java.lang.String value)
java.lang.String getWorkerMachineType()
See GCE machine types for a list of valid options.
If unset, the Dataflow service will choose a reasonable default.
void setWorkerMachineType(java.lang.String value)
java.util.List<java.lang.String> getFilesToStage()
Files are placed on the worker's classpath.
The default value is the list of jars from the main program's classpath.
void setFilesToStage(java.util.List<java.lang.String> value)
java.lang.String getWorkerDiskType()
void setWorkerDiskType(java.lang.String value)
@Experimental @Nullable java.lang.Boolean getUsePublicIps()
WARNING: This feature is experimental. You must be whitelisted to use it.
void setUsePublicIps(@Nullable java.lang.Boolean value)
@Nullable java.lang.String getMinCpuPlatform()
More details see Specifying Pipeline Execution Parameters.
void setMinCpuPlatform(java.lang.String minCpuPlatform)