@Hidden public interface DataflowPipelineDebugOptions extends ExperimentalOptions, PipelineOptions
Modifier and Type | Interface and Description |
---|---|
static class |
DataflowPipelineDebugOptions.DataflowClientFactory
Returns the default Dataflow client built from the passed in PipelineOptions.
|
static class |
DataflowPipelineDebugOptions.StagerFactory
Creates a
Stager object using the class specified in getStagerClass() . |
PipelineOptions.AtomicLongFactory, PipelineOptions.CheckEnabled, PipelineOptions.DirectRunner, PipelineOptions.JobNameFactory, PipelineOptions.UserAgentFactory
STATE_CACHE_SIZE
Modifier and Type | Method and Description |
---|---|
java.lang.String |
getApiRootUrl()
The root URL for the Dataflow API.
|
Dataflow |
getDataflowClient()
An instance of the Dataflow client.
|
java.lang.String |
getDataflowEndpoint()
Dataflow endpoint to use.
|
java.lang.String |
getDataflowJobFile()
The path to write the translated Dataflow job specification out to at job submission time.
|
boolean |
getDumpHeapOnOOM()
If true, save a heap dump before killing a thread or process which is GC thrashing
or out of memory.
|
java.lang.Double |
getGCThrashingPercentagePerPeriod()
The GC thrashing threshold percentage.
|
int |
getNumberOfWorkerHarnessThreads()
Number of threads to use on the Dataflow worker harness.
|
java.lang.String |
getOverrideWindmillBinary()
Custom windmill_main binary to use with the streaming runner.
|
java.lang.String |
getSaveHeapDumpsToGcsPath()
CAUTION: This option implies dumpHeapOnOOM, and has similar caveats.
|
Stager |
getStager()
The resource stager instance that should be used to stage resources.
|
java.lang.Class<? extends Stager> |
getStagerClass()
The class responsible for staging resources to be accessible by workers during job execution.
|
java.util.Map<java.lang.String,java.lang.String> |
getTransformNameMapping()
Mapping of old PTranform names to new ones, specified as JSON
{"oldName":"newName",...}
. |
java.lang.String |
getWindmillServiceEndpoint()
Custom windmill service endpoint.
|
int |
getWindmillServicePort() |
java.lang.Integer |
getWorkerCacheMb()
The size of the worker's in-memory cache, in megabytes.
|
void |
setApiRootUrl(java.lang.String value) |
void |
setDataflowClient(Dataflow value) |
void |
setDataflowEndpoint(java.lang.String value) |
void |
setDataflowJobFile(java.lang.String value) |
void |
setDumpHeapOnOOM(boolean dumpHeapBeforeExit) |
void |
setGCThrashingPercentagePerPeriod(java.lang.Double value) |
void |
setNumberOfWorkerHarnessThreads(int value) |
void |
setOverrideWindmillBinary(java.lang.String value) |
void |
setSaveHeapDumpsToGcsPath(java.lang.String gcsPath) |
void |
setStager(Stager stager) |
void |
setStagerClass(java.lang.Class<? extends Stager> stagerClass) |
void |
setTransformNameMapping(java.util.Map<java.lang.String,java.lang.String> value) |
void |
setWindmillServiceEndpoint(java.lang.String value) |
void |
setWindmillServicePort(int value) |
void |
setWorkerCacheMb(java.lang.Integer value) |
addExperiment, getExperiments, getExperimentValue, hasExperiment, setExperiments
as, getJobName, getOptionsId, getRunner, getStableUniqueNames, getTempLocation, getUserAgent, outputRuntimeOptions, setJobName, setOptionsId, setRunner, setStableUniqueNames, setTempLocation, setUserAgent
populateDisplayData
@Default.String(value="https://dataflow.googleapis.com/") java.lang.String getApiRootUrl()
dataflowEndpoint
can override this value if it
contains an absolute URL, otherwise apiRootUrl
will be combined with dataflowEndpoint
to generate the full URL to communicate with the Dataflow API.void setApiRootUrl(java.lang.String value)
@Default.String(value="") java.lang.String getDataflowEndpoint()
Defaults to the current version of the Google Cloud Dataflow API, at the time the current SDK version was released.
If the string contains "://", then this is treated as a URL, otherwise getApiRootUrl()
is used as the root URL.
void setDataflowEndpoint(java.lang.String value)
java.lang.String getDataflowJobFile()
void setDataflowJobFile(java.lang.String value)
@Default.Class(value=GcsStager.class) java.lang.Class<? extends Stager> getStagerClass()
void setStagerClass(java.lang.Class<? extends Stager> stagerClass)
@Default.InstanceFactory(value=DataflowPipelineDebugOptions.StagerFactory.class) Stager getStager()
void setStager(Stager stager)
@Default.InstanceFactory(value=DataflowPipelineDebugOptions.DataflowClientFactory.class) Dataflow getDataflowClient()
void setDataflowClient(Dataflow value)
java.util.Map<java.lang.String,java.lang.String> getTransformNameMapping()
{"oldName":"newName",...}
. To mark a transform as deleted, make newName the empty string.void setTransformNameMapping(java.util.Map<java.lang.String,java.lang.String> value)
java.lang.String getOverrideWindmillBinary()
void setOverrideWindmillBinary(java.lang.String value)
java.lang.String getWindmillServiceEndpoint()
void setWindmillServiceEndpoint(java.lang.String value)
@Default.Integer(value=443) int getWindmillServicePort()
void setWindmillServicePort(int value)
int getNumberOfWorkerHarnessThreads()
void setNumberOfWorkerHarnessThreads(int value)
boolean getDumpHeapOnOOM()
CAUTION: Heap dumps can of comparable size to the default boot disk. Consider increasing the boot disk size before setting this flag to true.
void setDumpHeapOnOOM(boolean dumpHeapBeforeExit)
@Default.Double(value=50.0) java.lang.Double getGCThrashingPercentagePerPeriod()
If 100 is given as the value, MemoryMonitor will be disabled.
void setGCThrashingPercentagePerPeriod(java.lang.Double value)
@Default.Integer(value=100) java.lang.Integer getWorkerCacheMb()
Currently, this cache is used for storing read values of side inputs. as well as the state for streaming jobs.
void setWorkerCacheMb(java.lang.Integer value)
@Experimental java.lang.String getSaveHeapDumpsToGcsPath()
void setSaveHeapDumpsToGcsPath(java.lang.String gcsPath)