@Hidden public interface DataflowPipelineDebugOptions extends ExperimentalOptions, MemoryMonitorOptions, PipelineOptions
Modifier and Type | Interface and Description |
---|---|
static class |
DataflowPipelineDebugOptions.DataflowClientFactory
Returns the default Dataflow client built from the passed in PipelineOptions.
|
static class |
DataflowPipelineDebugOptions.StagerFactory
Creates a
Stager object using the class specified in getStagerClass() . |
static class |
DataflowPipelineDebugOptions.UnboundedReaderMaxReadTimeFactory
Sets Integer value based on old, deprecated field (
getUnboundedReaderMaxReadTimeSec() ). |
PipelineOptions.AtomicLongFactory, PipelineOptions.CheckEnabled, PipelineOptions.DirectRunner, PipelineOptions.JobNameFactory, PipelineOptions.UserAgentFactory
STATE_CACHE_SIZE, STATE_SAMPLING_PERIOD_MILLIS
Modifier and Type | Method and Description |
---|---|
java.lang.String |
getApiRootUrl()
The root URL for the Dataflow API.
|
Dataflow |
getDataflowClient()
An instance of the Dataflow client.
|
java.lang.String |
getDataflowEndpoint()
Dataflow endpoint to use.
|
java.lang.String |
getDataflowJobFile()
The path to write the translated Dataflow job specification out to at job submission time.
|
int |
getDesiredNumUnboundedSourceSplits()
The desired number of initial splits for UnboundedSources.
|
boolean |
getDumpHeapOnOOM()
If true, save a heap dump before killing a thread or process which is GC thrashing
or out of memory.
|
int |
getJfrRecordingDurationSec() |
int |
getNumberOfWorkerHarnessThreads()
Number of threads to use on the Dataflow worker harness.
|
java.lang.Integer |
getReaderCacheTimeoutSec()
The amount of time before UnboundedReaders are considered idle and closed during streaming
execution.
|
boolean |
getRecordJfrOnGcThrashing()
If true, save a JFR profile when GC thrashing is first detected.
|
java.lang.String |
getSaveHeapDumpsToGcsPath()
CAUTION: This option implies dumpHeapOnOOM, and has similar caveats.
|
java.lang.String |
getSdkHarnessContainerImageOverrides()
Overrides for SDK harness container images.
|
Stager |
getStager()
The resource stager instance that should be used to stage resources.
|
java.lang.Class<? extends Stager> |
getStagerClass()
The class responsible for staging resources to be accessible by workers during job execution.
|
java.util.Map<java.lang.String,java.lang.String> |
getTransformNameMapping()
Mapping of old PTransform names to new ones, specified as JSON
{"oldName":"newName",...}
. |
java.lang.Integer |
getUnboundedReaderMaxElements()
The max elements read from an UnboundedReader before checkpointing.
|
java.lang.Integer |
getUnboundedReaderMaxReadTimeMs()
The max amount of time an UnboundedReader is consumed before checkpointing.
|
java.lang.Integer |
getUnboundedReaderMaxReadTimeSec()
Deprecated.
use
getUnboundedReaderMaxReadTimeMs() instead |
java.lang.Integer |
getUnboundedReaderMaxWaitForElementsMs()
The max amount of time waiting for elements when reading from UnboundedReader.
|
java.lang.Integer |
getWorkerCacheMb()
The size of the worker's in-memory cache, in megabytes.
|
void |
setApiRootUrl(java.lang.String value) |
void |
setDataflowClient(Dataflow value) |
void |
setDataflowEndpoint(java.lang.String value) |
void |
setDataflowJobFile(java.lang.String value) |
void |
setDesiredNumUnboundedSourceSplits(int value) |
void |
setDumpHeapOnOOM(boolean dumpHeapBeforeExit) |
void |
setJfrRecordingDurationSec(int value) |
void |
setNumberOfWorkerHarnessThreads(int value) |
void |
setReaderCacheTimeoutSec(java.lang.Integer value) |
void |
setRecordJfrOnGcThrashing(boolean value) |
void |
setSaveHeapDumpsToGcsPath(java.lang.String gcsPath) |
void |
setSdkHarnessContainerImageOverrides(java.lang.String value) |
void |
setStager(Stager stager) |
void |
setStagerClass(java.lang.Class<? extends Stager> stagerClass) |
void |
setTransformNameMapping(java.util.Map<java.lang.String,java.lang.String> value) |
void |
setUnboundedReaderMaxElements(java.lang.Integer value) |
void |
setUnboundedReaderMaxReadTimeMs(java.lang.Integer value) |
void |
setUnboundedReaderMaxReadTimeSec(java.lang.Integer value) |
void |
setUnboundedReaderMaxWaitForElementsMs(java.lang.Integer value) |
void |
setWorkerCacheMb(java.lang.Integer value) |
addExperiment, getExperiments, getExperimentValue, hasExperiment, setExperiments
getGCThrashingPercentagePerPeriod, setGCThrashingPercentagePerPeriod
as, getJobName, getOptionsId, getRunner, getStableUniqueNames, getTempLocation, getUserAgent, outputRuntimeOptions, revision, setJobName, setOptionsId, setRunner, setStableUniqueNames, setTempLocation, setUserAgent
populateDisplayData
@Default.String(value="https://dataflow.googleapis.com/") java.lang.String getApiRootUrl()
dataflowEndpoint
can override this value if it
contains an absolute URL, otherwise apiRootUrl
will be combined with dataflowEndpoint
to generate the full URL to communicate with the Dataflow API.void setApiRootUrl(java.lang.String value)
@Default.String(value="") java.lang.String getDataflowEndpoint()
Defaults to the current version of the Google Cloud Dataflow API, at the time the current SDK version was released.
If the string contains "://", then this is treated as a URL, otherwise getApiRootUrl()
is used as the root URL.
void setDataflowEndpoint(java.lang.String value)
java.lang.String getDataflowJobFile()
void setDataflowJobFile(java.lang.String value)
@Default.Class(value=GcsStager.class) java.lang.Class<? extends Stager> getStagerClass()
void setStagerClass(java.lang.Class<? extends Stager> stagerClass)
@Default.InstanceFactory(value=DataflowPipelineDebugOptions.StagerFactory.class) Stager getStager()
void setStager(Stager stager)
@Default.InstanceFactory(value=DataflowPipelineDebugOptions.DataflowClientFactory.class) Dataflow getDataflowClient()
void setDataflowClient(Dataflow value)
java.util.Map<java.lang.String,java.lang.String> getTransformNameMapping()
{"oldName":"newName",...}
. To mark a transform as deleted, make newName the empty string.void setTransformNameMapping(java.util.Map<java.lang.String,java.lang.String> value)
int getNumberOfWorkerHarnessThreads()
void setNumberOfWorkerHarnessThreads(int value)
boolean getDumpHeapOnOOM()
CAUTION: Heap dumps can of comparable size to the default boot disk. Consider increasing the boot disk size before setting this flag to true.
void setDumpHeapOnOOM(boolean dumpHeapBeforeExit)
boolean getRecordJfrOnGcThrashing()
Note, JFR profiles are only supported on java 9 and up.
void setRecordJfrOnGcThrashing(boolean value)
@Default.Integer(value=60) int getJfrRecordingDurationSec()
void setJfrRecordingDurationSec(int value)
@Default.Integer(value=100) java.lang.Integer getWorkerCacheMb()
Currently, this cache is used for storing read values of side inputs in batch as well as the user state for streaming jobs.
void setWorkerCacheMb(java.lang.Integer value)
@Default.Integer(value=60) java.lang.Integer getReaderCacheTimeoutSec()
void setReaderCacheTimeoutSec(java.lang.Integer value)
@Deprecated @Default.Integer(value=10) java.lang.Integer getUnboundedReaderMaxReadTimeSec()
getUnboundedReaderMaxReadTimeMs()
insteadvoid setUnboundedReaderMaxReadTimeSec(java.lang.Integer value)
@Default.InstanceFactory(value=DataflowPipelineDebugOptions.UnboundedReaderMaxReadTimeFactory.class) java.lang.Integer getUnboundedReaderMaxReadTimeMs()
void setUnboundedReaderMaxReadTimeMs(java.lang.Integer value)
@Default.Integer(value=10000) java.lang.Integer getUnboundedReaderMaxElements()
void setUnboundedReaderMaxElements(java.lang.Integer value)
@Default.Integer(value=1000) java.lang.Integer getUnboundedReaderMaxWaitForElementsMs()
void setUnboundedReaderMaxWaitForElementsMs(java.lang.Integer value)
@Default.Integer(value=0) int getDesiredNumUnboundedSourceSplits()
void setDesiredNumUnboundedSourceSplits(int value)
java.lang.String getSaveHeapDumpsToGcsPath()
void setSaveHeapDumpsToGcsPath(java.lang.String gcsPath)
java.lang.String getSdkHarnessContainerImageOverrides()
void setSdkHarnessContainerImageOverrides(java.lang.String value)