Package org.apache.beam.runners.spark
Interface SparkCommonPipelineOptions
- All Superinterfaces:
ApplicationNameOptions
,FileStagingOptions
,HasDisplayData
,PipelineOptions
,StreamingOptions
- All Known Subinterfaces:
SparkContextOptions
,SparkPipelineOptions
,SparkPortableStreamingPipelineOptions
,SparkStructuredStreamingPipelineOptions
,TestSparkPipelineOptions
public interface SparkCommonPipelineOptions
extends PipelineOptions, StreamingOptions, ApplicationNameOptions, FileStagingOptions
Spark runner
PipelineOptions
handles Spark execution-related configurations, such as the
master address, and other user-related knobs.-
Nested Class Summary
Nested ClassesModifier and TypeInterfaceDescriptionstatic class
Returns Spark's default storage level for the Dataset or RDD API based on the respective runner.static class
Returns the default checkpoint directory of /tmp/${job.name}.Nested classes/interfaces inherited from interface org.apache.beam.sdk.options.PipelineOptions
PipelineOptions.AtomicLongFactory, PipelineOptions.CheckEnabled, PipelineOptions.DirectRunner, PipelineOptions.JobNameFactory, PipelineOptions.UserAgentFactory
-
Field Summary
Fields -
Method Summary
Modifier and TypeMethodDescriptionstatic void
Classpath contains non jar files (eg.void
setCheckpointDir
(String checkpointDir) void
setEnableSparkMetricSinks
(Boolean enableSparkMetricSinks) void
setPreferGroupByKeyToHandleHugeValues
(Boolean preferGroupByKeyToHandleHugeValues) void
setSparkMaster
(String master) void
setStorageLevel
(String storageLevel) Methods inherited from interface org.apache.beam.sdk.options.ApplicationNameOptions
getAppName, setAppName
Methods inherited from interface org.apache.beam.sdk.options.FileStagingOptions
getFilesToStage, setFilesToStage
Methods inherited from interface org.apache.beam.sdk.transforms.display.HasDisplayData
populateDisplayData
Methods inherited from interface org.apache.beam.sdk.options.PipelineOptions
as, getJobName, getOptionsId, getRunner, getStableUniqueNames, getTempLocation, getUserAgent, outputRuntimeOptions, revision, setJobName, setOptionsId, setRunner, setStableUniqueNames, setTempLocation, setUserAgent
Methods inherited from interface org.apache.beam.sdk.options.StreamingOptions
getUpdateCompatibilityVersion, isStreaming, setStreaming, setUpdateCompatibilityVersion
-
Field Details
-
DEFAULT_MASTER_URL
- See Also:
-
-
Method Details
-
getSparkMaster
-
setSparkMaster
-
getCheckpointDir
-
setCheckpointDir
-
getStorageLevel
-
setStorageLevel
-
getEnableSparkMetricSinks
-
setEnableSparkMetricSinks
-
getPreferGroupByKeyToHandleHugeValues
-
setPreferGroupByKeyToHandleHugeValues
-
prepareFilesToStage
Classpath contains non jar files (eg. directories with .class files or empty directories) will cause exception in running log. Though theSparkContext
can handle this when running in local master, it's better not to include non-jars files in classpath.
-