Package org.apache.beam.runners.spark
Interface SparkCommonPipelineOptions
- All Superinterfaces:
ApplicationNameOptions,FileStagingOptions,HasDisplayData,PipelineOptions,StreamingOptions
- All Known Subinterfaces:
SparkContextOptions,SparkPipelineOptions,SparkPortableStreamingPipelineOptions,SparkStructuredStreamingPipelineOptions,TestSparkPipelineOptions
public interface SparkCommonPipelineOptions
extends PipelineOptions, StreamingOptions, ApplicationNameOptions, FileStagingOptions
Spark runner
PipelineOptions handles Spark execution-related configurations, such as the
master address, and other user-related knobs.-
Nested Class Summary
Nested ClassesModifier and TypeInterfaceDescriptionstatic classReturns Spark's default storage level for the Dataset or RDD API based on the respective runner.static classReturns the default checkpoint directory of /tmp/${job.name}.Nested classes/interfaces inherited from interface org.apache.beam.sdk.options.PipelineOptions
PipelineOptions.AtomicLongFactory, PipelineOptions.CheckEnabled, PipelineOptions.DirectRunner, PipelineOptions.JobNameFactory, PipelineOptions.UserAgentFactory -
Field Summary
Fields -
Method Summary
Modifier and TypeMethodDescriptionstatic voidClasspath contains non jar files (eg.voidsetCheckpointDir(String checkpointDir) voidsetEnableSparkMetricSinks(Boolean enableSparkMetricSinks) voidsetPreferGroupByKeyToHandleHugeValues(Boolean preferGroupByKeyToHandleHugeValues) voidsetSparkMaster(String master) voidsetStorageLevel(String storageLevel) Methods inherited from interface org.apache.beam.sdk.options.ApplicationNameOptions
getAppName, setAppNameMethods inherited from interface org.apache.beam.sdk.options.FileStagingOptions
getFilesToStage, setFilesToStageMethods inherited from interface org.apache.beam.sdk.transforms.display.HasDisplayData
populateDisplayDataMethods inherited from interface org.apache.beam.sdk.options.PipelineOptions
as, getGbek, getJobName, getOptionsId, getRunner, getStableUniqueNames, getTempLocation, getUserAgent, outputRuntimeOptions, revision, setGbek, setJobName, setOptionsId, setRunner, setStableUniqueNames, setTempLocation, setUserAgentMethods inherited from interface org.apache.beam.sdk.options.StreamingOptions
getUpdateCompatibilityVersion, isStreaming, setStreaming, setUpdateCompatibilityVersion
-
Field Details
-
DEFAULT_MASTER_URL
- See Also:
-
-
Method Details
-
getSparkMaster
-
setSparkMaster
-
getCheckpointDir
-
setCheckpointDir
-
getStorageLevel
-
setStorageLevel
-
getEnableSparkMetricSinks
-
setEnableSparkMetricSinks
-
getPreferGroupByKeyToHandleHugeValues
-
setPreferGroupByKeyToHandleHugeValues
-
prepareFilesToStage
Classpath contains non jar files (eg. directories with .class files or empty directories) will cause exception in running log. Though theSparkContextcan handle this when running in local master, it's better not to include non-jars files in classpath.
-