Interface SparkCommonPipelineOptions

All Superinterfaces:
ApplicationNameOptions, FileStagingOptions, HasDisplayData, PipelineOptions, StreamingOptions
All Known Subinterfaces:
SparkContextOptions, SparkPipelineOptions, SparkPortableStreamingPipelineOptions, SparkStructuredStreamingPipelineOptions, TestSparkPipelineOptions

public interface SparkCommonPipelineOptions extends PipelineOptions, StreamingOptions, ApplicationNameOptions, FileStagingOptions
Spark runner PipelineOptions handles Spark execution-related configurations, such as the master address, and other user-related knobs.
  • Field Details

  • Method Details

    • getSparkMaster

      @String("local[4]") String getSparkMaster()
    • setSparkMaster

      void setSparkMaster(String master)
    • getCheckpointDir

    • setCheckpointDir

      void setCheckpointDir(String checkpointDir)
    • getStorageLevel

    • setStorageLevel

      void setStorageLevel(String storageLevel)
    • getEnableSparkMetricSinks

      @Boolean(true) Boolean getEnableSparkMetricSinks()
    • setEnableSparkMetricSinks

      void setEnableSparkMetricSinks(Boolean enableSparkMetricSinks)
    • getPreferGroupByKeyToHandleHugeValues

      @Boolean(false) Boolean getPreferGroupByKeyToHandleHugeValues()
    • setPreferGroupByKeyToHandleHugeValues

      void setPreferGroupByKeyToHandleHugeValues(Boolean preferGroupByKeyToHandleHugeValues)
    • prepareFilesToStage

      @Internal static void prepareFilesToStage(SparkCommonPipelineOptions options)
      Classpath contains non jar files (eg. directories with .class files or empty directories) will cause exception in running log. Though the SparkContext can handle this when running in local master, it's better not to include non-jars files in classpath.