public class DataflowRunner extends PipelineRunner<DataflowPipelineJob>
PipelineRunner that executes the operations in the pipeline by first translating them
to the Dataflow representation using the DataflowPipelineTranslator and then submitting
them to a Dataflow service for execution.
When reading from a Dataflow source or writing to a Dataflow sink using DataflowRunner, the Google cloudservices account and the Google compute engine service account
of the GCP project running the Dataflow Job will need access to the corresponding source/sink.
Please see Google Cloud Dataflow Security and Permissions for more details.
| Modifier and Type | Class and Description |
|---|---|
static class |
DataflowRunner.StreamingPCollectionViewWriterFn<T>
A marker
DoFn for writing the contents of a PCollection to a streaming PCollectionView backend implementation. |
| Modifier and Type | Field and Description |
|---|---|
static java.lang.String |
PROJECT_ID_REGEXP
Project IDs must contain lowercase letters, digits, or dashes.
|
| Modifier | Constructor and Description |
|---|---|
protected |
DataflowRunner(DataflowPipelineOptions options) |
| Modifier and Type | Method and Description |
|---|---|
static DataflowRunner |
fromOptions(PipelineOptions options)
Construct a runner from the provided options.
|
DataflowPipelineTranslator |
getTranslator()
Returns the DataflowPipelineTranslator associated with this object.
|
static boolean |
hasExperiment(DataflowPipelineDebugOptions options,
java.lang.String experiment)
Returns true if the specified experiment is enabled, handling null experiments.
|
protected void |
replaceTransforms(Pipeline pipeline) |
DataflowPipelineJob |
run(Pipeline pipeline)
Processes the given
Pipeline, potentially asynchronously, returning a runner-specific
type of result. |
void |
setHooks(DataflowRunnerHooks hooks)
Sets callbacks to invoke during execution see
DataflowRunnerHooks. |
java.lang.String |
toString() |
create, run, runpublic static final java.lang.String PROJECT_ID_REGEXP
protected DataflowRunner(DataflowPipelineOptions options)
public static DataflowRunner fromOptions(PipelineOptions options)
options - Properties that configure the runner.public DataflowPipelineJob run(Pipeline pipeline)
PipelineRunnerPipeline, potentially asynchronously, returning a runner-specific
type of result.run in class PipelineRunner<DataflowPipelineJob>public static boolean hasExperiment(DataflowPipelineDebugOptions options, java.lang.String experiment)
protected void replaceTransforms(Pipeline pipeline)
public DataflowPipelineTranslator getTranslator()
@Experimental public void setHooks(DataflowRunnerHooks hooks)
DataflowRunnerHooks.public java.lang.String toString()
toString in class java.lang.Object