public static class Regex.FindAll extends PTransform<PCollection<java.lang.String>,PCollection<java.util.List<java.lang.String>>>
Regex.Find<String>
takes a PCollection<String>
and returns a PCollection<List<String>>
representing the value extracted from the Regex groups of the input
PCollection
to the number of times that element occurs in the input.
This transform runs a Regex on the entire input line. If a portion of the line does not match the Regex, the line will not be output. If it does match a portion of the line, the groups in the Regex will be used. The output will be the Regex groups.
Example of use:
PCollection<String> words = ...;
PCollection<List<String>> values =
words.apply(Regex.findAll("myregex (mygroup)"));
annotations, displayData, name, resourceHints
Constructor and Description |
---|
FindAll(java.util.regex.Pattern pattern) |
Modifier and Type | Method and Description |
---|---|
PCollection<java.util.List<java.lang.String>> |
expand(PCollection<java.lang.String> in)
Override this method to specify how this
PTransform should be expanded on the given
InputT . |
addAnnotation, compose, compose, getAdditionalInputs, getAnnotations, getDefaultOutputCoder, getDefaultOutputCoder, getDefaultOutputCoder, getKindString, getName, getResourceHints, populateDisplayData, setDisplayData, setResourceHints, toString, validate, validate
public PCollection<java.util.List<java.lang.String>> expand(PCollection<java.lang.String> in)
PTransform
PTransform
should be expanded on the given
InputT
.
NOTE: This method should not be called directly. Instead apply the PTransform
should
be applied to the InputT
using the apply
method.
Composite transforms, which are defined in terms of other transforms, should return the output of one of the composed transforms. Non-composite transforms, which do not apply any transforms internally, should return a new unbound output and register evaluators (via backend-specific registration methods).
expand
in class PTransform<PCollection<java.lang.String>,PCollection<java.util.List<java.lang.String>>>