public abstract static class XmlIO.Read<T> extends PTransform<PBegin,PCollection<T>>
XmlIO.read().| Modifier and Type | Class and Description |
|---|---|
static class |
XmlIO.Read.CompressionType
Strategy for determining the compression type of XML files being read.
|
name| Constructor and Description |
|---|
Read() |
| Modifier and Type | Method and Description |
|---|---|
PCollection<T> |
expand(PBegin input)
Override this method to specify how this
PTransform should be expanded
on the given InputT. |
XmlIO.Read<T> |
from(java.lang.String fileOrPatternSpec)
Reads a single XML file or a set of XML files defined by a Java "glob" file pattern.
|
void |
populateDisplayData(DisplayData.Builder builder)
Register display data for the given transform or component.
|
void |
validate(PipelineOptions options)
Called before running the Pipeline to verify this transform is fully and correctly
specified.
|
XmlIO.Read<T> |
withCharset(java.nio.charset.Charset charset)
Sets the XML file charset.
|
XmlIO.Read<T> |
withCompressionType(XmlIO.Read.CompressionType compressionType)
Decompresses all input files using the specified compression type.
|
XmlIO.Read<T> |
withMinBundleSize(long minBundleSize)
Sets a parameter
minBundleSize for the minimum bundle size of the source. |
XmlIO.Read<T> |
withRecordClass(java.lang.Class<T> recordClass)
Sets a JAXB annotated class that can be populated using a record of the provided XML file.
|
XmlIO.Read<T> |
withRecordElement(java.lang.String recordElement)
Sets name of the record element of the XML document.
|
XmlIO.Read<T> |
withRootElement(java.lang.String rootElement)
Sets name of the root element of the XML document.
|
XmlIO.Read<T> |
withValidationEventHandler(javax.xml.bind.ValidationEventHandler validationEventHandler)
Sets the
ValidationEventHandler to use with JAXB. |
getAdditionalInputs, getDefaultOutputCoder, getDefaultOutputCoder, getDefaultOutputCoder, getKindString, getName, toStringpublic XmlIO.Read<T> from(java.lang.String fileOrPatternSpec)
XmlIO.read().public XmlIO.Read<T> withRootElement(java.lang.String rootElement)
public XmlIO.Read<T> withRecordElement(java.lang.String recordElement)
public XmlIO.Read<T> withRecordClass(java.lang.Class<T> recordClass)
public XmlIO.Read<T> withMinBundleSize(long minBundleSize)
minBundleSize for the minimum bundle size of the source. Please
refer to OffsetBasedSource for the definition of minBundleSize. This is an optional
parameter.public XmlIO.Read<T> withCompressionType(XmlIO.Read.CompressionType compressionType)
If no compression type is specified, the default is XmlIO.Read.CompressionType.AUTO. In this
mode, the compression type of the file is determined by its extension. Supports .gz, .bz2,
.zip and .deflate compression.
public XmlIO.Read<T> withCharset(java.nio.charset.Charset charset)
public XmlIO.Read<T> withValidationEventHandler(javax.xml.bind.ValidationEventHandler validationEventHandler)
ValidationEventHandler to use with JAXB. Calling this with a null
parameter will cause the JAXB unmarshaller event handler to be unspecified.public void validate(PipelineOptions options)
PTransformBy default, does nothing.
validate in class PTransform<PBegin,PCollection<T>>public void populateDisplayData(DisplayData.Builder builder)
PTransformpopulateDisplayData(DisplayData.Builder) is invoked by Pipeline runners to collect
display data via DisplayData.from(HasDisplayData). Implementations may call
super.populateDisplayData(builder) in order to register display data in the current
namespace, but should otherwise use subcomponent.populateDisplayData(builder) to use
the namespace of the subcomponent.
By default, does not register any display data. Implementors may override this method to provide their own display data.
populateDisplayData in interface HasDisplayDatapopulateDisplayData in class PTransform<PBegin,PCollection<T>>builder - The builder to populate with display data.HasDisplayDatapublic PCollection<T> expand(PBegin input)
PTransformPTransform should be expanded
on the given InputT.
NOTE: This method should not be called directly. Instead apply the
PTransform should be applied to the InputT using the apply
method.
Composite transforms, which are defined in terms of other transforms, should return the output of one of the composed transforms. Non-composite transforms, which do not apply any transforms internally, should return a new unbound output and register evaluators (via backend-specific registration methods).
expand in class PTransform<PBegin,PCollection<T>>