Package org.apache.beam.sdk.io.hadoop.format
package org.apache.beam.sdk.io.hadoop.format
Defines transforms for writing to Data sinks that implement
HadoopFormatIO
.- See Also:
-
ClassDescriptionProvides mechanism for acquiring locks related to the job.A
HadoopFormatIO
is a Transform for reading data from any source or writing data to any sink which implements HadoopInputFormat
orOutputFormat
.Bounded source implementation forHadoopFormatIO
.HadoopFormatIO.Read<K,V> APTransform
that reads from any data source which implements Hadoop InputFormat.A wrapper to allow HadoopInputSplit
to be serialized using Java's standard serialization mechanisms.HadoopFormatIO.Write<KeyT,ValueT> APTransform
that writes to any data sink which implements Hadoop OutputFormat.HadoopFormatIO.Write.ExternalSynchronizationBuilder<KeyT,ValueT> Builder for External Synchronization defining.HadoopFormatIO.Write.PartitionedWriterBuilder<KeyT,ValueT> Builder for partitioning determining.HadoopFormatIO.Write.WriteBuilder<KeyT,ValueT> Main builder of Write transformation.Implementation ofExternalSynchronization
which registers locks in the HDFS.