T
- the type of the values being transcodedpublic class NullableCoder<T> extends StructuredCoder<T>
NullableCoder
encodes nullable values of type T
using a nested Coder<T>
that does not tolerate null
values. NullableCoder
uses exactly 1 byte per entry
to indicate whether the value is null
, then adds the encoding of the inner coder for
non-null values.Coder.Context, Coder.NonDeterministicException
Modifier and Type | Method and Description |
---|---|
boolean |
consistentWithEquals()
NullableCoder is consistent with equals if the nested Coder is. |
T |
decode(java.io.InputStream inStream)
Decodes a value of type
T from the given input stream in the given context. |
T |
decode(java.io.InputStream inStream,
Coder.Context context)
Decodes a value of type
T from the given input stream in the given context. |
void |
encode(T value,
java.io.OutputStream outStream)
Encodes the given value of type
T onto the given output stream. |
void |
encode(T value,
java.io.OutputStream outStream,
Coder.Context context)
Encodes the given value of type
T onto the given output stream in the given context. |
java.util.List<Coder<T>> |
getCoderArguments()
|
protected long |
getEncodedElementByteSize(T value)
Overridden to short-circuit the default
StructuredCoder behavior of encoding and
counting the bytes. |
TypeDescriptor<T> |
getEncodedTypeDescriptor()
Returns the
TypeDescriptor for the type encoded. |
Coder<T> |
getValueCoder()
Returns the inner
Coder wrapped by this NullableCoder instance. |
boolean |
isRegisterByteSizeObserverCheap(T value)
NullableCoder is cheap if valueCoder is cheap. |
static <T> NullableCoder<T> |
of(Coder<T> valueCoder) |
void |
registerByteSizeObserver(T value,
org.apache.beam.sdk.util.common.ElementByteSizeObserver observer)
Overridden to short-circuit the default
StructuredCoder behavior of encoding and
counting the bytes. |
java.lang.Object |
structuralValue(T value)
Returns an object with an
Object.equals() method that represents structural equality on
the argument. |
void |
verifyDeterministic()
NullableCoder is deterministic if the nested Coder is. |
equals, getComponents, hashCode, toString
verifyDeterministic, verifyDeterministic
public static <T> NullableCoder<T> of(Coder<T> valueCoder)
public Coder<T> getValueCoder()
Coder
wrapped by this NullableCoder
instance.public void encode(T value, java.io.OutputStream outStream) throws java.io.IOException, CoderException
Coder
T
onto the given output stream. Multiple elements can
be encoded next to each other on the output stream, each coder should encode information to
know how many bytes to read when decoding. A common approach is to prefix the encoding with the
element's encoded length.encode
in class Coder<T>
java.io.IOException
- if writing to the OutputStream
fails for some reasonCoderException
- if the value could not be encoded for some reasonpublic void encode(T value, java.io.OutputStream outStream, Coder.Context context) throws java.io.IOException, CoderException
Coder
T
onto the given output stream in the given context.encode
in class Coder<T>
java.io.IOException
- if writing to the OutputStream
fails for some reasonCoderException
- if the value could not be encoded for some reasonpublic T decode(java.io.InputStream inStream) throws java.io.IOException, CoderException
Coder
T
from the given input stream in the given context. Returns the
decoded value. Multiple elements can be encoded next to each other on the input stream, each
coder should encode information to know how many bytes to read when decoding. A common approach
is to prefix the encoding with the element's encoded length.decode
in class Coder<T>
java.io.IOException
- if reading from the InputStream
fails for some reasonCoderException
- if the value could not be decoded for some reasonpublic T decode(java.io.InputStream inStream, Coder.Context context) throws java.io.IOException, CoderException
Coder
T
from the given input stream in the given context. Returns the
decoded value.decode
in class Coder<T>
java.io.IOException
- if reading from the InputStream
fails for some reasonCoderException
- if the value could not be decoded for some reasonpublic java.util.List<Coder<T>> getCoderArguments()
Coder
Coder
for a parameterized type, returns the list of Coder
s being
used for each of the parameters in the same order they appear within the parameterized type's
type signature. If this cannot be done, or this Coder
does not encode/decode a
parameterized type, returns the empty list.getCoderArguments
in class Coder<T>
public void verifyDeterministic() throws Coder.NonDeterministicException
NullableCoder
is deterministic if the nested Coder
is.
Throw Coder.NonDeterministicException
if the coding is not deterministic.
In order for a Coder
to be considered deterministic, the following must be true:
Object.equals()
or Comparable.compareTo()
, if supported) have the same encoding.
Coder
always produces a canonical encoding, which is the same for an instance
of an object even if produced on different computers at different times.
verifyDeterministic
in class Coder<T>
Coder.NonDeterministicException
- if this coder is not deterministic.public boolean consistentWithEquals()
NullableCoder
is consistent with equals if the nested Coder
is.
Returns true
if this Coder
is injective with respect to Object.equals(java.lang.Object)
.
Whenever the encoded bytes of two values are equal, then the original values are equal
according to Objects.equals()
. Note that this is well-defined for null
.
This condition is most notably false for arrays. More generally, this condition is false
whenever equals()
compares object identity, rather than performing a
semantic/structural comparison.
By default, returns false.
consistentWithEquals
in class Coder<T>
public java.lang.Object structuralValue(T value)
Coder
Object.equals()
method that represents structural equality on
the argument.
For any two values x
and y
of type T
, if their encoded bytes are the
same, then it must be the case that structuralValue(x).equals(structuralValue(y))
.
Most notably:
null
should be a proper object with an
equals()
method, even if the input value is null
.
See also Coder.consistentWithEquals()
.
By default, if this coder is Coder.consistentWithEquals()
, and the value is not null,
returns the provided object. Otherwise, encodes the value into a byte[]
, and returns an
object that performs array equality on the encoded bytes.
structuralValue
in class Coder<T>
public void registerByteSizeObserver(T value, org.apache.beam.sdk.util.common.ElementByteSizeObserver observer) throws java.lang.Exception
StructuredCoder
behavior of encoding and
counting the bytes. The size is known (1 byte) when value
is null
, otherwise
the size is 1 byte plus the size of nested Coder
's encoding of value
.
Notifies the ElementByteSizeObserver
about the byte size of the encoded value using
this Coder
.
Not intended to be called by user code, but instead by PipelineRunner
implementations.
By default, this notifies observer
about the byte size of the encoded value using
this coder as returned by Coder.getEncodedElementByteSize(T)
.
registerByteSizeObserver
in class Coder<T>
java.lang.Exception
protected long getEncodedElementByteSize(T value) throws java.lang.Exception
StructuredCoder
behavior of encoding and
counting the bytes. The size is known (1 byte) when value
is null
, otherwise
the size is 1 byte plus the size of nested Coder
's encoding of value
.
Returns the size in bytes of the encoded value using this coder.
getEncodedElementByteSize
in class Coder<T>
java.lang.Exception
public boolean isRegisterByteSizeObserverCheap(T value)
NullableCoder
is cheap if valueCoder
is cheap.
Returns whether Coder.registerByteSizeObserver(T, org.apache.beam.sdk.util.common.ElementByteSizeObserver)
cheap enough to call for every element, that
is, if this Coder
can calculate the byte size of the element to be coded in roughly
constant time (or lazily).
Not intended to be called by user code, but instead by PipelineRunner
implementations.
By default, returns false. The default Coder.registerByteSizeObserver(T, org.apache.beam.sdk.util.common.ElementByteSizeObserver)
implementation
invokes Coder.getEncodedElementByteSize(T)
which requires re-encoding an element unless it is
overridden. This is considered expensive.
isRegisterByteSizeObserverCheap
in class Coder<T>
public TypeDescriptor<T> getEncodedTypeDescriptor()
Coder
TypeDescriptor
for the type encoded.getEncodedTypeDescriptor
in class Coder<T>