Package org.apache.beam.sdk.coders
Class StringUtf8Coder
java.lang.Object
- All Implemented Interfaces:
Serializable
A
Coder
that encodes Strings
in UTF-8 encoding. If in a nested context,
prefixes the string with an integer length field, encoded via a VarIntCoder
.- See Also:
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.beam.sdk.coders.Coder
Coder.Context, Coder.NonDeterministicException
-
Method Summary
Modifier and TypeMethodDescriptionboolean
decode
(InputStream inStream) Decodes a value of typeT
from the given input stream in the given context.decode
(InputStream inStream, Coder.Context context) Decodes a value of typeT
from the given input stream in the given context.void
encode
(String value, OutputStream outStream) Encodes the given value of typeT
onto the given output stream.void
encode
(String value, OutputStream outStream, Coder.Context context) Encodes the given value of typeT
onto the given output stream in the given context.long
getEncodedElementByteSize
(String value) Returns the size in bytes of the encoded value using this coder.Returns theTypeDescriptor
for the type encoded.static StringUtf8Coder
of()
void
ThrowCoder.NonDeterministicException
if the coding is not deterministic.Methods inherited from class org.apache.beam.sdk.coders.AtomicCoder
equals, getCoderArguments, getComponents, hashCode
Methods inherited from class org.apache.beam.sdk.coders.StructuredCoder
toString
Methods inherited from class org.apache.beam.sdk.coders.Coder
getEncodedElementByteSizeUsingCoder, isRegisterByteSizeObserverCheap, registerByteSizeObserver, structuralValue, verifyDeterministic, verifyDeterministic
-
Method Details
-
of
-
encode
Description copied from class:Coder
Encodes the given value of typeT
onto the given output stream. Multiple elements can be encoded next to each other on the output stream, each coder should encode information to know how many bytes to read when decoding. A common approach is to prefix the encoding with the element's encoded length.- Specified by:
encode
in classCoder<String>
- Throws:
IOException
- if writing to theOutputStream
fails for some reason
-
encode
Description copied from class:Coder
Encodes the given value of typeT
onto the given output stream in the given context.- Overrides:
encode
in classCoder<String>
- Throws:
IOException
- if writing to theOutputStream
fails for some reason
-
decode
Description copied from class:Coder
Decodes a value of typeT
from the given input stream in the given context. Returns the decoded value. Multiple elements can be encoded next to each other on the input stream, each coder should encode information to know how many bytes to read when decoding. A common approach is to prefix the encoding with the element's encoded length.- Specified by:
decode
in classCoder<String>
- Throws:
IOException
- if reading from theInputStream
fails for some reason
-
decode
Description copied from class:Coder
Decodes a value of typeT
from the given input stream in the given context. Returns the decoded value.- Overrides:
decode
in classCoder<String>
- Throws:
IOException
- if reading from theInputStream
fails for some reason
-
verifyDeterministic
public void verifyDeterministic()Description copied from class:AtomicCoder
ThrowCoder.NonDeterministicException
if the coding is not deterministic.In order for a
Coder
to be considered deterministic, the following must be true:- two values that compare as equal (via
Object.equals()
orComparable.compareTo()
, if supported) have the same encoding. - the
Coder
always produces a canonical encoding, which is the same for an instance of an object even if produced on different computers at different times.
Unless overridden, does not throw. An
AtomicCoder
is presumed to be deterministic- Overrides:
verifyDeterministic
in classAtomicCoder<String>
- two values that compare as equal (via
-
consistentWithEquals
public boolean consistentWithEquals()Returnstrue
if thisCoder
is injective with respect toObject.equals(java.lang.Object)
.Whenever the encoded bytes of two values are equal, then the original values are equal according to
Objects.equals()
. Note that this is well-defined fornull
.This condition is most notably false for arrays. More generally, this condition is false whenever
equals()
compares object identity, rather than performing a semantic/structural comparison.By default, returns false.
- Overrides:
consistentWithEquals
in classCoder<String>
- Returns:
true
. This coder is injective.
-
getEncodedTypeDescriptor
Description copied from class:Coder
Returns theTypeDescriptor
for the type encoded.- Overrides:
getEncodedTypeDescriptor
in classCoder<String>
-
getEncodedElementByteSize
Returns the size in bytes of the encoded value using this coder.- Overrides:
getEncodedElementByteSize
in classCoder<String>
- Returns:
- the byte size of the UTF-8 encoding of the string or, in a nested context, the byte size of the encoding plus the encoded length prefix.
- Throws:
Exception
-