Package org.apache.beam.sdk.coders
Class StringUtf8Coder
java.lang.Object
- All Implemented Interfaces:
Serializable
A
Coder that encodes Strings in UTF-8 encoding. If in a nested context,
prefixes the string with an integer length field, encoded via a VarIntCoder.- See Also:
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.beam.sdk.coders.Coder
Coder.Context, Coder.NonDeterministicException -
Method Summary
Modifier and TypeMethodDescriptionbooleandecode(InputStream inStream) Decodes a value of typeTfrom the given input stream in the given context.decode(InputStream inStream, Coder.Context context) Decodes a value of typeTfrom the given input stream in the given context.voidencode(String value, OutputStream outStream) Encodes the given value of typeTonto the given output stream.voidencode(String value, OutputStream outStream, Coder.Context context) Encodes the given value of typeTonto the given output stream in the given context.longgetEncodedElementByteSize(String value) Returns the size in bytes of the encoded value using this coder.Returns theTypeDescriptorfor the type encoded.static StringUtf8Coderof()voidThrowCoder.NonDeterministicExceptionif the coding is not deterministic.Methods inherited from class org.apache.beam.sdk.coders.AtomicCoder
equals, getCoderArguments, getComponents, hashCodeMethods inherited from class org.apache.beam.sdk.coders.StructuredCoder
toStringMethods inherited from class org.apache.beam.sdk.coders.Coder
getEncodedElementByteSizeUsingCoder, isRegisterByteSizeObserverCheap, registerByteSizeObserver, structuralValue, verifyDeterministic, verifyDeterministic
-
Method Details
-
of
-
encode
Description copied from class:CoderEncodes the given value of typeTonto the given output stream. Multiple elements can be encoded next to each other on the output stream, each coder should encode information to know how many bytes to read when decoding. A common approach is to prefix the encoding with the element's encoded length.- Specified by:
encodein classCoder<String>- Throws:
IOException- if writing to theOutputStreamfails for some reason
-
encode
Description copied from class:CoderEncodes the given value of typeTonto the given output stream in the given context.- Overrides:
encodein classCoder<String>- Throws:
IOException- if writing to theOutputStreamfails for some reason
-
decode
Description copied from class:CoderDecodes a value of typeTfrom the given input stream in the given context. Returns the decoded value. Multiple elements can be encoded next to each other on the input stream, each coder should encode information to know how many bytes to read when decoding. A common approach is to prefix the encoding with the element's encoded length.- Specified by:
decodein classCoder<String>- Throws:
IOException- if reading from theInputStreamfails for some reason
-
decode
Description copied from class:CoderDecodes a value of typeTfrom the given input stream in the given context. Returns the decoded value.- Overrides:
decodein classCoder<String>- Throws:
IOException- if reading from theInputStreamfails for some reason
-
verifyDeterministic
public void verifyDeterministic()Description copied from class:AtomicCoderThrowCoder.NonDeterministicExceptionif the coding is not deterministic.In order for a
Coderto be considered deterministic, the following must be true:- two values that compare as equal (via
Object.equals()orComparable.compareTo(), if supported) have the same encoding. - the
Coderalways produces a canonical encoding, which is the same for an instance of an object even if produced on different computers at different times.
Unless overridden, does not throw. An
AtomicCoderis presumed to be deterministic- Overrides:
verifyDeterministicin classAtomicCoder<String>
- two values that compare as equal (via
-
consistentWithEquals
public boolean consistentWithEquals()Returnstrueif thisCoderis injective with respect toObject.equals(java.lang.Object).Whenever the encoded bytes of two values are equal, then the original values are equal according to
Objects.equals(). Note that this is well-defined fornull.This condition is most notably false for arrays. More generally, this condition is false whenever
equals()compares object identity, rather than performing a semantic/structural comparison.By default, returns false.
- Overrides:
consistentWithEqualsin classCoder<String>- Returns:
true. This coder is injective.
-
getEncodedTypeDescriptor
Description copied from class:CoderReturns theTypeDescriptorfor the type encoded.- Overrides:
getEncodedTypeDescriptorin classCoder<String>
-
getEncodedElementByteSize
Returns the size in bytes of the encoded value using this coder.- Overrides:
getEncodedElementByteSizein classCoder<String>- Returns:
- the byte size of the UTF-8 encoding of the string or, in a nested context, the byte size of the encoding plus the encoded length prefix.
- Throws:
Exception
-