Class IsmFormat.IsmRecordCoder<V>
- All Implemented Interfaces:
Serializable
- Enclosing class:
IsmFormat
Coder for IsmFormat.IsmRecords.
Note that this coder standalone will not produce an Ism file. This coder can be used to
materialize a PCollection of IsmFormat.IsmRecords. Only when this coder is combined with
an IsmSink will one produce an Ism file.
The IsmFormat.IsmRecord encoded format is:
- encoded key component 1 using key component coder 1
- ...
- encoded key component N using key component coder N
- encoded value using value coder
- See Also:
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.beam.sdk.coders.Coder
Coder.Context, Coder.NonDeterministicException -
Method Summary
Modifier and TypeMethodDescriptionbooleandecode(InputStream inStream) Decodes a value of typeTfrom the given input stream in the given context.voidencode(IsmFormat.IsmRecord<V> value, OutputStream outStream) Encodes the given value of typeTonto the given output stream.intencodeAndHash(List<?> keyComponents, RandomAccessData keyBytesToMutate) Computes the shard id for the given key component(s).intencodeAndHash(List<?> keyComponents, RandomAccessData keyBytesToMutate, List<Integer> keyComponentByteOffsetsToMutate) Computes the shard id for the given key component(s).boolean<T> Coder<T> getKeyComponentCoder(int index) Returns the key coder at the specified index.Returns the list of key component coders.intgetNumberOfShardKeyCoders(List<?> keyComponents) Returns the value coder.intComputes the shard id for the given key component(s).inthashCode()static <V> IsmFormat.IsmRecordCoder<V> of(int numberOfShardKeyCoders, int numberOfMetadataShardKeyCoders, List<Coder<?>> keyComponentCoders, Coder<V> valueCoder) Returns an IsmRecordCoder with the specified key component coders, value coder.structuralValue(IsmFormat.IsmRecord<V> record) Returns an object with anObject.equals()method that represents structural equality on the argument.voidThrowCoder.NonDeterministicExceptionif the coding is not deterministic.Methods inherited from class org.apache.beam.sdk.coders.Coder
decode, encode, getEncodedElementByteSize, getEncodedElementByteSizeUsingCoder, getEncodedTypeDescriptor, isRegisterByteSizeObserverCheap, registerByteSizeObserver, verifyDeterministic, verifyDeterministic
-
Method Details
-
of
public static <V> IsmFormat.IsmRecordCoder<V> of(int numberOfShardKeyCoders, int numberOfMetadataShardKeyCoders, List<Coder<?>> keyComponentCoders, Coder<V> valueCoder) Returns an IsmRecordCoder with the specified key component coders, value coder. -
getKeyComponentCoders
Returns the list of key component coders. -
getKeyComponentCoder
Returns the key coder at the specified index. -
getValueCoder
Returns the value coder. -
encode
public void encode(IsmFormat.IsmRecord<V> value, OutputStream outStream) throws CoderException, IOException Description copied from class:CoderEncodes the given value of typeTonto the given output stream. Multiple elements can be encoded next to each other on the output stream, each coder should encode information to know how many bytes to read when decoding. A common approach is to prefix the encoding with the element's encoded length.- Specified by:
encodein classCoder<IsmFormat.IsmRecord<V>>- Throws:
CoderException- if the value could not be encoded for some reasonIOException- if writing to theOutputStreamfails for some reason
-
decode
Description copied from class:CoderDecodes a value of typeTfrom the given input stream in the given context. Returns the decoded value. Multiple elements can be encoded next to each other on the input stream, each coder should encode information to know how many bytes to read when decoding. A common approach is to prefix the encoding with the element's encoded length.- Specified by:
decodein classCoder<IsmFormat.IsmRecord<V>>- Throws:
CoderException- if the value could not be decoded for some reasonIOException- if reading from theInputStreamfails for some reason
-
getNumberOfShardKeyCoders
-
hash
Computes the shard id for the given key component(s).The shard keys are encoded into their byte representations and hashed using the 32-bit murmur3 algorithm, x86 variant (little-endian variant), using
1225801234as the seed value. We ensure that shard ids for metadata keys and normal keys do not overlap. -
encodeAndHash
Computes the shard id for the given key component(s).Mutates
keyBytessuch that when returned, contains the encoded version of the key components. -
encodeAndHash
public int encodeAndHash(List<?> keyComponents, RandomAccessData keyBytesToMutate, List<Integer> keyComponentByteOffsetsToMutate) Computes the shard id for the given key component(s).Mutates
keyBytessuch that when returned, contains the encoded version of the key components. Also, mutateskeyComponentByteOffsetsToMutateto store the location where each key component's encoded byte representation ends withinkeyBytes. -
getCoderArguments
Description copied from class:CustomCoderIf this is aCoderfor a parameterized type, returns the list ofCoders being used for each of the parameters in the same order they appear within the parameterized type's type signature. If this cannot be done, or thisCoderdoes not encode/decode a parameterized type, returns the empty list..Returns an empty list. A
CustomCoderhas no default argumentcoders.- Overrides:
getCoderArgumentsin classCustomCoder<IsmFormat.IsmRecord<V>>
-
verifyDeterministic
Description copied from class:CustomCoderThrowCoder.NonDeterministicExceptionif the coding is not deterministic.In order for a
Coderto be considered deterministic, the following must be true:- two values that compare as equal (via
Object.equals()orComparable.compareTo(), if supported) have the same encoding. - the
Coderalways produces a canonical encoding, which is the same for an instance of an object even if produced on different computers at different times.
- Overrides:
verifyDeterministicin classCustomCoder<IsmFormat.IsmRecord<V>>- Throws:
Coder.NonDeterministicException- aCustomCoderis presumed nondeterministic.
- two values that compare as equal (via
-
consistentWithEquals
public boolean consistentWithEquals()Description copied from class:CoderReturnstrueif thisCoderis injective with respect toObject.equals(java.lang.Object).Whenever the encoded bytes of two values are equal, then the original values are equal according to
Objects.equals(). Note that this is well-defined fornull.This condition is most notably false for arrays. More generally, this condition is false whenever
equals()compares object identity, rather than performing a semantic/structural comparison.By default, returns false.
- Overrides:
consistentWithEqualsin classCoder<IsmFormat.IsmRecord<V>>
-
structuralValue
Description copied from class:CoderReturns an object with anObject.equals()method that represents structural equality on the argument.For any two values
xandyof typeT, if their encoded bytes are the same, then it must be the case thatstructuralValue(x).equals(structuralValue(y)).Most notably:
- The structural value for an array coder should perform a structural comparison of the contents of the arrays, rather than the default behavior of comparing according to object identity.
- The structural value for a coder accepting
nullshould be a proper object with anequals()method, even if the input value isnull.
See also
Coder.consistentWithEquals().By default, if this coder is
Coder.consistentWithEquals(), and the value is not null, returns the provided object. Otherwise, encodes the value into abyte[], and returns an object that performs array equality on the encoded bytes.- Overrides:
structuralValuein classCoder<IsmFormat.IsmRecord<V>>
-
equals
-
hashCode
public int hashCode()
-