public class BeamSortRel extends org.apache.calcite.rel.core.Sort implements BeamRelNode
BeamRelNode
to replace a Sort
node.
Since Beam does not fully support global sort, it uses Top
to implement the Sort
algebra. The following types of ORDER BY are supported:
SELECT * FROM t ORDER BY id DESC LIMIT 10;
SELECT * FROM t ORDER BY id DESC LIMIT 10 OFFSET 5;
but an ORDER BY without a LIMIT is NOT supported. For example, the following will throw an exception:
SELECT * FROM t ORDER BY id DESC;
Top
, the result of a ORDER BY LIMIT must fit into the
memory of a single machine.
Constructor and Description |
---|
BeamSortRel(org.apache.calcite.plan.RelOptCluster cluster,
org.apache.calcite.plan.RelTraitSet traits,
org.apache.calcite.rel.RelNode child,
org.apache.calcite.rel.RelCollation collation,
org.apache.calcite.rex.RexNode offset,
org.apache.calcite.rex.RexNode fetch) |
Modifier and Type | Method and Description |
---|---|
BeamCostModel |
beamComputeSelfCost(org.apache.calcite.plan.RelOptPlanner planner,
org.apache.calcite.rel.metadata.RelMetadataQuery mq)
This method is called by
org.apache.beam.sdk.extensions.sql.impl.CalciteQueryPlanner.NonCumulativeCostImpl . |
PTransform<PCollectionList<Row>,PCollection<Row>> |
buildPTransform() |
org.apache.calcite.rel.core.Sort |
copy(org.apache.calcite.plan.RelTraitSet traitSet,
org.apache.calcite.rel.RelNode newInput,
org.apache.calcite.rel.RelCollation newCollation,
org.apache.calcite.rex.RexNode offset,
org.apache.calcite.rex.RexNode fetch) |
NodeStats |
estimateNodeStats(org.apache.calcite.rel.metadata.RelMetadataQuery mq)
This method is called by
org.apache.beam.sdk.extensions.sql.impl.planner.RelMdNodeStats . |
int |
getCount() |
boolean |
isLimitOnly() |
accept, computeSelfCost, copy, copy, explainTerms, getChildExps, getCollation, getCollationList
childrenAccept, deriveRowType, estimateRowCount, getInput, getInputs, replaceInput
accept, collectVariablesSet, collectVariablesUsed, computeDigest, computeSelfCost, explain, getCluster, getConvention, getCorrelVariable, getDescription, getDigest, getExpectedInputRowType, getId, getInput, getQuery, getRelTypeName, getRows, getRowType, getTable, getTraitSet, getVariablesSet, getVariablesStopped, isDistinct, isKey, isValid, isValid, metadata, onRegister, recomputeDigest, register, sole, toString
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
getPCollectionInputs, getPipelineOptions, isBounded
accept, accept, childrenAccept, collectVariablesSet, collectVariablesUsed, computeSelfCost, computeSelfCost, copy, estimateRowCount, explain, getChildExps, getCollationList, getConvention, getCorrelVariable, getExpectedInputRowType, getInput, getInputs, getQuery, getRelTypeName, getRows, getRowType, getTable, getVariablesSet, getVariablesStopped, isDistinct, isKey, isValid, isValid, metadata, onRegister, recomputeDigest, register, replaceInput
public BeamSortRel(org.apache.calcite.plan.RelOptCluster cluster, org.apache.calcite.plan.RelTraitSet traits, org.apache.calcite.rel.RelNode child, org.apache.calcite.rel.RelCollation collation, org.apache.calcite.rex.RexNode offset, org.apache.calcite.rex.RexNode fetch)
public NodeStats estimateNodeStats(org.apache.calcite.rel.metadata.RelMetadataQuery mq)
BeamRelNode
org.apache.beam.sdk.extensions.sql.impl.planner.RelMdNodeStats
. This is currently only used in
SQLTransform Path (and not JDBC path). When a RelNode wants to calculate its BeamCost or
estimate its NodeStats, it may need NodeStat of its inputs. However, it should not call this
directly (because maybe its inputs are not physical yet). It should call BeamSqlRelUtils.getNodeStats(org.apache.calcite.rel.RelNode,
org.apache.calcite.rel.metadata.RelMetadataQuery)
instead.estimateNodeStats
in interface BeamRelNode
public BeamCostModel beamComputeSelfCost(org.apache.calcite.plan.RelOptPlanner planner, org.apache.calcite.rel.metadata.RelMetadataQuery mq)
BeamRelNode
org.apache.beam.sdk.extensions.sql.impl.CalciteQueryPlanner.NonCumulativeCostImpl
. This is
currently only used in SQLTransform Path (and not JDBC path). This is needed when Calcite Query
Planner wants to get the cost of a plan. Instead of calling this directly for a node, if we
needed that it should be obtained by calling mq.getNonCumulativeCost. This way RelMetadataQuery
will call this method instead of ComputeSelfCost if the handler is set correctly (see org.apache.beam.sdk.extensions.sql.impl.CalciteQueryPlanner#convertToBeamRel(String)
)beamComputeSelfCost
in interface BeamRelNode
public boolean isLimitOnly()
public int getCount()
public PTransform<PCollectionList<Row>,PCollection<Row>> buildPTransform()
buildPTransform
in interface BeamRelNode
public org.apache.calcite.rel.core.Sort copy(org.apache.calcite.plan.RelTraitSet traitSet, org.apache.calcite.rel.RelNode newInput, org.apache.calcite.rel.RelCollation newCollation, org.apache.calcite.rex.RexNode offset, org.apache.calcite.rex.RexNode fetch)
copy
in class org.apache.calcite.rel.core.Sort