PartitionPruning (Spark 3.0.0-preview JavaDoc)

Object
- org.apache.spark.sql.dynamicpruning.PartitionPruning

```
public class PartitionPruning
extends Object
```
Dynamic partition pruning optimization is performed based on the type and selectivity of the join operation. During query optimization, we insert a predicate on the partitioned table using the filter from the other side of the join and a custom wrapper called DynamicPruning.
The basic mechanism for DPP inserts a duplicated subquery with the filter from the other side, when the following conditions are met: (1) the table to prune is partitioned by the JOIN key (2) the join operation is one of the following types: INNER, LEFT SEMI (partitioned on left), LEFT OUTER (partitioned on right), or RIGHT OUTER (partitioned on left)
In order to enable partition pruning directly in broadcasts, we use a custom DynamicPruning clause that incorporates the In clause with the subquery and the benefit estimation. During query planning, when the join type is known, we use the following mechanism: (1) if the join is a broadcast hash join, we replace the duplicated subquery with the reused results of the broadcast, (2) else if the estimated benefit of partition pruning outweighs the overhead of running the subquery query twice, we keep the duplicated subquery (3) otherwise, we drop the subquery.

Constructor Summary

Constructors
Constructor and Description

PartitionPruning()

Constructors
Constructor and Description
`PartitionPruning()`

Method Summary

All Methods Static Methods Concrete Methods
Modifier and Type	Method and Description
`static org.apache.spark.sql.catalyst.plans.logical.LogicalPlan`	`apply(org.apache.spark.sql.catalyst.plans.logical.LogicalPlan plan)`
`static scala.Option<scala.Tuple2<org.apache.spark.sql.catalyst.expressions.Expression,org.apache.spark.sql.catalyst.plans.logical.LogicalPlan>>`	`findExpressionAndTrackLineageDown(org.apache.spark.sql.catalyst.expressions.Expression exp, org.apache.spark.sql.catalyst.plans.logical.LogicalPlan plan)`
`static scala.Option<org.apache.spark.sql.execution.datasources.LogicalRelation>`	`getPartitionTableScan(org.apache.spark.sql.catalyst.expressions.Expression a, org.apache.spark.sql.catalyst.plans.logical.LogicalPlan plan)` Search the partitioned table scan for a given partition column in a logical plan
`static void`	`org$apache$spark$internal$Logging$$log__$eq(org.slf4j.Logger x$1)`
`static org.slf4j.Logger`	`org$apache$spark$internal$Logging$$log_()`
`static String`	`ruleName()`

Methods inherited from class Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Constructor Detail
- PartitionPruning
```
public PartitionPruning()
```

Method Detail

getPartitionTableScan

public static scala.Option<org.apache.spark.sql.execution.datasources.LogicalRelation> getPartitionTableScan(org.apache.spark.sql.catalyst.expressions.Expression a,
                                                                                                             org.apache.spark.sql.catalyst.plans.logical.LogicalPlan plan)

Search the partitioned table scan for a given partition column in a logical plan

Parameters:: a - (undocumented); plan - (undocumented)
Returns:: (undocumented)

apply

public static org.apache.spark.sql.catalyst.plans.logical.LogicalPlan apply(org.apache.spark.sql.catalyst.plans.logical.LogicalPlan plan)

ruleName
```
public static String ruleName()
```

org$apache$spark$internal$Logging$$log_

public static org.slf4j.Logger org$apache$spark$internal$Logging$$log_()

org$apache$spark$internal$Logging$$log__$eq

public static void org$apache$spark$internal$Logging$$log__$eq(org.slf4j.Logger x$1)

findExpressionAndTrackLineageDown

public static scala.Option<scala.Tuple2<org.apache.spark.sql.catalyst.expressions.Expression,org.apache.spark.sql.catalyst.plans.logical.LogicalPlan>> findExpressionAndTrackLineageDown(org.apache.spark.sql.catalyst.expressions.Expression exp,
                                                                                                                                                                                         org.apache.spark.sql.catalyst.plans.logical.LogicalPlan plan)

Class PartitionPruning

Constructor Summary

Method Summary

Methods inherited from class Object

Constructor Detail

PartitionPruning

Method Detail

getPartitionTableScan

apply

ruleName

org$apache$spark$internal$Logging$$log_

org$apache$spark$internal$Logging$$log__$eq

findExpressionAndTrackLineageDown