Chapter 4: Query Optimization Theory

This chapter develops the theoretical foundations for cost-based query optimization in unified systems. We formalize the query space as a complete lattice, establish equivalence-preserving transformations, build cost models spanning multiple paradigms, and develop selectivity estimation techniques. The chapter concludes with a category-theoretic perspective that reveals the deep structure underlying query optimization.

4.1 The Query Optimization Problem

Query optimization is the process of transforming a declarative query into an efficient execution plan. The declarative query specifies what data to retrieve; optimization determines how to retrieve it.

4.1.1 Search Space Explosion

Consider a simple join of five tables:

SELECT * FROM A, B, C, D, E
WHERE A.x = B.x AND B.y = C.y AND C.z = D.z AND D.w = E.w;

The number of possible join orderings is:

\frac{(2(n-1))!}{(n-1)!} = \frac{8!}{4!} = 1680 \text{ for } n = 5

For each ordering, multiple physical implementations exist (nested loop, hash join, merge join). With index choices, the search space grows exponentially.

4.1.2 Multi-Paradigm Complexity

In a unified system, optimization spans paradigms:

SELECT p.title, bm25_score(p.content) as relevance
FROM papers p
WHERE MATCH(p.content) AGAINST ('distributed systems')
  AND vector_similarity(p.embedding, ?) > 0.8
  AND p.author_id IN (
    SELECT vertex_id FROM graph_traverse(:professor, 'ADVISED', 3)
  )
ORDER BY relevance DESC
LIMIT 10;

This query involves:

Full-text search (MATCH...AGAINST)
Vector similarity (embedding comparison)
Graph traversal (advisor relationships)
Relational filtering (author constraint)

The optimizer must reason about posting list intersections, index selection across paradigms, and cost trade-offs between different execution strategies.

4.1.3 Optimization Objectives

A query optimizer seeks to minimize a cost function:

\text{minimize } C(P) = w_{io} \cdot C_{io}(P) + w_{cpu} \cdot C_{cpu}(P) + w_{mem} \cdot C_{mem}(P)

where:

$C_{io}(P)$ is I/O cost (disk reads/writes)
$C_{cpu}(P)$ is CPU cost (operations performed)
$C_{mem}(P)$ is memory cost (working memory required)
$w_{*}$ are weights reflecting system characteristics

Different systems weight these differently: OLTP systems prioritize latency (minimize $C_{io}$ ), OLAP systems prioritize throughput (minimize $C_{cpu}$ per row), memory-constrained systems bound $C_{mem}$ .

4.2 Query Space as Complete Lattice

We formalize the space of query plans as a mathematical structure amenable to optimization algorithms.

4.2.1 Logical Query Plans

A logical query plan is a tree of relational operators:

\mathcal{L} = \{\text{Scan}, \text{Filter}, \text{Project}, \text{Join}, \text{Aggregate}, \text{Sort}, \text{Limit}, ...\}

Each operator has:

Input schema(s)
Output schema
Semantic meaning (what it computes)

Example logical plan:

Limit(10)
  Sort(relevance DESC)
    Project(title, relevance)
      Filter(vector_sim > 0.8)
        Join(author_id = vertex_id)
          FTSSearch(content, 'distributed systems')
            Scan(papers)
          GraphTraverse(professor, ADVISED, 3)

4.2.2 Physical Query Plans

A physical query plan specifies concrete implementations:

\mathcal{P} = \{\text{SeqScan}, \text{IndexScan}, \text{NestedLoopJoin}, \text{HashJoin}, \text{MergeJoin}, ...\}

Each physical operator has:

Implementation algorithm
Cost characteristics
Resource requirements

Example physical plan:

HeapTopK(10, relevance DESC)
  Project(title, relevance)
    HashJoin(author_id = vertex_id)
      PostingListIntersect
        WANDSearch(content, 'distributed systems', k=1000)
        HNSWSearch(embedding, query_vec, k=1000, threshold=0.8)
      BFSTraversal(professor, ADVISED, max_depth=3)

4.2.3 Plan Equivalence

Two plans are equivalent if they produce identical results for all database states:

P_1 \equiv P_2 \iff \forall D: \text{eval}(P_1, D) = \text{eval}(P_2, D)

Equivalence induces equivalence classes on the plan space.

4.2.4 Partial Ordering by Cost

Define a partial order on plans by estimated cost:

P_1 \preceq P_2 \iff C(P_1) \leq C(P_2)

Within an equivalence class, the optimizer seeks the minimum element under $\preceq$ .

4.2.5 Lattice Structure

Theorem 4.1 (Query Plan Lattice): The space of query plans for a given query, ordered by cost, forms a complete lattice.

Proof sketch:

Bottom: The optimal plan (minimum cost)
Top: The worst plan (maximum cost)
Meet: Given plans $P_1, P_2$ , their meet is the cheaper of any common refinement
Join: The join is the more expensive plan from which both are reachable by cost-reducing transformations

The lattice structure guarantees that local search algorithms (like dynamic programming) can find global optima under appropriate conditions.

Optimization algorithms navigate the lattice:

Dynamic Programming (DPccp): Bottom-up construction of optimal subplans, guaranteed to find global optimum for join ordering.

Transformation-Based: Apply equivalence rules to transform plans, hill-climbing toward lower cost.

Randomized: Simulated annealing or genetic algorithms for large search spaces.

Loading diagram...

4.3 Equivalence-Preserving Transformations

Transformations convert one plan to an equivalent plan, enabling search through the plan space.

4.3.1 Selection Pushdown

Rule: Push selections through projections when possible.

\sigma_\phi(\pi_A(R)) \equiv \pi_A(\sigma_\phi(R)) \quad \text{if } \text{attrs}(\phi) \subseteq A

Benefit: Filter early reduces intermediate result sizes.

Example:

Before: Project(name, price) -> Filter(price > 100) -> Scan(products)
After:  Project(name, price) -> Scan(products) with pushed filter price > 100

4.3.2 Selection Split and Merge

Rule: Conjunctive predicates can split or merge.

\sigma_{\phi_1 \land \phi_2}(R) \equiv \sigma_{\phi_1}(\sigma_{\phi_2}(R))

Benefit: Enables independent optimization of each predicate.

4.3.3 Selection Commutativity

Rule: Independent selections commute.

\sigma_{\phi_1}(\sigma_{\phi_2}(R)) \equiv \sigma_{\phi_2}(\sigma_{\phi_1}(R))

Benefit: Enables ordering by selectivity (most selective first).

4.3.4 Join Commutativity

Rule: Joins are commutative.

R \bowtie S \equiv S \bowtie R

Benefit: Choose build/probe sides for hash join based on size.

4.3.5 Join Associativity

Rule: Joins are associative.

(R \bowtie S) \bowtie T \equiv R \bowtie (S \bowtie T)

Benefit: Enables exploring all join orderings.

4.3.6 Selection-Join Exchange

Rule: Push selection through join.

\sigma_\phi(R \bowtie S) \equiv \sigma_\phi(R) \bowtie S \quad \text{if } \text{attrs}(\phi) \subseteq \text{attrs}(R)

\sigma_\phi(R \bowtie S) \equiv R \bowtie \sigma_\phi(S) \quad \text{if } \text{attrs}(\phi) \subseteq \text{attrs}(S)

Benefit: Reduces join input sizes.

4.3.7 Projection Pushdown

Rule: Push projection through join.

\pi_A(R \bowtie_\theta S) \equiv \pi_A(\pi_{A_R \cup J}(R) \bowtie_\theta \pi_{A_S \cup J}(S))

where $J$ = join attributes, $A_R$ = $A \cap \text{attrs}(R)$ , $A_S$ = $A \cap \text{attrs}(S)$ .

Benefit: Reduces data width in intermediate results.

4.3.8 Cross-Paradigm Transformations

Unified systems enable novel transformations:

FTS-Filter Interchange:

\sigma_\phi(\text{FTS}(t, R)) \equiv \text{FTS}(t, \sigma_\phi(R)) \quad \text{when } \phi \text{ independent of FTS}

Vector-Filter Interchange:

\sigma_\phi(\text{kNN}(v, k, R)) \equiv \text{kNN}(v, k', \sigma_\phi(R)) \quad \text{with adjusted } k'

Note: Vector search may require $k' > k$ to account for filtered-out results.

Graph-Relational Interchange:

\text{GraphTraverse}(v, t, k) \cap \sigma_\phi(R) \equiv \text{FilteredTraverse}(v, t, k, \phi)

When filtering can push into traversal.

4.4 Cost Model Fundamentals

Cost models estimate execution cost without actually running queries.

4.4.1 I/O Cost Model

Sequential Read Cost:

C_{seq}(n) = n \cdot c_{seq}

where $n$ = pages read, $c_{seq}$ = cost per sequential page read.

Random Read Cost:

C_{rand}(n) = n \cdot c_{rand}

where $c_{rand} \gg c_{seq}$ (typically 10-100x for HDD, 2-10x for SSD).

Index Scan Cost:

C_{idx}(R, \phi) = \text{height}(I) \cdot c_{rand} + \text{sel}(\phi) \cdot |R| \cdot c_{rand}

Index lookup plus data page fetches for matching rows.

Sequential Scan Cost:

C_{scan}(R) = \text{pages}(R) \cdot c_{seq}

4.4.2 CPU Cost Model

Filter Cost:

C_{filter}(\phi, n) = n \cdot c_{comp}(\phi)

where $c_{comp}(\phi)$ depends on predicate complexity.

Hash Table Build Cost:

C_{build}(R) = |R| \cdot c_{hash}

Hash Probe Cost:

C_{probe}(S, R) = |S| \cdot c_{probe}

Sort Cost:

C_{sort}(R) = |R| \cdot \log(|R|) \cdot c_{cmp}

4.4.3 Memory Cost Model

Hash Join Memory:

M_{hash}(R) = |R| \cdot \text{row\_size}(R) \cdot (1 + \text{overhead})

If $M_{hash}(R) > M_{available}$ , spill to disk.

Sort Memory:

M_{sort}(R) = \min(|R| \cdot \text{row\_size}(R), M_{available})

External sort required when input exceeds memory.

4.4.4 Join Cost Models

Nested Loop Join:

C_{NLJ}(R, S) = |R| \cdot c_{outer} + |R| \cdot |S| \cdot c_{inner}

Hash Join:

C_{HJ}(R, S) = |R| \cdot c_{build} + |S| \cdot c_{probe} + |R \bowtie S| \cdot c_{output}

Merge Join (sorted inputs):

C_{MJ}(R, S) = |R| \cdot c_{merge} + |S| \cdot c_{merge}

Decision Criterion:

Use NLJ when $|R|$ small or inner indexed
Use HJ when memory sufficient for build side
Use MJ when inputs pre-sorted or sort cost amortized

4.4.5 Posting List Operation Costs

Intersection Cost:

C_\cap(P_1, P_2) = \min(|P_1|, |P_2|) \cdot c_{seek} + \min(|P_1|, |P_2|) \cdot c_{cmp}

With skip pointers, seeks skip past non-matching regions.

Union Cost:

C_\cup(P_1, P_2) = (|P_1| + |P_2|) \cdot c_{merge}

Linear merge of sorted lists.

FTS Scoring Cost:

C_{BM25}(P, q) = |P| \cdot |q| \cdot c_{score}

Score computation for each document-term pair.

4.4.6 Vector Search Cost

HNSW Search Cost:

C_{HNSW}(k, ef) = ef \cdot \log(N) \cdot c_{dist}

where $ef$ = search width, $N$ = index size, $c_{dist}$ = distance computation cost.

Brute Force Cost:

C_{brute}(N, d) = N \cdot d \cdot c_{mul}

where $d$ = vector dimension.

4.4.7 Graph Traversal Cost

BFS Cost:

C_{BFS}(k, d_{avg}) = \sum_{i=1}^{k} d_{avg}^i \cdot c_{expand}

where $k$ = depth, $d_{avg}$ = average degree.

With Filter:

C_{filtered\_BFS}(k, d_{avg}, sel) = \sum_{i=1}^{k} (d_{avg} \cdot sel)^i \cdot c_{expand}

Filter selectivity reduces effective degree at each hop.

4.5 Cardinality Estimation

Accurate cardinality estimation is critical for cost-based optimization.

4.5.1 Base Table Statistics

For each table $R$ , maintain:

$|R|$ : Row count
$V(A, R)$ : Distinct values for attribute $A$
$\min(A, R)$ , $\max(A, R)$ : Value range
$\text{hist}(A, R)$ : Value histogram

4.5.2 Single Predicate Selectivity

Equality: $\sigma_{A=v}(R)$

\text{sel}(A = v) = \frac{1}{V(A, R)}

Range: $\sigma_{A > v}(R)$

\text{sel}(A > v) = \frac{\max(A, R) - v}{\max(A, R) - \min(A, R)}

With histogram: Use histogram bucket frequencies for more accurate estimation.

4.5.3 Compound Predicate Selectivity

Independence Assumption:

\text{sel}(\phi_1 \land \phi_2) = \text{sel}(\phi_1) \cdot \text{sel}(\phi_2)

With Correlation:

\text{sel}(\phi_1 \land \phi_2) = \text{sel}(\phi_1) \cdot \text{sel}(\phi_2) \cdot \text{corr}(\phi_1, \phi_2)

where $\text{corr}(\phi_1, \phi_2)$ captures dependency (1 = independent, >1 = positive correlation, <1 = negative correlation).

4.5.4 Join Cardinality

Foreign Key Join:

|R \bowtie_{R.fk = S.pk} S| = |R|

Each row in $R$ matches exactly one row in $S$ .

General Equijoin:

|R \bowtie_{R.A = S.B} S| = \frac{|R| \cdot |S|}{\max(V(A, R), V(B, S))}

Assuming uniform distribution.

With Statistics:

|R \bowtie_{R.A = S.B} S| = \sum_{v} \text{freq}(v, R.A) \cdot \text{freq}(v, S.B)

4.5.5 FTS Cardinality

Term Query:

|\tau_t| = \text{df}(t)

Document frequency from inverted index.

Conjunction:

|\tau_{t_1} \cap \tau_{t_2}| \approx \frac{\text{df}(t_1) \cdot \text{df}(t_2)}{N}

Independence assumption.

With Co-occurrence Statistics:

|\tau_{t_1} \cap \tau_{t_2}| = \text{codf}(t_1, t_2)

Pre-computed co-occurrence counts.

4.5.6 Vector Search Cardinality

k-NN Search:

|\text{kNN}(q, k)| = k

By definition.

Threshold Search:

|\text{sim}(q, \cdot) > \theta| \approx N \cdot P(\text{sim} > \theta)

Requires distribution model for similarity scores.

4.5.7 Graph Traversal Cardinality

k-Hop Traversal:

|\tau_G(v, T, k)| \approx \min(N, d_{avg}^k)

Exponential growth bounded by graph size.

With Selectivity:

|\tau_G(v, T, k) \cap \sigma_\phi| \approx \min(N \cdot \text{sel}(\phi), d_{avg}^k \cdot \text{sel}(\phi)^k)

Filter applies at each hop.

4.6 Selectivity Estimation Techniques

4.6.1 Histograms

Equi-width Histogram: Divide value range into equal-width buckets.

\text{sel}(A \in [l, u]) = \sum_{b: b \cap [l, u] \neq \emptyset} \frac{|b \cap [l, u]|}{|b|} \cdot \text{freq}(b)

Equi-depth Histogram: Each bucket contains equal number of values.

Better for skewed distributions.

Compressed Histogram: Store frequent values separately, histogram for remainder.

4.6.2 Sampling

Random Sampling: Estimate selectivity from sample.

\text{sel}(\phi) \approx \frac{|\sigma_\phi(\text{sample})|}{|\text{sample}|}

Confidence Interval:

\text{sel}(\phi) \in \left[\hat{p} - z \sqrt{\frac{\hat{p}(1-\hat{p})}{n}}, \hat{p} + z \sqrt{\frac{\hat{p}(1-\hat{p})}{n}}\right]

4.6.3 Sketches

Count-Min Sketch: Estimate frequency of items.

Space: $O(\frac{1}{\epsilon} \log \frac{1}{\delta})$ for $(1+\epsilon)$ approximation with probability $1-\delta$ .

HyperLogLog: Estimate cardinality (distinct count).

Space: $O(\log \log N)$ for relative error $\frac{1.04}{\sqrt{m}}$ .

4.6.4 Machine Learning Approaches

Query-Driven Learning: Train models on query workload.

\text{sel}(\phi) = f_\theta(\text{features}(\phi))

Features include: predicate type, column statistics, query structure.

Learned Cardinality Estimation: Neural networks trained on actual cardinalities.

4.7 The DPccp Algorithm

Dynamic Programming for Connected Subgraph Complement Pairs (DPccp) is the standard algorithm for optimal join ordering.

4.7.1 Problem Formulation

Given relations $R_1, ..., R_n$ and join graph $G = (V, E)$ where vertices are relations and edges are join predicates:

Goal: Find minimum-cost join tree.

4.7.2 Algorithm

Algorithm: DPccp(R_1, ..., R_n, G)
  // Initialize single relations
  for i in 1..n:
    opt[{R_i}] = AccessPath(R_i)

  // Enumerate subgraph complement pairs
  for S in subsets(R_1, ..., R_n) ordered by size:
    for (S_1, S_2) in ccp_pairs(S, G):
      for each join algorithm J:
        cost = C(J, opt[S_1], opt[S_2])
        if cost < opt[S].cost:
          opt[S] = (J, opt[S_1], opt[S_2], cost)

  return opt[{R_1, ..., R_n}]

ccp_pairs(S, G): Enumerate pairs $(S_1, S_2)$ where $S_1 \cup S_2 = S$ , $S_1 \cap S_2 = \emptyset$ , and both $S_1$ and $S_2$ are connected in $G$ .

4.7.3 Complexity

Subsets: $O(2^n)$
CCP pairs per subset: $O(3^n / 2^n)$ on average
Total: $O(3^n)$

For large $n$ , heuristics or randomized algorithms necessary.

4.7.4 Extensions for Unified Queries

DPccp extends to multi-paradigm queries by:

Treating FTS/Vector/Graph operations as "virtual relations"
Modeling posting list intersections as joins
Including paradigm-specific cost models

4.8 Category-Theoretic Perspective

For readers with category theory background, we sketch the categorical view of query optimization.

4.8.1 Category of Queries

Define category $\mathbf{Query}$ :

Objects: Query plans (logical or physical)
Morphisms: Equivalence-preserving transformations

This forms a groupoid (every morphism is invertible) since transformations preserve equivalence.

4.8.2 Cost Functor

The cost function defines a functor:

C: \mathbf{Query} \rightarrow \mathbf{R}^+

mapping query plans to their estimated costs.

Functoriality: Transformations that preserve equivalence should have predictable cost effects:

C(T(Q)) = C(Q) + \Delta(T)

where $\Delta(T)$ is the cost delta of transformation $T$ .

4.8.3 Natural Transformations as Optimizations

An optimization strategy is a natural transformation:

\eta: \text{Id}_{\mathbf{Query}} \Rightarrow \text{Opt}

where $\text{Opt}$ is the "optimized plan" functor.

Naturality ensures optimization is consistent across equivalent query formulations.

4.8.4 Adjunctions in Query Processing

Free-Forgetful Adjunction:

F: \mathbf{SQL} \rightleftharpoons \mathbf{Plan} : U

$F$ (free functor): Compile SQL to logical plan
$U$ (forgetful functor): Extract SQL semantics from plan

The adjunction captures that plans are "free algebras" over SQL expressions.

Embedding-Projection Adjunction:

E: \mathbf{Logical} \rightleftharpoons \mathbf{Physical} : P

$E$ (embed): Logical plan as abstract physical plan
$P$ (project): Physical plan's logical semantics

This adjunction formalizes the relationship between logical and physical planning.

4.8.5 Monad of Query Execution

Query execution forms a monad:

\text{Exec}: \mathbf{Plan} \rightarrow \mathbf{Plan}

with:

$\eta_P: P \rightarrow \text{Exec}(P)$ (plan becomes executable)
$\mu_P: \text{Exec}(\text{Exec}(P)) \rightarrow \text{Exec}(P)$ (flatten nested execution)

The monad laws ensure consistent execution semantics.

4.9 Practical Optimization Architecture

4.9.1 Two-Phase Optimization

Phase 1: Logical Optimization

Apply transformation rules
Generate alternative logical plans
Prune obviously suboptimal plans

Phase 2: Physical Optimization

Select physical operators
Choose access paths
Determine join algorithms

Loading diagram...

4.9.2 Rule-Based Optimization

Apply transformation rules in priority order:

Predicate simplification: Constant folding, contradiction detection
Predicate pushdown: Move filters toward data sources
Projection pruning: Remove unused columns
Join elimination: Remove unnecessary joins (e.g., to unique key)
Subquery unnesting: Convert correlated subqueries to joins

4.9.3 Cost-Based Optimization

After rule-based transformations:

Enumerate join orderings (DPccp or heuristic)
For each ordering, select physical operators
Estimate cost of each plan
Select minimum-cost plan

4.9.4 Adaptive Optimization

Modern optimizers adapt during execution:

Adaptive Join Selection: Switch join algorithm based on runtime cardinalities.

Adaptive Parallelism: Adjust parallelism based on observed throughput.

Re-optimization: Re-plan query mid-execution if estimates badly wrong.

4.10 Cross-Paradigm Optimization

4.10.1 Unified Cost Model

Cross-paradigm optimization requires a unified cost model:

C_{total} = C_{relational} + C_{FTS} + C_{vector} + C_{graph} + C_{integration}

The integration cost captures posting list operations that combine paradigm results.

4.10.2 Paradigm Selection

Given a predicate expressible in multiple paradigms, choose the cheapest:

Example: Find documents where category = 'electronics'

Options:

Relational: Secondary index scan
FTS: Term query on category field
Graph: Label-based vertex filter

Cost comparison:

C_{rel} = \text{height}(idx) \cdot c_{rand} + |\sigma| \cdot c_{rand}

C_{FTS} = c_{term\_lookup} + |P_{electronics}| \cdot c_{decode}

C_{graph} = c_{label\_lookup} + |V_{electronics}| \cdot c_{decode}

Select minimum.

4.10.3 Intersection Ordering

For conjunctive queries across paradigms:

WHERE MATCH(content) AGAINST ('database')  -- FTS
  AND category = 'tech'                     -- Relational
  AND vector_sim(embedding, ?) > 0.8        -- Vector

Order intersections by:

Selectivity (most selective first)
Evaluation cost (cheapest first, tie-breaker)

If FTS selectivity = 0.01, relational selectivity = 0.1, vector selectivity = 0.05:

Order: FTS (0.01) -> Vector (0.05) -> Relational (0.1)

4.10.4 Materialization Points

Decide where to materialize intermediate results:

Eager Materialization: Materialize after each paradigm operation.

Pro: Simple, predictable memory
Con: May materialize large intermediate results

Lazy Materialization: Defer materialization until necessary.

Pro: Avoids unnecessary work
Con: Complex planning, potential repeated evaluation

Hybrid: Materialize based on estimated intermediate sizes.

4.11 Summary

This chapter established the theoretical foundations for query optimization:

Query space as lattice formalizes the search space with partial ordering by cost, enabling systematic exploration toward optimal plans
Equivalence-preserving transformations include classical rules (selection pushdown, join reordering) extended with cross-paradigm transformations for unified systems
Cost models span I/O, CPU, and memory costs for relational, FTS, vector, and graph operations, enabling unified cost estimation
Selectivity estimation uses histograms, sampling, and sketches for cardinality prediction, critical for accurate cost estimation
DPccp algorithm provides optimal join ordering through dynamic programming over connected subgraph complement pairs
Category-theoretic perspective reveals query optimization as navigation through a category of plans with cost as a functor and optimization strategies as natural transformations
Cross-paradigm optimization requires unified cost models, paradigm selection, intersection ordering, and materialization decisions

The following chapters (Part II) examine the storage engine that underlies all these operations, starting with LSM-tree architecture in Chapter 5.