Merge Sort

ABSTRACT

Merge Sort is a comparison-based sorting algorithm that uses the Divide and Conquer strategy to achieve a guaranteed $O (n lo g n)$ time complexity. It relies on the fact that merging two sorted lists is significantly more efficient ( $O (n)$ ) than sorting an unsorted one from scratch.

The Strategy

Divide: Split the unsorted list into two sub-lists of roughly $\frac{n}{2}$ size.
Recursively Sort: Call MergeSort on both sub-lists until base cases are reached.
Conquer (Merge): Combine the two sorted sub-lists into one sorted result using the RMerge helper.

Formal Proof of Correctness

Part 1: The Merge Helper (`RMerge`)

We prove the helper function using Regular Induction because the total number of elements ( $k + l$ ) decreases by exactly 1 in each recursive call.

Base Case: If both lists are empty ( $n = 0$ ), it returns an empty list, which is sorted.
Inductive Hypothesis: Assume RMerge correctly merges any two sorted lists with combined size $n - 1$ .
Inductive Step: For a combined size $n$ , we compare the heads ( $a_{1}, b_{1}$ ). The smaller element is prepended to the result of RMerge called on the remaining $n - 1$ elements. By the hypothesis, the sub-call is correct; thus, the final list is sorted.

Part 2: The Main Algorithm

We prove MergeSort using Strong Induction because each subsequent call halves the input size ( $\frac{n}{2} < n - 1$ ).

Base Case:
- $n = 0$ : Returns an empty list (trivially true).
- $n = 1$ : Returns $a_{1}$ , a trivially sorted list containing all elements.
Inductive Hypothesis: Assume MergeSort correctly sorts all lists with $k$ elements for any $0 \leq k < n$ , where $n > 1$ .
Inductive Step:
1. Divide the list of size $n$ into two halves of size $m = ⌊ n /2 ⌋$ and $n - m$ .
2. By the Strong Inductive Hypothesis, since both halves have size $< n$ , the recursive calls $L_{1} = M er g e S or t (Left)$ and $L_{2} = M er g e S or t (Right)$ return correctly sorted lists.
3. By the correctness of RMerge, $RM er g e (L_{1}, L_{2})$ results in a sorted list of all $n$ elements.

Time Analysis

1. Recurrence Extraction

Let $T (n)$ be the runtime of MergeSort on a list of size $n$ .

T (0) T (1) T (n) = c_{0} = c_{1} = 2 T (n /2) + T_{m er g e} (n)

Since $T_{m er g e} (n) = O (n)$ , the expression is:

T (n) = 2 T (n /2) + O (n)

2. Method A: Master Theorem

Comparing to $T (n) = a T (n / b) + O (n^{d})$ :

$a = 2$ (number of recursive calls)
$b = 2$ (factor by which size is reduced)
$d = 1$ (exponent of non-recursive work $O (n^{1})$ )

Comparison: $a = 2$ and $b^{d} = 2^{1} = 2$ .

Since $a = b^{d}$ , we use Case 2:

T (n) = O (n lo g n)

3. Method B: Unraveling (Iteration)

We substitute the recurrence into itself to find the pattern:

T (n) = 2 T (n /2) + c n = 2 [2 T (n / 2^{2}) + c (n /2)] + c n = 2^{2} T (n / 2^{2}) + 2 c n = 2^{2} [2 T (n / 2^{3}) + c (n / 2^{2})] + 2 c n = 2^{3} T (n / 2^{3}) + 3 c n \dots = 2^{k} T (n / 2^{k}) + k c n

To reach the base case $T (1)$ , we set $n / 2^{k} = 1 ⟹ k = lo g_{2} n$ :

T (n) = n T (1) + (lo g_{2} n) c n = n c_{1} + c n lo g_{2} n = O (n lo g n)

Fast Multiplication – Another application of D&C.
Master Theorem – Deep dive into the cases used here.
Recursive Proofs – General framework for Strong vs. Regular Induction.

Jason's Notebook

Explorer

Merge Sort

The Strategy

Formal Proof of Correctness

Part 1: The Merge Helper (`RMerge`)

Part 2: The Main Algorithm

Time Analysis

1. Recurrence Extraction

2. Method A: Master Theorem

3. Method B: Unraveling (Iteration)

Graph View

Table of Contents

Backlinks

Jason's Notebook

Explorer

Merge Sort

The Strategy

Formal Proof of Correctness

Part 1: The Merge Helper (RMerge)

Part 2: The Main Algorithm

Time Analysis

1. Recurrence Extraction

2. Method A: Master Theorem

3. Method B: Unraveling (Iteration)

Related Notes

Graph View

Table of Contents

Backlinks

Part 1: The Merge Helper (`RMerge`)