Greedy Technique and Prim Algorithm

Greedy Technique

The greed technique solves an optimization problem by iteratively building a solution. It always selects the optimal solution each iteration. Because the problem is an optimization, greedy algorithms use a priority queue.

Consider the making change returning the minimal number of coins. Almost everyone uses a greedy approach, first returning the largest domination coin that does not exceed the amount left to return. What items does the priority queue contain?

The greedy technique works for our denominations of coins, meaning quarters, dimes, nickels and pennies. It does not always work, consider the denominations of that included 7c, 5c and 1c. Make change on 10c using the greedy technique.

Prim's Algorithm

Prim's Algorithm constructs a minimal spanning tree (MST) in a connect graph or component.

Minimal Spanning Tree

A minimal spanning tree of a weighted graph is a spanning tree that has minimal of sum of edge weights.

Prim's Algorithm solves the greedy algorithm using the greedy technique. It builds the spanning tree by adding the minimal weight edge to a vertex not already in the tree.

Algorthm Prim(G)

V_T ← {v₀}

E_T ← {} // empty set

for i ← 1 to |V| do

find the minimum weight edge, e* = (v*, u*) such that v* is in V_T and u is in V- V_T

V_T ← V_T union {u*}

E_T ← E_T union {e*}

return E_T

Illustrate the algorithm

Proof of correctness

Typically the greedy algorithms are easy to write. Proving that they construct the optimal solution can be difficult.

We prove Prim's algorithm is correct by induction on the growing tree constructed by the algorithm. We assume that T_i_-1 is part of a minimal spanning tree for the graph and prove that the tree T_i constructed using Prim's algorithm must be part of a minimal spanning tree. Then T_n is a minimal spanning tree for the complete graph.

T₀ is a single vertex so it must be part of a minimal spanning tree.

We assume that T_i_-1 is part of a minimal spanning tree. Let T be the minimal spanning tree such that T_i_-1 is part of T. We prove by contraction that T_i is part of a minimal spanning tree.

Let e_i = (v, u) be the edge found by Prim's algorithm and assume that it is not an edge of a minimum spanning tree. Then e_i cannot be part of T.

Recall that T_i_-1 is a spanning tree of the vertices of T_i_-1 and is also part of T, a spanning tree of the complete graph, so there must be an edge (v', u') in T that connects the vertices of T_i_-1 to a vertex not in T_i_-1. Note that e_i also connects vertices in T_i_-1 with vertices not in T_i_-1therefore the Graph T + e_i has a cycle with edges (v', u') and e_i part of the cycle. If we delete the edge (v', u') from T + e_i, the remaining graph, T + e_i - (v', u'), is another spanning tree. Because (v', u') and e_i are both edges with origins in vertices T_i_-1 and also e_i is edge from Prim's algorthim the weight of e_i must be less than or equal to the weight of (v', u'). So the total weight of T + e_i - (v', u') is less than or equal to T. Therefore T + e_i - (v', u') is a minimal spanning tree of the complete graph with the edge e_i. This is a contradiction that e_i was not part of minimal spanning tree. So, Prim's algorithm constructs a minimal spanning tree.

Cost

V- V_T is a priority queue. The cost depends on the implementation of the priority queue.

Below is another version of Prim's algorithm with explicit priority queue.

Algorthm Prim(G)

V_T ← {v₀}

E_T ← {} // empty set

make Priority Queue of vertices, Q, // is the V-V_T

initialize with weights of the minimal edge adjacent to v₀

if the vertex is not adjacent to v₀then the weight is ∞

for i ← 1 to |V| do

u* ← remove minimum from Q // the optimal choice in V-V_T

for each u adjacent to u* in Q do

update u in Q with min(current key of u, w(u*, u))

V_T ← V_T union {u*}

E_T ← E_T union {e*}

return E_T

Consider that Q is implemented with a heap. What is the cost of updating u in Q? lg | Q | ε O(lg n), why? We assume that vertices are aware of their location in Q, but then it location must be update using bubble up or down.

How many update are performed?

The for-loop ovre adjacent vertices runs the degree of u*. So for the complete algorithm

∑_v_{ε V}degree(v) = 2m (2 times the number of edges). Why? because all edges are counted trice in the sum.

Remember this formula

The total cost of the algorithm in worst case includes constructing the priority queue.

O(n lg n + m lg n) = O((n+m)lg n)

Were the n lg n is cost of initializing Q.

If the graph is connected then n ε O(m), so the algorithm is O(m lg n)