Q: What is bitmask DP and how does it solve TSP?

Bitmask DP uses an integer to represent a subset: bit i = 1 means city i has been visited. State: dp[mask][v] = minimum cost to visit exactly the cities in mask, currently at city v. Transition: dp[mask | (1

Q: How is Floyd-Warshall different from running Dijkstra from every node?

Floyd-Warshall computes all-pairs shortest paths in O(V^3) time and O(V^2) space. It handles negative edge weights (but not negative cycles). Running Dijkstra from every node: O(V * (E + V) log V) with a binary heap. For dense graphs (E = V^2): Dijkstra from all nodes is O(V^3 log V), slower than Floyd-Warshall O(V^3). For sparse graphs: Dijkstra from all nodes is O(V*E log V), faster than Floyd-Warshall. Floyd-Warshall is simpler to implement (3 nested loops) and handles negative weights. Dijkstra cannot handle negative weights (use Bellman-Ford or Johnson's algorithm instead for sparse graphs with negative weights). Floyd-Warshall also detects negative cycles: if dp[i][i] < 0 after running, node i is on a negative cycle.

Question 1

What is the key insight for applying DP to graphs?

Accepted Answer

DP on graphs requires a topological order so that each subproblem depends only on already-solved subproblems. DAGs have a natural topological order. For trees, post-order DFS (leaves first) is the topological order. For general graphs with cycles, bitmask DP encodes the visited set into the state, preventing cycles in the DP recurrence. The three main variants: DAG DP (topological sort, then DP), Tree DP (post-order DFS), Bitmask DP (TSP, coverage problems). Bellman-Ford is also DP: dp[k][v] = shortest path using at most k edges; the iteration index k imposes order.

Question 2

How does Tree DP work for the binary tree maximum path sum problem?

Accepted Answer

Define dp(node) = maximum one-sided path sum starting at this node going downward (usable by the parent). Base case: dp(null) = 0. For each node: left_gain = max(0, dp(left)); right_gain = max(0, dp(right)). The path through this node = node.val + left_gain + right_gain -- update the global maximum with this. Return node.val + max(left_gain, right_gain) to the parent (one-sided path). The key is the two-value pattern: a function returns the value usable by the parent (one-sided), while also updating a global maximum with the full path (two-sided) at each node. O(n) time, O(h) space for the recursion stack (h = tree height).

Question 3

How does Bellman-Ford detect negative cycles?

Accepted Answer

Bellman-Ford runs n-1 relaxation passes (where n = number of vertices). After n-1 passes, if the graph has no negative cycles, all shortest paths are finalized (a shortest path visits at most n-1 edges in a graph with n nodes). To detect negative cycles: run a final n-th pass. If any distance can still be reduced (dist[u] + w < dist[v] for some edge (u,v,w)), there is a negative cycle. A negative cycle means there is no finite shortest path -- you can keep traversing the cycle to reduce the distance indefinitely. To find which nodes are affected by negative cycles: after detecting, BFS/DFS from the nodes where the n-th pass still relaxes to mark all reachable nodes as having distance -infinity.

Question 4

What is bitmask DP and how does it solve TSP?

Accepted Answer

Bitmask DP uses an integer to represent a subset: bit i = 1 means city i has been visited. State: dp[mask][v] = minimum cost to visit exactly the cities in mask, currently at city v. Transition: dp[mask | (1

Question 5

How is Floyd-Warshall different from running Dijkstra from every node?

Accepted Answer

Floyd-Warshall computes all-pairs shortest paths in O(V^3) time and O(V^2) space. It handles negative edge weights (but not negative cycles). Running Dijkstra from every node: O(V * (E + V) log V) with a binary heap. For dense graphs (E = V^2): Dijkstra from all nodes is O(V^3 log V), slower than Floyd-Warshall O(V^3). For sparse graphs: Dijkstra from all nodes is O(V*E log V), faster than Floyd-Warshall. Floyd-Warshall is simpler to implement (3 nested loops) and handles negative weights. Dijkstra cannot handle negative weights (use Bellman-Ford or Johnson's algorithm instead for sparse graphs with negative weights). Floyd-Warshall also detects negative cycles: if dp[i][i] < 0 after running, node i is on a negative cycle.

Dynamic Programming on Graphs: Shortest Path DP, DAG DP, and Tree DP (2025)

When to Use DP on Graphs

Bellman-Ford: DP on General Graphs

DAG DP: Longest Path

Tree DP

Tree Diameter

Bitmask DP: Traveling Salesman Problem

Floyd-Warshall: All-Pairs Shortest Path