Question 1

How does the KMP algorithm achieve O(n + m) time complexity?

Accepted Answer

KMP avoids rescanning characters by precomputing a Longest Proper Prefix which is also Suffix (LPS) array for the pattern. When a mismatch occurs after j matched characters, instead of resetting j to 0, KMP sets j = lps[j-1] - jumping back only as far as necessary. This means the text pointer i never moves backward: each character is compared at most once. Building the LPS table is O(m); the search loop is O(n). Total: O(n + m).

Question 2

What is the LPS array in KMP and how do you build it?

Accepted Answer

The LPS (Longest Proper Prefix which is also Suffix) array stores, for each position i in the pattern, the length of the longest proper prefix of pattern[0..i] that is also a suffix. For pattern "abcabc": lps = [0,0,0,1,2,3]. Building: use two pointers (length, i). If pattern[i] == pattern[length]: lps[i] = ++length. If mismatch and length > 0: length = lps[length-1] (fall back, do not increment i). If mismatch and length == 0: lps[i] = 0, increment i. This is O(m).

Question 3

When should you use Rabin-Karp instead of KMP?

Accepted Answer

Use Rabin-Karp when: searching for multiple patterns simultaneously (hash each pattern, compare window hash against all pattern hashes in O(1) per position), or when the problem is fundamentally about rolling window hashes (LC 187 Repeated DNA Sequences). Rabin-Karp O(n+m) average but O(nm) worst case due to hash collisions - use double hashing to reduce collision probability. KMP is O(n+m) worst case and better for single-pattern search. For interview purposes, if the problem says "find all occurrences," KMP is usually the cleaner choice.

Question 4

How does the Z-algorithm work for string matching?

Accepted Answer

The Z-array Z[i] stores the length of the longest substring starting at position i of string S that matches a prefix of S. To search for pattern P in text T: concatenate P + "$" + T (sentinel prevents Z values from spanning the boundary). Compute the Z-array. Any position i in the combined string where Z[i] == len(P) is a match starting at i - len(P) - 1 in T. Computing the Z-array uses a window [l, r] to reuse previously computed values, achieving O(n+m).

Question 5

What is LC 28 (Find Index of First Occurrence) and how do you solve it with KMP?

Accepted Answer

LC 28 asks for the index of the first occurrence of needle in haystack, or -1 if not found. KMP solution: build LPS for needle. Run the KMP search: match haystack characters against needle using the LPS table to skip on mismatch. Return the first match position. O(n + m) time, O(m) space. The brute force is O(n*m) and will TLE on large inputs with repeated characters. KMP is the optimal solution and the expected approach for this problem at FAANG companies.

String Search Algorithm Interview Patterns: KMP, Rabin-Karp, Z-Algorithm, and Rolling Hash (2025)

Naive String Search and Why to Avoid It

KMP (Knuth-Morris-Pratt) – O(n + m)

Rabin-Karp – Rolling Hash for Multiple Pattern Search

Z-Algorithm – Pattern Matching via Z-Array

When to Use Each Algorithm