Question 1

Why is string concatenation in a loop O(n^2) in Python and Java?

Accepted Answer

In Python and Java, strings are immutable. Concatenating string A (length a) with string B (length b) creates a new string of length a+b, copying all characters from both strings. In a loop of n iterations concatenating strings of average length L: total copies = L + 2L + 3L + ... + nL = O(n^2 * L). For n = 10,000 and L = 10: ~500 million character copies. Solution in Python: build a list of strings and join at the end -- join is O(total characters). Solution in Java: use StringBuilder.append() -- amortized O(1) per append, O(n) total. In C++: strings are mutable, += with a string literal may still copy, but std::ostringstream or reserve() + += is efficient.

Question 2

What is the expand-around-center technique for palindromes and when does it fail?

Accepted Answer

Expand-around-center: for each position i in the string, try to expand a palindrome centered at i (odd length: i, i) and (i, i+1) (even length). Expand while characters match on both sides. O(n^2) time, O(1) space. Better than O(n^2) DP (which uses O(n^2) space). It does NOT find all palindromic substrings -- it finds the longest one. Limitation: it works only for contiguous substrings. For palindromic subsequences (characters not necessarily adjacent), use DP. Manacher's algorithm finds all palindromic substrings in O(n) by reusing previously computed expansion results, but is complex to implement correctly in an interview -- expand-around-center is the preferred interview solution with a note about Manacher's for O(n).

Question 3

How do you detect if two strings are anagrams efficiently?

Accepted Answer

Method 1 (sorting): sort both strings and compare. O(n log n) time, O(n) space. Simple but not optimal. Method 2 (character frequency array): create an int[26] array (for lowercase English). Increment for each char in s, decrement for each char in t. If all zeros: anagrams. O(n) time, O(1) space (fixed alphabet size). Method 3 (prime product hash): assign each letter a prime number; multiply all primes for s and t. If products equal: anagrams. O(n) time, O(1) space, no array needed, but risk of integer overflow for long strings. For Group Anagrams: use the sorted string as the hashmap key -- all anagrams of the same word sort to the same key. Alternative: tuple(sorted(Counter(s).items())) as key for Unicode support.

Question 4

How do you encode a list of strings to a single string and decode it reliably?

Accepted Answer

The challenge: a naive delimiter (like comma) fails if strings contain the delimiter. Robust encoding: length-prefixed format. For each string s: encode as len(s)#s. The # is the separator between the length and the string content. Decoding: scan for #, read the integer before it (the length), then read exactly that many characters. The # in the content is safe because we read a fixed number of characters determined by the length, not by scanning for another #. This handles any string content including #, commas, newlines, null bytes, and Unicode. Edge cases: empty string encodes as 0#. Empty list encodes as empty string. This is the same technique used in Redis RESP (Redis Serialization Protocol) and many binary protocols.

Question 5

How does the KMP (Knuth-Morris-Pratt) algorithm achieve O(n+m) substring search?

Accepted Answer

Naive substring search: for each position in text (length n), compare pattern (length m) -- O(nm). KMP precomputes a "failure function" (or prefix table) for the pattern: lps[i] = length of the longest proper prefix of pattern[0..i] that is also a suffix. Build in O(m). During search: when a mismatch occurs at pattern position j after matching j characters, instead of restarting from 0, shift to lps[j-1] (reuse the longest matched prefix-suffix). The text pointer never goes backward. Total comparisons: O(n). Use cases in interviews: check if s2 contains s1 as a substring (is s1 a rotation of s2: check if s1 is a substring of s2+s2). For most interview problems, Python's in operator (which uses optimized Boyer-Moore-Horspool) or str.find() is sufficient -- mention KMP when asked about complexity.

String Manipulation Interview Patterns: Anagrams, Palindromes, and Substring Problems (2025)

String Fundamentals for Interviews

Valid Anagram and Group Anagrams

Longest Palindromic Substring (LC 5) — Expand Around Center

Palindrome Partitioning (LC 131)

Encode and Decode Strings (LC 271)

String Compression and Roman Numerals