Performance Optimization in Python

Optimize by measuring first, choosing the right algorithms/data structures, and leveraging libraries.

Profile first

Identify hotspots before changing code.

python -m cProfile -o prof.out app.py
snakeviz prof.out

Line and memory profilers: line_profiler, memory_profiler.

Big-O dominates. Replace nested loops with maps/sets, early exits, and better structures.

# O(n) membership
s = set(items)
if x in s: ...

Pick the right container for the job:

Use NumPy/Pandas for vectorized numeric/data ops; offload hot loops to C/Cython/Numba.

import numpy as np
x = np.arange(1_000_000)
y = x * 2

Batch I/O, stream with iterators, and parallelize appropriately:

Memoize pure functions; be mindful of memory keys.

from functools import lru_cache
@lru_cache(maxsize=1024)
def fib(n): ...

Measure, then optimize with algorithms/structures, vectorization, and appropriate concurrency
Cache wisely; keep code clear and validated with benchmarks