← Back to blog

What we learned indexing 10,000+ past papers

A short look at normalizing board names, session codes, and duplicate detection so search stays fast as the library grows.

Scaling to 10,000+ papers required stronger normalization and duplicate handling.

These indexing updates keep query speed consistent as archive size grows.

© 2025 Open Past Paper. All rights reserved.
Facebook logoInstagram logoX logoGithub logo