All Projects

Resume Parser

ai

An AI-powered resume parsing service that extracts structured candidate information from resumes in any format — PDF, DOCX, images — using a combination of OCR and GPT-based extraction.

The Problem

Recruiters received resumes in dozens of formats with inconsistent layouts. Manual data entry was slow and error-prone, and existing parsers had poor accuracy on non-standard resume formats.

Tech Stack

PythonFastAPIOpenAITesseract OCRPostgreSQL

Key Features

Multi-format support (PDF, DOCX, images)
AI-powered field extraction
Structured JSON output
Batch processing API
Confidence scoring

Challenges

  • Handling 50+ resume formats and layouts accurately
  • Optimizing OCR pipeline for scanned documents
  • Structuring unstructured text into consistent schemas

Lessons Learned

  • Combining OCR with LLMs yields better results than either alone
  • Schema validation is critical for downstream consumption