All Projects
Resume Parser
ai
An AI-powered resume parsing service that extracts structured candidate information from resumes in any format — PDF, DOCX, images — using a combination of OCR and GPT-based extraction.
The Problem
Recruiters received resumes in dozens of formats with inconsistent layouts. Manual data entry was slow and error-prone, and existing parsers had poor accuracy on non-standard resume formats.
Tech Stack
PythonFastAPIOpenAITesseract OCRPostgreSQL
Key Features
Multi-format support (PDF, DOCX, images)
AI-powered field extraction
Structured JSON output
Batch processing API
Confidence scoring
Challenges
- Handling 50+ resume formats and layouts accurately
- Optimizing OCR pipeline for scanned documents
- Structuring unstructured text into consistent schemas
Lessons Learned
- Combining OCR with LLMs yields better results than either alone
- Schema validation is critical for downstream consumption