AI-Powered HR Resume Screening System HLD
About This Architecture
AI-powered resume screening system combining Gemini LLM parsing, semantic embeddings, and multi-stage scoring to automate candidate evaluation. HR users upload job descriptions and resumes through a Streamlit web UI protected by WAF and CDN, with files validated through extension, MIME type, size, and PDF integrity checks before deduplication via SHA-256 hashing. The JD Parser Agent and Resume Parser Agent extract structured requirements and candidate profiles using Pydantic models, which feed into an Embedding Service using MiniLM-L6-v2 for semantic similarity scoring via cosine distance and skill overlap analysis. A Scoring Agent applies weighted rubrics with rule-based fallback scoring, bonus evaluation from GitHub achievements, and manual override capabilities before ranking and filtering candidates above a configurable threshold. The system generates JSON, HTML, and PDF reports while maintaining comprehensive audit logs and LLM result caching via LangChain prompt optimization. Fork this diagram to customize scoring weights, add additional data sources, or integrate with your ATS platform.
People also ask
How do you build an AI resume screening system that parses job descriptions and resumes, scores candidates semantically, and allows manual overrides?
This diagram shows a complete pipeline: HR users upload files through a Streamlit UI with WAF/CDN protection and multi-layer validation (extension, MIME, size, PDF integrity, deduplication). Gemini LLM agents parse JDs and resumes into structured Pydantic models, which are embedded using MiniLM-L6-v2 and scored via cosine similarity and skill overlap. A Scoring Agent applies weighted rubrics with
- Domain:
- Ml Pipeline
- Audience:
- HR tech leads and talent acquisition engineers building AI-powered resume screening systems
Generated by Diagrams.so — AI architecture diagram generator with native Draw.io output. Fork this diagram, remix it, or download as .drawio, PNG, or SVG.