As a Backend Engineer at the company, you will design and build a multi-agent Python pipeline that transforms unstructured exam papers (PDFs) into structured question–answer schemas used by our student-facing application. You will work with FastAPI, LLMs, OCR/Vision models, LangChain-style agent frameworks, and scalable cloud infrastructure.

This role sits at the intersection of GenAI engineering, backend systems design, and EdTech product development.

Key Responsibilities

1. Backend Engineering (FastAPI & Pipeline Architecture)

Architect and implement a modular, multi-stage Python pipeline covering OCR, content extraction, parsing, and question structuring.
Own, deploy, and maintain FastAPI microservices that integrate with LLMs, OCR engines, and internal data stores.
Build reliable logging, monitoring, and cost-tracking systems across all pipeline components and agents.

2. GenAI, LLMs & Multi-Agent Systems

Create multi-agent orchestration flows using frameworks like LangChain, LlamaIndex, or custom-built agent systems.
Implement LLM-driven extraction, structuring, and reasoning workflows for exam content.
Manage agent input/output schemas, reasoning traces, and message protocols.
Ensure consistently structured outputs across a wide range of exam formats.

3. OCR & Computer Vision

Integrate OCR and vision models for extracting text, diagrams, and images from PDFs.
Handle multimodal input types such as equations, tables, graphs, and visual elements.
Ensure reliable capture of mathematical notation and diagram-based questions.

4. Question Schema & Data Modelling

Design scalable schemas for:
- Questions and sub-questions
- Answer keys
- Diagrams, images, and mathematical expressions
- Metadata supporting automated/LLM-based grading
Ensure schema output supports both UI rendering and automated evaluation workflows.

5. Quality, Observability & Reliability

Implement structured, timestamped logging at each pipeline stage.
Build monitoring layers to track pipeline failures, false positives, and system health.
Develop automated tests for multi-stage and multi-agent workflows.

Technical Requirements

Core Skills

Strong proficiency in Python (3.9+)
Experience building production systems using FastAPI
Deep hands-on experience with LLMs, prompt engineering, and agentic workflows
Familiarity with OCR engines (Tesseract, PaddleOCR, Vision-Language models such as GPT-5, Gemini 3 Pro, Qwen3-VL, etc.)
Experience with PDF parsing libraries (pymupdf, pdfplumber, unstructured, etc.)
Understanding of LangChain BaseMessage schemas or equivalent custom agent designs
Ability to build and manage multi-stage pipelines with clean architecture

Software Engineering Best Practices

Strong focus on modular design, documentation, and maintainability
Experience implementing logging, monitoring, and analytics for backend systems
Comfort working with cloud environments for scaling inference workloads
Experience with CI/CD, unit testing, and integration testing
Experience modelling structured and unstructured data

Bonus Skills

Experience with:
- Vector databases (Pinecone, Weaviate, Chroma)
- Multi-agent orchestration frameworks
- Educational content processing / assessment workflows
Knowledge of math or science content (optional but helpful)

What You Will Build (Example Projects)

You will directly contribute to:

Automated OCR → Question Schema → Answer Key pipelines
LLM-based grading systems for handwritten or digital submissions
Multi-agent reasoning systems with detailed trace logging
Scalable ingestion frameworks capable of processing 10,000+ exam papers across regions and syllabi
Multimodal processing pipelines for diagrams, charts, graphs, and mathematical expressions

Intelligent Document Processing Engineer ( Backend)