AI Engineer / Senior AI Engineer ? Document Intelligence

Hyderabad, Telangana, India

4 hours ago

Applicants: 0

Apply Now

Analysis Multi-language Nougat

Salary Not Disclosed

4 weeks left to apply

Job Description

We are looking for someone obsessed with turning messy real-world documents into perfectly structured, actionable data. If you live and breathe document layout analysis, OCR post-processing, and visual + language models, this role is custom-built for you. Core Responsibilities Design and build production-grade Document Intelligence pipelines (invoices, contracts, forms, reports, handwritten, tables, multi-language, etc.) Train/fine-tune and deploy Layout-aware models (LayoutLMv3, Donut, LLaMA-Adapter, Nougat, etc.) Build and optimize Vision-Language models (VLLMs) on custom enterprise document datasets Improve OCR accuracy using layout context, post-correction with LLMs, and geometric reasoning Own the full stack: data preparation ? model training (PyTorch) ? evaluation ? ONNX/TensorRT optimization ? FastAPI deployment Push boundaries on table extraction, key-value pairing, nested hierarchies, and multi-page document understanding Must-Have Skills & Experience Very strong understanding of document layout analysis (bounding boxes, reading order, logical blocks, nested tables, headers/footers, multi-column detection) Hands-on experience with modern Document AI architectures: LayoutLMv1/v2/v3, DocFormer, LayoutXLM, Donut, Pix2Struct, Nougat, UDOP, etc. Vision-Language models (LLaVA, Qwen-VL, InternVL, PaliGemma, etc.) Deep experience fine-tuning and serving LLMs & VLLMs (Llama-3, Mistral, Phi-3-vision, Qwen, etc.) using PEFT (LoRA/QLoRA), vLLM, TGI, or Ollama Strong PyTorch proficiency (custom trainers, distributed training with DDP/FSDP, TorchCompile, mixed precision) Solid grasp of OCR ecosystems and post-processing (Tesseract, EasyOCR, PaddleOCR, AWS Textract/Google Document AI limitations and how to beat them) Experience building datasets from real enterprise documents (Labelling tools: UBIAI, Label Studio, Doccano, custom UI) Good applied math: Transformers, attention mechanisms, positional encodings (especially 2D layouts), RoPE, ALiBi Nice-to-Have (Big Bonus) Published research or open-source contributions in Document AI / VLLM space Experience with multimodal RAG over documents ONNX / TensorRT / DeepSpeed optimization for low-latency inference Kubernetes + GPU scheduling (we run our own bare-metal cluster) Who thrives here? You get excited when you see a 50-page scanned purchase order with overlapping stamps and handwritten notes ? because you already know exactly how you?re going to destroy it. Perks Work directly on enterprise deals worth crores ? your model = real revenue impact Unlimited GPU access (A100s & H100s in-house)

Required Skills

Analysis Multi-language Nougat

Additional Information

Company Name: Piazza Consulting Group
Industry: N/A
Department: N/A
Role Category: AI Engineer
Job Role: Mid-Senior level
Education: No Restriction
Job Types: On-site
Gender: No Restriction
Notice Period: Less Than 30 Days
Year of Experience: 1 - Any Yrs
Job Posted On: 4 hours ago
Application Ends: 4 weeks left to apply

Similar Jobs

1 month ago

Senior AI Applications Engineer

Oracle

Design, Oracle, CRM +2

3 days ago

Middle QA Automation Engineer (Contract - Longtime) - Remote

Uplers

Cucumber, Manual Testing, Automation +2

1 week ago

Azure devops

Virtusa

PowerShell, Python, Analysis

1 month ago

Developer L3

Wipro

JOB DESCRIPTION, Software development life cycle, Analysis

1 week ago

Developer L4

Wipro

JOB DESCRIPTION, Software development life cycle, Analysis

3 days ago

Developer L3

Wipro

JOB DESCRIPTION, Software development life cycle, Analysis

3 days ago

RPG Developer

Ashley Global Capability Center

RPG, Analysis

1 month ago

Developer L4

Wipro

JOB DESCRIPTION, Software development life cycle, Analysis

1 month ago

Data Analyst, AVP

NatWest Group

Data extraction, Analysis

1 month ago

Developer L3

Wipro

JOB DESCRIPTION, Software development life cycle, Analysis