Position - Intelligent Document Processing (IDP) EngineerLocation: Gurgaon/Houston (US)Experience – 3+ Years
The Opportunity:We are seeking a highly skilled and innovative Intelligent Document Processing (IDP)Engineer to join our dynamic team. In this role, you will be instrumental in designing,developing, and deploying advanced IDP solutions that automate the extraction,classification, and validation of data from various unstructured and semi-structureddocuments. You will leverage cutting-edge technologies in Artificial Intelligence, MachineLearning, and Natural Language Processing to build robust and scalable systems that driveefficiency and accuracy for our clients.
What You will do Design, develop, and implement end-to-end IDP solutions, from document ingestionto data extraction, validation, and integration with downstream systems. Utilize and integrate various AI/ML techniques, including OCR, computer vision,natural language processing (NLP), and deep learning, for document understandingand data extraction. Develop and train machine learning models for document classification, entityrecognition, and data validation. Work with a variety of document types, including invoices, contracts, forms, purchaseorders, and other business documents. Collaborate with product managers, data scientists, and other engineers tounderstand requirements, define technical specifications, and deliver high-qualitysolutions. Optimize and fine-tune IDP models for performance, accuracy, and scalability. Implement data quality checks and validation rules to ensure the integrity of extractedinformation. Stay up-to-date with the latest advancements in IDP, AI, ML, and NLP technologiesand recommend their application. Participate in code reviews, testing, and deployment processes. Troubleshoot and resolve issues related to IDP systems.
Why Join Us? Opportunity to work on cutting-edge AI/ML technologies and shape the future ofdocument processing. Collaborative and supportive team environment. Competitive salary and benefits package. Opportunities for professional growth and development.
What You'll Bring: Bachelor's or Master's degree in Computer Science, Engineering, ArtificialIntelligence, Data Science, or a related field. 3+ years of experience in developing and deploying Intelligent Document Processing(IDP) or similar data extraction solutions. Strong programming skills in Python (or similar languages like Java, C#) withexperience in relevant libraries (e.g., TensorFlow, PyTorch, scikit-learn, OpenCV,SpaCy, NLTK). In-depth understanding of OCR technologies and their practical applications. Proven experience with machine learning and deep learning concepts, particularly inthe context of computer vision and natural language processing. Experience with document parsing, information extraction, and data normalizationtechniques. Familiarity with cloud platforms (AWS, Azure, GCP) and their relevant services forAI/ML and data processing. Experience with version control systems (e.g., Git). Strong problem-solving skills and an analytical mindset. Excellent communication and collaboration skills.
Bonus Points If You Have: Experience with specific IDP platforms or tools (e.g., UiPath DocumentUnderstanding, Automation Anywhere IQ Bot, ABBYY FlexiCapture, Kofax, or open-source alternatives). Knowledge of data engineering principles and ETL processes. Experience with containerization technologies (Docker, Kubernetes). Familiarity with agile development methodologies. Experience working with large-scale data sets and distributed systems.