Senior ML Engineer/Researcher

Meet your recruiter Joao Lima
Vacancy details
AI/ML Engineering
Machine Learning Engineer
Senior
Bulgaria, 
Croatia, 
Poland, 
Portugal, 
Spain, 
Ukraine
Remote

Let’s breathe life into great tech ideas! With 3,000 people globally, Intellias is a company where benchmark technological solutions are born. Join in and take your part in digitalizing the world.

What project we have for you

We are actively experimenting with OCR and metadata extraction from the PDF documents. OCR is one of the very hot topics these days with open models actively competing for the leading places – DeepSeek OCR, LightOn OCR, etc. 

We are looking for someone with the experience of running OSS models on vLLM with focus on document intelligence – computer vision that results in PDF -> Markdown or PDF -> HTML conversion with high precision for complex documents

What you will do

  • Research, evaluate, and fine-tune open-source OCR and document intelligence models for text and layout extraction from complex PDFs.
  • Develop end-to-end solutions for PDF-to-Markdown / PDF-to-HTML conversion with high accuracy in text structure, formatting, and layout retention.
  • Build tools for data preprocessing, annotation, and quality evaluation of OCR outputs.
  • Implement techniques for post-processing, text alignment, and metadata extraction to enhance model precision.
  • Collaborate with research and engineering teams to integrate OCR pipelines into production-grade systems.
  • Stay up to date with the latest developments in document AI, multimodal learning, and OCR research.

What you need for this

Tech Stack:

  • Python
  • vLLM
  • Hugging Face (inference)
  • Computer Vision
  • PyTorch 

Requirements:

  • 5+ years of experience in Machine Learning, with at least 2+ years focused on OCR, Document AI, or vision-language models.
  • Strong hands-on expertise with Python, PyTorch, and Hugging Face Transformers (training, fine-tuning, inference).
  • Practical experience deploying LLM / VLM models on vLLM or equivalent high-performance inference frameworks.
  • Solid understanding of OCR pipelines, layout parsing, and document structure recognition (PDFs, scanned docs, tables, mixed content).
  • Understanding of cloud infrastructure and GPU-based inference pipelines.
  • Research mindset with the ability to experiment, analyze, and iterate quickly.
  • Strong communication and documentation skills; ability to clearly present findings and proposed improvements.

What it’s like to work at Intellias

At Intellias, where technology takes center stage, people always come before processes. By creating a comfortable atmosphere in our team, we empower individuals to unlock their true potential and achieve extraordinary results. That’s why we offer a range of benefits that support your well-being and charge your professional growth.
We are committed to fostering equity, diversity, and inclusion as an equal opportunity employer. All applicants will be considered for employment without discrimination based on race, color, religion, age, gender, nationality, disability, sexual orientation, gender identity or expression, veteran status, or any other characteristic protected by applicable law.
We welcome and celebrate the uniqueness of every individual. Join Intellias for a career where your perspectives and contributions are vital to our shared success.

Skills

LLM
OCR
Python
PyTorch

Have not found the most
suitable position
yet?

Leave your resume and we will select a cool option for you.
Good news!
Link copied
Good news!
You did it.
Bad news!
Something went wrong. Please try again.