At Variant Group, we are at the forefront of transforming everyday tasks through the power of cutting-edge AI. Our B2C SaaS products empower users to achieve complex objectives—from creating standout resumes to generating intricate legal contracts—using intuitive, accessible technology.
We’re now looking for a Machine Learning Engineer with deep expertise in Natural Language Processing (NLP) and Large Language Models (LLMs). This is a hybrid role that blends custom model development with LLM API integration to ship intelligent, production-ready features. You’ll work across the full lifecycle—from preparing training data and fine-tuning models, to designing retrieval pipelines and deploying performant inference systems in the cloud.
Variant Products
resume.co Live since May 2023, ±1M MAU
pdf.net Live since Dec 2024, ±400k MAU
contracts.net Live since January 2023, ±100k MAU
mealplan.co – Summer 2025
More coming soon!
What You’ll Do
- Integrate hosted LLM APIs (e.g. OpenAI, Anthropic) + custom models to support intelligent in-product behavior.
- Build and fine-tune transformer models using PyTorch, HuggingFace
- Design and deploy retrieval-augmented generation (RAG) pipelines with vector databases (e.g., pgvector) and graph-based reasoning (e.g., Neo4j).
- Develop scalable inference systems using vLLM, speculative decoding, and optimized serving techniques.
- Build modular, production-grade pipelines for training, evaluation, and deployment.
- Collaborate closely with product, design, and full-stack teams to ship features that bring AI to end users.
- Own infrastructure around Docker, Cloud Run, and GCP, ensuring speed, reliability, and observability.
What You Bring
- Strong Python engineering background with clean, tested, and maintainable code.
- Proven experience building with transformer-based models, including custom training and fine-tuning.
- Deep familiarity with HuggingFace, PyTorch, tokenization, and evaluation frameworks.