Senior ML Engineer
$152,000–$228,000 year
Remote · United States or Canada
Job Summary
Senior ML Engineer responsible for leading the productionization of Invoca's ML stack, including model serving, inference optimization, and API/pipeline development. Drive end-to-end MLOps, manage CI/CD for ML artifacts, and optimize deployment on Triton, Baseten, and Kubernetes GPU infrastructure. Collaborate with Data Scientists and AI Engineers to build scalable ML systems powering agentic AI products; require strong Python, PyTorch, NLP model deployment experience, and familiarity with fine-tuning SLMs/LLMs (LoRA/QLoRA/PEFT). Remote-first with US and Canada coverage. Salary range $152k–$228k plus bonus/equity.
Required Qualifications
- 5+ years of ML Engineering experience with a strong production focus
- Advanced Python and deep learning proficiency (PyTorch, HuggingFace Transformers, spaCy)
- Demonstrated track record deploying and maintaining transformer-based NLP models in production
- Hands-on experience fine-tuning SLMs/LLMs (LoRA, QLoRA, PEFT) and optimizing models via quantization, batching, and throughput tuning
- Proficiency with inference infrastructure: Triton, Baseten, vLLM, TGI, SageMaker, Vertex AI, or similar
- Experience building production-grade APIs that expose ML models to downstream consumers
- Familiarity with MLOps tooling, model monitoring, and eval platforms (Braintrust, MLflow, or equivalent)
- B.S. in Computer Science, Engineering, Statistics, or equivalent; advanced degree a plus
- Familiarity with RLHF or preference training is a bonus
- Location flexibility as remote-first with US/Canada coverage
Additional Requirements
- No visa sponsorship available for this role
- Must be based within ~2 hour drive of listed US metro areas or within Toronto for Canada role
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.