Back


Job Detail

ML Infrastructure Engineer

Altis Recruitment

Toronto, Ontario

ML Infrastructure Engineer

Altis Recruitment

Toronto, Ontario
 
Salary: From 90000 to 110000 per Year
 

About the Role

We are hiring an ML Infrastructure Engineer with deep DevOps and platform engineering expertise to build and manage scalable infrastructure for real-time AI systems. This is not a model development role — we are specifically looking for someone who can design, deploy, and manage robust ML platforms that support production-grade AI workloads across cloud environments.

You will be responsible for cloud infrastructure, container orchestration, CI/CD automation, and GPU inference optimization — working closely with AI researchers and media engineering teams to support high-throughput, low-latency systems.


What You Will Own

Platform Engineering & Architecture

  • Architect and maintain a production-ready ML platform, integrating training, deployment, and monitoring pipelines.

  • Design and implement cloud-native infrastructure that is reliable, secure, and scalable (primarily on AWS using EKS).

  • Build modular systems that are optimized for real-time performance, reproducibility, and automation.

DevOps, Automation & IaC

  • Build and manage CI/CD pipelines for model packaging, deployment, and rollback using GitHub Actions, ArgoCD, or Jenkins.

  • Use Terraform or other Infrastructure as Code (IaC) tools to define and provision infrastructure consistently and securely.

  • Implement model observability and performance diagnostics using Prometheus, Grafana, or equivalent tooling.

Model Serving & Inference Optimization

  • Deploy and manage inference endpoints using tools like TensorRT, ONNX Runtime, or TorchScript.

  • Optimize resource utilization and latency in GPU-powered environments.

  • Handle version control, rollback strategies, and reproducibility standards for deployed models.


Must-Have Qualifications

  • 3+ years of experience as a DevOps, ML Infrastructure, or Platform Engineer (not a research/modeling role).

  • Strong hands-on experience with:

    • Cloud platforms (preferably AWS, including EKS)

    • Kubernetes and Docker in production environments

    • Terraform or other IaC tools

    • CI/CD systems for ML pipelines (GitHub Actions, ArgoCD, Jenkins, etc.)

  • Experience supporting real-time ML inference systems using ONNX Runtime, TensorRT, or similar tools.

  • Strong understanding of platform monitoring, alerting, and observability best practices

Nice-to-Haves

  • Experience deploying generative AI or LLMs at scale.

  • Familiarity with media processing, VFX systems, or GPU-intensive real-time workloads.

  • Exposure to security best practices (IAM, encryption, container hardening, etc.).



We’re an equal opportunity employer committed to increasing diversity and inclusion in today’s workforce. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status. Minorities, women, LGBTQ candidates, and individuals with disabilities are encouraged to apply. If you require an accommodation, please review our accessibility policy and reach out to our accessibility officer with any questions.

 

We are committed to hiring military and Veteran spouses and encourage you to identify your connection with the MSEN when reaching out to us or applying to any of our open roles.

 

Have questions or want to learn more about us? We would love to hear from you!

 Altis Recruitment Team

Email: militaryfamilies@altis.com

613-230-3700

 

 

About Altis Recruitment

Welcome on behalf on the Altis Recruitment team! Altis has a long-standing business relationship with the Defence community. For more than 30 years, we have been grateful to work alongside the Department of National Defence and countless military professionals. We know that family members of military personnel often make many personal sacrifices to support their loved ones. We understand that it can be difficult to pursue a career when embracing sudden changes like relocation and deployment. For some, this has meant putting a pause on career goals or professional development. We would like to provide you with everything you need for a successful and confident job search – in addition to access to job opportunities. Download the checklists our experts have created to help you be at your best from application to interview.