Engineering Manager, Model Routing & Inference
Get more other jobs in your inbox
Verified daily — no ghost listings.
About This RoleAI processing…
Our mission is to automate coding. The first step in our journey is to build the best tool for professional programmers, using a combination of inventive research, design, and engineering. Our organization is very flat, and our team is small and talent dense. We particularly like people who are truth-seeking, passionate, and creative. We enjoy spirited debate, crazy ideas, and shipping code.
Key Responsibilities
- 1Building and evolving our inference gateway, a single abstraction over every provider's API semantics, so model onboarding becomes a config change.
- 2Building the systems that dynamically select the best model for each request based on cost, latency, and quality.
- 3Managing GPU cluster utilization and capacity planning across providers, optimizing for cost and performance.
- 4Designing routing backpressure and admission control so traffic spikes don't cascade into providers.
- 5Hiring and growing the team: sourcing, interviewing, and closing top inference and systems talent, while developing your engineers through coaching, mentorship, and high-leverage project assignments.
Requirements
- You have led engineering teams building high-throughput, low-latency distributed systems, especially in inference serving, traffic routing, or real-time data pipelines.
- You have strong software engineering fundamentals and enjoy shipping production systems that handle millions of requests.
- Experience with model serving frameworks (vLLM, TensorRT-LLM, TGI), load balancing, or building resilient multi-provider architectures is a plus.
- You may be a fit if You have led engineering teams building high-throughput, low-latency distributed systems, especially in inference serving, traffic routing, or real-time data pipelines.
Perks & BenefitsTypical for this role
Apply to This Job in Minutes
Generate ATS-optimized resume + cover letter + interview prep with Jobease.ca AI. Complete your application faster.
75% of AI Resumes Get Rejected
Beat the ATS with Jobease.ca's AI Resume Builder. Optimized for real hiring systems.
Build My ResumeProfile Match
Loading…Checking your profile against this job…
Job Overview
Share This Job
Track All Your Applications
Never lose track again. Jobease.ca organizes every application, interview, and follow-up.
Organize My Search