CU

Engineering Manager, Model Routing & Inference

cursor· 25 open roles

Location TBD On-siteFullTime2 months ago
Salaryest.
$150,000 - $250,000
Experience
Mid
Job Type
FullTime
Posted
2 months ago
Apply Now

Get more other jobs in your inbox

Verified daily — no ghost listings.

About This RoleAI processing…

Our mission is to automate coding. The first step in our journey is to build the best tool for professional programmers, using a combination of inventive research, design, and engineering. Our organization is very flat, and our team is small and talent dense. We particularly like people who are truth-seeking, passionate, and creative. We enjoy spirited debate, crazy ideas, and shipping code.

Key Responsibilities

  • 1
    Building and evolving our inference gateway, a single abstraction over every provider's API semantics, so model onboarding becomes a config change.
  • 2
    Building the systems that dynamically select the best model for each request based on cost, latency, and quality.
  • 3
    Managing GPU cluster utilization and capacity planning across providers, optimizing for cost and performance.
  • 4
    Designing routing backpressure and admission control so traffic spikes don't cascade into providers.
  • 5
    Hiring and growing the team: sourcing, interviewing, and closing top inference and systems talent, while developing your engineers through coaching, mentorship, and high-leverage project assignments.

Requirements

  • You have led engineering teams building high-throughput, low-latency distributed systems, especially in inference serving, traffic routing, or real-time data pipelines.
  • You have strong software engineering fundamentals and enjoy shipping production systems that handle millions of requests.
  • Experience with model serving frameworks (vLLM, TensorRT-LLM, TGI), load balancing, or building resilient multi-provider architectures is a plus.
  • You may be a fit if You have led engineering teams building high-throughput, low-latency distributed systems, especially in inference serving, traffic routing, or real-time data pipelines.

Perks & BenefitsTypical for this role

Competitive compensation aligned with experience and market rate
Health, dental, and vision coverage
Paid time off and company holidays
Remote-friendly or hybrid working arrangements where applicable
Learning and professional development support
Modern tools and equipment to do your best work

Apply to This Job in Minutes

Generate ATS-optimized resume + cover letter + interview prep with Jobease.ca AI. Complete your application faster.

Get Started Free

Similar Jobs

RE

Client Account Manager, Mid-Market (App Dev - Acquisitions)

redditRemote
View
RE

Client Account Manager, Large Customer Sales (Tech)

redditRemote
View
RE

Analytics Engineer

redditRemote
View