1. Home
  2. Jobs
  3. Low-Latency Model Inference

Low-Latency Model Inference Jobs

Browse 18 Low-Latency Model Inference jobs on FDE Jobs.

18 jobs
OpenAI logoOP

Software Engineer, Inference - Performance Optimization

OpenAI

San Francisco, California, United States (On-site)

$295K – $555K Yearly16h ago
OpenAI logoOP

TL, Research Inference

OpenAI

San Francisco, California, United States (On-site)

$380K – $555K Yearly1d ago
Cartesia logoCA

Inference Engineer

Cartesia

San Francisco, California, United States (On-site)

$180K – $250K Yearly16h ago
Cohere logoCO

Audio Inference Engineer, Model Efficiency

Cohere

Canada + 4 more (Remote)

16h ago
OpenAI logoOP

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)

$295K – $555K Yearly16h ago
Anthropic logoAN

Performance Engineer, Inference Systems

Anthropic

San Francisco, California, United States (Hybrid)

$350K – $850K Yearly16h ago
Perplexity logoPE

AI Inference Engineer (London)

Perplexity

London, England, United Kingdom (On-site)

16h ago
OpenAI logoOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)

$380K – $380K Yearly16h ago
Together AI logoTA

Research Intern, Inference (Fall 2026)

Together AI

San Francisco, California, United States (On-site)

$58 – $63 Hourly1d ago
fal.ai logoFA

Staff Technical Lead for Inference & ML Performance

fal.ai

San Francisco, California, United States (On-site)

1d ago
Together AI logoTA

Forward Deployed Engineer (Inference & Post-Training)

Together AI

San Francisco, California, United States (On-site)

$270K – $300K Yearly1w ago
Together AI logoTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)

$160K – $230K Yearly1d ago
Perplexity logoPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)

$220K – $485K Yearly16h ago
OpenAI logoOP

Software Engineer, Inference – AMD GPU Enablement

OpenAI

San Francisco, California, United States (On-site)

$295K – $555K Yearly16h ago
OpenAI logoOP

Inference Technical Lead, On-Device Transformers

OpenAI

San Francisco, California, United States (Hybrid)

$445K – $445K Yearly1d ago
Anthropic logoAN

Senior Software Engineer, Inference

Anthropic

Dublin, Leinster, Ireland (Hybrid)

€235K – €295K Yearly16h ago
Subscribe to this search

Get email updates when new jobs match this search.