Inference Optimization Jobs in San Francisco, California, United States

Browse 22 Inference Optimization jobs in San Francisco, California, United States on FDE Jobs.

22 jobs
OpenAI logoOP

Software Engineer, Inference - Performance Optimization

OpenAI

San Francisco, California, United States (On-site)

$295K – $555K Yearly13h ago
OpenAI logoOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)

$380K – $380K Yearly13h ago
Together AI logoTA

Forward Deployed Engineer (Inference & Post-Training)

Together AI

San Francisco, California, United States (On-site)

$270K – $300K Yearly1w ago
Together AI logoTA

Research Intern, Inference (Fall 2026)

Together AI

San Francisco, California, United States (On-site)

$58 – $63 Hourly21h ago
Together AI logoTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)

$160K – $230K Yearly21h ago
fal.ai logoFA

Staff Technical Lead for Inference & ML Performance

fal.ai

San Francisco, California, United States (On-site)

21h ago
OpenAI logoOP

TL, Research Inference

OpenAI

San Francisco, California, United States (On-site)

$380K – $555K Yearly21h ago
Cohere logoCO

Audio Inference Engineer, Model Efficiency

Cohere

Canada + 4 more (Remote)

13h ago
OpenAI logoOP

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)

$295K – $555K Yearly13h ago
Anthropic logoAN

Performance Engineer, Inference Systems

Anthropic

San Francisco, California, United States (Hybrid)

$350K – $850K Yearly13h ago
Cartesia logoCA

Inference Engineer

Cartesia

San Francisco, California, United States (On-site)

$180K – $250K Yearly13h ago
Perplexity logoPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)

$220K – $485K Yearly13h ago
OpenAI logoOP

Software Engineer, Inference – AMD GPU Enablement

OpenAI

San Francisco, California, United States (On-site)

$295K – $555K Yearly13h ago
OpenAI logoOP

Inference Technical Lead, On-Device Transformers

OpenAI

San Francisco, California, United States (Hybrid)

$445K – $445K Yearly21h ago
Together AI logoTA

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)

$200K – $280K Yearly13h ago
Anthropic logoAN

Technical Program Manager, Inference Performance

Anthropic

San Francisco, California, United States (Hybrid)

$290K – $365K Yearly13h ago
Anthropic logoAN

Engineering Manager, Inference

Anthropic

San Francisco, California, United States (Hybrid)

$425K – $560K Yearly13h ago
Red Hat Canada Limited (f.k.a Cygnus Solutions Canada Limited) logoRH

Forward Deployed Engineer, AI Inference (vLLM and Kubernetes)

Red Hat Canada Limited (f.k.a Cygnus Solutions Canada Limited)

United States (Remote)

$184.9K – $342.5K Yearly1w ago
Together AI logoTA

Research Intern RL & Post-Training Systems, Turbo (Fall 2026)

Together AI

San Francisco, California, United States (On-site)

$58 – $63 Hourly21h ago
Subscribe to this search

Get email updates when new jobs match this search.