GPU Inference Jobs
Browse 18 GPU Inference jobs on FDE Jobs.
OPSoftware Engineer, Inference – AMD GPU Enablement
OpenAI
San Francisco, California, United States (On-site)
OPInference Technical Lead, Sora
OpenAI
San Francisco, California, United States (Hybrid)
OPTL, Research Inference
OpenAI
San Francisco, California, United States (On-site)
PEAI Inference Engineer (London)
Perplexity
London, England, United Kingdom (On-site)
PEAI Inference Engineer (San Francisco)
Perplexity
San Francisco, California, United States (On-site)
OPSoftware Engineer, Model Inference
OpenAI
San Francisco, California, United States (On-site)
COAudio Inference Engineer, Model Efficiency
Cohere
Canada + 4 more (Remote)
ANPerformance Engineer, Inference Systems
Anthropic
San Francisco, California, United States (Hybrid)
CAInference Engineer
Cartesia
San Francisco, California, United States (On-site)
NESystem Engineer (Token Factory)
Nebius
Netherlands + 5 more (Remote)
DIDirector, Engineering - Inference Serving Engine
DigitalOcean
Bengaluru, Karnataka, India (Hybrid)
OPInference Technical Lead, On-Device Transformers
OpenAI
San Francisco, California, United States (Hybrid)
TASystems Research Engineer Intern - GPU Programming (Fall 2026)
Together AI
San Francisco, California, United States (On-site)
TASystems Research Engineer, GPU Programming
Together AI
San Francisco, California, United States (On-site)
TALLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)
FAStaff Technical Lead for Inference & ML Performance
fal.ai
San Francisco, California, United States (On-site)
OPSoftware Engineer, Inference - Performance Optimization
OpenAI
San Francisco, California, United States (On-site)
HLForward Deployed Infrastructure Engineer
Hyperbolic Labs
Worldwide (Remote)