1. Home
  2. Jobs
  3. GPU Inference

GPU Inference Jobs

Browse 18 GPU Inference jobs on FDE Jobs.

18 jobs
OpenAI logoOP

Software Engineer, Inference – AMD GPU Enablement

OpenAI

San Francisco, California, United States (On-site)

$295K – $555K Yearly16h ago
OpenAI logoOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)

$380K – $380K Yearly16h ago
OpenAI logoOP

TL, Research Inference

OpenAI

San Francisco, California, United States (On-site)

$380K – $555K Yearly1d ago
Perplexity logoPE

AI Inference Engineer (London)

Perplexity

London, England, United Kingdom (On-site)

16h ago
Perplexity logoPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)

$220K – $485K Yearly16h ago
OpenAI logoOP

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)

$295K – $555K Yearly16h ago
Cohere logoCO

Audio Inference Engineer, Model Efficiency

Cohere

Canada + 4 more (Remote)

16h ago
Anthropic logoAN

Performance Engineer, Inference Systems

Anthropic

San Francisco, California, United States (Hybrid)

$350K – $850K Yearly16h ago
Cartesia logoCA

Inference Engineer

Cartesia

San Francisco, California, United States (On-site)

$180K – $250K Yearly16h ago
Nebius logoNE

System Engineer (Token Factory)

Nebius

Netherlands + 5 more (Remote)

16h ago
DigitalOcean logoDI

Director, Engineering - Inference Serving Engine

DigitalOcean

Bengaluru, Karnataka, India (Hybrid)

1d ago
OpenAI logoOP

Inference Technical Lead, On-Device Transformers

OpenAI

San Francisco, California, United States (Hybrid)

$445K – $445K Yearly1d ago
Together AI logoTA

Systems Research Engineer Intern - GPU Programming (Fall 2026)

Together AI

San Francisco, California, United States (On-site)

$58 – $63 Hourly1d ago
Together AI logoTA

Systems Research Engineer, GPU Programming

Together AI

San Francisco, California, United States (On-site)

$160K – $230K Yearly1d ago
Together AI logoTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)

$160K – $230K Yearly1d ago
fal.ai logoFA

Staff Technical Lead for Inference & ML Performance

fal.ai

San Francisco, California, United States (On-site)

1d ago
OpenAI logoOP

Software Engineer, Inference - Performance Optimization

OpenAI

San Francisco, California, United States (On-site)

$295K – $555K Yearly16h ago
Subscribe to this search

Get email updates when new jobs match this search.