1. Home
  2. Jobs
  3. GPU Inference Optimization

GPU Inference Optimization Jobs

Browse 22 GPU Inference Optimization jobs on FDE Jobs.

22 jobs
OpenAI logoOP

Software Engineer, Inference – AMD GPU Enablement

OpenAI

San Francisco, California, United States (On-site)

$295K – $555K Yearly9h ago
OpenAI logoOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)

$380K – $380K Yearly9h ago
OpenAI logoOP

TL, Research Inference

OpenAI

San Francisco, California, United States (On-site)

$380K – $555K Yearly17h ago
Perplexity logoPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)

$220K – $485K Yearly9h ago
Cohere logoCO

Audio Inference Engineer, Model Efficiency

Cohere

Canada + 4 more (Remote)

9h ago
Together AI logoTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)

$160K – $230K Yearly17h ago
Perplexity logoPE

AI Inference Engineer (London)

Perplexity

London, England, United Kingdom (On-site)

9h ago
Snowflake logoSN

AI System Research and Development Engineer - Optimization

Snowflake

Bellevue, Washington, United States (On-site)

$200K – $265K Yearly17h ago
OpenAI logoOP

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)

$295K – $555K Yearly9h ago
fal.ai logoFA

Staff Technical Lead for Inference & ML Performance

fal.ai

San Francisco, California, United States (On-site)

17h ago
Anthropic logoAN

Performance Engineer, Inference Systems

Anthropic

San Francisco, California, United States (Hybrid)

$350K – $850K Yearly9h ago
OpenAI logoOP

Inference Technical Lead, On-Device Transformers

OpenAI

San Francisco, California, United States (Hybrid)

$445K – $445K Yearly17h ago
Together AI logoTA

Forward Deployed Engineer (Inference & Post-Training)

Together AI

San Francisco, California, United States (On-site)

$270K – $300K Yearly1w ago
OpenAI logoOP

Software Engineer, Inference - Performance Optimization

OpenAI

San Francisco, California, United States (On-site)

$295K – $555K Yearly9h ago
Together AI logoTA

Systems Research Engineer Intern - GPU Programming (Fall 2026)

Together AI

San Francisco, California, United States (On-site)

$58 – $63 Hourly17h ago
Together AI logoTA

Research Intern, Inference (Fall 2026)

Together AI

San Francisco, California, United States (On-site)

$58 – $63 Hourly17h ago
DigitalOcean logoDI

Director, Engineering - Inference Serving Engine

DigitalOcean

Bengaluru, Karnataka, India (Hybrid)

17h ago
Together AI logoTA

Systems Research Engineer, GPU Programming

Together AI

San Francisco, California, United States (On-site)

$160K – $230K Yearly17h ago
Nebius logoNE

System Engineer (Token Factory)

Nebius

Netherlands + 5 more (Remote)

8h ago
Subscribe to this search

Get email updates when new jobs match this search.