AI Performance Software Engineer Job at Signify Technology, San Francisco, CA

Y0dLWGZzSGNkV1pEWjBVd2RjVUlmUEhkRUE9PQ==
  • Signify Technology
  • San Francisco, CA

Job Description

AI Performance Engineer – CUDA & PyTorch Focus

Location: San Fransisco, CA

Compensation: $200,000-$300,000

A stealth-mode AI systems company is reimagining how large-scale inference is done. With generative AI workloads scaling rapidly, inference efficiency has become a critical bottleneck. We're building an integrated hardware-software platform that brings breakthrough performance and usability to production-scale LLM applications.

This is an opportunity to work on a highly technical team spun out of top-tier academic research, focused on the cutting edge of AI, distributed systems, and performance optimization.

What You’ll Do:

  • Drive core research and implementation of performance optimizations for modern AI models
  • Implement advanced techniques like FlashAttention, KV caching, quantization, and model compression
  • Design and build scalable, distributed compute strategies across GPU-based systems
  • Profile, benchmark, and optimize CUDA kernels and AI runtime performance across inference stacks
  • Work across frameworks like PyTorch, ONNX, and vLLM to improve end-to-end efficiency

What We're Looking For:

  • Strong background in CUDA and low-level GPU performance tuning
  • Proven experience building with PyTorch and deploying high-performance ML models
  • Proficiency in Python and C++
  • Experience with large-scale distributed systems in cloud environments (AWS, GCP, or Azure)
  • Exposure to AI compilers or frameworks like MLIR is a plus
  • Interest in system design, scalability, and accelerating LLM workloads in real production environments

If you’ve spent your time making large models faster, leaner, and more efficient—and want to solve hard technical problems at the core of GenAI infrastructure—this role is for you.

Reach out to learn more.

Job Tags

Similar Jobs

Evans

Architectural Intern Job at Evans

 ...customers. Looking for an Architectural Technologist (Level 2) to join our expanding Construction Team in Dallas, TX. The Architectural Intern will work under the supervision of the Project Architect as an integral member of a Design Team specializing in the development of... 

Gpac

Steel Detailing Project Manager Job at Gpac

Miscellaneous Metals Manager: GPAC is partnered with a well-established steel fabricator to hire a Steel Detailer that has experience as a Project Manager to run the miscellaneous metals department. Due to an increasing work load the company needs to hire an experienced... 

Jibble Group

Freelance Content Writer Job at Jibble Group

Our Mission To help businesses save time and money, and unleash their human potential. Our vision is to power and empower millions of businesses with our software. About Jibble Group We're a scale-up in the Workforce Management space that has fully embraced remote...

Macrosoft

Creative Manager Job at Macrosoft

Title: Creative Manager Location: Dallas, TX (Hybrid) Duration: 36 Months Required Skills: Experience developing 3D displays that bring the brand to life in the retail environment Be well-versed in all the main Creative Suite or creative tool applications...

Mara Talent

Recruitment Consultant (Graduate Scheme) Job at Mara Talent

Trainee Recruitment Consultant Entry-Level Opportunity - Austin, Texas Why not get ahead of the summer rush with an exciting new position and earn some serious cash in 2025.. About the Company Our client, a fast-growing and innovative recruitment agency,...