Software Engineer

 

Description:

 

This role requires deep technical expertise combined with the ability to influence architecture, engineering practices, and organizational priorities across multiple teams. You will partner closely with Fleet Engineering, Infrastructure, Product, Hardware, and AI Platform teams to ensure CoreWeave delivers industry-leading performance, reliability, and efficiency for GPU workloads at hyperscale.

What You'll Do
 

  • Define the long-term technical strategy and architecture for CoreWeave's GPU performance testing and validation platform.
  • Lead the design and implementation of scalable systems for validating performance, reliability, and health across CoreWeave's global infrastructure footprint.
  • Drive cross-functional initiatives spanning infrastructure testing, hardware qualification, fleet provisioning, and AI infrastructure performance optimization.
  • Architect and develop backend services, APIs, and automation frameworks in Go and/or Python that support large-scale testing and validation workflows.
  • Design and oversee Kubernetes-native testing platforms, operators, and controllers used across thousands of GPUs and clusters.
  • Establish performance benchmarks, testing methodologies, and operational standards for new hardware platforms and infrastructure deployments.
  • Influence engineering standards, deployment strategies, observability practices, and reliability frameworks across multiple teams.
  • Identify and solve systemic performance bottlenecks impacting customer workloads, infrastructure efficiency, and fleet utilization.
  • Partner with hardware vendors and internal stakeholders to evaluate emerging technologies and shape future infrastructure investments.
  • Mentor senior engineers and act as a technical leader across the organization through design reviews, architecture discussions, and strategic initiatives.
  • Serve as a key technical decision-maker during critical incidents involving performance, scalability, and infrastructure reliability.

Who You Are

  • 8+ years of software engineering experience, including experience leading large-scale technical initiatives.
  • Strong proficiency in Go and/or Python.
  • Deep hands-on experience operating Kubernetes-based infrastructure at production scale.
  • Proven track record of architecting distributed systems and driving technical direction across multiple teams.
  • Experience leading cross-functional efforts with significant business and engineering impact.
  • Strong systems-level understanding of infrastructure performance, reliability, and scalability.

Organization CoreWeave
Industry IT / Telecom / Software Jobs
Occupational Category Software Engineer
Job Location California,USA
Shift Type Morning
Job Type Full Time
Gender No Preference
Career Level Experienced Professional
Experience 8 Years
Posted at 2026-06-05 12:52 am
Expires on 2026-07-20