Job Description

Director of Engineering

Locations:
San Diego, CA
Austin, TX
Remote

Cirrascale Cloud Services provides high-performance cloud infrastructure purpose-built for deep learning, generative AI, and large-scale AI inference workloads. We specialize in dedicated GPU cloud solutions tailored to the unique needs of startups, research labs, and enterprise AI teams. Our mission is to accelerate AI innovation by combining powerful hardware with white-glove service and flexible, custom-built environments.

Director of Engineering

Location: Austin, Texas -or- San Diego, California

Position Overview

We are seeking a Director of Engineering responsible for defining, delivering and operating the organization’s core technology infrastructure and internal developer platforms.  This leader will work the Vice President of Engineering to drive strategy, architecture and execution for scalable, secure and high-performing systems that empower engineering teams, enable rapid product delivery and support business growth. The role requires a balance of hands-on technical expertise, strategic vision and people leadership with deep knowledge of modern storage architectures and network infrastructures.

Key Responsibilities

• Align infrastructure and platform initiatives with business objectives, ensuring technology investments deliver measurable value.

• Oversee the design, implementation and operation of scalable, reliable and secure systems across cloud and on-prem environments.

• Lead the development of internal developer platforms and tools to improve engineering productivity and release velocity.

• Lead timely and effective incident responses for production infrastructure, with particular focus on storage-related failures and performance issues.

• Oversee the creation, maintenance and accessibility of technical documentation for infrastructure, platform tools, operational procedures and incident response playbooks.

• Direct strategy and execution for software-defined storage platforms, ensuring high data durability, scalability and performance.

• Build, mentor and retain high performing infrastructure and platform engineering teams.

• Ensure compliance with regulatory, security and data privacy requirements.

Required Qualifications

• Bachelor’s degree in computer science, Engineering or related field (or equivalent experience).

• 10+ years of infrastructure, platform or DevOps engineering roles with 5+ years in leadership.

• Proven success leading large-scale bare metal HPC/AI Accelerator infrastructure initiatives.

• Deep expertise in software-defined storage, erasure coding techniques and object store technologies (Ceph).

• Experience with schedulers such as Slurm and Kubernetes.

• Strong understanding of CI/CD observability tooling and modern software delivery practices.

• Experience overseeing incident management processing, including production troubleshooting and postmortems.

• Proven ability to write, review and maintain clear technical documentation for both technical and non-technical audiences.

• Track record of building high-performing engineering teams and delivering results in a complex, fast-paced environment.

Preferred Qualifications

• Experience with AI model training workflows, including storage IO patterns in multi-node GPU clusters.

• Familiarity with storage solutions from WEKA, CEPH, and cloud-native offerings.

• Experience with monitoring tools (Prometheus, Grafana), and configuration management (Ansible).

• Knowledge of data governance and compliance standards in AI environments.

• Certifications such as CCNA, CCNP or equivalent experience.

• Expertise in HPC, data center networking and AI/ML infrastructure.

Benefits and Compensation

Comprehensive benefits package, including health, dental, and vision insurance, retirement plans, paid time off, and opportunities for professional development.

The base salary range for the Director of Engineering position is $173,400 to $255,000.  This pay range reflects the broad, minimum to maximum, pay range for this job for the location for which it has been posted. Compensation decisions are dependent on several factors including, but not limited to, an individual’s qualifications, location where the role is to be performed, internal equity, and alignment with market data.

Why Join Cirrascale?

Join a growing team that's pushing the boundaries of AI infrastructure. At Cirrascale, you’ll contribute to projects powering next-generation AI applications while working with top-tier hardware in a collaborative and innovative environment. From custom deployments to hands-on customer support, every role here plays a part in enabling breakthroughs in AI.

Apply Now:  careers@cirrascale.com

Interested in applying? Submit your resume and cover letter through the button below.

Apply Now