[Remote] Platform Engineer (GPU)

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Vero is an exciting AI infrastructure startup that collaborates closely with NVIDIA and other key organizations to shape the future of data centers. The Platform Engineer (GPU) will be responsible for the operation, optimization, and reliability of large-scale GPU clusters supporting AI/ML and HPC workloads, focusing on performance tuning and systems management.

Responsibilities

Support the reliability, performance, and day-to-day operations of large-scale GPU infrastructure supporting AI/ML and HPC workloads
Optimize Kubernetes platforms to maximize efficiency, utilization, and stability in production
Develop reusable Terraform and Ansible modules to enable scalable, low-drift deployments
Maintain high availability through strong observability, SLO/SLI ownership, and incident response practices
Troubleshoot complex cross-layer issues and manage platform lifecycle (upgrades, scaling, security, multi-tenancy) in production environments

Skills

3+ years of experience in Platform Engineering, SRE, DevOps or infrastructure roles
Robust experience with GPU infrastructure & HPC clusters
Proven experience operating and scaling large distributed systems in high-availability environments
Kubernetes
Terraform & Ansible
Strong background in monitoring, observability and incident response (Prometheus, Grafana, etc.)
Slurm (or similar workload schedulers)

Benefits

Huge equity upside
Medical, dental, and vision insurance for the employee and family
Equity Scheme
Bonus
401(k) with a generous employer match
Company-paid Life Insurance
Flexible Spending Account
Mental Wellness Benefits
Flexible PTO

Company Overview

We help founders and leaders build high-impact teams by connecting them with exceptional talent globally, with a focus in the US. It was founded in 2019, and is headquartered in London, City of London, GB, with a workforce of 11-50 employees. Its website is https://www.wearevero.io/.

Apply To This Job

Apply

[Remote] Platform Engineer (GPU)

More remote roles to explore

[Remote] Account Executive (High-Ticket Sales Closer)

[Remote] Senior Contract Analyst

[Remote] Remote Sales- No Experience Needed, Will Train, NO COLD CALLING

[Remote] Internal Audit Operations Specialist

[Remote] Reimbursement Coordinator I Non-Medicare PDGM

[Remote] Sales - Sales and Outreach Intern

[Remote] Research Development Mechanical Engineer

[Remote] Virtual Finance Manager

[Remote] Head of Strategic Accounts

[Remote] Senior Financial Analyst – OCI Finance – Supply Chain

Navigation Client Leader

Customer Success Manager | FinTech | Global Customers

Business Support Analyst, Parts & Service Business Planning

Manager of Sales Compensation

Experienced Customer Support Representative – Remote Chat Support Opportunities

Przedstawiciel Naukowy ds. Szczepień Dorosłych (K/M/N) Koszalin Słupsk

Senior Director, Enterprise Risk Management

CDI Specialist

Experienced Remote Data Entry Specialist – Flexible Work Opportunity at arenaflex

Experienced Data Entry Operator – Remote Opportunity at arenaflex