Back to all jobs

Site Reliability/Platform Engineer (Linux/ Kubernetes / Python) - 180-190K

Work from home Full-time role Hiring

Site Reliability/Platform Engineer (Linux/ Kubernetes / Python) Location: Reston, VA (onsite 3 days a week, but only 9 days a month) Salary: 180-90K + 10% Bonus Must have the following: Kubernetes, Red Hat Linux, Python & Bash scripting, Platform Engineering, On-prem infrastructure, Observability, Incident response, either Bare-Metal or VM is fine... OpenShift experience will be considered a big advantage. Responsibilities:

  • Own the stability, performance, and reliability of core platform infrastructure across on-premise environments and future Azure cloud.
  • Manage and optimize Kubernetes/OpenShift clusters, ensuring high availability and scalability.
  • Lead incident response, root-cause analysis, and long-term remediation efforts in a high-stakes production environment.
  • Drive continuous improvement of platform performance, reliability, and automation.
  • Build and enhance observability frameworks using tools like Prometheus, Grafana, and Datadog.
  • Develop and maintain automation scripts and tooling using Python and Bash to reduce manual intervention.
  • Partner with engineering and development teams to troubleshoot deployment, configuration, and infrastructure issues.
  • Support and enhance CI/CD pipelines and platform delivery processes.
  • Administer and optimize Linux-based systems, primarily within Red Hat environments.
  • Maintain documentation, runbooks, and operational procedures.
  • Participate in on-call rotation supporting critical systems.

Requirements:

  • Bachelor's degree in computer science or related field, or equivalent experience.
  • 5+ years of experience in Site Reliability Engineering or Platform Engineering roles.
  • Strong hands-on experience with Kubernetes in production environments (OpenShift preferred but not required).
  • Solid experience with Red Hat Enterprise Linux system administration.
  • Strong scripting experience with Python and Bash.
  • Experience managing on-premises infrastructure environments.
  • Experience with observability tools such as Prometheus, Grafana, or Datadog.
  • Strong troubleshooting experience across distributed systems, logs, metrics, and traces.
  • Experience working in high-performance, high-availability environments.
  • Exposure to Azure cloud services is a plus.
  • Strong communication and documentation skills.

Site Reliability Engineer, SRE, OpenShift engineer, Kubernetes engineer, Azure cloud engineer, platform engineer, DevOps engineer, observability, Grafana, Prometheus, Datadog, HashiCorp Vault, Kafka, AMQ, Redis, CI/CD, automation, Bash scripting, Python scripting, cloud infrastructure, hybrid cloud, data center, reliability engineering, incident response, root cause analysis, container platform, cluster management, Azure infrastructure, production support, platform reliability, DevOps, monitoring tools, automation engineer, enterprise infrastructure, platform services, site reliability, cloud platform, OpenShift administrator, Kubernetes troubleshooting Apply tot his job Apply To this Job

More remote roles to explore

Site Reliability Engineering Manager

Work from home Full-time role

Site Reliability Engineer – SkillBridge Intern

Work from home Full-time role

DevOps Engineer - Kubernetes, AWS & Docker Skills Required (Fully Remote )

Work from home Full-time role

FSO Audit LABS - Kubernetes DevOps Engineer - Senior - Bay Area

Work from home Full-time role

Team Lead, Site Reliability Engineering - Storage Layer Service

Work from home Full-time role

Site Reliability Engineer-SkillBridge Intern

Work from home Full-time role

SRE Architect + Strong Dynatrace exp

Work from home Full-time role

Software Engineer – Java, Spring Boot, Kubernetes, AWS

Work from home Full-time role

Senior DevOps Engineer (Kubernetes, Docker, Jenkins)

Work from home Full-time role

Staff Software Development Engineer-Kubernetes

Work from home Full-time role

Experienced Data Entry Operator | Data Quality Specialist – arenaflex

Work from home Full-time role

Fractional CMO Needed for B2B SaaS Growth Strategy

Work from home Full-time role

Senior Managing Editor

Work from home Full-time role

Recruiter- hybrid

Work from home Full-time role

Franchise Business Coach

Work from home Full-time role

Outbound Sales, Growth Hacker – SDR/BDR

Work from home Full-time role

Experienced Seasonal Senior Customer Service Representative – Las Vegas Airport

Work from home Full-time role

Experienced Customer Service Representative – Pet Parent Support Specialist (Remote in Texas)

Work from home Full-time role

Dynamic Customer Service Sales Representative – Travel Solutions & Revenue Growth Specialist at arenaflex

Work from home Full-time role

Senior Software Engineer, Windows/Desktop Applications - Ho Chi Minh City, Vietnam

Work from home Full-time role