Site Reliability Engineer (SRE)
Job Description:
We are seeking a Sr. Site Reliability Engineer (SRE) to lead the design, automation, and operation of reliable, scalable systems. This role combines software engineering with infrastructure expertise to ensure high availability and performance across both production and lab environments. You will work closely with development, infrastructure, and operations teams to drive reliability, observability, and continuous improvement.
Key Responsibilities:
Design, build, and maintain resilient infrastructure across cloud and Kubernetes (TalOS-based) environments
Build and maintain lab infrastructure for development, testing, and validation, including networking, hardware integration, and automation
Define and monitor SLIs, SLOs, and error budgets to guide reliability efforts
Develop automation tools and scripts in Python, Bash, or Go to reduce manual toil and improve system operations
Improve observability using Prometheus, Grafana, OpenTelemetry, and other monitoring/logging solutions
Manage incident response, perform root cause analysis, and lead postmortem processes
Optimize systems for performance, scalability, and fault tolerance
Contribute to infrastructure as code (IaC) using Terraform, Ansible, or Helm
Collaborate with engineering teams to ensure systems are designed for operational excellence
Requirements:
Bachelors degree in Software Engineering or Software Development
8+ years of experience as an SRE, DevOps Engineer, or Systems Engineer
Strong expertise in Kubernetes (TalOS preferred), cloud platforms (AWS, GCP, Azure), and Linux
Hands-on experience with monitoring, logging, and incident management tools
Proficiency in Python, Bash, or Go for scripting and automation
Experience with building and maintaining lab environments, including physical and virtual infrastructure
Solid knowledge of networking, distributed systems, and performance optimization
Familiarity with CI/CD workflows and Infrastructure as Code practices
Strong communication skills and ability to work cross-functionally
Preferred Qualifications:
Experience in optical systems (e.g., optical networking, photonic devices)
Exposure to or interest in quantum computing platforms and environments
QuEra is committed to cultivating a diverse work environment and is proud to be an equal opportunity employer. We highly value diversity in our current and future employees and do not discriminate (including in our hiring and promotion practices) based on race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.
...as we explore exciting ways to take advantage of the full ecosystem of mobile devices including Watch, TV, and whatever future developments Apple or Google creates. Our engineers are provided with top of the line Macbook Pros, multiple high-end monitors and the necessary...
...trajectory.? Individuals seeking employment at Capco are considered without regards to race, color, religion, national origin, age, sex, marital status, ancestry, physical or mental disability, veteran status, gender identity, or sexual orientation. Capco, a Wipro Company
...Center Agent (#14 Openings) Location: West Hollywood, CA 90048 Work Arrangement: Onsite until completion of training; potential for remote work thereafter Duration: 13 weeks Shift: Day Shift, 8-hour shifts) Position Overview The Patient Access...
This is a part time position. Primary responsibility is to Take Excellent Care Of Our Customers by satisfying each customer's needs,... ...experience. This requires being responsible for the bookkeeping, accounting functions, and the overall operation of the accounting office....
...PART-TIME CHILD SERVICES TRANSPORTER - LUZERNE COUNTY, PA Case Aide 1 JusticeWorks YouthCare is seeking a Child Services Transporter. As a Child Services Transporter, you will ensure our families feel comfortable and safe while being transported by our organization...