- Home
- Remote Jobs
- Senior Site Reliability Engineer Jobs
Senior Site Reliability Engineer Jobs - Remote Work From Home & Flexible
Job Search Results
Job Icon Guide
- Candidates Everywhere
- Candidates in US
- Australia Candidates
- Canada Candidates
- a Certain City or State
Featured Companies are employers who have come directly to FlexJobs, been approved by our staff, and have directly posted their jobs to the FlexJobs site.
- Hybrid Remote Work
- Full-Time
- Employee
- Gurugram, HR, India
Build and maintain infrastructure, automate system engineering tasks, and ensure service request SLAs are met. Solid experience in cloud-based rapid delivery environments, programming languages, and SRE or Production Support engineering teams.
- 100% Remote Work
- Full-Time
- Employee
- Poland
Engage with teams to improve service delivery and reliability. Measure and monitor production systems for availability and system health. Drive teams towards better operational excellence and lobby for changes to improve reliability and observability.
- 100% Remote Work
- Full-Time
- Employee
- Durham, NC
Design, build, and maintain core infrastructure. Create and iterate on workloads in Google Kubernetes Engine. Collaborate with engineering teams to improve scalability and performance. Ensure uptime and reliability of infrastructure. Participate in on...
- 100% Remote Work
- Full-Time
- Employee
- London, United Kingdom
Design, build and maintain infrastructure estate on public cloud providers. Support multiple development teams with CI/CD platform to deliver high-quality builds. Automate operational tasks via Go, Python and serverless solutions.
- 100% Remote Work
- Full-Time
- Employee
- 118,000 - 209,000 USD Annually
- Bulgaria, Croatia, Cyprus, Czechia, Egypt, Israel, Kenya, Lebanon, Luxembourg, Malta, Nigeria, Oman, Qatar, Romania, Saudi Arabia, Serbia, South Africa, Ukraine, United Arab Emirates, United Kingdom, Canada
Develop and maintain reliable infrastructure, build test automation tooling, Kubernetes, and execution tracing. Troubleshoot, debug, work with microservices, Cloud Native, Kubernetes/EKS. Plan capacity, maintain security.
- 100% Remote Work
- Full-Time
- Employee
- 65,000 - 115,000 USD Annually
- Columbus, OH
Manage production and pre-production environments, analyze performance, troubleshoot issues, and automate deployment and incident response processes. Work closely with cross-functional teams to ensure optimal system reliability and scalability.
- 100% Remote Work
- Full-Time
- Employee
- Canada
Engage with teams to improve service delivery and reliability, measure and monitor all production systems, seek out the cause of errors, and drive teams towards better operational excellence. Identify and drive down toil with creative innovation and ..
- 100% Remote Work
- Full-Time
- Employee
- Philippines
Review architecture and software components. Ensure best practices are consistent. Own and ensure SLOs and SLAs are met. Monitor operational metrics and lead improvement plans. Develop runbooks and other technical assets. Implement DR plans.
- Hybrid Remote Work
- Full-Time
- Employee
- Bangalore, KA, India, Mohali, PB, India
Improve system reliability by building scalable process frameworks, improving infrastructure and tooling, and reducing manual work through automation. Ensure adoption of SRE best practices and seamless transfer of support responsibilities.
- Full-Time
- Employee
- 175,000 - 230,000 USD Annually
- New York, NY
Consult with Product Engineers on software development, design and implement software infrastructure solutions, author, review, and optimize Terraform code, provide training and mentoring, participate in support rotation.
- 100% Remote Work
- Full-Time
- Employee
- 127,000 - 223,000 USD Annually
- Canada, or US National
Design, build, and maintain products and services, advocate Site Reliability Engineering principles, identify sources of toil and eliminate them, take ownership of projects, and participate in a 24x7 on-call rotation.
- Hybrid Remote Work
- Full-Time
- Employee
- Bangalore, KA, India
Improve infrastructure, tooling, and process improvements. Support and troubleshoot large-scale distributed software applications and networks. Promote best practices for reliability within the Engineering Department.
- 100% Remote Work
- Full-Time
- Employee
- Australia
Lead design, implementation, and maintenance of highly available and scalable infrastructure and services. Optimize AWS usage to align with security best practices. Develop best practices for monitoring, alerting, and incident response to ensure syst..
- 100% Remote Work
- Alternative Schedule
- Employee
- Bulgaria, Croatia, Cyprus, Czechia, Egypt, Israel, Kenya, Lebanon, Luxembourg, Malta, Nigeria, Oman, Qatar, Romania, Saudi Arabia, Serbia, South Africa, Ukraine, United Arab Emirates, United Kingdom
Develop self-healing, automated systems to ensure 24/7 service uptime, monitor SLOs and mission-critical metrics, build robust monitoring and alerting capabilities, and work with other teams to guarantee security and high availability of services.
- 100% Remote Work
- Full-Time
- Employee
- Portugal
Responsible for ensuring the smooth operation of user-facing services and production systems. Define and build Service Level Objectives and Indicators, automate processes, and participate in incident response. Strong technical skills in AWS and obser..
- 100% Remote Work
- Full-Time
- Employee
- Austria, Belgium, Denmark, Finland, France, Germany, Greece, Hungary, Ireland, Italy, Netherlands, Norway, Poland, Portugal, Romania, Spain, Sweden, Switzerland, Ukraine, United Kingdom
Drive continuous improvement and delivery within the team, maintain and develop existing cloud infrastructure, implement automation, testing and deployment pipelines, and own the reliability and security of cloud infrastructure and services.
- 100% Remote Work
- Full-Time
- Employee
- 212,000 - 212,000 USD Annually
- US National
Design, build, and support reliable core systems and infrastructure. Collaborate with cross-functional teams. Improve reliability, scalability, and performance. Debug complex issues and mentor other engineers.
- 100% Remote Work
- Full-Time
- Employee
- 173,400 - 298,000 USD Annually
- US National
Re-engineering core product for global scalability and latency optimization. Building monitoring and observability stack. Investing in security posture, automation, and operational challenges. Strong Linux systems and networking experience required.
- 100% Remote Work
- Full-Time
- Employee
- US National
Maintain and operate monitoring systems and infrastructure, collaborate with development and operations teams, conduct audits and assessments of monitoring systems, and represent the team in global incidents resolution.
- Full-Time
- Employee
- Amsterdam, Netherlands, Berlin, Germany, London, United Kingdom, Munich, Germany, Valencia, Spain
Communicate effectively with the team, assist team members, provide knowledge and counsel to other engineers, foster a culture of open sharing, and lead technical decision-making and incident management.
- 100% Remote Work
- Alternative Schedule
- Employee
- 100,000 - 201,000 USD Annually
- Bulgaria, Croatia, Cyprus, Czechia, Egypt, Israel, Kenya, Lebanon, Luxembourg, Malta, Nigeria, Oman, Qatar, Romania, Saudi Arabia, Serbia, South Africa, Ukraine, United Arab Emirates, United Kingdom
Assist in designing and improving processes, tools, and solutions for building, deploying, monitoring, and maintaining production systems. Collaborate with teams for events such as production releases and incident management. Investigate and...
- 100% Remote Work
- Full-Time
- Employee
- 143,000 - 198,196 USD Annually
- BC, Canada, ON, Canada, or US National
Improve reliability and stability of customer-facing, production infrastructure. Guide and empower engineers to introduce new features in a safe and sustainable way. Raise the bar on observability practices and participate in incident response processes.
- 100% Remote Work
- Full-Time
- Employee
- 160,000 - 180,000 USD Annually
- Australia, Bangladesh, China, Hong Kong, India, Indonesia, Japan, Malaysia, New Zealand, Pakistan, Philippines, Singapore, Sri Lanka, Taiwan, Thailand, Vietnam
Design, build, and maintain core infrastructure pieces, plan infrastructure growth, automate deployment process, debug production issues, and lead technical discussions. Preferably experience in site reliability engineering and/or software..
- 100% Remote Work
- Full-Time
- Employee
- London, ENG, United Kingdom
Advise engineering teams on designing and developing highly performant systems. Automate tasks to improve efficiency. Diagnose and fix network, system, and service-level issues. Optimize performance and user experience.
- Hybrid Remote Work
- Freelance
- 65.00 - 85.00 USD Hourly
- Phoenix, AZ
Facilitate code transition to production, oversee new code deployment, manage infrastructure incidents, and containerization/orchestration using Docker/Kubernetes. Requires DevOps/SRE/cloud experience and managing/creating CI/CD pipelines.
- 100% Remote Work
- Full-Time
- Employee
- 160,000 - 180,000 USD Annually
- Argentina, Bolivia, Brazil, Chile, Colombia, Ecuador, Guyana, Paraguay, Peru, Suriname, Uruguay, Venezuela, Canada, Mexico, or Work from Anywhere
Design, build, and maintain core infrastructure, plan infrastructure growth, automate deployment process, and ensure adequate observability. Debug production issues, lead technical discussions, coach junior SREs, and take on long-term projects.
- 100% Remote Work
- Full-Time
- Freelance
- São Paulo, Brazil, Argentina, Bolivia, Brazil, Chile, Colombia, Ecuador, Guyana, Paraguay, Peru, Suriname, Uruguay, Venezuela
Collaborate with development team, design and operate critical systems, drive reliability projects, troubleshoot system operation issues, provide infrastructure support, take ownership of software infrastructure projects, seek and give constructive f..
- 100% Remote Work
- Full-Time
- Employee
- 135,000 - 150,000 USD Annually
- US National
Design and implement SRE practices for production systems, manage infrastructure through automation, handle emergency response, and improve codebase. 8+ years of experience as a software engineer and 5+ years as a Site Reliability Engineer required.
- 100% Remote Work
- Full-Time
- Employee
- Ireland
Automate and improve reliability of core cloud products. Collaborate with product developers to ensure functional and non-functional requirements are met. Support and maintain critical production systems.
- Hybrid Remote Work
- Full-Time
- Employee
- Mexico City, CDMX, Mexico
As a Senior Site Reliability Engineer, you'll manage production environments, contribute to infrastructure and software solutions, and work closely with technical and non-technical teams. You'll also lead and execute strategic objectives for the team.
- 100% Remote Work
- Full-Time
- Employee
- 165,000 - 220,000 USD Annually
- Canada, New York, NY, or US National
Support the infrastructure platform, scale AWS footprint, build tools for problem identification, shape incident management practices, improve monitoring and alerting platform, collaborate with engineering team for optimal performance and scalability.
- 100% Remote Work
- Full-Time
- Employee
- 165,000 - 220,000 USD Annually
- NYC, NY, Canada, or US National
Maintain reliable, secure, scalable, and highly available infrastructure and applications. Scale AWS footprint, build tools for engineers, and drive incident management practices. Spread SRE culture and collaborate with the engineering team.
- 100% Remote Work
- Full-Time
- Employee
- Canada
Support reliable, secure, and scalable infrastructure platform development and maintenance. Collaborate with product and engineering teams, drive incident management, improve monitoring and alerting, and spread SRE culture. Stay current with industry..
- 100% Remote Work
- Full-Time
- Employee
- Austin, TX, Reston, VA
Design and develop platform software to increase product reliability and efficiency. Guide reliability practice through the SSDLC, maintain service and platform health, and improve operational processes continuously.
- 100% Remote Work
- Full-Time
- Employee
- 170,000 - 276,000 USD Annually
- US National (Not hiring in AK, HI, MT, ND, SD, NE, IA, WV)
Design, build, maintain, and operate the next-gen infrastructure platform. Extend it by building tools and apps, deploying cloud native open source tools, and instituting resilient infrastructure through Infrastructure as code. Foster a collaborative...
- 100% Remote Work
- Full-Time
- Employee
- Philippines
Review and ensure best practices for architecture and software components, monitor metrics, manage security controls, implement compliance standards, conduct performance tests, lead incident response, develop technical assets, collaborate with teams...
- 100% Remote Work
- Full-Time
- Employee
- 165,000 - 175,000 USD Annually
- US National
Serve as the primary cybersecurity resource for the company, staying abreast of the latest threats and implementing measures to protect our systems and data. Experience conducting security audits and vulnerability assessments.
- Hybrid Remote Work
- Full-Time
- Employee
- Mexico City, MX, Mexico
Manage production-ready features for community members. Work with technical and non-technical teams to operate large-scale, secure, and performant distributed systems. Plan, lead, and execute strategic objectives for the team.
- 100% Remote Work
- Freelance
- Bulgaria, Croatia, Cyprus, Czechia, Egypt, Israel, Kenya, Lebanon, Luxembourg, Malta, Nigeria, Oman, Qatar, Romania, Saudi Arabia, Serbia, South Africa, Ukraine, United Arab Emirates, United Kingdom
Co-own production service designs, drive reliability and observability improvements, build internal tools and automation software, champion reliability-focused practices. 4+ years of experience in Infrastructure, SRE, DevOps or System Administration ..
- 100% Remote Work
- Full-Time
- Employee
- Bulgaria, Croatia, Cyprus, Czechia, Egypt, Israel, Kenya, Lebanon, Luxembourg, Malta, Nigeria, Oman, Qatar, Romania, Saudi Arabia, Serbia, South Africa, Ukraine, United Arab Emirates, United Kingdom
Design, deploy and maintain infrastructure-as-code across private cloud, Kubernetes, and application clusters. Work with a team of talented engineers to drive upgrades and provide mission-critical services for global customers.
- Hybrid Remote Work
- Full-Time
- Employee
- Bangalore, India
Work with your SRE and other engineering counterparts for building more scalable, resilient and reliable systems. Collaborate with Engineering organizations to build and automate tooling. Work independently with a minimal level of guidance...
- Hybrid Remote Work
- Full-Time
- Employee
- Bengaluru, KA, India
Improve customer environments' availability, scalability, latency, and efficiency. Design & build modern tools for the SRE and other teams. Engage with other engineering teams across discovery & delivery phases of engagements, advisory, design & impl...
- Hybrid Remote Work
- Full-Time
- Employee
- 200,000 - 275,000 USD Annually
- New York, NY
Build and maintain an observability platform for critical systems, collaborate with cross-functional teams to optimize resource utilization, drive automation and infrastructure provisioning, reduce toil, and ensure cost, performance, and reliability ..
- 100% Remote Work
- Full-Time
- Employee
- Romania
Build and support private and public cloud infrastructure using OpenStack. Contribute to software development, troubleshoot code-level problems, and engage with engineers and partners to handle disaster recovery and security challenges.
- Hybrid Remote Work
- Full-Time
- Employee
- Glendale, CA, Los Angeles, CA, New York, NY
Partner with developers to deliver software products, introduce DevOps mindset, build security designs, automate processes, collaborate with cross-functional teams, participate in incident response, develop tools for capacity planning, and implement ..
- 100% Remote Work
- Full-Time
- Employee
- Durham, NC
Design, build, and maintain infrastructure, participate in solution design, handle production incidents, monitor performance and costs, collaborate with engineering teams, and ensure uptime and reliability of infrastructure.
- 100% Remote Work
- Full-Time
- Employee
- London, ENG, United Kingdom
Design, build and maintain infrastructure primitives for the company's next generation cloud platform. Perform infrastructure standardization and unification across all business units and geographies. Design, build and support CI/CD pipelines to deli..
- 100% Remote Work
- Full-Time
- Employee
- London, United Kingdom
Build robust, easy-to-use foundational platforms and tools. Exemplify cloud-native site reliability best practices. Write performant, maintainable, and clear code. Debug problems in cloud native distributed systems.
- Hybrid Remote Work
- Full-Time
- Employee
- Boca Raton, FL
Automate processes and apply industry-standard site reliability principles. Manage secure, scalable cloud infrastructure and services, monitor performance, and troubleshoot issues. Collaborate with cross-functional teams and stay updated on new cloud..
- 100% Remote Work
- Full-Time
- Employee
- 65,000 - 115,000 USD Annually
- Boston, MA
Manage production and pre-production environments, security, change management, deployment, architecture, and tools. Analyze performance and ensure scalability and reliability of applications hosted in AWS. Automate deployment, monitoring, and incide..
Want a Great Remote
or Flexible Job?
Save time and find higher-quality jobs than on other sites, guaranteed.
Join FlexJobs Now!FlexJobs in the News
Success Stories Just In!
Weekly Newsletter
Get new job postings, the latest job search tips, trends, news, and exclusive promotions!
Sign Up Today!