Senior Site Reliability Engineer Jobs - Remote Work From Home & Flexible

More Filters

Job Search Results

1 to 50 of 38,095 for "senior site reliability engineer"
  • Hybrid Remote Work
  • Full-Time
  • Employee
  • Gurugram, HR, India

Build and maintain infrastructure, automate system engineering tasks, and ensure service request SLAs are met. Solid experience in cloud-based rapid delivery environments, programming languages, and SRE or Production Support engineering teams.

  • 100% Remote Work
  • Full-Time
  • Employee
  • Poland

Engage with teams to improve service delivery and reliability. Measure and monitor production systems for availability and system health. Drive teams towards better operational excellence and lobby for changes to improve reliability and observability.

  • 100% Remote Work
  • Full-Time
  • Employee
  • Durham, NC

Design, build, and maintain core infrastructure. Create and iterate on workloads in Google Kubernetes Engine. Collaborate with engineering teams to improve scalability and performance. Ensure uptime and reliability of infrastructure. Participate in on...

Inc 500
Inc 500
  • 100% Remote Work
  • Full-Time
  • Employee
  • London, United Kingdom

Design, build and maintain infrastructure estate on public cloud providers. Support multiple development teams with CI/CD platform to deliver high-quality builds. Automate operational tasks via Go, Python and serverless solutions.

  • 100% Remote Work
  • Full-Time
  • Employee
  • 118,000 - 209,000 USD Annually
  • Bulgaria, Croatia, Cyprus, Czechia, Egypt, Israel, Kenya, Lebanon, Luxembourg, Malta, Nigeria, Oman, Qatar, Romania, Saudi Arabia, Serbia, South Africa, Ukraine, United Arab Emirates, United Kingdom, Canada

Develop and maintain reliable infrastructure, build test automation tooling, Kubernetes, and execution tracing. Troubleshoot, debug, work with microservices, Cloud Native, Kubernetes/EKS. Plan capacity, maintain security.

FlexJobs Top 100 Remote
FlexJobs Top 100 Remote
  • 100% Remote Work
  • Full-Time
  • Employee
  • 65,000 - 115,000 USD Annually
  • Columbus, OH

Manage production and pre-production environments, analyze performance, troubleshoot issues, and automate deployment and incident response processes. Work closely with cross-functional teams to ensure optimal system reliability and scalability.

Red100
Barrons 400
Forbes 2000
...
Red100,
Barrons 400,
Forbes 2000
  • 100% Remote Work
  • Full-Time
  • Employee
  • Canada

Engage with teams to improve service delivery and reliability, measure and monitor all production systems, seek out the cause of errors, and drive teams towards better operational excellence. Identify and drive down toil with creative innovation and ..

  • 100% Remote Work
  • Full-Time
  • Employee
  • Philippines

Review architecture and software components. Ensure best practices are consistent. Own and ensure SLOs and SLAs are met. Monitor operational metrics and lead improvement plans. Develop runbooks and other technical assets. Implement DR plans.

  • Hybrid Remote Work
  • Full-Time
  • Employee
  • Bangalore, KA, India, Mohali, PB, India

Improve system reliability by building scalable process frameworks, improving infrastructure and tooling, and reducing manual work through automation. Ensure adoption of SRE best practices and seamless transfer of support responsibilities.

Glassdoor Employees Choice - Best Place to Work
Forbes 2000
Glassdoor Employees Choice - Best Place to Work,
Forbes 2000
  • Full-Time
  • Employee
  • 175,000 - 230,000 USD Annually
  • New York, NY

Consult with Product Engineers on software development, design and implement software infrastructure solutions, author, review, and optimize Terraform code, provide training and mentoring, participate in support rotation.

  • 100% Remote Work
  • Full-Time
  • Employee
  • 127,000 - 223,000 USD Annually
  • Canada, or US National

Design, build, and maintain products and services, advocate Site Reliability Engineering principles, identify sources of toil and eliminate them, take ownership of projects, and participate in a 24x7 on-call rotation.

Red100
Red100
  • Hybrid Remote Work
  • Full-Time
  • Employee
  • Bangalore, KA, India

Improve infrastructure, tooling, and process improvements. Support and troubleshoot large-scale distributed software applications and networks. Promote best practices for reliability within the Engineering Department.

Glassdoor Employees Choice - Best Place to Work
Forbes 2000
Glassdoor Employees Choice - Best Place to Work,
Forbes 2000
  • 100% Remote Work
  • Full-Time
  • Employee
  • Australia

Lead design, implementation, and maintenance of highly available and scalable infrastructure and services. Optimize AWS usage to align with security best practices. Develop best practices for monitoring, alerting, and incident response to ensure syst..

  • 100% Remote Work
  • Alternative Schedule
  • Employee
  • Bulgaria, Croatia, Cyprus, Czechia, Egypt, Israel, Kenya, Lebanon, Luxembourg, Malta, Nigeria, Oman, Qatar, Romania, Saudi Arabia, Serbia, South Africa, Ukraine, United Arab Emirates, United Kingdom

Develop self-healing, automated systems to ensure 24/7 service uptime, monitor SLOs and mission-critical metrics, build robust monitoring and alerting capabilities, and work with other teams to guarantee security and high availability of services.

  • 100% Remote Work
  • Full-Time
  • Employee
  • Portugal

Responsible for ensuring the smooth operation of user-facing services and production systems. Define and build Service Level Objectives and Indicators, automate processes, and participate in incident response. Strong technical skills in AWS and obser..

  • 100% Remote Work
  • Full-Time
  • Employee
  • Austria, Belgium, Denmark, Finland, France, Germany, Greece, Hungary, Ireland, Italy, Netherlands, Norway, Poland, Portugal, Romania, Spain, Sweden, Switzerland, Ukraine, United Kingdom

Drive continuous improvement and delivery within the team, maintain and develop existing cloud infrastructure, implement automation, testing and deployment pipelines, and own the reliability and security of cloud infrastructure and services.

  • 100% Remote Work
  • Full-Time
  • Employee
  • 212,000 - 212,000 USD Annually
  • US National

Design, build, and support reliable core systems and infrastructure. Collaborate with cross-functional teams. Improve reliability, scalability, and performance. Debug complex issues and mentor other engineers.

  • 100% Remote Work
  • Full-Time
  • Employee
  • 173,400 - 298,000 USD Annually
  • US National

Re-engineering core product for global scalability and latency optimization. Building monitoring and observability stack. Investing in security posture, automation, and operational challenges. Strong Linux systems and networking experience required.

  • 100% Remote Work
  • Full-Time
  • Employee
  • US National

Maintain and operate monitoring systems and infrastructure, collaborate with development and operations teams, conduct audits and assessments of monitoring systems, and represent the team in global incidents resolution.

Human Rights Campaign Best Places to Work
Forbes 2000
Glassdoor Employees Choice - Best Place to Work
Human Rights Campaign Best Places to Work,
Forbes 2000,
Glassdoor Employees Choice - Best Place to Work
  • Full-Time
  • Employee
  • Amsterdam, Netherlands, Berlin, Germany, London, United Kingdom, Munich, Germany, Valencia, Spain

Communicate effectively with the team, assist team members, provide knowledge and counsel to other engineers, foster a culture of open sharing, and lead technical decision-making and incident management.

  • 100% Remote Work
  • Alternative Schedule
  • Employee
  • 100,000 - 201,000 USD Annually
  • Bulgaria, Croatia, Cyprus, Czechia, Egypt, Israel, Kenya, Lebanon, Luxembourg, Malta, Nigeria, Oman, Qatar, Romania, Saudi Arabia, Serbia, South Africa, Ukraine, United Arab Emirates, United Kingdom

Assist in designing and improving processes, tools, and solutions for building, deploying, monitoring, and maintaining production systems. Collaborate with teams for events such as production releases and incident management. Investigate and...

FlexJobs Top 100 Remote
FlexJobs Top 100 Remote
  • 100% Remote Work
  • Full-Time
  • Employee
  • 143,000 - 198,196 USD Annually
  • BC, Canada, ON, Canada, or US National

Improve reliability and stability of customer-facing, production infrastructure. Guide and empower engineers to introduce new features in a safe and sustainable way. Raise the bar on observability practices and participate in incident response processes.

  • 100% Remote Work
  • Full-Time
  • Employee
  • 160,000 - 180,000 USD Annually
  • Australia, Bangladesh, China, Hong Kong, India, Indonesia, Japan, Malaysia, New Zealand, Pakistan, Philippines, Singapore, Sri Lanka, Taiwan, Thailand, Vietnam

Design, build, and maintain core infrastructure pieces, plan infrastructure growth, automate deployment process, debug production issues, and lead technical discussions. Preferably experience in site reliability engineering and/or software..

  • 100% Remote Work
  • Full-Time
  • Employee
  • London, ENG, United Kingdom

Advise engineering teams on designing and developing highly performant systems. Automate tasks to improve efficiency. Diagnose and fix network, system, and service-level issues. Optimize performance and user experience.

Entrepreneur's Best Small and Medium Workplaces
Harris Reputation Quotient Survey
FlexJobs Top 100 Remote
Entrepreneur's Best Small and Medium Workplaces,
Harris Reputation Quotient Survey,
FlexJobs Top 100 Remote
  • Hybrid Remote Work
  • Freelance
  • 65.00 - 85.00 USD Hourly
  • Phoenix, AZ

Facilitate code transition to production, oversee new code deployment, manage infrastructure incidents, and containerization/orchestration using Docker/Kubernetes. Requires DevOps/SRE/cloud experience and managing/creating CI/CD pipelines.

FlexJobs Top 100 Remote
FlexJobs Top 100 Remote
  • 100% Remote Work
  • Full-Time
  • Employee
  • 160,000 - 180,000 USD Annually
  • Argentina, Bolivia, Brazil, Chile, Colombia, Ecuador, Guyana, Paraguay, Peru, Suriname, Uruguay, Venezuela, Canada, Mexico, or Work from Anywhere

Design, build, and maintain core infrastructure, plan infrastructure growth, automate deployment process, and ensure adequate observability. Debug production issues, lead technical discussions, coach junior SREs, and take on long-term projects.

  • 100% Remote Work
  • Full-Time
  • Freelance
  • São Paulo, Brazil, Argentina, Bolivia, Brazil, Chile, Colombia, Ecuador, Guyana, Paraguay, Peru, Suriname, Uruguay, Venezuela

Collaborate with development team, design and operate critical systems, drive reliability projects, troubleshoot system operation issues, provide infrastructure support, take ownership of software infrastructure projects, seek and give constructive f..

  • 100% Remote Work
  • Full-Time
  • Employee
  • 135,000 - 150,000 USD Annually
  • US National

Design and implement SRE practices for production systems, manage infrastructure through automation, handle emergency response, and improve codebase. 8+ years of experience as a software engineer and 5+ years as a Site Reliability Engineer required.

  • 100% Remote Work
  • Full-Time
  • Employee
  • Ireland

Automate and improve reliability of core cloud products. Collaborate with product developers to ensure functional and non-functional requirements are met. Support and maintain critical production systems.

Glassdoor Employees Choice - Best Place to Work
Glassdoor Employees Choice - Best Place to Work
  • Hybrid Remote Work
  • Full-Time
  • Employee
  • Mexico City, CDMX, Mexico

As a Senior Site Reliability Engineer, you'll manage production environments, contribute to infrastructure and software solutions, and work closely with technical and non-technical teams. You'll also lead and execute strategic objectives for the team.

  • 100% Remote Work
  • Full-Time
  • Employee
  • 165,000 - 220,000 USD Annually
  • Canada, New York, NY, or US National

Support the infrastructure platform, scale AWS footprint, build tools for problem identification, shape incident management practices, improve monitoring and alerting platform, collaborate with engineering team for optimal performance and scalability.

  • 100% Remote Work
  • Full-Time
  • Employee
  • 165,000 - 220,000 USD Annually
  • NYC, NY, Canada, or US National

Maintain reliable, secure, scalable, and highly available infrastructure and applications. Scale AWS footprint, build tools for engineers, and drive incident management practices. Spread SRE culture and collaborate with the engineering team.

  • 100% Remote Work
  • Full-Time
  • Employee
  • Canada

Support reliable, secure, and scalable infrastructure platform development and maintenance. Collaborate with product and engineering teams, drive incident management, improve monitoring and alerting, and spread SRE culture. Stay current with industry..

  • 100% Remote Work
  • Full-Time
  • Employee
  • Austin, TX, Reston, VA

Design and develop platform software to increase product reliability and efficiency. Guide reliability practice through the SSDLC, maintain service and platform health, and improve operational processes continuously.

  • 100% Remote Work
  • Full-Time
  • Employee
  • 170,000 - 276,000 USD Annually
  • US National (Not hiring in AK, HI, MT, ND, SD, NE, IA, WV)

Design, build, maintain, and operate the next-gen infrastructure platform. Extend it by building tools and apps, deploying cloud native open source tools, and instituting resilient infrastructure through Infrastructure as code. Foster a collaborative...

Great Places to Work - Best Workplaces for Millennials
Entrepreneur's Best Small and Medium Workplaces
Great Places to Work - Best Workplaces for Millennials,
Entrepreneur's Best Small and Medium Workplaces
  • 100% Remote Work
  • Full-Time
  • Employee
  • Philippines

Review and ensure best practices for architecture and software components, monitor metrics, manage security controls, implement compliance standards, conduct performance tests, lead incident response, develop technical assets, collaborate with teams...

  • 100% Remote Work
  • Full-Time
  • Employee
  • 165,000 - 175,000 USD Annually
  • US National

Serve as the primary cybersecurity resource for the company, staying abreast of the latest threats and implementing measures to protect our systems and data. Experience conducting security audits and vulnerability assessments.

Inc 500
Inc 500
  • Hybrid Remote Work
  • Full-Time
  • Employee
  • Mexico City, MX, Mexico

Manage production-ready features for community members. Work with technical and non-technical teams to operate large-scale, secure, and performant distributed systems. Plan, lead, and execute strategic objectives for the team.

  • 100% Remote Work
  • Freelance
  • Bulgaria, Croatia, Cyprus, Czechia, Egypt, Israel, Kenya, Lebanon, Luxembourg, Malta, Nigeria, Oman, Qatar, Romania, Saudi Arabia, Serbia, South Africa, Ukraine, United Arab Emirates, United Kingdom

Co-own production service designs, drive reliability and observability improvements, build internal tools and automation software, champion reliability-focused practices. 4+ years of experience in Infrastructure, SRE, DevOps or System Administration ..

  • 100% Remote Work
  • Full-Time
  • Employee
  • Bulgaria, Croatia, Cyprus, Czechia, Egypt, Israel, Kenya, Lebanon, Luxembourg, Malta, Nigeria, Oman, Qatar, Romania, Saudi Arabia, Serbia, South Africa, Ukraine, United Arab Emirates, United Kingdom

Design, deploy and maintain infrastructure-as-code across private cloud, Kubernetes, and application clusters. Work with a team of talented engineers to drive upgrades and provide mission-critical services for global customers.

  • Hybrid Remote Work
  • Full-Time
  • Employee
  • Bangalore, India

Work with your SRE and other engineering counterparts for building more scalable, resilient and reliable systems. Collaborate with Engineering organizations to build and automate tooling. Work independently with a minimal level of guidance...

  • Hybrid Remote Work
  • Full-Time
  • Employee
  • Bengaluru, KA, India

Improve customer environments' availability, scalability, latency, and efficiency. Design & build modern tools for the SRE and other teams. Engage with other engineering teams across discovery & delivery phases of engagements, advisory, design & impl...

Inc 500
Inc 500
  • Hybrid Remote Work
  • Full-Time
  • Employee
  • 200,000 - 275,000 USD Annually
  • New York, NY

Build and maintain an observability platform for critical systems, collaborate with cross-functional teams to optimize resource utilization, drive automation and infrastructure provisioning, reduce toil, and ensure cost, performance, and reliability ..

  • 100% Remote Work
  • Full-Time
  • Employee
  • Romania

Build and support private and public cloud infrastructure using OpenStack. Contribute to software development, troubleshoot code-level problems, and engage with engineers and partners to handle disaster recovery and security challenges.

Fortune 100
Great Places to Work - Best Workplaces for Millennials
Human Rights Campaign Best Places to Work
...
Fortune 100,
Great Places to Work - Best Workplaces for Millennials,
Human Rights Campaign Best Places to Work
  • Hybrid Remote Work
  • Full-Time
  • Employee
  • Glendale, CA, Los Angeles, CA, New York, NY

Partner with developers to deliver software products, introduce DevOps mindset, build security designs, automate processes, collaborate with cross-functional teams, participate in incident response, develop tools for capacity planning, and implement ..

  • 100% Remote Work
  • Full-Time
  • Employee
  • Durham, NC

Design, build, and maintain infrastructure, participate in solution design, handle production incidents, monitor performance and costs, collaborate with engineering teams, and ensure uptime and reliability of infrastructure.

Inc 500
Inc 500
  • 100% Remote Work
  • Full-Time
  • Employee
  • London, ENG, United Kingdom

Design, build and maintain infrastructure primitives for the company's next generation cloud platform. Perform infrastructure standardization and unification across all business units and geographies. Design, build and support CI/CD pipelines to deli..

Barrons 400
ComputerWorld - Best Places to Work in IT
FlexJobs Top 100 Remote
Barrons 400,
ComputerWorld - Best Places to Work in IT,
FlexJobs Top 100 Remote
  • 100% Remote Work
  • Full-Time
  • Employee
  • London, United Kingdom

Build robust, easy-to-use foundational platforms and tools. Exemplify cloud-native site reliability best practices. Write performant, maintainable, and clear code. Debug problems in cloud native distributed systems.

Barrons 400
ComputerWorld - Best Places to Work in IT
FlexJobs Top 100 Remote
Barrons 400,
ComputerWorld - Best Places to Work in IT,
FlexJobs Top 100 Remote
  • Hybrid Remote Work
  • Full-Time
  • Employee
  • Boca Raton, FL

Automate processes and apply industry-standard site reliability principles. Manage secure, scalable cloud infrastructure and services, monitor performance, and troubleshoot issues. Collaborate with cross-functional teams and stay updated on new cloud..

Red100
Best Companies to Work for in Florida
Red100,
Best Companies to Work for in Florida
30+ days ago
  • 100% Remote Work
  • Full-Time
  • Employee
  • 65,000 - 115,000 USD Annually
  • Boston, MA

Manage production and pre-production environments, security, change management, deployment, architecture, and tools. Analyze performance and ensure scalability and reliability of applications hosted in AWS. Automate deployment, monitoring, and incide..

Red100
Barrons 400
Forbes 2000
...
Red100,
Barrons 400,
Forbes 2000
rocket ship image

Want a Great Remote
or Flexible Job?

Save time and find higher-quality jobs than on other sites, guaranteed.

Join FlexJobs Now!

Currently Hiring on FlexJobs

Kelly Sutherland
Thermo Fisher Scientific Yodo1

Success Stories Just In!

Great experience! Using FlexJobs, I found a job in less than
Kelsey G., Ore City, TX
Administrative Assistant Scheduler at Paragon Planners
Jun 28, 2024
Weekly Newsletter icon

Weekly Newsletter

Get new job postings, the latest job search tips, trends, news, and exclusive promotions!

Sign Up Today!
MPR Resume Builder