Site Reliability Jobs - Remote Work From Home & Flexible

More Filters

Job Search Results

1 to 50 of 22,915 for "site reliability"
  • 100% Remote Work
  • Full-Time
  • Employee
  • Poland

Engage with teams to improve service delivery and reliability. Measure and monitor production systems for availability and system health. Drive teams towards better operational excellence and lobby for changes to improve reliability and observability.

  • 100% Remote Work
  • Full-Time
  • Employee
  • 144,000 - 278,000 USD Annually
  • USA, or US National

Engage with teams to improve service delivery and reliability, measure and monitor production systems, identify and drive improvements in reliability and observability, and automate processes to reduce toil.

  • 100% Remote Work
  • Full-Time
  • Employee
  • 147,500 - 227,500 USD Annually
  • Atlanta, GA, or US National

Design, build, and maintain infrastructure estate on public cloud providers. Support development teams with CI/CD platform, automate tasks, manage Kubernetes clusters, and collaborate with Security team.

  • 100% Remote Work
  • Full-Time
  • Employee
  • 147,500 - 227,500 USD Annually
  • Phoenix, AZ, or US National

Design, build, and maintain infrastructure estate, support development teams with CI/CD platform, automate operational tasks, manage and monitor Kubernetes clusters, collaborate with security team, mentor team members, ensure exceptional customer...

  • 100% Remote Work
  • Full-Time
  • Employee
  • 140,250 - 165,000 USD Annually
  • US National

Build and maintain secure system architectures for a diverse fleet of clients. Provide subject matter expertise and support to IT, Security, and Engineering teams. Facilitate incident response and maintain documentation.

Forbes 2000
FlexJobs Top 100 Remote
Fortune 500
Forbes 2000,
FlexJobs Top 100 Remote,
Fortune 500
  • 100% Remote Work
  • Full-Time
  • Freelance
  • Argentina, Bolivia, Brazil, Chile, Colombia, Ecuador, Guyana, Paraguay, Peru, Suriname, Uruguay, Venezuela

Provide senior-level site reliability engineering services in LATAM, including incident response, infrastructure maintenance, monitoring, automation, scalability planning, and product roadmap influence. Requires 5+ years of site-reliability experienc..

Fast Company - Most Innovative Company
Fast Company - Most Innovative Company
  • 100% Remote Work
  • Full-Time
  • Employee
  • London, United Kingdom

Design, build and maintain infrastructure estate on public cloud providers. Support multiple development teams with CI/CD platform to deliver high-quality builds. Automate operational tasks via Go, Python and serverless solutions.

  • 100% Remote Work
  • Full-Time
  • Employee
  • 40,000 - 60,000 USD Annually
  • US National

Use Apps to analyze their impact on government and citizen interaction. Troubleshoot and document application, service, and system issues. Own escalated issues reported by customers and internal teams.

Inc 500
Inc 500
  • 100% Remote Work
  • Full-Time
  • Employee
  • 118,000 - 209,000 USD Annually
  • Bulgaria, Croatia, Cyprus, Czechia, Egypt, Israel, Kenya, Lebanon, Luxembourg, Malta, Nigeria, Oman, Qatar, Romania, Saudi Arabia, Serbia, South Africa, Ukraine, United Arab Emirates, United Kingdom, Canada

Develop and maintain reliable infrastructure, build test automation tooling, Kubernetes, and execution tracing. Troubleshoot, debug, work with microservices, Cloud Native, Kubernetes/EKS. Plan capacity, maintain security.

FlexJobs Top 100 Remote
FlexJobs Top 100 Remote
  • Hybrid Remote Work
  • Full-Time
  • Employee
  • Gurugram, HR, India

Build and maintain infrastructure, automate system engineering tasks, and ensure service request SLAs are met. Solid experience in cloud-based rapid delivery environments, programming languages, and SRE or Production Support engineering teams.

  • 100% Remote Work
  • Full-Time
  • Employee
  • Austria, Belgium, Denmark, Finland, France, United Kingdom, Ukraine, Switzerland, Brazil, Mexico, Puerto Rico

Work with a team of professionals to ensure site reliability and security. Improve infrastructure, monitor cloud infrastructure, and perform security sweeps. Aid in reconfiguring architecture for rapid deployments.

  • Hybrid Remote Work
  • Full-Time
  • Employee
  • Paris, France

Lead team of Site Reliability Engineers for reliability and scalability of Search Products. Collaborate with senior leadership to define technical direction and strategy. Provide leadership, guidance, mentorship, and enforce engineering processes and..

Inc 500
Inc 500
  • Hybrid Remote Work
  • Full-Time
  • Employee
  • San Francisco, CA, Portland, OR, MI, NY, London, United Kingdom, Melbourne, Australia

Implementing and maintaining systems that monitor networks, server health, and application performance. Configuring infrastructure systems to provide load balancing, application firewalls, reverse proxying, and related services.

  • 100% Remote Work
  • Full-Time
  • Employee
  • Seattle, WA, Boston, MA, San Francisco, CA, New York, NY

Respond to, investigate and fix service issues, whether they are deep in the OS kernel or in the application code. Design, build and maintain the infrastructure we need to support orders of magnitude more customers. 5+ years of experience...

Forbes 2000
Forbes 2000
  • 100% Remote Work
  • Full-Time
  • Employee
  • 120,000 USD Annually
  • US National

Build CI and CD pipelines. Optimize and scale workloads. Secure containers and web services. We like you to know Docker, Kubernetes, GCP, AWS, Go, Postgres, Redis, familiarity with JavaScript, excellent communication skills (English)...

Remote.co Remote-First Company
Remote.co Remote-First Company
  • 100% Remote Work
  • Full-Time
  • Employee
  • Jacksonville, FL

Facilitating the development process and operations. Identifying setbacks and shortcomings. Creating suitable DevOps channels across the organization. Establishing continuous build environments to speed up software development. Designing efficient...

  • 100% Remote Work
  • Full-Time
  • Employee
  • 65,000 - 115,000 USD Annually
  • Columbus, OH

Manage production and pre-production environments, analyze performance, troubleshoot issues, and automate deployment and incident response processes. Work closely with cross-functional teams to ensure optimal system reliability and scalability.

Red100
Barrons 400
Forbes 2000
...
Red100,
Barrons 400,
Forbes 2000
  • Hybrid Remote Work
  • Full-Time
  • Employee
  • 120,000 - 200,000 USD Annually
  • New York, NY

Maintain availability of Linux servers, design and operate infrastructure, collaborate with product teams on requirements, troubleshoot and solve network and systems issues, automate operational tasks.

  • Hybrid Remote Work
  • Full-Time
  • Employee
  • Washington, DC

Maintain availability of cloud and physical Linux servers, design and deploy infrastructure, collaborate with product teams on requirements, troubleshoot and solve network and systems issues, automate routine operational tasks.

  • 100% Remote Work
  • Full-Time
  • Employee
  • 120,000 - 250,000 USD Annually
  • New York, NY

Experienced DevOps and Site Reliability Engineers needed to build and support infrastructure for real estate data services. Work with cutting-edge technology and tools, including Google Cloud, Kubernetes, and Terraform. 7-10 years of engineering expe..

  • 100% Remote Work
  • Full-Time
  • Employee
  • France

Design, develop, deploy, and maintain reliable and scalable infrastructure. Manage large Kubernetes clusters. Measure and optimize system performance. Provide primary operational support and engineering for multiple teams. 7+ years of experience in a...

  • 100% Remote Work
  • Full-Time
  • Employee
  • US National

Install, upgrade and manage systems powering customer infrastructure running Circonus software. Communicate with management and customers regarding aberrant system's behavior. Participate in an on-call schedule.

  • 100% Remote Work
  • Full-Time
  • Employee
  • Durham, NC

Design, build, and maintain core infrastructure. Create and iterate on workloads in Google Kubernetes Engine. Collaborate with engineering teams to improve scalability and performance. Ensure uptime and reliability of infrastructure. Participate in on...

Inc 500
Inc 500
  • 100% Remote Work
  • Full-Time
  • Employee
  • Romania

Build and support private and public cloud infrastructure using OpenStack. Contribute to software development, troubleshoot code-level problems, and engage with engineers and partners to handle disaster recovery and security challenges.

Fortune 100
Great Places to Work - Best Workplaces for Millennials
Human Rights Campaign Best Places to Work
...
Fortune 100,
Great Places to Work - Best Workplaces for Millennials,
Human Rights Campaign Best Places to Work
  • 100% Remote Work
  • Full-Time
  • Employee
  • Argentina, Bolivia, Brazil, Chile, Colombia, Ecuador, Guyana, Paraguay, Peru, Suriname, Uruguay, Venezuela

Work with a team of DevOps/SRE and DBA professionals. Improve infrastructure and processes. Monitor and maintain cloud infrastructure. Aid in reconfiguring existing architecture for rapid deployments.Take ownership and responsibility for our cloud oper...

  • 100% Remote Work
  • Full-Time
  • Employee
  • Hanoi, Vietnam

Manage IT infrastructure, upgrade and install hardware and software, troubleshoot IT issues, maintain networks and servers, act as a cloud system admin, automate alerting and monitoring system logs, implement security protocols, mentor IT department ..

  • Hybrid Remote Work
  • Full-Time
  • Employee
  • Austin, TX

Collaborate with Cloud engineers, Backend and Front Developers, Release Engineers and Security Officers. Operate our AWS Cloud Infrastructure. Monitor and proactively maintain our Service Levels (availability...

  • Hybrid Remote Work
  • Full-Time
  • Employee
  • Krakow, Poland

Lead the design and implementation of solutions to improve availability and resiliency. Define operational maturity by implementing SLIs, SLOs, and proactive failure mitigation. Drive incident response, retrospectives, and user experience improvements.

  • 100% Remote Work
  • Full-Time
  • Employee
  • Canada, Mexico, Argentina, Bolivia, Brazil, Chile, Colombia, Ecuador, Guyana, Paraguay, Peru, Suriname, Uruguay, Venezuela, Australia, Bangladesh, China, Hong Kong, India, Indonesia, Japan, Malaysia, New Zealand, Pakistan, Philippines, Singapore, Sri Lanka, Taiwan, Thailand, Vietnam, or US National

Optimize, automate, and improve performance of the cloud environment. Develop solutions to enhance key performance indicators. Gather and analyze metrics for system optimization and fault resolution. Drive innovation and scalability of the platform.

  • Hybrid Remote Work
  • Full-Time
  • Employee
  • Bangalore, India

Driving the incident management process and support a blameless post-mortem culture. Participating in application design consulting and capacity planning. Defining and formalizing SRE practices and help guide the overall reliability engineering...

  • 100% Remote Work
  • Full-Time
  • Freelance
  • Denver, CO

Optimize CI/CD pipelines for various content management systems and web applications. Migrate CI/CD processes to Buddy Works and configure Prometheus for monitoring. Integrate CI/CD pipelines with Kubernetes and Rancher. Provide support for CMS and...

  • Hybrid Remote Work
  • Full-Time
  • Employee
  • Bengaluru, India

Responsible for resolving technical issues, building/testing code, deploying apps, measuring performance, and optimizing infrastructure. Requires 4-7 yrs of relevant experience and proficiency in scripting languages, cloud platforms, containerization,..

  • Hybrid Remote Work
  • Full-Time
  • Employee
  • Boston, MA

Lead and manage SRE team, implement best practices for site reliability, collaborate with cross-functional teams, develop incident response plans, provide technical guidance, and ensure security and compliance.

  • 100% Remote Work
  • Full-Time
  • Employee
  • Bucharest, Romania

Manage IT infrastructure, troubleshoot issues, optimize performance, monitor and maintain networks and servers, automate alerting and monitoring system logs, implement security protocols and procedures, mentor IT department employees.

  • 100% Remote Work
  • Full-Time
  • Employee
  • Sao Paulo, Brazil

Manage IT infrastructure, upgrade/install hardware and software, troubleshoot and maintain networks and servers. Act as a cloud system admin, automate alerting and monitoring system logs, and implement security protocols. Bachelor's degree and 2+ year..

  • 100% Remote Work
  • Full-Time
  • Employee
  • Kinshasa

Manage IT infrastructure, troubleshoot and resolve issues, maintain networks and servers, optimize performance, automate alerting and monitoring system logs, implement security protocols, upgrade and install hardware and software.

  • 100% Remote Work
  • Full-Time
  • Employee
  • Islamabad, Pakistan

Manage company's IT infrastructure, upgrade/install hardware and software, troubleshoot and maintain networks/servers, automate alerting/monitoring, implement security protocols, mentor IT employees, stay updated with IT advancements.

  • 100% Remote Work
  • Full-Time
  • Employee
  • Columbo, Sri Lanka

Manage company's IT infrastructure, troubleshoot and resolve IT issues, maintain networks and servers, upgrade and install hardware and software, implement security protocols, document processes, mentor IT department employees.

  • Hybrid Remote Work
  • Full-Time
  • Employee
  • 125,000 - 150,000 USD Annually
  • Pinehurst, NC, Tampa, FL

Automate infrastructure-as-code for deploying virtual machines, containers, and services. Collaborate with platform developers to build solutions for customer use cases. Deploy software to classified networks and triage critical system faults.

  • 100% Remote Work
  • Alternative Schedule
  • Employee
  • Bulgaria, Croatia, Cyprus, Czechia, Egypt, Israel, Kenya, Lebanon, Luxembourg, Malta, Nigeria, Oman, Qatar, Romania, Saudi Arabia, Serbia, South Africa, Ukraine, United Arab Emirates, United Kingdom

Develop self-healing, automated systems to ensure 24/7 service uptime, monitor SLOs and mission-critical metrics, build robust monitoring and alerting capabilities, and work with other teams to guarantee security and high availability of services.

  • Hybrid Remote Work
  • Full-Time
  • Employee
  • 92,290 - 156,860 USD Annually
  • Manchester, NH, San Diego, CA

Implement advanced technological solutions, automate infrastructure, and monitor performance. Collaborate with cross-functional teams to deliver seamless service. Solve complex requests and respond to occasional off-hours service disruptions.

Military-Friendly Employer
The UK Times Top Employers for Women
Forbes World's Most Innovative Companies
...
Military-Friendly Employer ,
The UK Times Top Employers for Women,
Forbes World's Most Innovative Companies
  • 100% Remote Work
  • Full-Time
  • Employee
  • Work from Anywhere

"Troubleshoot, maintain, and improve compute infrastructure. Work with Linux, Docker, Kubernetes, Python, message queues, CI/CD pipelines, and more. Familiarity with network platforms, RDBMS, time-series data stores, and remote collaboration tools."

  • 100% Remote Work
  • Full-Time
  • Employee
  • Romania

Build and improve observability platform, ensure seamless communication between services, collaborate with engineering team to establish best practices, work with product engineering teams to ensure high performance and observability of services.

Fortune 100
Great Places to Work - Best Workplaces for Millennials
Human Rights Campaign Best Places to Work
...
Fortune 100,
Great Places to Work - Best Workplaces for Millennials,
Human Rights Campaign Best Places to Work
  • 100% Remote Work
  • Full-Time
  • Employee
  • 165,750 - 195,000 USD Annually
  • San Jose, CA, or US National

Operational duties for cloud-based products, including deployments, on-call support, and incident management. Management of cloud infrastructure elements and improvement of monitoring systems. Contribution to and implementation of DevOps best practices.

Glassdoor Employees Choice - Best Place to Work
Forbes 2000
Glassdoor Employees Choice - Best Place to Work,
Forbes 2000
  • 100% Remote Work
  • Full-Time
  • Employee
  • 120,000 - 140,000 USD Annually
  • Chicago, IL, or US National

Develop and manage distributed, fault-tolerant cloud systems at scale. Refine and fine-tune monitoring solutions and collaborate with tech teams to enhance client reliability experience. Stay up-to-date with industry trends and emerging technologies ..

  • Hybrid Remote Work
  • Full-Time
  • Employee
  • Barcelona, Spain

Develop software and fixes, ensure code quality, test and distribute code updates, and monitor server health and stability. Meet KPIs, identify preventative measures, mentor junior engineers, liaise between teams, and possess excellent technical and ..

  • 100% Remote Work
  • Full-Time
  • Employee
  • Hyderabad, TG, India

Ensure high uptime and reliability for distributed systems, manage Linux servers in a multi-cloud environment, develop monitoring resources and alerting systems, manage SQL and NOSQL database systems, and coordinate with DBA and developers. Minimum...

Fast Company - Most Innovative Company
Fast Company - Most Innovative Company
  • Hybrid Remote Work
  • Full-Time
  • Employee
  • Bengaluru, India

Operate, maintain and administer customer infrastructure solutions. Provide Root Cause Analysis reports for outages/incidents. Collaborate with teammates to contribute to the continuous improvement of our working culture.

  • 100% Remote Work
  • Full-Time
  • Employee
  • Australia

Lead design, implementation, and maintenance of highly available and scalable infrastructure and services. Optimize AWS usage to align with security best practices. Develop best practices for monitoring, alerting, and incident response to ensure syst..

  • 100% Remote Work
  • Full-Time
  • Employee
  • 113,923 - 170,884 USD Annually
  • US National

Architect, design, and implement AWS infrastructure services including network, load balancers, and containers. Lead operational health, maintenance, optimization, and security. Coordinate with teams for end-to-end software delivery.

FlexJobs Top 100 Remote
FlexJobs Top 100 Remote
rocket ship image

Want a Great Remote
or Flexible Job?

Save time and find higher-quality jobs than on other sites, guaranteed.

Join FlexJobs Now!

Currently Hiring on FlexJobs

Coalition Technologies G2i
Intact Financial Corporation Vista

Success Stories Just In!

FlexJobs is very informative and efficient! So many great resources
Ebony A., Decatur, GA
Customer Service Representative
Jul 1, 2024
Weekly Newsletter icon

Weekly Newsletter

Get new job postings, the latest job search tips, trends, news, and exclusive promotions!

Sign Up Today!
MPR Resume Builder