Compañía

Phoenix Recruitment LlcVer más

addressDirecciónSanta Fe, Santa Fe
CategoríaAlimentos y restaurantes

Descripción del trabajo

Esta oferta de trabajo no se encuentra disponible en tu país.

This is a remote position.

Title - Site Reliability Engineer , 1 year of project experience

Employment Type : Full-time

Base Salary : $60K-$70K

Phoenix Recruitment offers a variety of recruiting services to assist both employers and employees. They specialize in marketing open positions, recruiting, and helping employers to find qualified candidates across various industries.

Phoenix Recruitment has expertise in streamlining the hiring process. They can help ensure that the process is efficient, well organized, and compliant with relevant regulations.

About The Job

As an SRE, you'll troubleshoot and resolve technical issues, optimize performance, and establish reliability-based release management processes.

The SRE role is the practical implementation of DevOps principles, where speed and stability are carefully balanced, and the team acts as versatile problem solvers, filling gaps in knowledge and expertise to ensure efficient software operations.

You will :

Apply SRE principles to maintain the reliability, availability, and performance of software systems.Automate deployment processes, configuration management, and CI / CD pipelines to streamline software development and delivery.

Planned and assisted with the migration of Windows and Linux-based machines to containerized machines.Plan and Assist with the overall Disaster Recovery (DR) of the infrastructure and operations (InfraOps).

Manage and maintain software infrastructure, ensuring proper configuration, security, and scalability.Perform system administration tasks, monitor system performance, troubleshoot issues, and apply necessary fixes.

Act as a versatile problem solver, filling gaps in team knowledge and expertise to ensure smooth and efficient software operations.

Facilitate smooth team and project transitions, providing guidance, training, and support for development teams to manage their infrastructure independently.

Develop a reliability rating system to assess team and project performance, collecting and analyzing metrics to evaluate adherence to best practices.

Respond quickly and effectively to critical incidents, conducting post-incident reviews to identify root causes and implement preventive measures.

Develop and maintain automation tools and scripts to improve operational efficiency.Identify performance bottlenecks and implement optimizations to enhance system response times and resource utilization.

Stay up to date with the latest industry trends, technologies, and best practices related to SRE, DevOps, and infrastructure management.

Collaborate effectively with cross-functional teams and communicate technical concepts and recommendations clearly to both technical and non-technical stakeholders.

Implement a reliability-based release management process, allowing teams with higher reliability scores to perform quick and frequent releases.

Proactively identify potential issues and implement preventive measures to reduce incidents and outages.Implement observability practices to detect abnormal behaviors in the software and collect information for effective problem resolution.

Set and monitor critical metrics to gain insights into system reliability, including latency, traffic, errors, and saturation levels.

Establish Service-Level Objectives (SLOs) and measure Service-Level Indicators (SLIs) to assess the quality-of-service delivery and reliability.

Planned, participated, and managed on-call rotations to ensure prompt response to reported software issues.Utilize incident response tools to categorize the severity of reported cases and handle them promptly.

Implement configuration management tools to automate software workflows and enhance team productivity.

Projects you could work on :

Implementing automated CI / CD pipelines for smooth software deployment.Setting up and maintaining a reliable and scalable cloud infrastructure.

Designing and implementing the migration of physical machines to virtual machines.Designing incident response procedures and post-incident review processes.

Developing automation tools to streamline repetitive tasks and improve team productivity.Analyzing system performance metrics and optimizing resources for better efficiency.

Establishing observability practices to detect and resolve software issues proactively.Defining SLOs and SLIs to assess service quality and reliability across projects.

Planning and managing on-call rotations to ensure timely issue resolution.Configuring and maintaining software workflows using configuration management tools.

Why Phoenix Recruitment LLC?

Phoenix Recruitment often has an extensive network of employers and candidates. This network allows them to tap into a pool of qualified candidates and connect them with suitable job opportunities.

They can also leverage their connections to help employers find the right talent efficiently. Outsourcing the recruitment process to a specialized agency can save your time and resources, avoid delays, reduce administrative burdens, and increase the chances of finding the right fit for your organization.

J-18808-Ljbffr#J-18808-Ljbffr

Refer code: 561905. Phoenix Recruitment Llc - El día anterior - 2024-02-16 12:38

Phoenix Recruitment Llc

Santa Fe, Santa Fe

Compartir trabajos con amigos

Trabajos relacionados

Site Reliability Engineer

Site Reliability Engineer

Pwc South Africa

Rosario, Santa Fe

5 Hace meses - visto

Sr. Site Reliability Engineer - Terraform, AWS, K8s

Tenable Network Security

Santa Fe, Santa Fe

5 Hace meses - visto

Site Reliability Engineer  

PwC

Rosario, Santa Fe

5 Hace meses - visto