Senior Site Reliability (Sre) Engineer

Senior Site Reliability (Sre) Engineer
Empresa:

Myskysys



Função de trabalho:

Engenharia

Detalhes da Vaga

Role: Senior Site Reliability EngineerPosition Type: Full-Time Contract (40hrs/week)Contract Duration: 6-8 Months+Work Hours: Eastern Standard Time (EST)Work Schedule: 8 hours/day (Mon-Fri)Location: Hybrid (combination of on-site and remote work)- on site 2-3 times a monthOverviewThe Site Reliability Engineer (SRE) plays a critical role in ensuring the reliability, scalability, and performance of Client's digital platforms and infrastructure. As part of a global team of highly skilled engineers, the SRE will work on challenging and impactful projects that directly contribute to the company's core business activities. Client is committed to fostering a culture of innovation, collaboration, and continuous learning, providing the SRE with an opportunity to grow and develop their skills while making a positive impact on the world.Main AccountabilitiesTroubleshoot and resolve infrastructure issues and incidents in a timely manner.Design, implement, and maintain reliable and scalable infrastructure solutions to support Client's digital platforms and applications.Monitor and analyze system performance, identify potential issues, and take proactive measures to prevent outages and disruptions.Collaborate with cross-functional teams, including software engineers, product managers, and operations personnel, to ensure seamless integration of infrastructure and application components.Develop and implement automation scripts and tools to streamline infrastructure management tasks and improve operational efficiency.Stay up to date with industry best practices and emerging technologies in the field of site reliability engineering.Close cooperation with DevOps and Cloud engineers.Impact/DimensionsContributes to the reliability and uptime of Client's digital platforms, which are critical for the company's global operations and customer satisfaction.Works on projects that have a direct impact on Client's revenue and profitability.The individual in this role will have a significant impact on the efficiency and effectiveness of Client's technology operations and will be responsible for driving continuous improvement initiatives that save the company time and money.Key Performance Indicators (KPIs)Mean Time to Repair (MTTR) for critical systemsSystem uptime and availabilityNumber of incidents and outages preventedCustomer satisfaction with infrastructure performanceMajor Opportunities And DecisionsIdentifying and mitigating potential risks to infrastructure stability and performance.Making decisions on infrastructure investments and resource allocation to optimize cost-effectiveness and scalability.Balancing the need for innovation with the requirement for stability and reliability in infrastructure operations.Management/LeadershipLeads and mentors a team of junior SREs and infrastructure engineers.Provides technical guidance to cross-functional teams on infrastructure-related matters.Actively participates in shaping the company's infrastructure strategy and roadmap.Key Relationships, Stakeholders & Interfaces (External & Internal)Works closely with software engineering teams to ensure seamless integration of infrastructure and application components.Development teamsInfrastructure teamsBusiness stakeholdersVendors and partnersKnowledge And Technical CompetenciesStrong understanding of SRE & DevOps principles and practices.Experience with CI/CD Azure DevOps platform.Knowledge of infrastructure management tools such as Ansible, Puppet, or Chef.Solid experience with containerization such as Docker and orchestration tools such as Kubernetes.Solid knowledge about security aspects in cloud and on-premises.Proficient in scripting languages such as Python or Bash.Experience with cloud computing platforms such as AWS and Azure where GCP is preferred.Experience with monitoring software such as Datadog, Zabbix, Kibana, etc.Hands-on coding, deploying, and supporting large scale, serverless architectures.Infrastructure provisioning with Terraform or CloudFormation (IaaC).Experience with Linux and Windows operating systems.Strong problem-solving and analytical skills.Excellent communication and interpersonal skills.Education/ExperienceBachelor's degree in computer science or a related field.5+ years of experience in DevOps engineering.Experience with leading teams and managing projects.Very good knowledge of English in general.
#J-18808-Ljbffr


Fonte: Whatjobs_Ppc

Função de trabalho:

Requisitos

Senior Site Reliability (Sre) Engineer
Empresa:

Myskysys



Função de trabalho:

Engenharia

Engenheiro Civil - Sp

Planejar, organizar, executar e controlar projetos na área da construção civil, realizar investigações e levantamentos técnicos, definir metodologia de execu...


Desde Grupo Consult - São Paulo

Publicado 7 days ago

Ajudante De Obras - Zona Leste - Sp

Preparar e transportar materiais, ferramentas, aparelhos ou qualquer peça, limpando-as e arrumando-as de acordo com instruções. Auxiliar o oficial ou encarre...


Desde Grupo Consult - São Paulo

Publicado 7 days ago

Controlador De Acesso - Sp

controlar entrada e saída de funcionários, prestadores de serviços e cadastros ,necessário noções de informática .área hospitalar Salário: 1780 Cargo: Contro...


Desde Dunamis Servicos - São Paulo

Publicado 7 days ago

Controlador De Acesso - Sp

PRINCIPAIS ATIVIDADES:- Esclarecer dúvidas quanto ao local a se dirigir para atendimento;- Facilitar a mobilidade de pacientes com reduzida condição de andar...


Desde Hsanp Hospital - São Paulo

Publicado 7 days ago

Built at: 2024-10-01T09:37:59.367Z