Senior Site Reliability Engineer, Isbn Cloud Ops

Senior Site Reliability Engineer, Isbn Cloud Ops
Empresa:

Sap


Detalhes da Vaga

```html
We help the world run better. Our company culture is focused on helping our employees enable innovation by building breakthroughs together. How? We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and future-focused work. We offer a highly collaborative, caring team environment with a strong focus on learning and development, recognition for your individual contributions, and a variety of benefit options for you to choose from. Apply now!
As a Senior Site Reliability Engineer (SRE), you will be part of a high-performance team which:

Continuously improves the reliability of critical systems, working closely with development and operations teams.
Is responsible for monitoring, troubleshooting, and developing tooling and automation to optimize system performance and efficiency.
Challenging the status quo.

To be successful in this role, you will need to thrive in an agile environment where teams work together toward a common goal:

Identify engineering defects in the existing code base and continuously improve the code quality.
Collaborate with development teams to implement and deploy new features and enhancements, ensuring they meet reliability and performance standards.
Perform code reviews and pair program with other engineers on the team.
Define and implement efficient end-to-end provisioning of automation solutions.
Build CI/CD pipeline configurations to orchestrate provisioning and deployment.
Automate monitoring tools to monitor system health and reliability to support high uptime requirements.
Automate repetitive tasks and processes to improve efficiency and reduce manual intervention.
Collaborate with cross-functional teams to define and establish service level indicators (SLIs), service level objectives (SLOs), and key software engineering metrics.
Automate infrastructure in AWS and in private data centers with CloudFormation, Terraform, Ansible, and AWS DevOps tools.
Conduct post-incident analyses to identify root causes and implement preventive measures to avoid future incidents.
Perform capacity planning and resource allocation to ensure optimal system performance and scalability.
Stay up to date with industry best practices, new technologies, and emerging trends in site reliability engineering.
Create and maintain documentation for system architecture, configuration, and troubleshooting procedures.

Requirements:

Full understanding of DevOps, SRE, and agile software development roles and concepts.
Senior level ability to use one or more of these languages: Python, Typescript/Javascript, Golang, Java, or C#.
Full understanding of Git (code version control) and software development best practices (GitOps).
Strong knowledge of Linux/Unix systems and command line tools.
Senior level knowledge of IaC and Configuration Management, using technologies such as Cloud Formation, Terraform, Puppet, and Ansible.
Senior level knowledge of AWS main resources (VPC, EC2, IAM, API Gateway, autoscaling, availability zones, Lambda...) and deploying and running systems at scale.
Full understanding of microservices architecture (concepts).
Full understanding of observability best practices, and monitoring and logging tools such as Dynatrace, New Relic, Prometheus, Grafana, ELK stack, Splunk…
Senior level knowledge of Jenkins, AWS Code Deploy or similar CI/CD tools and pipelines.
Prior experience with containerized deployments.

```
#J-18808-Ljbffr


Fonte: Whatjobs_Ppc

Função de trabalho:

Requisitos

Senior Site Reliability Engineer, Isbn Cloud Ops
Empresa:

Sap


Estágio Em Tecnologia Da Informação

Auxiliar no cadastramento de produtos de nossos clientes em bancos de dados, bem como identificarprodutos e fazer marcações em nossos sistemas. assegurar a ...


Rio Grande do Sul

Publicado 2 days ago

Técnico Instalação Cftv

rea de atuao: OutrosLocalizao: Alvorada-RSAtribuies: Atribuies: Instalao de sistemas eltricos, cmeras, alarmes e rede lgica;Executar tarefas de aperfeioament...


Rio Grande do Sul

Publicado 2 days ago

Operador Monitoramento

rea de atuao: OperaesLocalizao: Porto Alegre-RSAtribuies: Atribuies: Atendimento aos clientes, prestadores de servio e visitantes com oservio de portaria rem...


Rio Grande do Sul

Publicado 2 days ago

Técnico De Suporte

rea de atuao: FiscalLocalizao: Porto Alegre-RSAtribuies: Atribuies: Prestar suporte do mdulo fiscal e contbil entre outros via telefone e web do Sistema Domn...


Rio Grande do Sul

Publicado 2 days ago

Built at: 2024-09-20T00:52:46.060Z