Site Reliability Engineer

voltar aos resultados

Empresa:

Turnkey Tech Staffing

Lugar:

Detalhes da Vaga

About the Company YipitData is the leading market research firm for the disruptive economy and recently raised $475M from The Carlyle Group at a valuation of over $1B. We analyze billions of data points every day to provide accurate, detailed insights on ridesharing, e-commerce marketplaces, payments, and more. Our data team uses proprietary technology to identify, license, clean, and analyze the data many of the world's largest investment funds and corporations depend on. For three years, we have been recognized as one of Inc's Best Workplaces. We are a fast-growing technology company backed by Norwest Venture Partners and The Carlyle Group. We cultivate a strong people-centric culture focused on mastery, ownership, and transparency. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, marital status, disability, gender, gender identity or expression, or veteran status. We are proud to be an equal-opportunity employer. About the Role We are looking for a Software Engineer - SRE to join our engineering infrastructure team. This team is responsible for developing and maintaining the infrastructure and systems that support the availability, reliability, scalability and performance of our software applications and tools. The team works closely with development and infrastructure teams to identify and resolve issues in our systems, increase reliability, measure reliability indicators, implement automation and optimize performance. Our AWS cloud infrastructure and developer tools allow engineers to quickly develop & release solutions that are secure, reliable, scalable, and cost-efficient. Our responsibilities include: Providing standard architectures that are reliable, scalable, and secure Providing robust infrastructure abstractions Providing guidance on application development, deployment, reliability, scalability, and security Hosting training sessions to increase engineers' impact & mastery Our Infrastructure department has two teams that work closely to enable other engineering teams to build software as efficiently as possible: Infra & Tools: develop our infrastructure platform and development tools SRE: support engineering teams in building and operating systems sustainably with high reliability and availability In This Role You Will: Work together with SRE Lead and other team members to measure reliability of systems in the cloud using SLIs (Service Level Indicators) and SLOs (Service Level Objectives) Leverage Python and AWS services (such as AWS CloudFormation or AWS CDK) to automate the provisioning and management of infrastructure resources. Participate in incidents and help teams follow the established incident management processes and iteratively improve them Get involved in maintenance activities for various cloud resources and keep track of them Implement robust monitoring solutions to track the health, performance and availability of existing systems using tools like Datadog and AWS CloudWatch. Help improve the CI/CD pipeline using tools like GitHub, CircleCI and Python scripts Collaborate with other Engineering teams and act as an embedded SRE to help them gain reliability objectives for their systems Contribute to our cloud infrastructure platform and developer tools You Are Likely To Succeed If: 3 years of experience in software engineering 3 years experience with Python 1 years of experience with SRE/DevOps discipline You have performance engineering skills. You can dig into our perf tools to understand where the bottlenecks are AND how to improve them Good knowledge and experience (3 years) with AWS services, including CloudFormation, EC2, ECS, RDS, S3, SQS, VPC, IAM, CloudTrail, CloudWatch, etc. Familiarity with containerization technologies such as Docker and container orchestration platforms like ECS You are comfortable working with new cloud technologies and learning new skills You are a team player with exceptional verbal and written communication skills You have strong problem-solving and troubleshooting skills. Excellent communication and collaboration abilities to work effectively in a team environment. Nice to have: experience with Django, React, Terraform, Databricks, Datadog, Airflow For You: You'll work with a New York-based team You will have the opportunity to acquire new skills through an internal training program Flex PTO for any reason, including sick days (no specified limits), flexible work schedule Personal laptop Health and wellness package Remote work

Fonte: Adzuna_Ppc

Função de trabalho:

Tecnologia da informação

Requisitos

Site Reliability Engineer

Empresa:

Turnkey Tech Staffing

Lugar:

Brasil

Função de trabalho:

Tecnologia da informação

Denunciar esta vaga

Vagas Semelhantes

Ver mais vagas semelhantes

Datastage Developer

This position is a Remote position to work from on in LATAM working with US clients. You will be working as a consultant directly with clients US Clients.We ...

Desde Allianceit Inc - Brasil

Publicado 11 days ago

Scrum Master Sr

Job DescriptionA Genesis Consulting tem uma oportunidade imediata para Scrum Masters experientes e com verdadeira paixão por crescimento profissional e trans...

Desde Genesis Consulting Partners, Llc - Brasil

Publicado 11 days ago

Devops Engineer (Linux, Gcp) - Remote

ITTConnect is seeking a Senior DevOps Engineer (Linux) to work remotely for a client in the US. This is a position with a global leader in consulting, digita...

Desde Ittconnect - Brasil

Publicado 11 days ago

Analista De Secops Cloud

DESCRIÇÃO DA VAGANossa tecnologia dita o ritmo do mercado. Afinal, 25% do PIB brasileiro passa pelos softwares presentes em mais de 80 mil empresas clientes ...

Desde Totvs - Brasil

Publicado 11 days ago

Built at: 2024-06-29T11:37:48.801Z