Site Reliability Engineer

Site Reliability Engineer
Empresa:

Turnkey Tech Staffing


Lugar:

Brasil


Função de trabalho:

Tecnologia da informação

Detalhes da Vaga

About the Company YipitData is the leading market research and analytics firm for the disruptive economy and recently raised up to $475M from The Carlyle Group at a valuation over $1B. We analyze billions of alternative data points every day to provide accurate, detailed insights on ridesharing, e-commerce marketplaces, payments and more. Our on-demand insights team uses proprietary technology to identify, license, clean and analyze the data many of the world's largest investment funds and corporations depend on. For three years and counting, we have been recognized as one of Inc's Best Workplaces . We are a fast-growing technology company backed by The Carlyle Group and Norwest Venture Partners. Our offices are located in NYC, Austin, Miami, Denver, Mountain View, Seattle, Hong Kong, Shanghai, Beijing, Guangzhou, and Singapore. We cultivate a people-centric culture focused on mastery, ownership, and transparency. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, marital status, disability, gender, gender identity or expression, or veteran status. We are proud to be an equal-opportunity employer. About the Role We are looking for a Senior Software Engineer – SRE to join our Infrastructure engineering team. This team is responsible for developing and maintaining the Infrastructure and Tools that support the availability, reliability, scalability and performance of our software applications. The team works closely with other Engineering teams to identify and resolve issues in our systems, increase reliability, measure reliability indicators, implement automation, and optimize performance. This team has been building cloud platforms and developer tools for almost a decade and is one of the reasons our Engineering group has been so lean as our company scales. We own common infrastructure and abstractions so teams can manage their resources independently and build apps efficiently. Because of our work, engineers are able to independently: Manage applications through our custom PaaS Build and deploy custom infrastructure using our library of CDK constructs Manage CI/CD pipelines using our guidelines and templates Use web development best practices through our Python libraries Day-to-day work on this team is interdisciplinary and can range between implementing new features to the platform, performing Infra updates with limited downtime, and supporting Engineering team initiatives. The Infrastructure team is a mission critical team that enables multiple Engineering teams to be efficient and productive for our growing engineering practice. Our responsibilities include: Providing standard architectures that are reliable, scalable, and secure Providing robust infrastructure abstractions Providing guidance on application development, deployment, reliability, scalability, and security Hosting training sessions to increase engineers' impact & mastery In This Role You Will: Work together with the Infra Team Lead and other team members to build better developer tooling and environments Mentor and be a project leader on key initiatives with a high level of visibility Leverage Python and AWS services (such as AWS CloudFormation or AWS CDK) to automate the provisioning and management of infrastructure resources Participate in incidents and help teams follow the established incident management processes and iteratively improve them Implement robust monitoring solutions to track the health, performance, and availability of existing systems using tools like Datadog and AWS CloudWatch Help improve performance bottlenecks across our Django backend and React frontend infrastructure Help improve the CI/CD pipeline using tools like GitHub, CircleCI, and Python scripts Collaborate with other Engineering teams and act as an embedded SRE to help them gain reliability objectives for their systems by establishing SLAs, SLIs, and SLOs Contribute to our cloud infrastructure platform and developer tools You Are Likely To Succeed If: Bachelor's or Master's degree in Computer Science or related STEM fields 5 years experience with Python 3 years of experience with SRE/DevOps discipline. You get why we need SLOs/SLIs and error budgets. You might have read the SRE Handbook end-to-end 3 years of hands-on experience with AWS , including CloudFormation, EC2, ECS, RDS, S3, SQS, VPC, IAM, CloudTrail, CloudWatch, etc. You've done things beyond manually using the AWS console. You're comfortable using the AWS CLI, boto3, and writing IaC using CDK or CDKTF. You wouldn't be surprised if I asked you when to use Fargate vs EC2 for the AWS ECS service 2 years of experience with Django 2 years of experience with SQL You have Performance Engineering skills , you can dig into our perf tools to understand where the bottlenecks are AND how to improve them. Given a slow Django endpoint, you can tell me all the steps you would take to improve the performance. You can take it further and set up monitoring and alerting to know if we are exhausting our error budgets. You know how to create and run load tests, stress tests, etc. to ensure new code does not degrade the performance of the endpoint you just improved Familiarity with containerization technologies such as Docker and container orchestration platforms like ECS You are comfortable working with new cloud technologies and learning new skills You are a team player with exceptional verbal and written communication skills You have strong problem-solving and troubleshooting skills Nice to have: experience with Kubernetes, React, Terraform, Databricks, Datadog, Airflow For You: You'll work with a New York-based team You will have the opportunity to acquire new skills through an internal training program Flex PTO for any reason, including sick days (no specified limits), flexible work schedule Personal laptop Health and wellness package Remote work


Fonte: Adzuna_Ppc

Função de trabalho:

Requisitos

Site Reliability Engineer
Empresa:

Turnkey Tech Staffing


Lugar:

Brasil


Função de trabalho:

Tecnologia da informação

[Job-18206] Mid-Level Java Developer, Brazil

Nós somos especialistas em tech transformation, nós somos a CI&T. Combinamos a força disruptiva da Inteligência Artificial com a expertise humana para apoi...


Desde Ci&Amp;T - Brasil

Publicado 3 days ago

[Job-18328] Junior Developer Frontend (Angular), Brasil

Nós somos especialistas em tech transformation, nós somos a CI&T. Combinamos a força disruptiva da Inteligência Artificial com a expertise humana para apoiar...


Desde Ci&Amp;T - Brasil

Publicado 3 days ago

[Job-18312] Pleno Developer Backend (Golang), Brasil

Nós somos especialistas em tech transformation, nós somos a CI&T. Combinamos a força disruptiva da Inteligência Artificial com a expertise humana para apoiar...


Desde Ci&Amp;T - Brasil

Publicado 3 days ago

Desenvolvimento Front End Sênior / Remoto

Quais serão suas responsabilidades?Desenvolvimento front-end Angular em time multidisciplinar;Realizar testes rigorosos para garantir que a interface funcion...


Desde Dbc Company - Brasil

Publicado 3 days ago

Built at: 2024-10-18T06:15:44.639Z