Site Reliability Engineer

Detalhes da Vaga

About the Company

YipitData is the leading market research and analytics firm for the disruptive economy and recently raised up to $475M from The Carlyle Group at a valuation over $1B.
We analyze billions of alternative data points every day to provide accurate, detailed insights on ridesharing, e-commerce marketplaces, payments and more.
Our on-demand insights team uses proprietary technology to identify, license, clean and analyze the data many of the world's largest investment funds and corporations depend on.
For three years and counting, we have been recognized as one of

Inc's Best Workplaces

.
We are a fast-growing technology company backed by The Carlyle Group and Norwest Venture Partners.
Our offices are located in NYC, Austin, Miami, Denver, Mountain View, Seattle, Hong Kong, Shanghai, Beijing, Guangzhou, and Singapore.
We cultivate a people-centric culture focused on mastery, ownership, and transparency.
We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, marital status, disability, gender, gender identity or expression, or veteran status.
We are proud to be an equal-opportunity employer.

About the Role
We are looking for a Senior Software Engineer – SRE to join our Infrastructure engineering team.
This team is responsible for developing and maintaining the Infrastructure and Tools that support the availability, reliability, scalability and performance of our software applications.
The team works closely with other Engineering teams to identify and resolve issues in our systems, increase reliability, measure reliability indicators, implement automation, and optimize performance.
This team has been building cloud platforms and developer tools for almost a decade and is one of the reasons our Engineering group has been so lean as our company scales.
We own common infrastructure and abstractions so teams can manage their resources independently and build apps efficiently.
Because of our work, engineers are able to independently:
Manage applications through our custom PaaS
Build and deploy custom infrastructure using our library of CDK constructs
Manage CI/CD pipelines using our guidelines and templates
Use web development best practices through our Python libraries
Day-to-day work on this team is interdisciplinary and can range between implementing new features to the platform, performing Infra updates with limited downtime, and supporting Engineering team initiatives.
The Infrastructure team is a mission critical team that enables multiple Engineering teams to be efficient and productive for our growing engineering practice.
Our responsibilities include:
Providing standard architectures that are reliable, scalable, and secure
Providing robust infrastructure abstractions
Providing guidance on application development, deployment, reliability, scalability, and security
Hosting training sessions to increase engineers' impact & mastery

In This Role You Will:
Work together with the Infra Team Lead and other team members to build better developer tooling and environments
Mentor and be a project leader on key initiatives with a high level of visibility
Leverage Python and AWS services (such as AWS CloudFormation or AWS CDK) to automate the provisioning and management of infrastructure resources
Participate in incidents and help teams follow the established incident management processes and iteratively improve them
Implement robust monitoring solutions to track the health, performance, and availability of existing systems using tools like Datadog and AWS CloudWatch
Help improve performance bottlenecks across our Django backend and React frontend infrastructure
Help improve the CI/CD pipeline using tools like GitHub, CircleCI, and Python scripts
Collaborate with other Engineering teams and act as an embedded SRE to help them gain reliability objectives for their systems by establishing SLAs, SLIs, and SLOs
Contribute to our cloud infrastructure platform and developer tools

You Are Likely To Succeed If:
Bachelor's or Master's degree in Computer Science or related STEM fields
5+ years experience with

Python
3+ years of experience with SRE/DevOps discipline.
You get why we need SLOs/SLIs and error budgets.
You might have read the

SRE Handbook

end-to-end
3+ years of hands-on experience with

AWS

, including CloudFormation, EC2, ECS, RDS, S3, SQS, VPC, IAM, CloudTrail, CloudWatch, etc.
You've done things beyond manually using the AWS console.
You're comfortable using the AWS CLI, boto3, and writing IaC using CDK or CDKTF.
You wouldn't be surprised if I asked you when to use Fargate vs EC2 for the AWS ECS service
2+ years of experience with

Django
2+ years of experience with SQL
You have

Performance Engineering skills

, you can dig into our perf tools to understand where the bottlenecks are AND how to improve them.
Given a slow Django endpoint, you can tell me all the steps you would take to improve the performance.
You can take it further and set up monitoring and alerting to know if we are exhausting our error budgets.
You know how to create and run load tests, stress tests, etc.
to ensure new code does not degrade the performance of the endpoint you just improved
Familiarity with containerization technologies such as Docker and container orchestration platforms like ECS
You are comfortable working with new cloud technologies and learning new skills
You are a team player with exceptional verbal and written communication skills
You have strong problem-solving and troubleshooting skills
Nice to have: experience with Kubernetes, React, Terraform, Databricks, Datadog, Airflow

For You:
You'll work with a New York-based team
You will have the opportunity to acquire new skills through an internal training program
Flex PTO for any reason, including sick days (no specified limits), flexible work schedule
Personal laptop
Health and wellness package
Remote work

Salário Nominal: A acordar

Fonte: Appcast_Ppc

Função de trabalho:

Tecnologia da informação

Requisitos

Vagas Semelhantes

Ver mais vagas semelhantes

Data Engineer Web Scraping

Before you apply, please get familiar with Luxoft Luxoft locations: https://career.luxoft.com/locations/ Logeek Magazine: https://career.luxoft.com/logeek-ma...

Luxoft - Brasil

Publicado 7 days ago

Salesforce Developer

We are seeking a highly skilled Salesforce Developer to join our dynamic team. The ideal candidate will possess a deep understanding of the Salesforce platfo...

Osf Digital - Brasil

Publicado 5 days ago

Especialista Devops

Somos a Control iD, empresa que faz parte do grupo sueco ASSA ABLOY. Nascemos em 2006 com a missão de popularizar a biometria no Brasil. Hoje, somos líderes ...

Control Id - Brasil

Publicado 5 days ago

Senior Front-End Engineer

Job Type: Full-time, Indefinite Contract, Remote (US CST, EST, MST time zones) About Station70 Station70 is redefining digital asset security through our wal...

Station70 - Brasil

Publicado 5 days ago

Built at: 2024-12-01T00:40:17.507Z