About the Company
YipitData is the leading market research and analytics firm for the disruptive economy and recently raised up to $475M from The Carlyle Group at a valuation over $1B.
We analyze billions of alternative data points every day to provide accurate, detailed insights on ridesharing, e-commerce marketplaces, payments and more.
Our on-demand insights team uses proprietary technology to identify, license, clean and analyze the data many of the world's largest investment funds and corporations depend on.
For three years and counting, we have been recognized as one of
Inc's Best Workplaces
.
We are a fast-growing technology company backed by The Carlyle Group and Norwest Venture Partners.
Our offices are located in NYC, Austin, Miami, Denver, Mountain View, Seattle, Hong Kong, Shanghai, Beijing, Guangzhou, and Singapore.
We cultivate a people-centric culture focused on mastery, ownership, and transparency.
We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, marital status, disability, gender, gender identity or expression, or veteran status.
We are proud to be an equal-opportunity employer.
About the Role
We are looking for a Senior Software Engineer – SRE to join our Infrastructure engineering team.
This team is responsible for developing and maintaining the Infrastructure and Tools that support the availability, reliability, scalability and performance of our software applications.
The team works closely with other Engineering teams to identify and resolve issues in our systems, increase reliability, measure reliability indicators, implement automation, and optimize performance.
This team has been building cloud platforms and developer tools for almost a decade and is one of the reasons our Engineering group has been so lean as our company scales.
We own common infrastructure and abstractions so teams can manage their resources independently and build apps efficiently.
Because of our work, engineers are able to independently:
Manage applications through our custom PaaS
Build and deploy custom infrastructure using our library of CDK constructs
Manage CI/CD pipelines using our guidelines and templates
Use web development best practices through our Python libraries
Day-to-day work on this team is interdisciplinary and can range between implementing new features to the platform, performing Infra updates with limited downtime, and supporting Engineering team initiatives.
The Infrastructure team is a mission critical team that enables multiple Engineering teams to be efficient and productive for our growing engineering practice.
Our responsibilities include:
Providing standard architectures that are reliable, scalable, and secure
Providing robust infrastructure abstractions
Providing guidance on application development, deployment, reliability, scalability, and security
Hosting training sessions to increase engineers' impact & mastery
In This Role You Will:
Work together with the Infra Team Lead and other team members to build better developer tooling and environments
Mentor and be a project leader on key initiatives with a high level of visibility
Leverage Python and AWS services (such as AWS CloudFormation or AWS CDK) to automate the provisioning and management of infrastructure resources
Participate in incidents and help teams follow the established incident management processes and iteratively improve them
Implement robust monitoring solutions to track the health, performance, and availability of existing systems using tools like Datadog and AWS CloudWatch
Help improve performance bottlenecks across our Django backend and React frontend infrastructure
Help improve the CI/CD pipeline using tools like GitHub, CircleCI, and Python scripts
Collaborate with other Engineering teams and act as an embedded SRE to help them gain reliability objectives for their systems by establishing SLAs, SLIs, and SLOs
Contribute to our cloud infrastructure platform and developer tools
You Are Likely To Succeed If:
Bachelor's or Master's degree in Computer Science or related STEM fields
5+ years experience with
Python
3+ years of experience with SRE/DevOps discipline.
You get why we need SLOs/SLIs and error budgets.
You might have read the
SRE Handbook
end-to-end
3+ years of hands-on experience with
AWS
, including CloudFormation, EC2, ECS, RDS, S3, SQS, VPC, IAM, CloudTrail, CloudWatch, etc.
You've done things beyond manually using the AWS console.
You're comfortable using the AWS CLI, boto3, and writing IaC using CDK or CDKTF.
You wouldn't be surprised if I asked you when to use Fargate vs EC2 for the AWS ECS service
2+ years of experience with
Django
2+ years of experience with SQL
You have
Performance Engineering skills
, you can dig into our perf tools to understand where the bottlenecks are AND how to improve them.
Given a slow Django endpoint, you can tell me all the steps you would take to improve the performance.
You can take it further and set up monitoring and alerting to know if we are exhausting our error budgets.
You know how to create and run load tests, stress tests, etc.
to ensure new code does not degrade the performance of the endpoint you just improved
Familiarity with containerization technologies such as Docker and container orchestration platforms like ECS
You are comfortable working with new cloud technologies and learning new skills
You are a team player with exceptional verbal and written communication skills
You have strong problem-solving and troubleshooting skills
Nice to have: experience with Kubernetes, React, Terraform, Databricks, Datadog, Airflow
For You:
You'll work with a New York-based team
You will have the opportunity to acquire new skills through an internal training program
Flex PTO for any reason, including sick days (no specified limits), flexible work schedule
Personal laptop
Health and wellness package
Remote work