About the Company YipitData is the leading market research firm for the disruptive economy and recently raised $475M from The Carlyle Group at a valuation of over $1B. We analyze billions of data points every day to provide accurate, detailed insights on ridesharing, e-commerce marketplaces, payments, and more. Our data team uses proprietary technology to identify, license, clean, and analyze the data many of the world's largest investment funds and corporations depend on. For three years, we have been recognized as one of Inc's Best Workplaces. We are a fast-growing technology company backed by Norwest Venture Partners and The Carlyle Group. We cultivate a strong people-centric culture focused on mastery, ownership, and transparency. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, marital status, disability, gender, gender identity or expression, or veteran status. We are proud to be an equal-opportunity employer. About the Role We are looking for a Software Engineer - SRE to join our engineering infrastructure team. This team is responsible for developing and maintaining the infrastructure and systems that support the availability, reliability, scalability and performance of our software applications and tools. The team works closely with development and infrastructure teams to identify and resolve issues in our systems, increase reliability, measure reliability indicators, implement automation and optimize performance. Our AWS cloud infrastructure and developer tools allow engineers to quickly develop & release solutions that are secure, reliable, scalable, and cost-efficient. Our responsibilities include: Providing standard architectures that are reliable, scalable, and secure Providing robust infrastructure abstractions Providing guidance on application development, deployment, reliability, scalability, and security Hosting training sessions to increase engineers' impact & mastery Our Infrastructure department has two teams that work closely to enable other engineering teams to build software as efficiently as possible: Infra & Tools: develop our infrastructure platform and development tools SRE: support engineering teams in building and operating systems sustainably with high reliability and availability In This Role You Will: Work together with SRE Lead and other team members to measure reliability of systems in the cloud using SLIs (Service Level Indicators) and SLOs (Service Level Objectives) Leverage Python and AWS services (such as AWS CloudFormation or AWS CDK) to automate the provisioning and management of infrastructure resources. Participate in incidents and help teams follow the established incident management processes and iteratively improve them Get involved in maintenance activities for various cloud resources and keep track of them Implement robust monitoring solutions to track the health, performance and availability of existing systems using tools like Datadog and AWS CloudWatch. Help improve the CI/CD pipeline using tools like GitHub, CircleCI and Python scripts Collaborate with other Engineering teams and act as an embedded SRE to help them gain reliability objectives for their systems Contribute to our cloud infrastructure platform and developer tools You Are Likely To Succeed If: 3 years of experience in software engineering 3 years experience with Python 1 years of experience with SRE/DevOps discipline You have performance engineering skills. You can dig into our perf tools to understand where the bottlenecks are AND how to improve them Good knowledge and experience (3 years) with AWS services, including CloudFormation, EC2, ECS, RDS, S3, SQS, VPC, IAM, CloudTrail, CloudWatch, etc. Familiarity with containerization technologies such as Docker and container orchestration platforms like ECS You are comfortable working with new cloud technologies and learning new skills You are a team player with exceptional verbal and written communication skills You have strong problem-solving and troubleshooting skills. Excellent communication and collaboration abilities to work effectively in a team environment. Nice to have: experience with Django, React, Terraform, Databricks, Datadog, Airflow For You: You'll work with a New York-based team You will have the opportunity to acquire new skills through an internal training program Flex PTO for any reason, including sick days (no specified limits), flexible work schedule Personal laptop Health and wellness package Remote work