Position Overview: We are looking for a talented and motivated Site Reliability Engineer (SRE) to join our team remotely from LATAM. The ideal candidate will have strong technical skills and exceptional problem-solving abilities. As an SRE, you will ensure the reliability, availability, and performance of critical systems and applications. Key Responsibilities: Provide technical support for resolving complex issues and ensuring timely resolutions. Utilize Service Cloud (Salesforce) or Zendesk for managing support tickets and communication. Implement and monitor alerting and monitoring tools (e.g., Signalfx, Datadog) to ensure system health and availability. Support and optimize AWS services, including Athena , ensuring data reliability and accessibility. Troubleshoot and resolve issues related to applications, infrastructure, and APIs. Develop and maintain automation scripts using Shell Scripting , Groovy , or YAML for system processes and deployments. Collaborate with development teams to manage CI/CD pipelines using tools like Jenkins and Git/Bitbucket . Perform root cause analysis and ensure incident management aligns with ITIL/ITSM practices. Manage and troubleshoot monitoring tools to provide proactive system insights and enhance system reliability. Provide support for REST and WEB API integration and related issues. Document solutions, processes, and best practices for future reference. Requirements: Must-Have Skills: Linux: Proficiency in using and troubleshooting Linux systems. Shell Scripting: Strong experience in writing and debugging scripts. ITIL/ITSM: Knowledge of incident, problem, and change management processes. PL/SQL: Experience in database troubleshooting and query optimization. Monitoring Tools: Hands-on experience with tools like Signalfx or Datadog . Jenkins - CI/CD: Basic experience with pipelines and automation. Groovy Scripting/YAML: Ability to create and maintain scripts and configuration files. Git/Bitbucket: Familiarity with version control tools and workflows. REST and WEB API Support: Knowledge of API integration and troubleshooting. AWS Expertise: Experience with AWS services, including Athena . Application Troubleshooting: Proven ability to resolve issues in a production environment. Good-to-Have Skills: Configuration Management Tools: Experience with tools like Ansible or Chef . Java Project Troubleshooting: Ability to debug and resolve issues in Java-based applications. Soft Skills: English Proficiency: Strong verbal and written communication skills.