
Posted 16 days ago
No clicks
Provide production support and SRE-style operations for cloud and Linux-based data solution applications, troubleshooting user and technology issues and owning major production incidents. Maintain observability and monitoring (CloudWatch, Datadog, Geneos, etc.), automate toil with self-service tooling, and coordinate CI/CD-driven deployments. Support lifecycle management of cloud services (AWS: S3, EC2, Glue, Spark, RDS, Snowflake, Argo) and liaise with development and infrastructure teams for release management. Use scripting (Python/Shell) and monitoring/APM tools to drive reliability and incident resolution.
- Compensation
- Not specified
- City
- Ciudad Autónoma de Buenos Aires
- Country
- Argentina
Currency: Not specified
Full Job Description
Location: Ciudad Autónoma de Buenos Aires, Argentina
Job responsibilities:
- Provide resolution of all user and technology issues on the Production instance of the data solution application suite that involves tech troubleshooting on various systems and tools.
- Bring SRE mind-set to the application support Org. Follow 70-30 Prod Support & SRE hybrid model.
- Contribute on traceability and observability of the cloud based applications using CloudWatch and Datadog etc.
- Focus on Toil automation and Self-service tooling to avoid repetitive manual tasks.
- Support and Technology lifecycle management of the various Cloud and Linux based applications.
- Coordinate and execute all software deployments into Production environments with the help of automated CI/CD pipeline.' and 'Own major production incidents, working closely with other technology teams in order to resolve business-impacting issues
- Develop specialist knowledge in relevant systems, sharing that knowledge with team members and global partners
- Liaising with application development and infrastructure teams targeting on-going development, production activity and release management
Required qualifications, capabilities, and skills:
- 3 years of experience or equivalent expertise troubleshooting, resolving, and maintaining information technology services
- Demonstrated knowledge of applications or infrastructure in a large-scale technology environment both on premises and public cloud
- Experience in observability and monitoring tools and techniques
- Exposure to processes in scope of the Information Technology Infrastructure Library (ITIL) framework
- AWS/Cloud based monitoring
- Experience supporting AWS cloud based applications
- Python/Shell Scripting
- AWS services– S3, EC2, Glue, Apache Spark, Snowflake, Argo Workflows, RDS
- Application Performance monitoring tools : Datadog/Splunk/AppDynamic/Dynatrace/Grafana/Splunk
- Schedulers - Glue/Bridge/Step/Control-M / Autosys
- Monitoring - Geneos, Datadog / CloudWatch
Preferred qualifications, capabilities, and skills:
- Strong written and verbal communication skills
- Strong problem solving skills- Ability to understand complex component/system/service interactions and look for causes and provide solutions.
- Self-driven & proactive
- Customer Service Orientation - build positive working relationships with customers at all times providing realistic information and managing their expectations
- Prior knowledge in JAVA is a plus
- Advanced English level






