Lead Application Reliability Engineer
“Job Overview
The selected candidate will become the key engineer in supporting and advancing the platform used for threat-modeling process in Citi. The responsibilities will cover (among others) maintaining and supporting the threat-modeling application as well as developing relevant tools used throughout the threat-modeling process. The application is comprised of web servers and backend data storage databases and supporting it requires understanding of middleware, database, container, and AWS cloud environment as well as change-control and compliance processes. We are seeking a highly skilled and dedicated Lead Application Reliability Engineer to ensure the continuous availability, optimal performance, and security of a critical threat-modeling application.
Required Skills & Qualifications
– 6+ years of relevant experience in an Engineering role, preferably in Financial Services or a large, complex, and/or global environment.
– Experience managing and troubleshooting Linux Operating Systems , including (e.g., Red Hat Enterprise Linux (RHEL), CentOS, Ubuntu) System Administration Tasks like and User Management, Service Restarts, – File System Checks. Must Have.
– Proficiency in Scripting for Automation and with (e.g., Bash, Python) Configuration Management Tools for system administration and infrastructure automation – (e.g., Ansible, Puppet, Chef). Must Have.
– Experience with container orchestration using Helm and Kubernetes on platforms like or AWS EKS, GCP GKE, – OpenShift. Must Have.
– Working knowledge of Relational Databases , including basic querying – (e.g., PostgreSQL). Must Have.
– Proven track record of maintaining applications and their technology stacks compliant with security and configuration requirements, including successfully passing internal and external security audits by demonstrating secure configuration of applications and infrastructure and ensuring continuous compliance with regulatory standards (e.g., implementing least privilege access, hardening OS, managing firewall rules) through automated checks and reporting – (e.g., SOX, GDPR). Must Have.
– Demonstrated adherence to strict change control procedures, executing all changes through a formalized change management process (e.g., code deployments, infrastructure updates) with proper documentation and approvals – (e.g., ITSM, ServiceNow). Must Have.
– Experience with Ticketing Systems – (e.g., Jira, ServiceNow). Must Have.
Note
👉 Please reference you found the job on https://topdevopsjobs.com/, this helps us get more companies to post here, thanks!
When applying for jobs, you should NEVER have to pay to apply. You should also NEVER have to pay to buy equipment which they then pay you back for later. Also never pay for trainings you have to do. Those are scams! NEVER PAY FOR ANYTHING! Posts that link to pages with ‘how to work online’ are also scams. Don’t use them or pay for them. Also always verify you’re actually talking to the company in the job post and not an imposter. A good idea is to check the domain name for the site/email and see if it’s the actual company’s main domain name.
“

