Job Title: Resiliency Architect
Experience: 10+ yrs
Notice Period: Immediate to 30 days
Location: Hyderabad
Job Description:
Principal Responsibilities
• Understand business goals and drivers and translate those into an appropriate technical solution.
• Gather technical requirements, assess capabilities, and provide and engineer appropriate resilient solution recommendations.
• Design and implement full stack applications that can be used to demonstrate and validate various resiliency and integration patterns.
• Validate and test resiliency solutions in the Cloud, on containers and on premises.
• Focus on continuous improvement practices as required to meet system resiliency imperatives.
• Define high availability and resilience standard and best practices for adopting new and existing technologies for applications across platforms.
• Establish the appropriate monitoring and alerting of solution events related to performance, scalability, availability, and reliability.
• Support for the adoption of DevOps methodology and Agile project management.
• Communicates complicated technical concepts effectively to a broad group of stakeholders.
• Provide mentoring, knowledge transfer and assist in training for other team members.
Experience
• Minimum of 10 years’ experience in the design & implementation of distributed applications
• Minimum of 5 years’ experience in networking, infrastructure, middleware and database architecture
• Minimum of 5 years’ experience in highly available architecture and solution implementation
• Minimum of 5 years’ experience with industry patterns, methodologies, and techniques across the disaster recovery disciplines.
Knowledge and Skills
• Ability to be curious, solve problems and engineer solutions that meet resiliency requirements
• Ability to work independently, with minimal supervision.
• Strong knowledge of AWS cloud environment is a plus
• Experience with performance analysis, tuning and engineering is a plus
• Knowledge of monitoring tools (cloud watch, cloud trail, Splunk, and other application monitoring.)
• Knowledge of SSL concepts, Authentication and Authorization, AWS Identity Access Management (IAM)
• In-depth, hands-on expertise in Java, SQL, Linux
• Must be comfortable working in an open, highly collaborative team
• Strong troubleshooting skills
• Ability to write scripts (Bash, PHP, Python) for automation of solution resiliency validation and verification.
• Excellent oral and written communication skills along with and ability to communicate at all levels.
• Chaos engineering experience a huge plus.
• Bachelor’s Degree in a technical discipline or equivalent work experience