Position Overview:
We are seeking a highly skilled Automation Specialist with comprehensive knowledge in telecommunications systems and infrastructure. The ideal candidate will possess expertise in various domains including systems administration, middleware, AWS services administration, databases, Linux, networking, and application flows. The primary responsibility of the RRT Specialist will be to provide rapid and efficient response to technical issues, implement proactive monitoring solutions, and automate processes to enhance operational efficiency.
Position Overview:
We are seeking a highly skilled Automation Specialist with comprehensive knowledge in telecommunications systems and infrastructure. The ideal candidate will possess expertise in various domains including systems administration, middleware, AWS services administration, databases, Linux, networking, and application flows. The primary responsibility of the RRT Specialist will be to provide rapid and efficient response to technical issues, implement proactive monitoring solutions, and automate processes to enhance operational efficiency.
Key Responsibilities:
Technical Expertise: Demonstrate proficiency in systems, infrastructure, middleware, AWS services administration, database management, Linux, and network technologies. Possess in-depth knowledge of application flows, especially legacy systems, focusing on understanding the "why" rather than the "how".
Dashboard Development: Develop automated dashboards for APIs and ELK monitoring analytics dashboards, including proactive alerting mechanisms for application teams.
Problem Resolution: Debug existing P3 (Priority 3) P’M (Problem Management) issues, actively working to reduce aging and automate end-to-end user journeys.
Impact Analysis: Assist in conducting thorough impact analysis, pinpointing issues, and detecting blast radius to facilitate efficient problem resolution.
Production Handover: Initiate production handovers ensuring seamless transition and operational continuity.
Continuous Improvement: Continuously learn and share expertise with application and infrastructure teams. Review and improve Standard Operating Procedures (SOPs) for quality and effectiveness.
Education and Training: Educate application teams on best practices and facilitate knowledge exchange between teams. Conduct RCA (Root Cause Analysis) investigations for Problem Records (PRBs) and implement solutions to reduce aging.
Log Analysis and Maintenance: Analyze log files to identify gaps or areas for improvement. Perform regular housekeeping and log retention checks to maintain system integrity.
Performance Optimization: Optimize database indexing to address performance issues. Establish standard configurations and categories in applications for efficient management via ServiceNow.
Monitoring and Documentation: Ensure adequate monitoring is in place for all services and applications. Centralize flow diagrams, SOPs, and architecture diagrams for quick reference and accessibility.
Must have skills- Devops with automation, ELK, CICD, Jenkins, Docker, AWS