• 6 years of strong SRE experience along with knowledge of the Core Azure Service, IoT/ Event Hub, Databricks
• Must have 3 years of experience with Kubernetes and docker
• Implement and manage monitoring (ELK), alerting, and logging systems to ensure proactive identification and resolution of issues
• Engage and contribute towards System Monitoring, Incident management, performance tuning and fault finding
• Must have Python, Powershell scripting experience or any other scripting language
• Must have effective communication with excellent logic and problem-solving skills and a drive to make a difference
• Good to have experience with AI/ML Ops, Release Management, CI/CD using tools such as GitHub, Blackduck Hub, Coverity, Container Signing with good understanding on Software configuration Management
• Ability to understand and communicate customer issues
• Experience in development and supporting enterprise applications
Good written and verbal communication skills with the ability to document and communicate technical information to IT professionals