Nuance is putting together an SRE team to own and manage a new Cloud environment for one of it’s high profile service offering.
Modern software delivery process:
Other than the traditional SRE responsibilities, we see the role as an integral part of our Software delivery and we’re looking for people that have a “shift level mindset”. You’ll work closely with the devOps team and provide requirements for efficient workflows. As a consumer of the infrastructure services, you’re also a key stakeholder and provide feedback to the Infrastructure team for exposing build out and configuration pipelines along with the necessary tool sets to monitor and scale the environment. Environment build out as code along with common pipelines is key for production stability and reproducibility on lower environments.
Security front and center:
Our customers are large Enterprises that require different set of compliances and have security at the forefront of every deal. Every decisions, tasks and operations need to consider the impact on PCI, HIPPA, PII, GDRP, FedRAMP compliances and overall security of the solution. Traceability for audits and quick access to security metrics needs to be part of the day to day.
Skills and Technologies:
Expert in Azure Cloud management and services (Azure app-insigts, active-directory (AD), role-based-access-control (RBAC), JIT, KeyVault, etc.)
Exposure Azure DevOps (ADO)
Monitoring of Cloud services performance and alarms (Azure Monitor, Grafana, Jager, Kibana, …)
Kubernetes Orchestration and Container services echo system
Cloud architecture, networking and topologies (including Geo Redundancy and failover, Cloud to Cloud communication, )
Experience with Compliance accreditation (FedRAMP, PCI, PII, GDRP, HIPPA, …)
Security Reviews (Pen Testing, security environment scans, threat modeling, …)
Networking (TLS, mTLS, Service mesh, Loadbalancer, proxy, API Gateways, App Gateways, Express Route, … )
Creative and strives to automate routine tasks (scripts, dashboards, workflows, microservices)
Tracing and network debugging (OpenTelemetry, Wireshark, …)
Responsibilities:
Setting and Documenting SLO (Service Level Objectives)
Implementing SLI (Service Level Indicators)
On call for escalations regarding cloud services performance.
Leading debugging sessions and pulling in necessary subject matter experts to determine root cause
Digesting the customer contract and hosting requirement to design the appropriate hosting solution architecture
Provisioning and scaling environment to meet production traffic variations (networking and HPA)
Gather and analyze metrics from cloud resources to assist in performance tuning and fault finding
Engage in and improve the whole lifecycle of services from inception and design, throughout development, capacity planning, and launch reviews - to deployment, operation, and refinement
Scale systems sustainably through mechanisms such as automation; evolve systems reliability, efficiency, and velocity by pushing for change
Nice to Have:
Experience with other cloud vendors (AWS, GCS)
IPV6
#LI-HYBRID
Job Type: Full-time
Pay: $150,000.00 - $195,000.00 per year
Benefits:
401(k)
401(k) matching
Dental insurance
Employee assistance program
Employee discount
Flexible schedule
Flexible spending account
Health insurance
Health savings account
Life insurance
Paid time off
Professional development assistance
Referral program
Tuition reimbursement
Vision insurance
Schedule:
8 hour shift
Monday to Friday
On call
Supplemental Pay:
Bonus pay
Education:
Bachelor's (Preferred)
Experience:
Kubernetes: 5 years (Preferred)
container: 5 years (Preferred)
Docker: 5 years (Preferred)
DevOps: 5 years (Preferred)
site reliability: 5 years (Preferred)
Azure: 10 years (Preferred)
AWS: 10 years (Preferred)
Security clearance:
Secret (Preferred)
Willingness to travel:
25% (Preferred)
Work Location: Multiple locations