NielsenIQ

Senior Site Reliability Engineer

Pune, MH, IN

7 days ago
Save Job

Summary

At NielsenIQ, we’re passionate about building software that solves problems across B2B and B2C business. We count on our site reliability engineers (SREs) to empower users with a rich feature set, high availability, and stellar performance level to pursue their missions. We’re seeking an experienced SRE to deliver insights from massive-scale data in real time. Specifically, we’re searching for someone who has fresh ideas and a unique viewpoint, and who enjoys collaborating with a cross-functional team to develop real-world solutions and positive user experiences for every interaction.


Objectives of this role

  • Run the production environment by monitoring availability and taking a holistic view of system health
  • Build software and systems to manage platform infrastructure and applications
  • Improve reliability, quality, and time-to-market of our suite of software solutions
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement
  • Provide primary operational support and engineering for multiple large-scale distributed software applications


Responsibilities

  • Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding
  • Partner with development teams to improve services through rigorous testing and release procedures
  • Participate in system design consulting, platform management, and capacity planning
  • Create sustainable systems and services through automation and uplifts
  • Balance feature development speed and reliability with well-defined service-level objectives


Qualifications


Required skills and qualifications

  • Bachelor's degree in computer science with 8+ years of experience in an enterprise environment.
  • Master’s degree desirable in Computer Science and related fields.
  • Relevant AWS/Azure certification is required.
  • Ability to program (structured and OOP) using one or more high-level languages, such as Python, Java, C/C++, Ruby, and JavaScript
  • Proactive approach to identifying problems, performance bottlenecks, and areas for improvement

Desired Experience

  • Site Reliability Engineer (SRE) experience for Azure/AWS Cloud with a large account is a must.
  • Experience working in an agile development environment and project management
  • Familiarity with cloud frameworks, containers and Kubernetes services
  • Understanding of cloud infrastructure and the compute stack based on Linux, and/or Windows, the management tools, and the network integration.
  • Hands-on experience with 2 or more automation tools like Terraform, Ansible, Cloud Formation, ARM Templates
  • Experience with distributed storage technologies such as NFS, HDFS, Ceph, and Amazon S3, as well as dynamic resource management frameworks (Apache Mesos, Kubernetes, Yarn)
  • Hands on experience with scripting with Unix/Powershell.
  • Excellent written and verbal communication skills in English.
  • Experience in all aspects of cloud computing (e.g. infrastructure, storage, platforms, data etc.)
  • Experience of architecture, design, and implementation of complex, highly available and highly scalable Cloud solutions
  • Strong knowledge in Cloud-best practices and hands-on experience in design, implementation, and/or support of cloud migrations and operations

How strong is your resume?

Upload your resume and get feedback from our expert to help land this job

People also searched: