HyperVerge

DevOps Engineer

Bengaluru, KA, IN

8 days ago
Save Job

Summary

About Us


HyperVerge is a global deep tech AI company serving clients across India, Africa, ASEAN, and US regions. Our market-leading AI technology is ranked among the best in globally reputed leaderboards such as NIST. Our products are trusted by large enterprises like Jio, Vodafone, Airbus, SBI, Bajaj, ICICI Securities, and unicorns such as Grab, Groww, Cred, and Slice for verifying customer identities and preventing fraud.

We’ve powered over 1 Billion (~12% of the world’s population) ID checks in the last four years!


About the role:


  • Design, build and maintain infrastructure‑as‑code automation stacks on AWS using Terraform, spanning multiple environments, regions and MLOps workflows (training pipelines, model registry, and inference platforms).
  • Ensure high levels of availability, observability, performance, scalability and security for both application and ML workloads by architecting resilient architectures, defining SLOs/SLA-aware monitoring and enforcing DevSecOps controls.
  • Enable highly reliable, fast and operationally efficient release pipelines for services and models through GitOps‑driven CI/CD (GitHub Actions / GitLab CI / Argo CD), supporting various deployment strategies.
  • Continuously enhance developer efficiency, system reliability, resilience, and cost optimization by architecting robust leak‑prevention guardrails.


Key Skills and Strengths Required:


  • 1–4 years of experience in a 24×7 production environment as a DevOps/Platform Engineer.
  • Strong expertise in AWS (EC2, VPC, IAM, ALB, Auto Scaling, S3, CloudWatch) with a focus on building resilient and scalable systems.
  • Deep hands-on experience with Terraform and infrastructure as code for managing complex environments efficiently.
  • Solid understanding of security fundamentals: TLS, encryption, secrets management, and access controls.
  • Proficient in Python or Go, along with shell scripting, for building automation, tooling, and integrations.
  • Experience with GitOps-based CI/CD (GitLab CI, Argo CD) and deployment strategies
  • Strong Linux administration and troubleshooting skills.
  • Familiarity with HTTP, DNS, and reverse proxy configurations for production traffic handling.
  • Hands-on with Kubernetes (EKS), Helm, and Istio for container orchestration.
  • Comfortable working with open-source tooling, AWS-native services, and implementing cloud cost optimization practices.
  • Exposure to LLM workloads: model serving, inference, vector DBs, and RAG pipelines.
  • Outcome-oriented mindset with a strong bias for action, ownership, and delivering measurable results.


Good to have:


  • Exposure to GPU infrastructure for ML/DL training and inference workloads.
  • Experience managing and scaling ELK for centralized logging and observability.
  • Familiarity with database systems such as Postgres, MongoDB, Redis, and Redshift.
  • Knowledge of service reliability tooling such as chaos engineering, circuit breakers, and auto-healing mechanisms.
  • Experience setting up internal developer platforms (IDPs)
  • Exposure to performance testing, load generation tools and resource profiling in production.


This Role is for you If :


  • You’re passionate about technology and thrive in fast-paced, ever-evolving environments, staying current with tools, trends, and best practices.
  • You take initiative, question the status quo, and aren’t afraid to ask “Why?” or explore “What if?” scenarios to uncover better solutions.
  • You’re a continuous learner, always seeking to sharpen your skills and grow technically and professionally.
  • You’re proactive and action-oriented — you don’t wait for direction, you make things happen.
  • You enjoy diving deep into complex systems, understanding how things work, and finding ways to improve performance, reliability, and efficiency.
  • You’re outcome-focused — you care about the business impact of your work and align engineering efforts toward measurable goals.
  • You have strong problem-solving skills — you can break down ambiguity, troubleshoot effectively, and deliver thoughtful, scalable solutions.


How strong is your resume?

Upload your resume and get feedback from our expert to help land this job

People also searched: