IMC Trading

HPC Storage Engineer

Chicago, IL, US

about 1 month ago
Save Job

Summary

At IMC, technology is at the core of everything we do, and how we build and maintain our storage infrastructure is key to our global success. Our Storage Engineering team scales our storage infrastructure to support growing data volumes and multiple global research initiatives. As a Storage Engineer, you will also strengthen our global team by providing expert support, balancing workloads across regions, and ensuring our systems can efficiently handle current and future storage demands.


Your Core Responsibilities:

  • Design, deploy, and manage our global storage platform, ensuring high performance, massive scalability, reliability, and future-proof solutions
  • Collaborate with cross-functional research and engineering teams, enabling them to leverage high-performance storage solutions, optimize data workflows, and accelerate computational workloads across global infrastructures
  • Diagnose and resolve complex storage, Linux, and networking challenges in a fast-paced environment
  • Integrate storage systems with distributed computing clusters, ensuring seamless compatibility with hardware and software across multiple locations
  • Participate in ‘follow-the-sun’ support, responding proactively to critical storage issues and ensuring continuous uptime for our operations worldwide


Your Skills and Experience:

  • 5+ years of experience in system storage architecture and system design within a large-scale compute environment
  • Storage system design and optimization with systems like Lustre, GPFS, S3, BeeGFS, Weka, Ceph, Vast, PowerScale, DDN, MinIO
  • Strong Linux Engineering skills, including bare-metal
  • Understanding of the implementation of storage stack from the kernel to user space (including file systems, block storage, I/O schedulers, VFS)
  • Storage benchmarking and performance tuning, with experience analyzing throughput, latency, IOPS, and workload-specific optimizations
  • Ability to manage large-scale, performance-critical environments, including capacity planning, scaling, and optimization
  • Knowledge of hardware components critical to storage systems, including NVMe, CPU/GPU/xPU architectures, PCIe, power utilization, NIC (eth/ib), PMEM, SCM, RedFish and how they impact performance and scalability
  • Proficiency in programming with languages such as Rust, Python, Go, or Bash
  • Nice to haves: Kubernetes, CSI, Slurm, AWS, GCP, HDFS, AI/ML frameworks, Prometheus and Terraform/Ansible
  • Exceptional communication skills with the ability to explain complex technical concepts to non-technical audiences


How strong is your resume?

Upload your resume and get feedback from our expert to help land this job

People also searched: