CPGIO

Senior Software Engineer

Addison, IL, US

$140k
4 days ago
Save Job

Summary

Job description


About CPGIO:

At CPGIO, we're revolutionizing commerce intelligence. With a legacy as one of the Top eCommerce shops globally and a bootstrapped revenue of $100M/year, we're now pivoting toward high-tech, agent-reasoning information products that empower over 500 leading household brands. Our platform converges project management, analytical data, and supply chain information into actionable insights—helping brand managers and eCommerce leaders make smarter, faster decisions without ever compromising security or customer trust.


Role Overview:

We’re looking for a dynamic Senior Software Engineer to join our small, agile team and report directly to our Chief Product and Technology Officer. In this role, you will drive the design, implementation, and continuous improvement of the reliability, performance, and security of our next-generation commerce intelligence platform. You'll be instrumental in safeguarding our data flows and building transformative workflows that underpin the way brands manage their business.


Key Responsibilities:


1. System Design & Architecture:

  • Innovative Infrastructure: Architect and implement resilient, scalable, and secure systems that support real-time insights and intelligent agent-driven workflows.
  • Disaster Recovery & Resilience: Develop and refine failover strategies, ensuring minimal disruption and rapid recovery.
  • Collaborative Design: Partner with product and development teams to integrate reliability best practices seamlessly into every stage of our product lifecycle.


2. Monitoring & Alerting:

  • Proactive Observability: Build robust monitoring frameworks using tools such as Grafana, Loki, Tempo, and Mimir to detect anomalies and ensure system health.
  • Real-Time Alerts: Develop automated alerting systems that empower teams to respond swiftly to potential issues before they impact customers.


3. Automation & Tooling:

  • Streamlined Operations: Implement infrastructure-as-code solutions and automation scripts to manage deployments, scaling, and routine system checks.
  • CI/CD Enhancement: Optimize continuous integration and continuous deployment pipelines to accelerate delivery cycles while maintaining high standards of quality and security.


4. Incident Management & Response:

  • Leadership in Crisis: Lead incident response efforts, conducting root cause analysis and driving post-mortem reviews to prevent future disruptions.
  • Playbook Development: Establish and maintain comprehensive incident response procedures that ensure a coordinated and efficient resolution process.


5. Capacity Planning & Performance Optimization:

  • Scalability: Forecast and manage system capacity needs to accommodate rapid growth and dynamic workloads.
  • Performance Tuning: Identify bottlenecks and fine-tune systems to maintain optimal performance and low latency across all services.


6. Security & Compliance:

  • Data Protection: Enforce and enhance security best practices to protect sensitive customer data and ensure the integrity of data flows.
  • Compliance: Collaborate with cross-functional teams to ensure adherence to industry standards and regulatory requirements.


Skills & Qualifications:


Technical Expertise:

  • Programming Languages: Proficient in Python, Node.js, and JavaScript.
  • Artificial Intelligence: Experience with machine learning, LLM management, and AI training is a big plus.
  • Database Systems: Hands-on experience with PostgreSQL and other relational databases.
  • Cloud Platforms: Proven expertise in AWS, Neon, and similar cloud environments.
  • Infrastructure Abstraction: Familiarity with platforms like Raleway, Vercel, Netlify, and Firebase.
  • CI/CD Tools: Experience with GitHub Actions, Jenkins, or CircleCI.


System & Architecture:

  • Distributed Systems: Deep understanding of microservices architectures, containerization (Docker), and orchestration (Kubernetes).
  • Networking: Strong grasp of networking concepts, security protocols, and system administration in Linux environments.


Soft Skills:

  • Analytical Thinking: Exceptional problem-solving and analytical abilities.
  • Collaboration: Excellent communication skills with a track record of successful cross-functional collaboration.
  • Adaptability: Thrives in a fast-paced, evolving environment with a proactive approach to challenges.


Experience:

  • 4-7 years of experience in Site Reliability Engineering or a similar role.
  • Bachelor’s degree preferred or equivalent work experience.
  • Demonstrated experience managing high-traffic, complex systems with a focus on reliability and performance.
  • Proven leadership in incident management and system optimization initiatives.


What We Offer:

  • Culture & Environment: Enjoy the laid-back vibe of Addison with perks like summer hours, WFH Fridays, flexible start times, and generous PTO. We offer a comprehensive benefits package including medical, dental, vision, life insurance, a robust 401(k) with company match, and HSA contributions. Employees enjoy discounts on household items, access to an on-site gym, and a supportive team that values collaboration, learning, and professional growth. Plus, we host quarterly events, team outings, and recognition programs to celebrate your success.
  • Impact & Innovation: Be at the forefront of developing transformative technologies that redefine how brand managers and eCommerce stakeholders operate.
  • Competitive Benefits: Salary DOE ranging from $95,000 to $140,000, along with a comprehensive benefits package.


Join Us:

If you're passionate about building secure, high-performance systems that drive industry-leading innovation, we’d love to hear from you. Apply now to be part of a team that’s setting the pace for the future of commerce intelligence.


CPG.IO

How strong is your resume?

Upload your resume and get feedback from our expert to help land this job

People also searched: