About the Company
My client is a fast-growing technology company operating the largest online energy marketplace in Japan. Since its founding in 2019, the platform has captured over 95% of the domestic market and is now handling nearly ¥2 trillion in annual transaction volume.
In 2024, the company raised ¥6 billion in Series B funding from top global investors, including venture capital firms, financial institutions, and major energy trading houses. Backed by strong funding and institutional support, they are actively scaling their engineering team to meet the growing demand for secure, reliable, and scalable infrastructure.
Role Overview
This is a hands-on engineering position focused on ensuring system reliability, performance, and scalability across a growing suite of energy trading and data services. The successful candidate will join a small, high-performing engineering team and contribute to infrastructure architecture, observability, DevOps, and SRE practices in a high-impact environment.
Key Responsibilities
- Design and implement monitoring and observability systems
- Lead architecture improvements to support service modularization and scale
- Define and manage service-level objectives (SLOs)
- Improve CI/CD pipelines using GitHub Actions, Cloud Build, and Argo CD
- Build and maintain shared Terraform modules and infrastructure as code
- Maintain and document incident response protocols and on-call practices
- Conduct regular failure drills and system resilience testing
- Ensure security best practices across infrastructure and deployment pipelines
- Create and maintain internal engineering documentation and technical standards
Required Skills
- Experience with team-based software development using Git
- CI/CD pipeline design using tools like GitHub Actions or CircleCI
- Building and maintaining observability/monitoring platforms
- Infrastructure-as-code expertise (Terraform or similar)
- Experience operating public or private cloud environments (GCP, AWS, Azure, OpenStack)
- Solid understanding of Unix/Linux systems and container technologies (Docker)
- Knowledge of networking and system security fundamentals
- Ability to create technical documentation and build alignment across stakeholders
Preferred Qualifications
- Go development experience
- Experience diagnosing and solving performance bottlenecks and SPOFs
- Kubernetes architecture design and implementation
- Practical experience with SLOs and capacity planning
- Datadog, Prometheus, and BI tool (Looker Studio, etc.) usage
- Hands-on experience with production RDBMS (PostgreSQL, RDS)
- Security auditing and infrastructure hardening experience
- Background with microservices design and distributed systems architecture
- Exposure to building or operating ML infrastructure is a plus
Selling Points:
- Market Dominance: You'll be building infrastructure for a platform used by nearly all of Japan’s energy providers
- High Growth, High Scale: The team is scaling infrastructure to support exponential growth in a regulated, mission-critical market
- Modern Tech Stack: Work hands-on with GCP, Kubernetes, Terraform, Go, Argo, and GitHub Actions in production
- Strong Funding & Backing: Backed by DCM Ventures and other major investors across the energy and financial industries
- Engineering-Led Culture: Flat structure with strong ownership, open technical discussions, and autonomy to drive decisions
If you're an experienced SRE or platform engineer looking to work on high-scale, mission-critical systems in a high-trust, engineering-led environment — this is a unique opportunity with one of Japan’s most impactful tech companies.
📩 Interested candidates are encouraged to apply or reach out directly for more details.