Rapyuta Robotics
Infrastructure Operations Engineer
インフラストラクチャオペレーションエンジニア
Tags: Full-time, 2 YOE, Business Japanese
Chennai, Tamil Nadu, India / Tokyo, Tokyo, Japan・Fetched 30+ days ago
Job Description
Team: Engineering
Rapyuta Robotics, an ETH Zurich startup headquartered in Tokyo, aspires to become the global leader in making robots more accessible. We currently lead the pick-assist AMR market in Japan and have secured investments from reputable backers, including Goldman Sachs, Sony, and Yaskawa.
As we establish a global hub in Chennai as an independent center to facilitate our global expansion, we're seeking an Infrastructure Operations Engineer. Your role will be crucial in ensuring the reliability, security, and optimal performance of our network infrastructure, involving the maintenance of hardware, software, and network systems. Collaboration with DevOps, Site Reliability, Robotics Software, and Fullstack Engineers is integral to the role.
Responsibilities:
- Ownership of Operational Metrics:
- Taking responsibility for critical operational metrics, including Mean Time to Detect (MTTD), Mean Time to Resolve (MTTR), Mean Time to Acknowledge (MTTA), and meeting SLAs for resolution.
- Incident Management:
- Acting as the definitive source of information during major incidents, providing real-time insights into current and future developments.
- Facilitating Pre-Launch Support:
- Validation and environment setup activities.
- Post-Launch Operational Maintenance:
- Ensuring ongoing operational efficiency by continuously measuring and monitoring site availability, network performance, and the overall health of the product.
- Lifecycle Enhancement:
- Improving the entire lifecycle of deployments and updates by leveraging operational knowledge to expedite processes.
- Collaboration with Project Management:
- Working closely to monitor progress and ensure the successful implementation of initiatives.
- Operations Handbook Development:
- Leading the development and evolution of the operations handbook, ensuring it remains comprehensive and up-to-date.
Requirements
Minimum qualifications:
- At least 2 years of relevant work experience.
- Proficiency in any general-purpose programming language such as Python or Go
- Strong understanding of Linux/Unix fundamentals.
- Strong analytical and debugging skills.
- Experience with Configuration Management, Docker, Kubernetes, IaaS, PaaS, Continuous Delivery, Continuous Integration, and the latest DevOps practices.
- Knowledge of Linux networking, routing concepts, DNS and DHCP.
- Strong communication skills in English
Preferred Qualifications:
- Ability to write clean, maintainable code, preferably in Python, and familiarity with common libraries, frameworks, and REST APIs to streamline deployment and release management.
- Hands-on experience with Configuration Management systems, particularly Ansible and Terraform.
- Expertise in provisioning servers and implementing disaster recovery and backup strategies for data from edge sites.
- Proficient in Docker, Kubernetes, and related technologies.
- Capability to actively participate in incidents, debug issues, and provide support to developers and product managers.
Benefits
- Competitive Compensation: Receive a salary package commensurate with your expertise.
- Leading Technology Environment: Contribute to groundbreaking advancements in robotics technology within a cutting-edge work environment.
- Elite Engineering Team: Collaborate with a team of highly skilled engineers, fostering an atmosphere of innovation and creativity.
- Global Workplace Culture: Engage in a multinational work culture that values diversity, offering a range of perspectives.
- Comprehensive Insurance: Access comprehensive insurance coverage for you and your family, ensuring peace of mind.
- Flexible Work Hours: Enjoy a flexible work schedule that accommodates personal needs while promoting work-life balance.