Senior Software Engineer, Site Reliability, Cloud Platform
Argo AI
Munich, Germany
vor 3 Tg.

Company : Argo AI GmbH

Who we are :

Argo AI is in the business of building self-driving technology you can trust. With experienced leaders in the field and collaborative partnerships with some of the world’s largest automakers, we’re building self-driving technology that is engineered to scale globally and transform mobility for millions.

Talented individuals join our team because they share our purpose to make it safe, easy, and enjoyable for everyone to get around cities.

We aspire to impact key industries that move people and goods, from ride hailing to deliveries.

Meet the team :

Argo AI Site Reliability Engineers are responsible for building and running our mission-critical systems. Through the implementation of monitoring and automation, our SREs constantly ensure the health, reliability, scalability, and performance of Argo AI’s infrastructure.

The Site Reliability team works together with engineering teams, IT, and Security to address unique business challenges through comprehensive solutions while taking into account system uptime, reliability, and maintainability.

Members of the team are expected to promote the importance of resiliency patterns to other teams within Argo AI, as well as contribute to a culture of continuous learning.

What you’ll do :

  • Design and implement scalable distributed systems to facilitate the development of self-driving vehicles
  • Monitor and maintain mission-critical production services to ensure maximum uptime
  • Document actions to build a comprehensive library of runbooks, which will act as a knowledge base and foundation for automation
  • Scale the reliability and velocity of our systems and processes through increased automation
  • Participate in an on-call rotation and culture of continuous improvement through blameless postmortems
  • What you'll need to succeed :

  • Degree in Computer Engineering, Computer Science, Electrical Engineering, Robotics or a related field
  • Expertise in at least one scripting language (e.g. Bash, Python)
  • Fundamental understanding of Linux operating system internals, TCP / IP networking, and storage subsystems
  • Strong experience scaling and securing services in the cloud (AWS, GCP) or cloud native environments
  • Experience using infrastructure-as-code principles to automate the creation of infrastructure resources (e.g. Terraform, CloudFormation)
  • Understanding of engineering design limitations and ability to provide guidance to teams to scale their services to achieve desired performance within budget
  • Experience implementing and debugging cloud native and open source tools such as Kubernetes, etcd, Prometheus, FluentD and Istio
  • Strong communication skills and the ability to work effectively in a diverse and distributed team
  • What we offer you :

  • Competitive compensation packages
  • 30 vacation days
  • Subsidized daily lunches, beverages, and snacks
  • Professional development reimbursement
  • Global Employee assistance program (Offerings include : work-life balance support, mindfulness programs, life coaching, new parent coaching, and more!)
  • Local and global discount programs
  • Company and team bonding outlets : employee resource groups, quarterly team activity stipend, and wellness initiatives
  • Melde diesen Job
    checkmark

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    Bewerben
    E-Mail
    Klicke auf "Weiter", um unseren Datenschutz-und Nutzungsbestimmungen zuzustimmen . Du kriegst außerdem die besten Jobs als E-Mail-Alert. Los geht's!
    Weiter
    Bewerbungsformular