Mission - Your mission is to build Data Ecosystem as foundation for Data Driven Product Development as part of PwC’s Digital Factory.
Impact & Responsibilities - You run our infrastructure with Ansible, Terraform and Kubernetes. You are responsible for monitoring and alerting alert on symptoms and not on outages.
You document every action so your findings turn into repeatable actions and then into automation. You also improve the deployment process to make it as boring as possible.
Debugging production issues across services and levels of the stack also belongs to your responsibilities as well as planning the growth of PwC's infrastructure.
Work together - You mentor and train other team members on design techniques and coding standards. You work with internal stakeholders to understand their needs.
You are also responsible for implementing best practices and providing feedback to team members through peer reviews.
You have more than 5 years of experience in SRE, Software Engineering or Operations Engineering roles and know your way around Linux and the Unix Shell.
You have strong programming skills with experience in Go, Java or Python.
You like to think about systems - edge cases, failure modes, behaviors and specific implementations. You have worked with Docker, Kubernetes, Helm, Terraform, Ansible, or similar technologies and know what the use of config management systems like Ansible (the one we use) is.
You are enthusiastic,have a go-for-it attitude and want to deliver quickly and iterate fast. When you see something broken, you can't help but fix it.
You like to collaborate and communicate asynchronously.
You are a team player and enjoy collaborating with cross-functional teams. You like to share your knowledge and experience and can document all the things so you don't need to learn the same thing twice.