Overview

For our partner, a fast growing company with multiple software and infrastructure projects running in parallel, with great teams situated in Romania, France, Switzerland, Spain and the UK we are looking for a DevOps Engineers.

Their systems are complex, and they have a lot of challenges surrounding the high load nature of the business.  They write object oriented, unit & functionally tested code, and deploy to build, test, preprod and production using Jenkins for continuous integration. They deploy to production on at least a weekly basis.  They use Zabbix, Graylog2, site24x7 (and some internal developed tools) to monitor production, and have a” fix first, fix once” policy – they ensure that any error that occurs is fixed, the solution automated and monitored so that we never have to worry about it again.

 

What we offer:

  • Office Location: Aurel Vlaicu, near Promenada Mall
  • Modern office environment with all the work and play amenities
  • An amazing platform for learning and development.
  • Free will in choosing testing technologies or developing new ones.
  • Freedom to execute numerous Social Engineering scenarios.
  • Flexible working schedule.

 

What the role will involve

  • Monitoring production using our monitoring systems (Zabbix, Graylog, etc)
  • Operational Analysis (learning & understanding the business domain + monitor business domain metrics)
  • Communicating with our internal users about issues they are seeing with our production systems, understanding their concerns, diagnosing and replicating the issues.
  • Analysing issues across multiple application and infrastructure boundaries by investigating logs, network traffic, server performance and configuration
  • Replicating issues with these systems, and handing over the resolution to our development & systems administration teams
  • Support oncall (two weeks), which will involve being Level2 support for our production systems both during working hours and out of hours, receiving push notifications/alerts from our monitoring solutions and responding to queries on official communication channels (both synchronous (Instant Messenging solution) + asynchronous (email) while respecting agreed SLAs

Required traits and experience:

  • A naturally helpful and communicative person who loves solving problems
  • Very good analytical skills
  • Linux power user: Very good systems administration knowledge of, and work experience with Linux, Debian/CentOS, MySQL, bash, networking, firewalls;
  • Knowledge of monitoring systems (mainly Zabbix) concepts, operations, maintenance, workflows, reports, analysis, improvements, automation, templating – so on and so forth – for both state and trend type of monitoring
  • Ability to respond professionally in crysis situations while maintaining transparency
  • Experience with at least one version control system (preferably Git)
  • Experience of Continuous Integration (preferably Jenkins)
  • Automation tools, i.e. Ansible, Saltstack
  • Virtualisation solutions like Vagrant, XenServer
  • Knowledge of AWS
  • Proficient in English

Preferred, though not required, experience:

  • Python, PHP (scripting) experience
  • Experience with nosql systems like Redis, Couch base, Cassandra
  • Experience with High Availability setups (keepalived, heartbeat)