Mid-Level (2-5 Years)
For our partner, a fast-growing company with multiple software and infrastructure projects running in parallel, with great teams situated in Romania, France, Switzerland, Spain and the UK we are looking for Data Engineer.
The successful candidate will work with Data Analysts who build statistical models to predict the value we can expect from each hotel under various different scenarios. Your job is the help to take those predictions, combine them with other information we have, and translate those predictions into software which will define how much we bid for traffic, what traffic we process and how it is priced.
Help design, build, maintain and operate the data pipeline which takes input from different systems, feeds it through our Spark data processing engine, and outputs it to our data analysis platform in AWS Redshift.
Build rapid prototypes to enable quick feedback through our A: B test framework
Design, train and maintain machine learning models
Analyze performance and results of the statistical and machine learning models to feedback into new iterations of the models
Think of new ways to visualize our data in Tableau and come up with ideas for new sources of insight
We are not looking for someone who wants a well-defined specification – we are looking for someone who wants to work with the rest of the team to come up with ideas and create new models.
You will be working in an ever-changing, agile environment – our partners are constantly changing their models, and we need to react quickly to this.
Degree with strong analytical focus (Computer Science, Mathematics, Engineering, Statistics etc)
A passion for working with databases and large scale data processing systems
Fluent in SQL
Fluent in Python
Excellent problem-solving and analytical skills combined with the ability to explain concepts to both technical and non-technical audiences
Results/task orientated with excellent attention to detail
Happy working in a fast paced environment
Familiarity with python’s core big data/data science libraries: e.g. pandas, sci-kit-learn etc
Experience doing ETL and a solid understanding of database design
Experience with MySQL and PostgreSQL / Redshift
Experience working with Scala / Spark