Big Data Engineer

  • Fospha
  • London, UK
  • 23 Feb, 2018
Full Time Engineering

Job Description

Experience in designing, building and maintaining large scale systems that take advantage of Big Data technologies Experience in batch and real-time data processing frameworks Experience in designing and implementing complex services, often made of separate APIs. Experience in software development and deployment, taking a test-driven approach to working with distributed systems and/or high performance computing, Experience in Python and experience in the creation of Python services using its backend frameworks Understanding of the multiple types of distributed data stores and their trade-offs. Proven ability to develop and ship robust machine learning solutions to solve business problems. Proven ability to clearly communicate findings to technical and non-technical audiences - verbal and written.


Python, SQL, API, ETL, ELT



We are looking for a Senior who has considerable experience in automation and productionisation of data science products; machine learning implementations, predictive models and real time pattern/trigger based actioning. Big Data technologies, such as Apache Spark, is a must.

Fospha is a London based big data software start-up, solving pan-industry business pains. Fospha's verticals include: Finance, Government, Insurance, Media, Online, Tech, and E-coms.

Fospha is part of the Blenheim Chalcot Group a venture incubator with combined sales in excess of £350m, £500m of assets under management. We are based in their 30,000sft venture hub offices, home to a buzzing mix of fast growth businesses. Think a large building packed with some of the smartest digital minds in London.

It's a unique environment with a support eco-system, that allows us to build a disruptive business at an unprecedented speed.

Fospha specialises in collecting, integrating, storing, activating, and operationalising business data. As part of the Data science team you will be involved in research and development of innovative data led solutions to business problems. The data science team,

Builds machine learning solutions for some of the largest internet era companies Builds IP for fledgling software companies where data is at the core of their product offering Builds rapid prototypes as proof of concept and proof of value add for Fospha clients Interfaces with Fospha software development teams to build repeatable and scalable software solutions Translates business problems into analytically tractable research problems, and develops the requirements and mathematical tools necessary to see solutions. Interfaces with key business stakeholders to bring data solutions into the heart of businesses, disrupting the status quo and driving growth. Integrates disparate data sets and performs deep analytics to unlock actionable insights and growth opportunities for Fospha clients. Consults on data science strategies and growth opportunities at c-suite level, both internally and externally.

As part of the Engineering team you will be involved in designing, building, and deploying scalable technologies with a focus on data science solutions.

As a big data engineer working with the data science team, you will be delivery focused and consistently challenged to architect and design solutions that can rapidly deploy data science models and scale analysis efforts while. The role will involve optimisation of models to suit available hardware and data requirements, and thus an understanding and willingness to learn the inner workings of data science developed machine learning algorithms and objectives is a must.

At the Fospha data science team you will be part of a team of mathematicians, physicists, statisticians, and engineers. We work on problems together and see them through from concept to operational workhorses; our solutions grow with us and we grow with our solutions.

Do you ask questions like, 'Can this algorithm be written as a sketch?', 'How do I get around the memory management here?', 'What's missing from the best of yesterday's technology that we need for today's problem?', 'What's the trade-off in moving this algorithm to a GPU?', 'What does the system look like for the problem, and is that the same system needed for the solution?', 'Why do we need Spark?', 'Do we need Secondary indexes, MapReduce, or both?', 'Is the best solution a graph model here?', 'How do I productionise data science models with firehose outputs?'

If so, we have a home for you in both the Data Science and Engineering Team


Designing, building and maintaining large scale systems to productionise data science products (e.g deploying Spark) Gather and process raw data at scale (including writing scripts, web scraping, calling APIs, write SQL queries, writing applications, etc.). Process unstructured data into a form suitable for analysis, utilizing custom applications and modern ETL/ELT. Understanding data science algorithmic design and their optimisation for large scale deployment.

About You

Delivery focused and experienced enough to push back on any technical status quo in order to deliver data science objectives. Deep expertise and understanding of performance constraints, memory management in your preferred language/environment We don't expect you to be an expert in all of these technologies but would anticipate varied experience and a curiosity for expanding your skills - Python, R, (No)SQL, C++, Java, ... Strong analytical, learning and problem solving skills with personal interest in subjects such as math/statistics, machine learning, AI and analytics. Database and data warehousing experience, design and methods for efficiently retrieving data - both in RDBMS and NoSQL environments Distributed computing, its advantages and disadvantages in Big Data analytical applications Experience building large scale data science pipelines end to end, understand and tune data science models in variety of languages (Python, R, Scala) Interest in working with technologies: Kafka, Spark, Hadoop, Hive, Impala, Ansible, Scala, Java, R, Python, Kylo, NiFi, Kinesis

Perks & Benefits

Flexible pension plan Flexible Working Maternity & Paternity Leave Sick Pay 25 days annual leave Bereavement Leave Bike to Work Scheme Discounted Café Communal area and meeting rooms Free work events Discounted Gym Membership Local food discounts Discounted wellbeing Yoga classes Discounted childcare in Hammersmith Discounted health and beauty 10 seconds from Hammersmith station Free fruit Free drinks (soft drinks and alcohol)