Senior Data Engineer (AI / ML)
Description
Our client (a Multinational Pharmaceutical) is looking for a Data Engineer.
Mission
Advanced Analytics and AI are high on the agenda at our client and they are looking to strengthen the internal team of AI experts with a particular focus on sales & marketing. In this context the client is looking for an outstanding data engineer with strong Python & Spark skills to contribute to the development of analytics workflows focused on insights generation, prescriptive analytics and decision support apps.
Responsibilities
-
Develop and operate data pipelines processing large, complex datasets as input for analytics and machine learning;
-
Help to define the analytical scope and data for projects, including investigating data sources, designing new features and data integration flows;
-
Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc;
-
Create data tools for analytics and data scientist team members that assist them in building and optimizing their results;
-
Utilizing a diverse array of technologies and data science toolsets as needed, primarily Python, Spark and Pandas, but also Jupyter, Denodo, Azure ML, Azure DevOps, Docker, Databricks, GIT, SQL, ...;
-
Communicate ideas, approaches and results with peers and stakeholders.
​​
Skills / Requirements
​
-
Mastery of Python, Spark and Pandas to create ETL pipelines for data scientists to use; knowledge of one or more data pipelines frameworks is a plus;
-
At least 3 years of intensive hands-on experience as a full-stack Python data engineer: Python, Spark, Pandas, NumPy, SciPy, visualization (matplotlib), machine learning (scikit- learn), data pipeline orchestration (e.g. kedro);
-
Good knowledge and experience with versioning systems (GIT);
-
Good knowledge and experience with databases;
-
Advanced degree in a relevant discipline such as: Statistics, Applied Mathematics, Operations Research/Optimization, Computer Science, Computational/Theoretical Physics, Data Science/visualization, Machine Learning, Electrical/Computer Engineering or Health Sciences (e.g. Bioengineering / Bioinformatics) ;
-
Experience in extracting, cleaning, preparing and modeling data. Experience with command-line scripting, data structures, and algorithms;
-
Ability to work across structured, semi-structured, and unstructured data;
-
Strong presentation and communication skills towards peer data scientists and non-technical stakeholders;
-
Ability to work individually and in teams (agile);
-
Experience with the healthcare / pharmaceutical industry is a plus;
-
Experience with sales & marketing analytics is a plus.
Additional Information
-
Hours per week: Full time
-
Duration of the contract: 6 months (followed by extensions)
-
Start date: ASAP
-
Location: full remote
​​​
Do you want to apply for this job ? Let us know and send your CV to hello@akindra.ro
​