Senior Data Engineer Full-time Job
1 week ago IT & Telecoms Dubai 45 views Reference: 34098Job Details
Roles & Responsibilities:
Data discovery, ingestion and features.
Timely delivery of data pipelines, features to support Data Scientist and Analytics Delivery lead in use case delivery.
Work with analytics delivery lead and Lead Data Engineer to understand the business needs and build impactful analytics solutions.
Contribute to cross-functional problem-solving sessions with your squad (data scientists specially) and provide tangible solutions using your expertise in data engineering and domain data knowledge.
Data onboarding and access process – work with business analysts and analytics delivery lead to manage stakeholders that are affected by changes.
Use state of the art technologies to acquire, ingest and transform big datasets.
Design and build high-performance, secure, and scalable data pipelines to support data science projects following software engineering best practices.
Map data to hypotheses, curate, wrangle, prepare data and feature engineering to be used in machine learning models
Design and develop the data and analytics platform selecting the right technologies for each problem at hand (big-data stack, SQL, no-SQL, etc.)
Build modular pipeline to construct features and modelling tables.
Build a sense of trust and rapport that creates a comfortable & effective workplace and experience in working as part of an agile squad.
Contribute to formulate data architecture strategy based on business goals and problem at hand.
Conduct knowledge sharing sessions internally within the CoE as well as externally in meetups, conferences etc.
Participates in and helps plan cultural initiatives for CoE.
Independently taken charge of delivery of workstream including testing and maintenance.
Collaborate with data engineers, UI/UX designers, programmers to deploy solutions at scale.
Required qualifications:
Degree in computer science, engineering, mathematics, or equivalent experience.
2+ years of previous commercial experience as a leader in a data-driven role.
2+ years of hands-on experience building data pipelines in production and ability to work across structured, semi-structured and unstructured data.
2+ years of experience in ML pipeline for streaming/batch workflow.
Ability to write clean, maintainable, and robust code in Python.
Understanding and expertise of software engineering concepts and best practices.
Knowledge of testing frameworks and libraries.
Experience with analytics (descriptive, predictive, EDA), feature engineer, algorithms, anomaly detection, data quality assessment and python visualization libraries - e.g. matplotlib, seaborn or other.
Comfortable with notebook and source code development - Jupyter, Pycharm/VScode.
Hands-on experience of technologies like Python, Spark/Pyspark, Hadoop/MapReduce/HIVE, Pandas etc.
Familiarity with query languages and database technologies, CI/CD, testing and validation of data and software.
Collaborative - able to build strong relations that enable robust debate, and resolve periodic disagreements regarding priorities.
Excellent interpersonal, and communication skills.
Ability to communicate effectively with technical and non-technical audience.
Ability to work under pressure with a solid sense for setting priorities.
Ability to lead technical work with strong sense of ownership.
Strong command of English language (both verbal and written)
Tech stack and activities that you would use and preform on a daily basis