Job description
Project Role : Data Engineer
Project Role Description : Design, develop and maintain data solutions for data generation, collection, and processing. Create data pipelines, ensure data quality, and implement ETL (extract, transform and load) processes to migrate and deploy data across systems.
Must have skills : PySpark
Good to have skills : Apache Spark
Minimum 3 year(s) of experience is required
Educational Qualification : 15 years full time education
Summary:
As a Data Engineer, a typical day involves designing, developing, and maintaining comprehensive data solutions that support the generation, collection, and processing of data. This role requires creating efficient data pipelines to facilitate smooth data flow and ensuring the integrity and quality of data throughout its lifecycle. The position also involves implementing extract, transform, and load processes to enable seamless migration and deployment of data across various systems, contributing to the overall data infrastructure and operational efficiency within the organization.
Roles & Responsibilities:
- Expected to perform independently and become an SME.
- Required active participation/contribution in team discussions.
- Contribute in providing solutions to work related problems.
- Collaborate with cross-functional teams to understand data requirements and deliver scalable solutions.
- Monitor and optimize data workflows to improve performance and reliability.
- Document processes and maintain clear communication regarding data pipeline status and issues.
- Support junior team members by sharing knowledge and assisting with technical challenges.
Professional & Technical Skills:
- Must To Have Skills: Proficiency in PySpark, Apache Spark.
- Experience in building and managing large-scale data processing systems.
- Strong knowledge of data pipeline architecture and ETL frameworks.
- Ability to troubleshoot and resolve data quality and performance issues.
- Familiarity with distributed computing concepts and big data technologies.
Additional Information:
- The candidate should have minimum 3 years of experience in PySpark.
- This position is based at our Mumbai office.
- A 15 years full time education is required.
This job post has been translated by AI and may contain minor differences or errors.