About this role
Our client is seeking a skilled Data Engineer to join their team in Bangalore on a contract basis. The ideal candidate will be responsible for designing, developing, and maintaining scalable data pipelines using PySpark and distributed computing frameworks.
Key Responsibilities:
- Design and implement ETL processes to integrate data from structured and unstructured sources into cloud data warehouses.
- Work across Azure or AWS cloud ecosystems to deploy and manage big data workflows.
- Optimize performance of SQL queries and develop stored procedures for data transformation and analytics.
- Collaborate with Data Scientists and Analysts to ensure data availability and quality.
- Monitor and troubleshoot data pipeline performance and reliability.
Required Skills & Qualifications:
- Proficiency in PySpark and experience with distributed computing frameworks.
- Strong knowledge of cloud platforms, specifically Azure or AWS.
- Experience with SQL and database management.
- Familiarity with data warehousing concepts and ETL tools.
- Excellent problem-solving skills and ability to work collaboratively in a team environment.
Experience:
- Minimum 5-8 years of experience in data engineering or related fields.
What we offer:
- Opportunity to work on innovative projects in a dynamic environment.
- Collaborative team culture with a focus on professional growth and development.
This role is managed by AI-First Talent on behalf of our client. Your application is reviewed directly by our talent team.