Data Engineer - Senior Associate
Job Description
Responsibilities
- Design and deploy data architectures and systems that enable efficient data processing and analytics
- Build and maintain data pipelines, integration, and transformation workflows to meet client needs
- Leverage Amazon Web Services (AWS) and Azure Data Factory to strengthen data engineering capabilities
- Apply data architecture development and database management expertise to optimize solutions
- Utilize Apache Airflow and Apache Hadoop for scalable processing and workflow orchestration
- Construct and oversee data lakes and data warehouses to support large-scale storage and retrieval
- Ensure data quality and validation through rigorous testing and performance tuning
- Collaborate with clients to capture data requirements and deliver actionable insights
- Employ the Databricks Unified Data Analytics Platform for advanced analytics and visualization
- Apply data security best practices to safeguard sensitive information and ensure compliance
- Use dimensional modeling and directed acyclic graphs (DAGs) to organize and process data efficiently
- Support the development of data strategies that drive business growth and informed decision-making
Requirements
- Bachelor's degree required
- Minimum of 2 years of experience
Technologies
- AWS
- Azure Data Factory
- Apache Airflow
- Apache Hadoop
- Databricks Unified Data Analytics Platform
Benefits
- Medical
- Dental
- Vision
- 401k
- Holiday pay
- Vacation
- Personal and family sick leave
What Sets You Apart
- Educational background in MIS, Computer and Information Science, Systems Engineering, Electrical Engineering, Chemical Engineering, Industrial Engineering, Mathematics, Statistics, or Mathematical Statistics is preferred
- Proficiency with data engineering platforms such as Databricks
- Experience with cloud platforms including AWS and Microsoft Azure
- Strong skills in data architecture development and data modeling
- Experience implementing data pipelines and data integration strategies
- Ability to navigate complex data environments using Apache Hadoop and Apache Airflow
- Demonstrates critical thinking to address data-related challenges