Data Engineer
Job Description
Based in Burlington, Vermont, this onsite data engineering role centers on designing, building, and maintaining scalable data pipelines and architectures that support data-driven decision making. The position requires the ability to obtain a security clearance.
Overview
We are seeking a data engineer to join our team and play a central role in turning data into actionable insights. You will design, build, and maintain scalable pipelines and architectures that enable large-scale analytics, integrate diverse data sources, and optimize data workflows to support advanced analytics initiatives. This role suits professionals who thrive in fast-paced environments and are passionate about data.
Responsibilities
- Design, implement, and optimize reliable ETL pipelines to facilitate smooth data movement across systems using Informatica, Talend, and shell scripting.
- Create and maintain scalable data warehouses and data lakes on platforms such as Azure Data Lake and Hadoop ecosystems to support wide-scale analytics.
- Develop and manage complex SQL databases including Microsoft SQL Server and Oracle, ensuring performance, security, and reliability.
- Collaborate with cross-functional teams to translate data requirements into technical solutions using Python, Java, Bash (Unix shell), and RESTful APIs.
- Leverage big data technologies like Apache Hive, Spark, and Hadoop to process large datasets efficiently while upholding data quality and consistency.
- Integrate linked data sources to enrich datasets for comprehensive analysis and use Looker for visualization and reporting.
- Support model training and analysis activities by delivering clean, well-structured datasets and contribute to iterative improvements of data models through testing.
- Participate in Agile development cycles to deliver high-quality solutions swiftly and document processes for ongoing maintenance.
Requirements
- Eligibility to obtain a security clearance is required.
- Proven experience designing and implementing large-scale data pipelines with AWS services such as AWS Glue or S3; familiarity with Azure Data Lake is a plus.
- Strong programming skills in Python, Java, VBA, Bash (Unix shell), or Shell Scripting for automation and customization tasks.
- Extensive knowledge of SQL databases including Microsoft SQL Server, Oracle, and experience with Data Warehouse concepts.
- Hands-on experience with big data frameworks such as Hadoop ecosystem components (HDFS, Hive), Spark, and related tools.
- Proficiency with ETL tools like Talend or Informatica; understanding of RESTful API integration for data exchange.
- Familiarity with analytics platforms such as Looker for creating dashboards and reports that drive business insights.
- Ability to design efficient database schemas and optimize query performance; strong analytical skills to interpret complex datasets.
- Knowledge of model training techniques for predictive analytics; experience within Agile development methodologies is preferred.
Technologies
- Informatica
- Talend
- Shell Scripting
- Azure Data Lake
- Hadoop
- HDFS
- Hive
- Spark
- Microsoft SQL Server
- Oracle
- Python
- Java
- VBA
- Bash (Unix shell)
- RESTful APIs
- Looker
- AWS Glue
- S3
- AWS
Benefits
- 401(k)
- Dental insurance
- Health insurance
- Paid time off
- Tuition reimbursement
- Vision insurance
Pay
USD 114,880.73 - 138,350.98 per year
Work Location
Burlington, VT (onsite)