EngineerJobs.io
← Back to all jobs

Job Description

Within the OIS/CXI Analytics group, this Data Engineer role centers on building scalable data pipelines and ML-ready data infrastructure to empower AI-driven operational insights across Amazon’s fulfillment and operations network. The position emphasizes production-grade ETL/ELT development, feature engineering workflows, data governance, GenAI-enabled reporting, and close collaboration with ML engineers, data scientists, and stakeholders to deliver reliable, data-driven decisions.

Details

  • Location: Nashville, TN (onsite)
  • Salary: USD 125,500 - 169,800 per year
  • Minimum experience: 3 years
  • Education: Bachelor's degree or higher

Responsibilities

  • Architect and maintain production grade ETL/ELT pipelines and large-scale data infrastructure to support OTS operational intelligence
  • Develop feature engineering workflows and ML-ready data pipelines to enable data science experimentation and production model serving
  • Contribute to data governance and quality standards across analytical and ML data products
  • Assist in implementing GenAI solutions for automated reporting, diagnostics, predictive and prescriptive analytics
  • Construct and manage semantic layers and dashboard data models that inform global operations decisions
  • Collaborate with Program Managers, BI teams, ML engineers, data scientists, and operational stakeholders to prioritize work aligned with OTS goals
  • Adhere to and contribute to data engineering best practices, including code reviews, testing, monitoring, and documentation

Requirements

  • 3+ years of data engineering experience
  • 3+ years designing and operating large-scale BI data structures with data modeling experience
  • Experience in data modeling, data warehousing, and building ETL pipelines
  • Hands-on with AWS technologies such as Redshift, S3, AWS Glue, EMR, Kinesis, Firehose, Lambda, and IAM roles/permissions
  • Background in data warehouse architectures, data modeling, infrastructure components, ETL/ELT and reporting/analytic tools, data structures, and practical SQL coding
  • Bachelor's degree or higher in computer science, engineering, or related fields, or equivalent experience building and maintaining data flows
  • Proficiency in Python and SQL; experience with PySpark or Apache Spark
  • Experience with infrastructure-as-code (CDK, CloudFormation) and CI/CD pipelines for data and ML systems
  • Experience with data modeling and designing relational and non-relational databases

Technologies

  • Python
  • SQL
  • PySpark
  • Apache Spark
  • Redshift
  • S3
  • AWS Glue
  • EMR
  • Kinesis
  • FireHose
  • Lambda
  • IAM
  • CDK
  • CloudFormation

Benefits

  • Medical, Dental, and Vision Coverage
  • Maternity and Parental Leave Options
  • Paid Time Off (PTO)
  • 401(k) Plan

Preferred Qualifications

  • Experience with non-relational databases and data stores such as object storage, document or key-value stores, graph databases, and column-family databases
  • Master's degree or higher in computer science, engineering, analytics, mathematics, statistics, IT or equivalent

Similar Jobs

Get Job Alerts

New jobs delivered to your inbox.