Sr. AI & Data Engineer–Trading Analytics
Job Description
Shell seeks a Senior AI & Data Engineer, Trading Analytics, in Houston, TX (hybrid) to design and deliver AI driven front office analytics and GenAI/agent based solutions for trading.
Responsibilities
- Design, implement, and deploy AI driven analytics for traders and analysts, including seasonality analyses, correlation studies, regression models, forecasting, and scenario modeling over market pricing and fundamentals data
- Collaborate with traders and analysts to convert vague business questions into well defined analytical problems and practical solutions
- Present analytical outputs and AI generated insights to commercial stakeholders in a concise, actionable format
- Build and maintain scalable, reusable data pipelines on Databricks using PySpark/Spark, SQL, Delta Lake, and Unity Catalog
- Support ingestion, modelling, and transformation of large scale time series pricing and fundamentals datasets
- Optimize pipelines for performance, reliability, and cost efficiency in line with platform and data governance standards
- Develop GenAI and agent based solutions to support trading analytics, including:
- Retrieval Augmented Generation (RAG)
- Prompt engineering
- Agent orchestration using frameworks such as LangGraph
- Tool calling and guardrails
- Integrate LLM based workflows with structured trading and market data to augment analysis, insight generation, and decision support
- Prototype solutions quickly, gather user feedback, and harden selected use cases for production deployment
- Contribute to production ready analytics and AI solutions with testing, documentation, versioning, and basic observability
- Adhere to CI/CD and DevOps practices, including Git based workflows and automated testing
- Support governance requirements such as PII handling, data lineage, and auditability in line with Trading & Supply standards
Requirements
- Must have legal authorization to work in the US on a full-time basis for anyone other than the current employer
- Bachelor’s degree or equivalent relevant years of experience
- At least 10 years of relevant experience
- Hands on experience with Databricks and/or Spark (PySpark, SQL, Delta Lake; Unity Catalog desirable)
- Proven data engineering skills, including pipeline development, data modelling, and performance optimization
- Strong foundation in statistics, econometrics, or data science, with experience applying these techniques to time series or market style datasets
- Practical experience building or contributing to LLM based solutions, including prompt engineering and retrieval based approaches
- Familiarity with GenAI frameworks and tooling (e.g., LangGraph or similar orchestration patterns)
- Experience working in collaborative engineering teams using Git and CI/CD pipelines
- Strong communication skills and the ability to work directly with analysts, traders, and other business stakeholders
Technologies
- Databricks
- PySpark
- Spark
- SQL
- Delta Lake
- Unity Catalog
- LangGraph
Benefits
- Medical coverage
- Dental coverage
- Vision coverage
- Life Insurance
- Business Travel Accident Insurance
- Occupational Accidental Death Benefit
- Company pension plan
- 401(k) plan
- Paid vacation time (up to 6 weeks)
- Paid holidays (up to 11)
- Parental leave (16 weeks birthing; 8 weeks non-birthing)
- Short-term disability leave (up to 26 weeks at 100% or 50%)
- Long-Term Disability insurance
- Financial reimbursement for adoption, wellness, education, and personal learning expenses
- Discretionary long-term incentives
Additional Preferred Qualifications
- Exposure to commodity or financial trading environments
- Understanding of market fundamentals, supply-demand dynamics, or risk concepts
- Experience with MLflow, feature stores, or vector databases
- Familiarity with working in regulated or risk-sensitive environments