Senior Scientific Data Engineer
Job Description
Senior Scientific Data Engineer at Lawrence Berkeley National Laboratory in the San Francisco Bay Area, focusing on core scientific data systems, data workflows, and AI-ready data pipelines.
Responsibilities
- Contribute to the design, development, and enhancement of JGI’s core scientific data and compute capabilities within a skilled engineering team.
- Architect, build, deploy, and operate production automated systems, APIs, and workflows for genomic data movement, metadata management, job orchestration, data access, and large‑scale scientific computing.
- Identify technical issues and integration gaps, driving continuous improvements across systems and processes.
- Increase reliability, scalability, observability, interoperability, and maintainability of shared production data platforms while enabling sustainable operations.
- Promote engineering best practices through technical reviews, knowledge sharing, and team process optimization.
Requirements
- Bachelor’s degree (or equivalent knowledge/training) in Computer Science or related field, plus at least 8 years of relevant experience delivering production software and data systems that support metadata management, workflow orchestration, data lifecycle operations, and broad user data access, or an equivalent combination of education and experience.
- Strong foundation in software and data engineering for data‑intensive distributed systems, including system design, concurrency, performance, and testing.
- Experience with database and data storage technologies, including relational databases, object storage, and systems handling structured, semi‑structured, and large‑scale data.
- Hands‑on experience with data engineering and event‑driven technologies such as Airflow or Kafka.
- Experience using AI coding assistants (for example Claude Code, Codex, Cursor) with demonstrated ability to review and validate generated software for correctness, quality, security, maintainability, and production suitability.
- Proficiency in Python and experience with one or more additional programming languages.
- Excellent communication skills, including the ability to present complex technical information to internal teams and stakeholders.
- Proven ability to collaborate effectively with users, stakeholders, and engineering teams to deliver results in a multidisciplinary environment.
Technologies
- Python
- Airflow
- Kafka
- Claude Code
- Codex
- Cursor
- WDL
- Nextflow
Benefits
- Comprehensive health coverage and retirement options including pension or 401K‑style plans
- Supportive culture with focus on belonging and team investment
- Winter Holiday Shutdown each year
- Parental bonding leave for both mothers and fathers
- Pet insurance
- Relocation assistance
Desired Qualifications
- Master’s degree (or equivalent knowledge) in Computer Science or related field
- Experience with genomics, bioinformatics, and/or next‑generation sequencing data
- Proficiency with scientific workflow languages or systems such as WDL and Nextflow
- Experience with full‑stack or front‑end application development
- Background working in high performance computing environments
Additional Information
- Application deadline: Priority consideration for resumes and cover letters by June 1, 2026; applications accepted until the posting is removed
- Appointment type: Full‑time, exempt from overtime, 2‑year term with benefits eligibility; potential for extension or conversion to a Career appointment based on performance, funds, and needs
- Salary range: Budgeted range of $139,440 to $174,312 annually; aligns with the broader $139,440 to $235,308 range for this job code; final salary depends on qualifications and experience
- Background check: Required; convictions reviewed for relevance to responsibilities; a history does not automatically disqualify
- Work modality: Hybrid schedule (remote and on‑site) at 1 Cyclotron Road, Berkeley, CA; must reside within 150 miles; flexible telework possibilities rare
- Relocation assistance: Eligible
- Work authorization: Must be legally authorized to work in the United States; no visa sponsorship provided
- Misconduct disclosure: Finalist must disclose any relevant misconduct decisions within the last seven years