Research Data Engineer II- Cabin
Job Description
The University of Rochester is seeking a Research Data Engineer II to design, implement, and sustain data engineering and analytics infrastructure that supports MRI research and neuroimaging workflows within the URMC CABIN environment. The role emphasizes building scalable data pipelines and integration frameworks for multimodal research data.
Location
Rochester, NY (onsite)
Compensation
USD 77,216 - 115,824 per year
Responsibilities
- Design, develop, and maintain data engineering and analytics infrastructure to support MRI research, neuroimaging workflows, and multimodal data analysis within the URMC CABIN environment.
- Create and sustain scalable data pipelines and software systems enabling researchers to collect, process, manage, and analyze structured and unstructured data produced by MRI scanners, imaging tools, and related platforms.
- Develop data integration frameworks to consolidate information from imaging systems, research databases, clinical systems, and analysis environments.
- Support the deployment and ongoing maintenance of research data infrastructure, including data repositories, data lakes, and workflow automation systems used in MRI and neuroscience studies.
- Collaborate with researchers, engineers, and IT staff to deliver reliable, scalable, and reproducible data and software solutions that enable scientific discovery and advanced imaging analysis workflows.
Requirements
- Bachelor's degree in data science, computer science, biomedical informatics, bioinformatics, statistics, engineering, or a related field (required).
- At least two years of experience in data engineering, research computing, or data-intensive scientific environments (required).
- An equivalent combination of education and related experience may be considered (required).
- Proficiency in SQL and at least one additional language such as Python, R, or Java (required).
- Experience designing and maintaining ETL pipelines and research data workflows (required).
Technologies
- SQL
- Python
- R
- Java
- Docker
- Singularity
- Git
- REDCap
- Electronic lab notebooks
- Biospecimen management systems
- DICOM
- NIfTI
- Linux
- High Performance Computing (HPC)
- Cloud infrastructure
- Data lakes
- Data repositories
Knowledge, Skills and Abilities
- Strong analytical and problem-solving abilities.
- Ability to design scalable and maintainable data architectures.
- Strong organizational and project coordination skills.
- Ability to work effectively in collaborative, matrix research environments.
- Excellent written and verbal communication for interactions with researchers and technical teams.
- Ability to present technical concepts clearly to both technical and non-technical stakeholders.
- Attention to detail and a commitment to high-quality data and software practices.