
Senior Data Engineer - Digital Pathology (Remote)/ 3 days ago
Quick Summary
Senior Data Engineer - Digital Pathology (Remote)
Mayo Clinic is top-ranked in more specialties than any other care provider according to U.S. News & World Report. We prioritize patient needs while investing in our employees through competitive compensation, comprehensive benefit plans, and continuous education and advancement opportunities, ensuring a long and successful career.
Benefits Highlights
- Medical: Multiple plan options.
- Dental: Delta Dental or reimbursement account for flexible coverage.
- Vision: Affordable plan with national network.
- Pre-Tax Savings: HSA and FSAs for eligible expenses.
- Retirement: Competitive retirement package to secure your future.
Responsibilities
Join the Digital Biology team, the advanced technology group for Mayo Clinic Digital Pathology. We seek a Senior Data Engineer to drive the technical vision for our shared engineering pod. This role involves building, deploying, and optimizing scalable, multimodal data pipelines (pathology, -omics, imaging) that feed our biological foundation models and AI Virtual Cells.
You will collaborate directly with AI pods and bioinformaticians, taking ownership of data reliability and velocity. Transform complex raw biological information into high-quality training assets essential for developing life-changing diagnostic tools.
Core duties include developing and deploying data pipelines, integrations, and transformations to support analytics and machine learning applications within an assigned product team. This requires using open-source programming languages and vended software to achieve desired design functionality. The role demands independent judgment and maintaining expertise in current solutions, coding languages, and tools. May provide consultative services to departments/divisions and leadership committees.
Demonstrated experience in designing, building, and installing data systems applied to the Department of Data & Analytics technology framework is required. You will partner with product owners and Analytics and Machine Learning delivery teams to:
- Identify and retrieve data.
- Conduct exploratory analysis.
- Pipeline and transform data.
- Identify and visualize trends.
- Build and validate analytical models.
- Translate assessments into actionable insights.
Qualifications
A Bachelor's degree in engineering, mathematics, computer science, information technology, health science, or another analytical/quantitative field and a minimum of five years of professional or research experience in data visualization, data engineering, or analytical modeling techniques; OR an Associate’s degree in a relevant field and a minimum of seven years of professional or research experience in these areas. In-depth business or practice knowledge will also be considered.
The incumbent must manage a varied workload with multiple priorities, stay current on healthcare trends, and adapt to enterprise changes. Required skills include:
- Interpersonal skills and time management skills.
- Demonstrated experience working on cross functional teams.
- Strong analytical skills and the ability to identify and recommend solutions.
- Commitment to customer service.
- Excellent verbal and written communication skills, attention to detail, and a high capacity for learning and problem resolution.
Required Technical Expertise
- Advanced experience in SQL.
- Strong experience in scripting languages such as Python, JavaScript, PHP, C++ or Java & API integration.
- Experience in hybrid data processing methods (batch and streaming) such as Apache Spark, Hive, Pig, Kafka.
- Experience with big data, statistics, and machine learning.
- Ability to navigate linux and windows operating systems.
Preferred Technical Expertise
- Knowledge of workflow scheduling (Apache Airflow Google Composer).
- Infrastructure as code (Kubernetes, Docker).
- CI/CD (Jenkins, Github Actions).
- Experience in DataOps/DevOps and agile methodologies.
- Experience with hybrid data virtualization such as Denodo.
- Working knowledge of Tableau, Power BI, SAS, ThoughtSpot, DASH, d3, React, Snowflake, SSIS, and Google Big Query.
- Google Cloud Platform (GCP) certification is preferred.
Preferred Candidate Experience
- SQL
- Python
- Google Cloud Dataflow (Apache Beam)
- Google Cloud BigQuery
- GCP Professional Data Engineer Certification
Compensation and Details
- Exemption Status: Exempt
- Compensation Detail: $138,257.60 - $200,512.00/ year. Education, experience and tenure may be considered along with internal equity when job offers are extended.
- Benefits Eligible: Yes
- Schedule: Full Time (80 Hours/Pay Period)
- Schedule Details: M-F daytime hours 100% remote role, the employee must live within the US.
- Weekend Schedule: NA
- International Assignment: No
Site Description
Mayo Clinic locations include three major campuses in Phoenix/Scottsdale, Arizona, Jacksonville, Florida, Rochester, Minnesota, Mayo Clinic Health System campuses throughout Midwestern communities, and international locations. Each location offers a unique environment for employees to thrive.
Equal Opportunity
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender identity, sexual orientation, national origin, protected veteran status or disability status. Learn more about the "EOE is the Law". Mayo Clinic participates in E-Verify and may provide the Social Security Administration and, if necessary, the Department of Homeland Security with information from each new employee's Form I-9 to confirm work authorization.
Recruiter: Laura Percival

