ProPharma Logo

ProPharma

Senior Bioinformatics Data Engineer (Consultant)

Posted 4 Days Ago
Remote
Hiring Remotely in United States
Senior level
Remote
Hiring Remotely in United States
Senior level
The Senior Bioinformatics Data Engineer will build data pipelines, implement clinical data ingestion, and optimize platform infrastructure while ensuring data quality and compliance with legal standards.
The summary above was generated by AI

For the past 25 years, ProPharma has improved the health and wellness of patients by providing advice and expertise that empowers biotech, med device, and pharmaceutical organizations of all sizes to confidently advance scientific breakthroughs and introduce new therapies. ProPharma partners with its clients through an advise-build-operate model across the complete product lifecycle. With deep domain expertise in regulatory sciences, clinical research solutions, quality & compliance, pharmacovigilance, medical information, and R&D technology, ProPharma offers an end-to-end suite of fully customizable consulting solutions that de-risk and accelerate our partners’ most high-profile drug and device programs.

Key Responsibilities
  • Build and maintain Dagster-orchestrated ingestion pipelines for genomics vendors (Caris, Predicine, Tempus, Olink, CellCarta), including IO managers, Iceberg writers, and row-level accounting.
  • Develop and harden dbt Silver-to-Gold transformations: real-data test coverage, store-failures patterns, staging/intermediate/mart models, and macro consolidation.
  • Implement clinical data ingestion paths (SDTM and ADaM), reconciliation logic, and subject-dimension routing.
  • Deliver platform infrastructure: FastAPI endpoints, CI/CD pipelines, containerized deployments, observability instrumentation, and Redshift performance tuning.
  • Extract transformation rules from legacy R and PySpark code and reconcile against new platform implementations.
  • Identify repetitive processes and convert them into automated workflows, guardrails, or reusable tooling.
  • Participate in adversarial design and code reviews, identifying edge cases and pushing back on suboptimal patterns.
  • Collaborate with the lead engineer on design decisions and jointly own delivery velocity through paired working sessions and PR reviews.
  • Ensure all work meets reproducibility standards: CI on every PR, automated tests, no ad-hoc notebook-based production processes.
Minimum Qualifications
  • AI-native engineering practice: demonstrated experience building systems and workflows around AI coding agents (Claude Code, Cursor, Codex, or equivalent) - not just prompting them. You recognize when a repeated process should become an automated pipeline, when agent output needs guardrails, and when to build infrastructure that makes future work faster. Surface-level tool usage is insufficient.
  • Education: Bachelor's or master's degree in computer science, Data Engineering, Bioinformatics, or related field.
  • Experience: 5+ years of professional experience in data engineering with shipped production pipelines on AWS (S3, ECS/Fargate, Redshift or equivalent MPP).
  • Strong proficiency in Python and SQL with working knowledge of modern data engineering libraries.
  • Advanced proficiency with dbt and a workflow orchestration tool (Dagster, Airflow, or Prefect).
  • Data quality instinct: track record of catching silent failures, questioning data correctness assumptions, and noticing lossy joins or incomplete deliveries.
  • Solid understanding of lakehouse architecture patterns, ETL processes, and schema design for complex multi-modal datasets.
  • Ability to handle PHI-adjacent clinical data under Incyte's contractor policy (background check, compliance training, VPN access).
  • Willingness to work within legacy codebases (R, PySpark) to extract business rules and validate new implementations.
  • Excellent communication skills and ability to work in an embedded pair model with tight feedback loops.
Preferred Qualifications
  • Direct experience with Apache Iceberg, AWS Glue Catalog, or lakehouse table formats.
  • Comfort reading genomic data (VAF, HGVS nomenclature, VCFs, CNV/fusion semantics) or demonstrated ability to ramp on unfamiliar scientific domains quickly.
  • Familiarity with clinical data standards including SDTM, ADaM, and CDISC.
  • Pharma, clinical research, or life sciences background.
  • Experience with containerization (Docker/ECS) and infrastructure-as-code (CloudFormation).
  • Proficiency in R for interoperability with bioinformatics teams.

We celebrate our differences and strive to create a workplace where each person can be their authentic self. We are committed to diversity, equity, and inclusion. Employees are encouraged to unleash their innovative, collaborative, and entrepreneurial spirits. With a holistic approach as an Equal Opportunity Employer, we provide a safe space where all employees feel empowered to succeed.

All applications to roles at ProPharma are personally reviewed by a member of our recruitment team. We do not rely on AI screening tools to support our hiring process. You will always receive an outcome to your application so that you have an answer from us - whether you're successful or not.

Whilst ProPharma supports remote working, we also recognise the value that comes from in person collaboration. As such, we encourage any new hires that are based within a reasonably short commute of one of our offices to work on a hybrid basis and spend some time working from that office location, as agreed with your manager. All applications will be treated on their own merit and candidates will not be at any advantage or disadvantage based on their proximity to an office.

***ProPharma Group does not accept unsolicited resumes from recruiters/third parties. Please, no phone calls or emails to anyone regarding this posting.***

Similar Jobs

10 Hours Ago
In-Office or Remote
Chicago, IL, USA
145K-260K Annually
Expert/Leader
145K-260K Annually
Expert/Leader
Artificial Intelligence • Cloud • Enterprise Web • Information Technology • Software • Analytics • Business Intelligence
The Enterprise Account Executive drives sales in a new market, develops strategic account plans, engages with C-Suite executives, and maintains client relationships, ensuring customer needs are met while exceeding sales quotas.
Top Skills: Crm SoftwareMicrosoft Office SuiteSales ToolsSalesforce
11 Hours Ago
Easy Apply
Remote or Hybrid
Easy Apply
119K-170K Annually
Senior level
119K-170K Annually
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
The Senior Product Manager will lead data classification capabilities in a Data Security platform, overseeing product strategy and collaborating with various teams to drive accuracy and compliance.
Top Skills: AIData ClassificationData PrivacyData SecurityLarge Language ModelsMachine LearningNlp
11 Hours Ago
Remote
United States
74K-110K Annually
Senior level
74K-110K Annually
Senior level
Beauty • Robotics • Design • Appliances • Manufacturing
The Senior Consumer Insights Analyst leads consumer insights within cross-functional teams by managing research processes, synthesizing data, and delivering actionable findings to influence product development and strategy.
Top Skills: Consumer ResearchData Analysis

What you need to know about the Chicago Tech Scene

With vibrant neighborhoods, great food and more affordable housing than either coast, Chicago might be the most liveable major tech hub. It is the birthplace of modern commodities and futures trading, a national hub for logistics and commerce, and home to the American Medical Association and the American Bar Association. This diverse blend of industry influences has helped Chicago emerge as a major player in verticals like fintech, biotechnology, legal tech, e-commerce and logistics technology. It’s also a major hiring center for tech companies on both coasts.

Key Facts About Chicago Tech

  • Number of Tech Workers: 245,800; 5.2% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: McDonald’s, John Deere, Boeing, Morningstar
  • Key Industries: Artificial intelligence, biotechnology, fintech, software, logistics technology
  • Funding Landscape: $2.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Pritzker Group Venture Capital, Arch Venture Partners, MATH Venture Partners, Jump Capital, Hyde Park Venture Partners
  • Research Centers and Universities: Northwestern University, University of Chicago, University of Illinois Urbana-Champaign, Illinois Institute of Technology, Argonne National Laboratory, Fermi National Accelerator Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account