Axle Informatics Logo

Axle Informatics

Data Platform Engineer

Posted 2 Days Ago
Easy Apply
Remote
Hiring Remotely in USA
125K-150K Annually
Mid level
Easy Apply
Remote
Hiring Remotely in USA
125K-150K Annually
Mid level
The Senior Data Architect will develop and maintain the core data infrastructure for health research, focusing on data pipelines, orchestration, and quality systems. Responsibilities include coding, data modeling, and supporting data ingestion and transformation processes.
The summary above was generated by AI

(ID: 2026-1524)

 

Axle is a bioscience and information technology company that offers advancements in translational research, biomedical informatics, and data science applications to research centers and healthcare organizations nationally and abroad. With experts in biomedical science, software engineering, and program management, we focus on developing and applying research tools and techniques to empower decision-making and accelerate research discoveries. We work with some of the top research organizations and facilities in the country including multiple institutes at the National Institutes of Health (NIH).


Benefits We Offer:

  • 100% Medical, Dental & Vision Coverage for Employees
  • Paid Time Off and Paid Holidays
  • 401K match up to 5%
  • Educational Benefits for Career Growth
  • Employee Referral Bonus
  • Flexible Spending Accounts:
    • Healthcare (FSA)
    • Parking Reimbursement Account (PRK)
    • Dependent Care Assistant Program (DCAP)
    • Transportation Reimbursement Account (TRN)

About the Mission
Join the team at the forefront of revolutionizing medical research in the United States. We are building and maintaining the foundational infrastructure of the National Clinical Cohort Collaborative (N3C)—the nation’s largest and most significant public repository of harmonized electronic health record (EHR) data.

What began as a critical response to the COVID-19 pandemic has evolved into a multi-disease, terabyte-scale data resource that enables researchers across the country to accelerate discovery and improve public health outcomes. The platform integrates EHRs, claims, registries, and other data sources in a secure, regulated environment to support thousands of scientists.

This role is an opportunity to contribute to the core data platform that makes this research possible.


The Role
We are seeking a mid-level Data Platform Engineer to help build and operate the core data infrastructure that powers large-scale, regulated healthcare and research datasets. This role is ideal for an engineer who has moved beyond “entry level,” understands how production systems behave, and wants to grow into owning complex pipelines, orchestration logic, and platform reliability.

You’ll work alongside senior engineers and informatics experts to design, implement, and maintain ingestion, transformation, orchestration, and data quality systems that are reliable, observable, and secure.


What You’ll Do

Build Production-Grade Data Systems

  • Write clean, modular, well-tested Python code for data pipelines and platform services.
  • Use decorators, context managers, and unit tests to ensure correctness and maintainability.
  • Contribute to shared libraries and reusable components across the platform.


Design and Maintain Data Models

  • Implement relational data models aligned with medallion architectures (bronze/silver/gold).
  • Support schema evolution and backward-compatible changes.
  • Work with modern table formats such as Apache Iceberg.

Data Orchestration & Ingestion

  • Build and maintain data workflows using Dagster (preferred) or Airflow.
  • Manage sensors, schedules, and complex job dependencies.
  • Implement ingestion pipelines using Airbyte or similar ELT tools.

Transformation & Data Quality

  • Implement idempotent transformation logic using SQLMesh/Tobiko (preferred) or dbt.
  • Add data quality checks and validation gates using frameworks like Great Expectations.
  • Partner with upstream and downstream users to diagnose and resolve data issues.


Containerization & CI/CD

  • Build, debug, and optimize Docker images for local and production environments.
  • Contribute to CI/CD pipelines supporting automated testing and deployment.
  • Follow modern Git workflows including branching strategies, pull requests, and code reviews.

Infrastructure, Cloud & Security

  • Read and modify infrastructure-as-code using Terraform.
  • Work with AWS primitives (S3, Lambda, Glue, Fargate), with a focus on portability and migration toward open-source, cloud-agnostic alternatives.
  • Apply least-privilege and identity-based access concepts (OIDC/IAM).
  • Operate comfortably within regulated environments (HIPAA, FedRAMP).

Documentation & Collaboration

  • Document data flows, system architecture, and operational procedures clearly.
  • Collaborate closely with senior engineers, informaticists, and project stakeholders.
  • Participate in design reviews and contribute ideas for improving platform reliability and scalability.

What You’ll Bring
Required

  • 2–4 years of experience in Data Engineering or Backend Software Engineering.
  • Strong proficiency in Python and SQL.
  • Solid understanding of relational theory and data modeling.
  • Experience working with orchestration tools (Dagster, Airflow, or similar).
  • Familiarity with containerization and Docker-based workflows.
  • Experience working with version control, CI/CD, and collaborative development practices.
  • Ability to write clear technical documentation.

Nice to Have

  • Experience with Iceberg, Airbyte, Great Expectations, SQLMesh, or dbt.
  • Prior work on regulated data platforms (healthcare, government, finance).
  • Interest in data platform architecture and long-term system evolution.

 


Disclaimer: The above description is meant to illustrate the general nature of work and level of effort being performed by individuals assigned to this position or job description. This is not restricted as a complete list of all skills, responsibilities, duties, and/or assignments required. Individuals may be required to perform duties outside of their position, job description or responsibilities as needed.

The diversity of Axle’s employees is a tremendous asset. We are firmly committed to providing equal opportunity in all aspects of employment and will not tolerate any illegal discrimination or harassment based on age, race, gender, religion, national origin, disability, marital status, covered veteran status, sexual orientation, status with respect to public assistance, and other characteristics protected under state, federal, or local law and to deter those who aid, abet, or induce discrimination or coerce others to discriminate.

Accessibility: If you need an accommodation as part of the employment process please contact: [email protected]

This role has a market-competitive salary with an anticipated base compensation range listed below. Actual salaries will vary depending on a candidate’s experience, qualifications, skills, and location.

#IND

Salary Range
$125,000$150,000 USD

Top Skills

Airflow
Aws,S3,Lambda,Glue,Fargate,Sqlmesh,Dbt,Great Expectations
Dagster
Docker
Python
SQL
Terraform

Similar Jobs

38 Minutes Ago
Remote
USA
129K-207K Annually
Senior level
129K-207K Annually
Senior level
Artificial Intelligence • Cloud • Fintech • Professional Services • Software • Analytics • Financial Services
Lead the design and execution of data solutions that enhance customer-facing applications and analytics, partnering with stakeholders to align data strategies with product features.
Top Skills: DbtKafkaPythonSnowflakeSQL
12 Hours Ago
Easy Apply
Remote
USA
Easy Apply
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Design, build, and operate low-latency indexing and streaming services. Lead initiatives to improve latency and reliability and collaborate on defining data contracts for SDKs and platform primitives.
Top Skills: ClickhouseGoGrpcKafkaMongoDBRedisS3
12 Hours Ago
Easy Apply
Remote
USA
Easy Apply
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The role involves developing and maintaining backend services for a blockchain platform, collaborating on integrations, troubleshooting challenges, and ensuring high-quality code.
Top Skills: APIsBlockchainGoRuby

What you need to know about the Chicago Tech Scene

With vibrant neighborhoods, great food and more affordable housing than either coast, Chicago might be the most liveable major tech hub. It is the birthplace of modern commodities and futures trading, a national hub for logistics and commerce, and home to the American Medical Association and the American Bar Association. This diverse blend of industry influences has helped Chicago emerge as a major player in verticals like fintech, biotechnology, legal tech, e-commerce and logistics technology. It’s also a major hiring center for tech companies on both coasts.

Key Facts About Chicago Tech

  • Number of Tech Workers: 245,800; 5.2% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: McDonald’s, John Deere, Boeing, Morningstar
  • Key Industries: Artificial intelligence, biotechnology, fintech, software, logistics technology
  • Funding Landscape: $2.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Pritzker Group Venture Capital, Arch Venture Partners, MATH Venture Partners, Jump Capital, Hyde Park Venture Partners
  • Research Centers and Universities: Northwestern University, University of Chicago, University of Illinois Urbana-Champaign, Illinois Institute of Technology, Argonne National Laboratory, Fermi National Accelerator Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account