Worth AI Logo

Worth AI

Principal Data Engineer

Posted 17 Days Ago
In-Office or Remote
4 Locations
Expert/Leader
In-Office or Remote
4 Locations
Expert/Leader
The Principal Data Engineer will own the data architecture, design and build scalable pipelines, enforce data quality, and lead collaboration across teams to enable BI and ML.
The summary above was generated by AI

Worth AI, a leader in the computer software industry, is looking for a talented and experienced Principal Data Engineer to join their innovative team. At Worth AI, we are on a mission to revolutionize decision-making with the power of artificial intelligence while fostering an environment of collaboration, and adaptability, aiming to make a meaningful impact in the tech landscape.. Our team values include extreme ownership, one team and creating reaving fans both for our employees and customers.

Worth is looking for a Principal Data Engineer to own the company-wide data architecture and platform. Design and scale reliable batch/streaming pipelines, institute data quality and governance, and enable analytics/ML with secure, cost-efficient systems. Partner with engineering, product, analytics, and security to turn business needs into durable data products.

Responsibilities

What you will do:

  • Architecture & Strategy
    • Define end-to-end data architecture (lake/lakehouse/warehouse, batch/streaming, CDC, metadata).
    • Set standards for schemas, contracts, orchestration, storage layers, and semantic/metrics models.
    • Publish roadmaps, ADRs/RFCs, and “north star” target states; guide build vs. buy decisions.
  • Platform & Pipelines
    • Design and build scalable, observable ELT/ETL and event pipelines.
    • Establish ingestion patterns (CDC, file, API, message bus) and schema-evolution policies.
    • Provide self-service tooling for analysts/scientists (dbt, notebooks, catalogs, feature stores).
    • Ensure workflow reliability (idempotency, retries, backfills, SLAs).
  • Data Quality & Governance
    • Define dataset SLAs/SLOs, freshness, lineage, and data certification tiers.
    • Enforce contracts and validation tests; deploy anomaly detection and incident runbooks.
    • Partner with governance on cataloging, PII handling, retention, and access policies.
  • Reliability, Performance & Cost
    • Lead capacity planning, partitioning/clustering, and query optimization.
    • Introduce SRE-style practices for data (error budgets, postmortems).
    • Drive FinOps for storage/compute; monitor and reduce cost per TB/query/job.
  • Security & Compliance
    • Implement encryption, tokenization, and row/column-level security; manage secrets and audits.
    • Align with SOC 2 and privacy regulations (e.g., GDPR/CCPA; HIPAA if applicable).
  • ML & Analytics Enablement
    • Deliver versioned, documented datasets/features for BI and ML.
    • Operationalize training/serving data flows, drift signals, and feature-store governance.
    • Build and maintain the semantic layer and metrics consistency for experimentation/BI.
  • Leadership & Collaboration
    • Provide technical leadership across squads; mentor senior/staff engineers.
    • Run design reviews and drive consensus on complex trade-offs.
    • Translate business goals into data products with product/analytics leaders.

Requirements
    • 10+ years in data engineering (including 3+ years as staff/principal or equivalent scope).
    • Proven leadership of company-wide data architecture and platform initiatives.
    • Deep experience with at least one cloud (AWS) and a modern warehouse or lakehouse (e.g., Snowflake, Redshift, Databricks).
    • Strong SQL and one programming language (Python or Scala/Java).
    • Orchestration (Airflow/Dagster/Prefect), transformations (dbt or equivalent), and streaming (Kafka/Kinesis/PubSub).
    • Data modeling (3NF, star, data vault) and semantic/metrics layers.
    • Data quality testing, lineage, and observability in production environments.
    • Security best practices: RBAC/ABAC, encryption, key management, auditability.

** All Remote Hires - will be required to travel to Orlando, Florida at least twice per year for Town Halls and team collaboration in addition to orientation in Orlando, Florida.

Nice to Have

    • Feature stores and ML data ops; experimentation frameworks.
    • Cost optimization at scale; multi-tenant architectures.
    • Governance tools (DataHub/Collibra/Alation), OpenLineage, and testing frameworks (Great Expectations/Deequ).
    • Compliance exposure (SOC 2, GDPR/CCPA; HIPAA/PCI where relevant).
    • Model features sourced from complex 3rd-party data (KYB/KYC, credit bureaus, fraud detection APIs)

Benefits
    • Health Care Plan (Medical, Dental & Vision)
    • Retirement Plan (401k, IRA)
    • Life Insurance
    • Unlimited Paid Time Off
    • 9 paid Holidays
    • Family Leave
    • Work From Home
    • Free Food & Snacks (Access to Industrious Co-working Membership!)
    • Wellness Resources

Top Skills

Airflow
AWS
Dagster
Databricks
Dbt
Java
Kafka
Kinesis
Prefect
Pubsub
Python
Redshift
Scala
Snowflake

Similar Jobs

6 Days Ago
Easy Apply
Remote
USA
Easy Apply
75K-150K Annually
Senior level
75K-150K Annually
Senior level
Big Data • Information Technology
Design and scale PaaS and Data Infrastructure for SaaS products. Lead technical execution of distributed systems and automation pipelines while mentoring engineers and enforcing standards for reliability and performance.
Top Skills: DbtElkFlinkGoGrafanaHelmJavaKafkaKubernetesOpentelemetryPostgresPrometheusPythonSnowflakeSparkTrinoTypescript
13 Days Ago
Easy Apply
Remote
United States
Easy Apply
189K-236K Annually
Senior level
189K-236K Annually
Senior level
Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
The Principal Data Platform Engineer will lead the development of scalable data management systems, focusing on ingestion, processing, and analytics platforms while collaborating with cross-functional teams.
Top Skills: AirflowApache KafkaSparkAWSDockerKubernetesPython
14 Days Ago
In-Office or Remote
2 Locations
145K-188K Annually
Senior level
145K-188K Annually
Senior level
Healthtech
The role involves designing and operating modern data and ML platforms, building scalable pipelines, implementing MLOps systems, ensuring compliance, and enabling AI solutions in public health.
Top Skills: AirflowAWSAws GovcloudAzure MlCloudFormationDatabricksDbtEvent HubsGlueGrafanaKafkaKinesisMlflowOpentelemetryPrometheusPythonRedshiftSagemakerSnowflakeSQLStep FunctionsSynapseTerraformVertex Ai

What you need to know about the Chicago Tech Scene

With vibrant neighborhoods, great food and more affordable housing than either coast, Chicago might be the most liveable major tech hub. It is the birthplace of modern commodities and futures trading, a national hub for logistics and commerce, and home to the American Medical Association and the American Bar Association. This diverse blend of industry influences has helped Chicago emerge as a major player in verticals like fintech, biotechnology, legal tech, e-commerce and logistics technology. It’s also a major hiring center for tech companies on both coasts.

Key Facts About Chicago Tech

  • Number of Tech Workers: 245,800; 5.2% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: McDonald’s, John Deere, Boeing, Morningstar
  • Key Industries: Artificial intelligence, biotechnology, fintech, software, logistics technology
  • Funding Landscape: $2.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Pritzker Group Venture Capital, Arch Venture Partners, MATH Venture Partners, Jump Capital, Hyde Park Venture Partners
  • Research Centers and Universities: Northwestern University, University of Chicago, University of Illinois Urbana-Champaign, Illinois Institute of Technology, Argonne National Laboratory, Fermi National Accelerator Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account