SumerSports Logo

SumerSports

Data Engineer

Reposted 15 Days Ago
Remote
Hiring Remotely in United States
Mid level
Remote
Hiring Remotely in United States
Mid level
As a Data Engineer, you'll build and maintain data pipelines, work with ML teams, and ensure the data integrity for analysis and AI-driven products.
The summary above was generated by AI

SumerSports is a leading football intelligence technology company that specializes in providing an innovative suite of products for football fans and NFL clubs. We are a collection of executives, engineers, data scientists, and visionaries from NFL clubs, technology startups, finance, and academia. 


Our data-driven platform empowers teams with insights and tools to make informed decisions within salary cap constraints. The platform also serves the NCAA, offering insights around the transfer portal and more.


What sets us apart is our unique blend of big tech talent, data scientists, and former NFL personnel, who have a combined 600+ years of NFL experience. Our domain knowledge is augmented by AI and machine learning technologies to create a unique view into many aspects of Football.

As a Data Engineer, you’ll design, build, and maintain the data pipelines that power our deep learning and LLM systems. You’ll work across ingestion, transformation, and orchestration layers — from real-time feeds to analytics-ready datasets. 


Your mission is to make data reliable, discoverable, and scalable for use by model training, analytics, and AI-driven products across multiple sports. You’ll collaborate closely with our MLOps, LLMOps, and Sports Data teams to ensure seamless integration between data and AI. 


Responsibilities:

  • Build and operate robust data pipelines for ingestion, cleaning, and transformation using Databricks, Airflow, or Dagster. 
  • Develop efficient ETL/ELT workflows in Python and SQL to support both batch and streaming workloads.
  • Collaborate with ML and AI teams to deliver high-quality datasets for training, evaluation, and production features.
  • Model and maintain structured data assets (Delta, Parquet, Iceberg) for reliability, versioning, and lineage tracking. 
  • Implement orchestration and monitoring — schedule jobs, track dependencies, and automate recovery from failures. 
  • Ensure data quality and compliance through validation frameworks, schema enforcement, and audit logging. 
  • Contribute to data platform evolution — evaluate tools, standardize best practices, and improve developer experience.
  • Support performance and cost optimization across compute, storage, and orchestration systems.

Qualifications:

  • 3–6 years of experience as a Data Engineer or ETL Developer in a production environment. 
  • Proficiency in Python and SQL; strong familiarity with Databricks, Spark, or equivalent big-data frameworks. 
  • Experience with workflow orchestration tools such as Airflow, Dagster, Luigi or Prefect. 
  • Deep understanding of data modeling, data warehousing, and distributed data processing. 
  • Knowledge of modern data lakehouse architectures (Delta, Parquet, Iceberg). 
  • Familiarity with CI/CD, GitHub Actions, and data pipeline testing frameworks. 
  • Comfort working in a cross-functional environment with ML, product, and analytics teams. 

Nice to Have:

  • Experience with sports, telemetry, or sensor data pipelines. 
  • Familiarity with streaming frameworks (Kafka, Spark Structured Streaming, Flink). 
  • General knowledge of American football, the NFL, and college football
  • Background in data governance, lineage, and observability tools (Monte Carlo, Great Expectations, Unity Catalog, OpenLineage). 
  • Experience with cloud infrastructure (AWS, GCP, or Azure) and containerization (Docker, Kubernetes). 
  • Exposure to best practices in machine-learning model management and MLOps 

Benefits:

  • Competitive Salary and Bonus Plan
  • Comprehensive health insurance plan
  • Retirement savings plan (401k) with company match
  • Remote working environment
  • A flexible, unlimited time off policy
  • Generous paid holiday schedule - 13 in total including Monday after the Super Bowl

Similar Jobs

Yesterday
Remote or Hybrid
77K-202K Annually
Senior level
77K-202K Annually
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Senior Data Engineer on PwC's Managed Data, Analytics & Insights team to design, build and manage advanced data ecosystems. Responsibilities include designing data solutions and scalable pipelines, solving complex problems, mentoring junior staff, maintaining high delivery standards, and building client relationships while aligning solutions to business context.
Top Skills: DatabricksKafka
3 Days Ago
Easy Apply
Remote
United States
Easy Apply
115K-145K Annually
Junior
115K-145K Annually
Junior
Fintech • Insurance • Machine Learning • Analytics • Financial Services • Automation
Build and maintain reliable data pipelines, Airflow DAGs, and Snowflake-based Data Vault/warehouse models. Implement CI/CD, automated testing, observability, and production support while partnering with stakeholders and developing insurance domain expertise.
Top Skills: Apache AirflowBigQueryCi/CdClaude CodeCursorData Observability ToolingData Vault 2.0PythonRbacRedshiftSnowflakeSnowflake CortexSQL
5 Days Ago
Remote or Hybrid
Chicago, IL, USA
77K-202K Annually
Senior level
77K-202K Annually
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Design and build data infrastructure, pipelines, and integration solutions using cloud and big-data tools. Develop data lakes/warehouses, ensure data quality and security, apply data modeling and DAGs, use Databricks, Airflow, and Hadoop, and collaborate with clients to deliver actionable insights.
Top Skills: Apache AirflowApache HadoopAWSAzure Data FactoryDagsData LakeData WarehouseDatabricksDimensional ModelingAzure

What you need to know about the Chicago Tech Scene

With vibrant neighborhoods, great food and more affordable housing than either coast, Chicago might be the most liveable major tech hub. It is the birthplace of modern commodities and futures trading, a national hub for logistics and commerce, and home to the American Medical Association and the American Bar Association. This diverse blend of industry influences has helped Chicago emerge as a major player in verticals like fintech, biotechnology, legal tech, e-commerce and logistics technology. It’s also a major hiring center for tech companies on both coasts.

Key Facts About Chicago Tech

  • Number of Tech Workers: 245,800; 5.2% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: McDonald’s, John Deere, Boeing, Morningstar
  • Key Industries: Artificial intelligence, biotechnology, fintech, software, logistics technology
  • Funding Landscape: $2.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Pritzker Group Venture Capital, Arch Venture Partners, MATH Venture Partners, Jump Capital, Hyde Park Venture Partners
  • Research Centers and Universities: Northwestern University, University of Chicago, University of Illinois Urbana-Champaign, Illinois Institute of Technology, Argonne National Laboratory, Fermi National Accelerator Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account