Sayari Logo

Sayari

Principal Data Engineer

Reposted 5 Days Ago
Remote
Hiring Remotely in United States
200K-220K Annually
Senior level
Remote
Hiring Remotely in United States
200K-220K Annually
Senior level
The Principal Data Engineer will lead the Data Resolution team, focusing on complex data challenges using Spark, system architecture, and mentorship, to optimize graph data pipelines.
The summary above was generated by AI
About Sayari: 

Sayari is the leader in Agentic Systems of Work for economic security and risk. Powered by the Sayari Commercial World Model : a digital twin of global commerce resolving 10.6B+ primary-source records from 250+ jurisdictions : Sayari transforms risk and investigative teams from manual data gatherers into decisive mission leaders. By unifying corporate ownership, trade data, and risk intelligence into a single graph, Sayari uncovers connections and typologies that legacy watchlist, adverse media, and point solutions miss, enabling prescriptive execution at scale. Trusted by the world’s most demanding regulators, including U.S. Customs and Border Protection, the U.S. Treasury, and Fortune 500 enterprises, Sayari delivers the evidence-based transparency needed to prove decisions, satisfy regulators and protect global commerce. Headquartered in Washington, D.C., Sayari is used by thousands of professionals across 35+ countries to secure supply chains and dismantle illicit networks.

Our company culture is defined by a dedication to our mission of using open data to prevent illicit commercial and financial activity, a passion for finding novel approaches to complex problems, and an understanding that diverse perspectives create optimal outcomes. We embrace cross-team collaboration, encourage training and learning opportunities, and reward initiative and innovation. If you like working with supportive, high-performing, and curious teams, Sayari is the place for you.

POSITION DESCRIPTION

We are looking for a Principal Data Engineer to join our Data Resolution team and serve as a technical anchor for our most complex data challenges. In this role, you will be a "player-coach," spending the majority of your time (70%) hands-on with Spark and graph data logic while dedicating the remainder of your time to system architecture, design planning, and technical mentorship. You will be instrumental in evolving our graph build pipelines, optimizing our cloud footprint, and overseeing the long-term planning and execution of major data pipeline re-architectures. This is a high-impact role where your work directly powers the data products used by global systems defenders.


JOB RESPONSIBILITIES
  • Design and implement complex Spark data logic, focusing on performance optimization, data volume tuning, and robust execution.
  • Own the architectural design of graph build pipelines, ensuring they are scalable, automated, and highly resilient.
  • Plan and oversee the strategic re-architecture of data pipelines to meet evolving business needs and scale.
  • Optimize infrastructure-as-code and schema designs to reduce cloud costs and improve pipeline latency.
  • Act as a technical consultant for the team, fostering a collaborative and engineer-led approach to design decisions.
  • Support the development of the engineering team through code reviews, design docs, and architectural best practices.
  • Ensure the accuracy of mission-critical data outputs.
SKILLS & EXPERIENCE

Required Skills & Experience

  • 8+ years of experience in the big data space, with a proven track record of implementing large-scale features and leading process redesigns.
  • Expert-level mastery of Apache Spark for large-scale data processing.
  • Strong experience with orchestration tools (Airflow) and cloud computing environments.
  • Hands-on experience architecting and managing data flows into databases such as Elasticsearch, Memgraph, and Cassandra.
  • Demonstrated ability in system architecture, including Infrastructure as Code (IaC) and schema design.
  • A "builder" mindset with experience evolving and improving existing architectures to meet new scale requirements.

Preferred Skills & Experience

  • Experience working specifically with graph data or graph databases.
  • Prior experience with entity resolution or identity resolution systems.
  • Experience evaluating and selecting modern analytical database architectures.

The target base salary for this position is $200,000-$220,000 plus company bonus and equity. Final offer amounts are determined by multiple factors including location, local market variances, candidate experience and expertise, internal peer equity, and may vary from the amounts listed above.


Benefits: 
  • 100% fully paid medical, vision, and dental for employees and their dependents
  • Generous time off; we observe all US federal holidays, close our office for a winter break (12/24-12/31), in addition to granting 18 PTO days and 10 sick days
  • Outstanding compensation package; competitive commissions for revenue roles and bonuses for non-revenue positions
  • A strong commitment to diversity, equity, and inclusion
  • Eligibility to participate in additional benefits such as 401k match up to 5%, 100% paid life insurance (up to $100,000 coverage),, and parental leave
  • A collaborative and positive culture - your team will be as smart and driven as you
  • Limitless growth and learning opportunities
 
Sayari is an equal opportunity employer and strongly encourages diverse candidates to apply. We believe diversity and inclusion mean our team members should reflect the diversity of the United States. No employee or applicant will face discrimination or harassment based on race, color, ethnicity, religion, age, gender, gender identity or expression, sexual orientation, disability status, veteran status, genetics, or political affiliation. We strongly encourage applicants of all backgrounds to apply.
Pay Range
$200,000$220,000 USD

Similar Jobs

3 Days Ago
Remote or Hybrid
United States
Senior level
Senior level
Fintech • Software
The Principal Data Platform Engineer leads the design of data architectures, implements data platform patterns, and optimizes system performance, ensuring data strategy operates at scale.
Top Skills: Azure FabricData FactoryDelta LakeDockerKubernetesOnelakePower BIPysparkPythonSQL
15 Days Ago
Remote or Hybrid
160K-200K Annually
Expert/Leader
160K-200K Annually
Expert/Leader
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
The Principal Data Engineer develops and maintains advertising technology, architecting backend services using microservices and cloud technologies, integrating AI capabilities, and guiding the team in best practices.
Top Skills: SparkAWSDockerFlinkJavaKafkaKinesisNoSQLPythonRest ApiScalaSQL
2 Days Ago
In-Office or Remote
113K-193K Annually
Senior level
113K-193K Annually
Senior level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
The Principal Data Engineer will lead data engineering efforts on public cloud, design scalable solutions, build data pipelines, and drive deliverables with a high level of technical expertise in relevant technologies.
Top Skills: AdfAzureCi/CdContainersDatabricksDelta LakeDockerJenkinsKafkaSnowflakeSparkTerraform

What you need to know about the Chicago Tech Scene

With vibrant neighborhoods, great food and more affordable housing than either coast, Chicago might be the most liveable major tech hub. It is the birthplace of modern commodities and futures trading, a national hub for logistics and commerce, and home to the American Medical Association and the American Bar Association. This diverse blend of industry influences has helped Chicago emerge as a major player in verticals like fintech, biotechnology, legal tech, e-commerce and logistics technology. It’s also a major hiring center for tech companies on both coasts.

Key Facts About Chicago Tech

  • Number of Tech Workers: 245,800; 5.2% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: McDonald’s, John Deere, Boeing, Morningstar
  • Key Industries: Artificial intelligence, biotechnology, fintech, software, logistics technology
  • Funding Landscape: $2.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Pritzker Group Venture Capital, Arch Venture Partners, MATH Venture Partners, Jump Capital, Hyde Park Venture Partners
  • Research Centers and Universities: Northwestern University, University of Chicago, University of Illinois Urbana-Champaign, Illinois Institute of Technology, Argonne National Laboratory, Fermi National Accelerator Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account