RevenueBase Logo

RevenueBase

Senior Data & AI Platform Engineer (AWS, Snowflake, Vector Search)

Posted 2 Days Ago
Remote
Hiring Remotely in USA
Senior level
Remote
Hiring Remotely in USA
Senior level
Build production-grade AI-powered data tooling: extract data from Snowflake, generate and store embeddings, enable semantic search, design enrichment pipelines using LLM APIs, optimize AWS infrastructure, and create reusable services and SDKs for scalable, observable data and AI workflows.
The summary above was generated by AI
RevenueBase:
  • We're building the data infrastructure that makes AI agents trustworthy instead of error-prone.

  • We provide continuously refreshed, verified B2B data for autonomous AI agents and GTM workflows.

  • We've tripled growth while maintaining 100% gross dollar retention and staying cashflow positive.

  • We power AI agents for Clay, Zoominfo, Dun & Bradstreet, and the next generation of AI GTM tools.

About the Role

We are looking for a Senior Data & AI Platform Engineer to build internal tools and services on top of our large-scale data infrastructure. Your primary focus will be developing systems that leverage vector embeddings, LLM APIs, and semantic search to unlock value from structured and unstructured data.

This is a hands-on engineering role for someone who enjoys building practical AI-powered tools — not just experiments — and shipping them into production in a fast-moving startup environment.

What You’ll Do
  • Design and build data-driven tools that operate on large datasets stored in S3 and Snowflake

  • Implement pipelines that:

    • Extract specific columns or datasets from Snowflake

    • Generate vector embeddings via APIs such as OpenAI

    • Store and manage embeddings in vector databases like Pinecone

    • Enable semantic search and similarity-based retrieval

  • Develop enrichment workflows that:

    • Query structured data

    • Use LLM APIs to generate new derived columns

    • Write enriched results back into Snowflake

  • Build reusable internal services and SDKs around embedding generation, prompt orchestration, and data augmentation

  • Optimize performance and cost across AWS infrastructure

  • Work closely with product and data teams to turn use cases into scalable engineering solutions

  • Ensure reliability, observability, and maintainability of AI-powered pipelines

Example Projects
  • Tool to extract a single Snowflake column, generate embeddings, push to Pinecone, and expose a semantic search API

  • Batch enrichment pipeline that queries records from Snowflake, calls OpenAI APIs for structured enrichment, and writes new columns back

  • Internal framework for LLM-based data transformation and validation

  • Query abstraction layer to make AI-enhanced analytics accessible to non-engineering teams

Required Qualifications
  • 5+ years of software engineering experience

  • Strong backend engineering skills (Python preferred; other modern languages acceptable)

  • Solid experience with:

    • AWS (IAM, Lambda, ECS/EKS, S3, networking, security best practices)

    • Data warehousing (Snowflake preferred)

    • API design and distributed systems

  • Hands-on experience working with LLM APIs (e.g., OpenAI) and embedding workflows

  • Experience with vector databases (Pinecone or similar)

  • Strong understanding of data modeling, ETL/ELT patterns, and performance optimization

  • Production experience in at least one startup environment

  • Ability to operate independently and ship high-impact systems end-to-end

Nice to Have
  • Experience building internal developer platforms or data tooling

  • Familiarity with prompt engineering and evaluation pipelines

  • Experience with orchestration frameworks (Airflow, Prefect, Dagster)

  • Exposure to retrieval-augmented generation (RAG) systems

  • Infrastructure-as-code experience (Terraform, CDK)

  • Experience managing large-scale embedding refresh and re-indexing workflows

What Success Looks Like
  • Engineers and analysts can easily leverage AI-powered data enrichment

  • Embedding-based search works reliably at scale

  • New AI use cases can be implemented quickly using shared internal tooling

  • Systems are robust, observable, and cost-efficient

Why Join Us?
  • Work on practical, production-grade AI systems

  • Direct impact on how data is leveraged across the company

  • Startup speed with real ownership and autonomy

  • Opportunity to define the internal AI platform from the ground up

Top Skills

Python,Aws,S3,Iam,Lambda,Ecs,Eks,Snowflake,Openai,Pinecone,Llm Apis,Vector Databases,Embeddings,Semantic Search

Similar Jobs

32 Minutes Ago
Remote
United States of America
Internship
Internship
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
Contribute to Product Security by building automation to triage and remediate CodeQL alerts, design AI-assisted workflows to prioritize findings, tune static analysis rules, and integrate CodeQL checks into CI/CD pipelines while collaborating with security and engineering teams.
Top Skills: Python,Javascript,Java,Codeql,Ci/Cd,Github Advanced Security,Snyk,Wiz,Sast,Sca,Devsecops,Ai Agents,Static Analysis
32 Minutes Ago
Remote
United States of America
145K-193K Annually
Expert/Leader
145K-193K Annually
Expert/Leader
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
Lead and manage a regional KYC team handling B2B onboarding, enhanced due diligence, and periodic reviews. Ensure SLA adherence, high-quality risk memos, AI-driven workflow improvements, audit readiness, and strong cross-functional partnerships with product, engineering, QA, and commercial teams while monitoring regulatory developments.
Top Skills: Google Workspace,Slack,Macos,Ai-Enabled Tools,Ai-Assisted Research Or Workflow Tools
32 Minutes Ago
Remote
United States of America
140K-185K Annually
Senior level
140K-185K Annually
Senior level
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
Lead second-line, risk-based compliance testing and monitoring across AML, BSA, Sanctions, and enterprise programs. Plan, execute, and document testing, identify findings, support root cause analysis and remediation, validate control effectiveness across jurisdictions, and improve testing methodologies and reporting while partnering with stakeholders.
Top Skills: ChainalysisChatgptEllipticGeminiGoogle SuitemacOSSlackTrm Labs

What you need to know about the Chicago Tech Scene

With vibrant neighborhoods, great food and more affordable housing than either coast, Chicago might be the most liveable major tech hub. It is the birthplace of modern commodities and futures trading, a national hub for logistics and commerce, and home to the American Medical Association and the American Bar Association. This diverse blend of industry influences has helped Chicago emerge as a major player in verticals like fintech, biotechnology, legal tech, e-commerce and logistics technology. It’s also a major hiring center for tech companies on both coasts.

Key Facts About Chicago Tech

  • Number of Tech Workers: 245,800; 5.2% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: McDonald’s, John Deere, Boeing, Morningstar
  • Key Industries: Artificial intelligence, biotechnology, fintech, software, logistics technology
  • Funding Landscape: $2.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Pritzker Group Venture Capital, Arch Venture Partners, MATH Venture Partners, Jump Capital, Hyde Park Venture Partners
  • Research Centers and Universities: Northwestern University, University of Chicago, University of Illinois Urbana-Champaign, Illinois Institute of Technology, Argonne National Laboratory, Fermi National Accelerator Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account