Careerflow.ai Logo

Careerflow.ai

AI/ML Software Engineer (RL Environments) (Contract)

Posted 7 Hours Ago
Remote
Hiring Remotely in United States
Mid level
Remote
Hiring Remotely in United States
Mid level
Design and build reinforcement-learning training environments and diverse tasks to evaluate and improve LLM agents; iterate rapidly on task designs from customer feedback, deliver high-quality outputs with minimal supervision, and maintain PST overlap for collaboration.
The summary above was generated by AI
About the Role

We're seeking experienced Machine Learning Engineers and Software Engineers with ML experience to design and build high-quality RL training environments for LLM agents. As an RL Environment Engineer, you'll create diverse machine learning tasks that challenge and improve language models, working with minimal supervision to deliver consistent, quality outputs.

What You'll Do
  • Design and build tasks for machine learning domains that target specific language models and difficulty distributions

  • Iterate rapidly on task designs based on customer feedback, with 24-hour turnaround times

  • Create diverse, challenging scenarios that test language model capabilities and expose their limitations

  • Hit the ground running with minimal onboarding time

What We're Looking For
  • Strong machine learning background through coursework, previous work experience, or personal projects

  • Python fluency: you write clean, efficient Python code regularly

  • Heavy LLM user who understands current model capabilities and failure modes through daily hands-on experience

  • Self-directed and creative. You can generate novel ML task ideas in your domain without constant guidance

  • High responsibility and integrity. You deliver quality work consistently and meet deadlines

  • Availability overlap with PST 9am-5pm (minimum 3 hours required)

Work Details
  • Location: Remote

  • Type: Contractor

Time Commitment: 40 hours a week. Must have at least 3 hours of overlap with PST business hours (9am-5pm)

Selection Process:
  1. Screening

  2. Hacker rank assessment

  3. 1 Week paid task

  4. Full time

Similar Jobs

3 Minutes Ago
Remote
United States
145K-171K Annually
Mid level
145K-171K Annually
Mid level
Healthtech • Social Impact • Software • Telehealth
Build and maintain scalable platforms, services, and internal tooling for Revenue Cycle Management (payments, claims, operational workflows). Collaborate with engineering, operations, and business stakeholders to improve efficiency, reliability, and cost effectiveness across the revenue cycle. Apply strong engineering fundamentals, product thinking, and continuous improvement to plan, launch, and maintain systems.
Top Skills: AWSKafkaNode.jsReactTypescript
20 Minutes Ago
Remote
United States
270K-315K Annually
Senior level
270K-315K Annually
Senior level
Security • Software • Cybersecurity • Automation
Own and grow enterprise accounts by prospecting, building pipeline, and closing new business. Consult senior executives, develop partnership strategies, exceed revenue targets, and provide market feedback to improve product-market fit.
An Hour Ago
Easy Apply
Remote
USA
Easy Apply
150K-230K Annually
Senior level
150K-230K Annually
Senior level
Artificial Intelligence • Cloud • Software • Infrastructure as a Service (IaaS)
Lead product marketing for Runpod's AI infrastructure platform: refine positioning, own product-line PMM and launches, produce technical long-form content, run competitive intelligence and win/loss programs, enable sales with battlecards and assets, build AI-assisted content workflows, and synthesize customer and usage data to inform GTM and product priorities.
Top Skills: Ai ToolsAi/Ml WorkflowsAnalytics ToolsCloudDistributed SystemsGpu InfrastructureHpcInfrastructure-As-A-ServiceSeoServerless ArchitectureSQL

What you need to know about the Chicago Tech Scene

With vibrant neighborhoods, great food and more affordable housing than either coast, Chicago might be the most liveable major tech hub. It is the birthplace of modern commodities and futures trading, a national hub for logistics and commerce, and home to the American Medical Association and the American Bar Association. This diverse blend of industry influences has helped Chicago emerge as a major player in verticals like fintech, biotechnology, legal tech, e-commerce and logistics technology. It’s also a major hiring center for tech companies on both coasts.

Key Facts About Chicago Tech

  • Number of Tech Workers: 245,800; 5.2% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: McDonald’s, John Deere, Boeing, Morningstar
  • Key Industries: Artificial intelligence, biotechnology, fintech, software, logistics technology
  • Funding Landscape: $2.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Pritzker Group Venture Capital, Arch Venture Partners, MATH Venture Partners, Jump Capital, Hyde Park Venture Partners
  • Research Centers and Universities: Northwestern University, University of Chicago, University of Illinois Urbana-Champaign, Illinois Institute of Technology, Argonne National Laboratory, Fermi National Accelerator Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account