Back to jobs/Anthropic

Regional hiringpublishedExternal employer

Anthropic•AI Research

Software Engineer, RL Data

Location

London, City of London, United Kingdom

Work type

Hybrid

Employment

Full Time

Experience

7-10 years

Compensation

$320K - $485K per year

Posted

1 month ago

Summary and responsibilities

Role overview

Summary

This senior Software Engineer role on Anthropic's RL Data team involves making architectural decisions and building robust systems for high-quality reinforcement learning data. Responsibilities include developing data collection pipelines, improving QA frameworks, and hardening execution environments, requiring end-to-end ownership and collaboration with research teams.

About Anthropic

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

About the role

This is a senior, foundational role on a new team: you'll make architecture decisions the rest of the team builds on, and help shape what we build first. The work is hands-on and varied. Some weeks you'll be deep in pipeline or infrastructure engineering; others you'll be tuning prompts until the output is good, or sitting with a research team that depends on your systems and shipping the fixes they need. We're looking for experienced engineers who own outcomes end-to-end — down to reading transcripts, supporting users, and wrangling vendors.

Anthropic's RL Data team builds the systems that produce high-quality reinforcement learning data for Claude: data collection pipelines, human feedback tooling, the execution environments RL tasks run in, and the quality assurance that keeps training data trustworthy at scale. Our goal is to make Claude great at real work — especially the work that matters most, like AI safety research and beneficial deployments of AI. (To be upfront: this is dual-use work — it advances general capabilities too.)

Key responsibilities

Own significant parts of our stack end-to-end, from technical architecture through the unglamorous operational work that makes it succeed.
Build data collection pipelines, read the transcripts they produce, and iterate on prompts, evals, and graders until the output is good.
Develop and improve QA frameworks to catch reward hacking and ensure environment quality.
Build interfaces that make collecting human data fast and painless for the people providing it.
Harden execution environments — sandboxing, snapshotting, tool coverage — so tasks hold up at training scale.
Embed with the teams and domain experts who use our systems day-to-day, and work with operations, security, and compliance partners to roll our systems out to new users and vendors.

Minimum qualifications

A track record of owning major projects end-to-end in fast-paced, ambiguous environments — for example as a founder or CTO, forward deployed engineer, tech lead, founding engineer at a startup, or creator of a substantial open-source project.
Trusted to run key projects: you lead and inspire others, plan workstreams effectively, collaborate with cross-functional stakeholders, and proactively eliminate or escalate blockers.
Strong software engineering skills in at least one modern programming language — we mostly use Python and TypeScript, but care more that you pick new tools up quickly than that you know our exact stack. Familiarity with Docker, Kubernetes, and common cloud infrastructure is a plus.
Effective use of AI tools in your own day-to-day work.
Care about the societal impacts of your work.

Preferred qualifications

Experience with reinforcement learning on LLMs, particularly on the data side: creating evals, environments, rewards, graders, or training data.
Experience helping organizations use AI more effectively, including integrating with third-party tools via APIs, CLIs, and MCP servers.
Strong data engineering skills: pipelines that handle large volumes reliably in production, LLM-powered enrichment steps, and a focus on improving data quality.
Experience shipping user-facing products or internal platforms people love: interviewing users, hunting down friction, measurably improving the experience.
Basic familiarity with AI safety or security research.

Logistics

Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience

Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience

Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position

Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.

Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.

How we're different

We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.

The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.

Come work with us!

Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

Updated 1 month ago

Candidate fit

Skills and qualifications

Additional skills

Software Engineering • 1+ yrs

Python • 1+ yrs

TypeScript • 1+ yrs

Docker • 1+ yrs

Kubernetes • 1+ yrs

Cloud Infrastructure • 1+ yrs

Data Engineering • 1+ yrs

Reinforcement Learning • 1+ yrs

Experience

7-10 years

How this role is positioned

Role classification

Job domains

Software Engineering

Industries

Technology & IT

Employment

Full Time

Contract duration

Permanent

Hiring type

Direct

Global hiring

Location specific

Offer details

Compensation and benefits

Compensation

$320K - $485K per year

VisibilityShared on listing

CurrencyUSD

PeriodYearly

Benefits and perks

Paid Parental Leave

Flexible Working Hours

Visa Sponsorship

Location, schedule, and role shape

Work setup

Work conditions

Primary locationLondon, City of London, United Kingdom

Work typeHybrid

Global hiringNo

Bandwidth profile

peopleMedium • 7/10

physicalLow • 2/10

cognitiveHigh • 9/10

executionHigh • 9/10

creativityMedium • 7/10

uncertaintyHigh • 8/10

communicationHigh • 8/10

Context on the employer

Company snapshot

Company

Anthropic

Team size

Growing team

Location

London, City of London, United Kingdom

Visit website

Similar jobs in Software Engineering

Natural scientist, methods and algorithm development

Confidential Client•Oberkochen, Baden-Württemberg, Germany

Full Time5-8 yrs

€70K - €83K /yr

Hybrid

Wildlife & ConservationOperationsSoftware EngineeringOptimizationSimulationData ScienceProcess ModelingMachine LearningMATLABPythonC#+5

This role involves developing and implementing methods and algorithms for high-precision optics, applying modern simulation techniques, and optimizing optical processes. The position also includes managing interdisciplinary projects, developing machine learning algorithms, and building scientific networks.

...

External

Associate Data Solutions Engineer

Higher Logic•Remote, New York, United States

Full Time0-2 yrs

Compensation not disclosed

Remote

Technology & ITSoftware & SaaSSoftware EngineeringSQLPythonETL/ELTData ModelingDBTIcebergData WarehousingCommunication+5

The Associate Data Solutions Engineer supports Higher Logic’s strategic data initiatives by designing, developing, and scaling data pipelines and models. This role is critical in building the data warehouse and enabling AI capabilities, working closely with internal stakeholders to ensure data is reliable, accessible, and actionable.

...

External

Data Associate Engineer

CareFirst BlueCross BlueShield•Baltimore, Maryland, United States

Full Time2-6 yrs

$62.9K - $125K /yr

Remote

Healthcare & PharmaSoftware & SaaSSoftware EngineeringSQLNoSQLPythonData TransformationETLData ModelingDatabase DesignCommunication+5

The Data Associate Engineer understands data needs, advises on technological resources, and partners with senior team members to aggregate and analyze data for actionable insights. This role involves developing reports, dashboards, and tools for business users, as well as creating technical solutions to improve data access and usage. The engineer will also develop and execute ETL code, ensuring reliable data loading processes and high data quality.

...

External

Associate Data Engineer

UST•Frisco, Texas, United States

Full Time0-3 yrs

$50 - $55 /hr

On-site

Technology & ITSoftware & SaaSSoftware EngineeringPythonSQLReactData EngineeringETLData PipelinesCloudGit+5

This Associate Data Engineer role focuses on building and maintaining scalable, data-driven full-stack applications and platforms. The position involves collaborating with data engineers, scientists, and architects, ensuring code quality, and adhering to modern engineering practices.

...

External

Data Solutions Engineer - Data Pipelines, Regulatory Reporting

Randstad Digital•Columbus, Ohio, United States

Contract2+ yrs

$55 - $70 /hr

On-site

Technology & ITSoftware & SaaSSoftware EngineeringData PipelinesData ModelingSoftware DevelopmentTroubleshootingData QualityAutomationClient Relationship ManagementCollaboration+5

As a Data Solutions Engineer, you will be responsible for building and operating governed, auditable data pipelines and models for regulatory reporting, audit compliance, and executive insights. This role involves integrating data from various enterprise platforms and external RTO systems to improve data quality, traceability, and automation across the analytics lifecycle.

...

External

Ecosystems Digital Twin Researcher II - Edge AI, Embedded ML

San Diego Zoo Wildlife Alliance•Escondido, California, United States

Contract3-8 yrs

$99.5K - $111.9K /yr

On-site

Wildlife & ConservationOperationsSoftware EngineeringPythonPyTorchTensorFlowEdge MLEmbedded SoftwareQuantizationLinuxScience Communication+5

The Researcher II develops programs for scientific research, conducts independent and collaborative research in biological or social sciences, and supervises assigned research/laboratory operations. This role involves overseeing data collection, analysis, and interpretation for conservation programs, publishing findings, and securing funding.

...

External

More jobs from Technology & IT

Project Manager

Kraemer Design + Production•Cincinnati, Ohio, United States

Full Time3-7 yrs

Compensation not disclosed

Hybrid

Technology & ITOperationsDesign & CreativeProject ManagementBudget ManagementSchedule ManagementClient CommunicationRisk ManagementProblem SolvingMicrosoft OfficeFabrication Coordination+5

Kraemer Design + Production (KD+P) is seeking a highly organized Project Manager to lead interactive projects from kickoff through installation. This role involves aligning internal teams and external partners, managing schedules and budgets, and coordinating client communication to ensure projects are completed safely, professionally, and to KD+P’s quality standards in complex, custom-built environments.

...

External

Design Professional

Cushing Terrell•Minneapolis, Minnesota, United States

Full Time0-6 yrs

$60K - $65K /yr

On-site

Technology & ITDesign & CreativeArchitectural DesignConstruction DocumentsConstruction AdministrationRevitAutodeskSketchUpAdobe Creative SuiteCollaboration+4

As a Design Professional at Cushing Terrell, you will collaborate with project teams on architectural design, project development, construction documents, and construction administration. You will develop design and technical solutions under the direct supervision of an experienced architect, coordinating design with technical teams and managing project segments.

...

External

Data Analytics Intern

Zelestra•Seville, Spain

Internship0-1 yrs

Compensation not disclosed

Hybrid

Technology & ITSoftware & SaaSIT & System AdministrationOperationsData AnalysisPower BIPythonMicrosoft OfficeReportingProcess AutomationContinuous ImprovementCommunication+6

The Data Analytics Intern will support the analysis of operational and production data, identify deviations, and prepare reports for clients. This role involves contributing to the development and automation of analysis tools and proposing continuous improvement initiatives to enhance efficiency and support decision-making.

...

External

Business Development & Strategy Intern

Anderson Global•Paris, France

Internship0-1 yrs

Compensation not disclosed

On-site

Technology & ITOperationsBusiness DevelopmentStrategic PlanningHubSpotProspectingCommercial Performance TrackingPartnership ManagementInterpersonal SkillsAgility+4

This 6-month internship involves working directly with the General Manager to build the commercial engine of Companow, a company assisting international entrepreneurs in France. Responsibilities include business development, partnership management, attending international events, and contributing to CRM structuring.

...

External

Associate Data Solutions Engineer

Higher Logic•Remote, New York, United States

Full Time0-2 yrs

Compensation not disclosed

Remote

Technology & ITSoftware & SaaSSoftware EngineeringSQLPythonETL/ELTData ModelingDBTIcebergData WarehousingCommunication+5

...

External

Associate Data Engineer

UST•Frisco, Texas, United States

Full Time0-3 yrs

$50 - $55 /hr

On-site

Technology & ITSoftware & SaaSSoftware EngineeringPythonSQLReactData EngineeringETLData PipelinesCloudGit+5

...

External

Popular Domains

Explore opportunities across specialized functional areas.

Administration & OfficeRoles providing organizational, secretarial, clerical, and executive support functions.

Customer Success & SupportRoles managing customer onboarding, retention, satisfaction, and technical support.

Data Science & AnalyticsRoles using data modeling, statistics, and visualization to derive business insights.

Design & CreativeRoles focused on visual design, UX/UI, branding, illustration, and creative production.

Education AdministrationRoles managing educational institutions, programs, curriculum, and student affairs.

Finance & AccountingRoles managing financial reporting, budgeting, auditing, tax, and investment activities.

Gigs & Flexible TasksShort-term, contract, or freelance task-based work across any domain.

Healthcare & MedicalRoles in clinical care, medical practice, patient management, and health services delivery.

Trending Industries

Discover roles in the world's most innovative sectors.

Aerospace & Space Tech

Agency & Consulting Services

Agriculture & AgriTech

Automotive

Biotech & Life Sciences

Blockchain & Web3

Construction & Infrastructure

Cybersecurity

Software Engineer, RL Data

London, City of London, United Kingdom • Full Time