Back to jobs/Anthropic

Regional hiringpublishedExternal employer

Anthropic•Artificial Intelligence

Research Engineer, Knowledge Foundations

Location

San Francisco, California, United States

Work type

Hybrid

Employment

Full Time

Experience

5-15 years

Compensation

$350K - $850K per year

Posted

1 month ago

Summary and responsibilities

Role overview

Summary

As a Research Engineer on the Knowledge Work team, you will design and execute experiments to enhance Claude's ability to search, retrieve, and reason over information at scale. This involves developing training environments, curating data, and building evaluations to improve model behavior in real-world professional workflows.

About the role

The Knowledge Work team builds the training environments and evaluations that make Claude effective at real-world professional workflows — searching, analyzing, and creating across the tools and documents knowledge workers use every day. As that work scales, the systems behind it need to be as rigorous as the research itself.

As a Research Engineer on Knowledge, you'll design and run experiments that improve how Claude searches, retrieves, and reasons over information at scale. The work spans environment design, data curation, RL training, evaluation, and the infrastructure that supports it all. You'll move fluidly between these depending on what's blocking progress. You'll partner closely with researchers and other RL teams to ship capabilities that show up directly in Claude's behavior.

As our training and evaluations continue to scale, we see a strong synergy between the capabilities our models learn, the tools we build for them to use, and the tools we build for ourselves to understand it all. We own the science behind superhuman epistemics and we ensure the quality of the stack that drives it. We understand that real ownership and impact comes as much through hardening and iterating on environments as it does creating new ones.

Responsibilities

Design, build, and iterate on training environments and data pipelines that improve Claude's ability to reason over knowledge-intensive tasks
Run experiments end-to-end: form a hypothesis, build the infrastructure, train models, analyze results, and decide what to try next
Develop evaluations that meaningfully capture progress on search, retrieval, and reasoning quality
Identify failure modes in current model behavior and translate them into concrete training signals
Collaborate closely with researchers across RL Data, post-training, and product teams to align on priorities and ship improvements
Contribute to shared infrastructure and tooling that compounds the team's velocity over time
Own a clean, canonical set of evaluation tools and processes for Knowledge Work capabilities, including the process used for model releases
Build and automate observability, dashboards, and operational tooling for our training environments and evaluation systems, with an emphasis on high signal-to-noise: a small set of trusted metrics and alerts rather than sprawling instrumentation

You may be a good fit if you

Are a highly experienced Python engineer who ships reliable, well-instrumented code that teammates trust in production
Experience designing, running, and analyzing ML experiments
Ability to work across the stack — from data pipelines to model training to evaluation
Have 5+ years of experience operating ML or distributed systems at scale
Comfort working with ambiguity and choosing the most impactful problem to tackle next
Clear written and verbal communication, especially when collaborating across time zones
Find genuine satisfaction and impact in making existing critical systems dependable

Preferred qualifications

Hands-on experience training, fine-tuning, or doing RL on large language models
Experience building evaluations for LLMs, particularly in open-ended or knowledge-intensive domains
Prior work in a research-heavy environment such as a frontier AI lab, quant research firm, or domain-focused AI startup
Published research on LLMs, RL, retrieval, or related areas
Experience with distributed training systems
Are comfortable being the long-term, context-rich owner of a system and its operational health

Representative projects

Building a training environment that teaches Claude to plan and execute multi-step research tasks against real document corpora
Designing an evaluation suite that distinguishes genuine reasoning over evidence from plausible-sounding pattern matching
Scaling long-running evals and fickle training environments that use many different tools
Curating and validating a high-quality dataset of expert research workflows for use in post-training
Diagnosing why Claude fails on a class of long-horizon retrieval tasks and proposing a training intervention, tool, or infrastructure change to fix it

Updated 1 month ago

Candidate fit

Skills and qualifications

Additional skills

Python • 1+ yrs

ML experiments • 1+ yrs

Distributed systems • 1+ yrs

Data pipelines • 1+ yrs

Model training • 1+ yrs

Evaluation • 1+ yrs

Large Language Models • 1+ yrs

Communication • 1+ yrs

Experience

5-15 years

How this role is positioned

Role classification

Job domains

Software Engineering

Industries

Technology & IT

Software & SaaS

Employment

Full Time

Contract duration

Permanent

Hiring type

Direct

Global hiring

Location specific

Offer details

Compensation and benefits

Compensation

$350K - $850K per year

VisibilityShared on listing

CurrencyUSD

PeriodYearly

Benefits and perks

Paid Parental Leave

Flexible Working Hours

Visa Sponsorship

Location, schedule, and role shape

Work setup

Work conditions

Primary locationSan Francisco, California, United States

Work typeHybrid

Global hiringNo

Bandwidth profile

peopleMedium • 7/10

physicalLow • 2/10

cognitiveHigh • 9/10

executionHigh • 8/10

creativityHigh • 9/10

uncertaintyHigh • 8/10

communicationHigh • 8/10

Context on the employer

Company snapshot

Company

Anthropic

Team size

Growing team

Location

San Francisco, California, United States

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

Visit website

Similar jobs in Software Engineering

Natural scientist, methods and algorithm development

Confidential Client•Oberkochen, Baden-Württemberg, Germany

Full Time5-8 yrs

€70K - €83K /yr

Hybrid

Wildlife & ConservationOperationsSoftware EngineeringOptimizationSimulationData ScienceProcess ModelingMachine LearningMATLABPythonC#+5

This role involves developing and implementing methods and algorithms for high-precision optics, applying modern simulation techniques, and optimizing optical processes. The position also includes managing interdisciplinary projects, developing machine learning algorithms, and building scientific networks.

...

External

Associate Data Solutions Engineer

Higher Logic•Remote, New York, United States

Full Time0-2 yrs

Compensation not disclosed

Remote

Technology & ITSoftware & SaaSSoftware EngineeringSQLPythonETL/ELTData ModelingDBTIcebergData WarehousingCommunication+5

The Associate Data Solutions Engineer supports Higher Logic’s strategic data initiatives by designing, developing, and scaling data pipelines and models. This role is critical in building the data warehouse and enabling AI capabilities, working closely with internal stakeholders to ensure data is reliable, accessible, and actionable.

...

External

Data Associate Engineer

CareFirst BlueCross BlueShield•Baltimore, Maryland, United States

Full Time2-6 yrs

$62.9K - $125K /yr

Remote

Healthcare & PharmaSoftware & SaaSSoftware EngineeringSQLNoSQLPythonData TransformationETLData ModelingDatabase DesignCommunication+5

The Data Associate Engineer understands data needs, advises on technological resources, and partners with senior team members to aggregate and analyze data for actionable insights. This role involves developing reports, dashboards, and tools for business users, as well as creating technical solutions to improve data access and usage. The engineer will also develop and execute ETL code, ensuring reliable data loading processes and high data quality.

...

External

Associate Data Engineer

UST•Frisco, Texas, United States

Full Time0-3 yrs

$50 - $55 /hr

On-site

Technology & ITSoftware & SaaSSoftware EngineeringPythonSQLReactData EngineeringETLData PipelinesCloudGit+5

This Associate Data Engineer role focuses on building and maintaining scalable, data-driven full-stack applications and platforms. The position involves collaborating with data engineers, scientists, and architects, ensuring code quality, and adhering to modern engineering practices.

...

External

Data Solutions Engineer - Data Pipelines, Regulatory Reporting

Randstad Digital•Columbus, Ohio, United States

Contract2+ yrs

$55 - $70 /hr

On-site

Technology & ITSoftware & SaaSSoftware EngineeringData PipelinesData ModelingSoftware DevelopmentTroubleshootingData QualityAutomationClient Relationship ManagementCollaboration+5

As a Data Solutions Engineer, you will be responsible for building and operating governed, auditable data pipelines and models for regulatory reporting, audit compliance, and executive insights. This role involves integrating data from various enterprise platforms and external RTO systems to improve data quality, traceability, and automation across the analytics lifecycle.

...

External

Ecosystems Digital Twin Researcher II - Edge AI, Embedded ML

San Diego Zoo Wildlife Alliance•Escondido, California, United States

Contract3-8 yrs

$99.5K - $111.9K /yr

On-site

Wildlife & ConservationOperationsSoftware EngineeringPythonPyTorchTensorFlowEdge MLEmbedded SoftwareQuantizationLinuxScience Communication+5

The Researcher II develops programs for scientific research, conducts independent and collaborative research in biological or social sciences, and supervises assigned research/laboratory operations. This role involves overseeing data collection, analysis, and interpretation for conservation programs, publishing findings, and securing funding.

...

External

More jobs from Technology & IT

Project Manager

Kraemer Design + Production•Cincinnati, Ohio, United States

Full Time3-7 yrs

Compensation not disclosed

Hybrid

Technology & ITOperationsDesign & CreativeProject ManagementBudget ManagementSchedule ManagementClient CommunicationRisk ManagementProblem SolvingMicrosoft OfficeFabrication Coordination+5

Kraemer Design + Production (KD+P) is seeking a highly organized Project Manager to lead interactive projects from kickoff through installation. This role involves aligning internal teams and external partners, managing schedules and budgets, and coordinating client communication to ensure projects are completed safely, professionally, and to KD+P’s quality standards in complex, custom-built environments.

...

External

Design Professional

Cushing Terrell•Minneapolis, Minnesota, United States

Full Time0-6 yrs

$60K - $65K /yr

On-site

Technology & ITDesign & CreativeArchitectural DesignConstruction DocumentsConstruction AdministrationRevitAutodeskSketchUpAdobe Creative SuiteCollaboration+4

As a Design Professional at Cushing Terrell, you will collaborate with project teams on architectural design, project development, construction documents, and construction administration. You will develop design and technical solutions under the direct supervision of an experienced architect, coordinating design with technical teams and managing project segments.

...

External

Data Analytics Intern

Zelestra•Seville, Spain

Internship0-1 yrs

Compensation not disclosed

Hybrid

Technology & ITSoftware & SaaSIT & System AdministrationOperationsData AnalysisPower BIPythonMicrosoft OfficeReportingProcess AutomationContinuous ImprovementCommunication+6

The Data Analytics Intern will support the analysis of operational and production data, identify deviations, and prepare reports for clients. This role involves contributing to the development and automation of analysis tools and proposing continuous improvement initiatives to enhance efficiency and support decision-making.

...

External

Business Development & Strategy Intern

Anderson Global•Paris, France

Internship0-1 yrs

Compensation not disclosed

On-site

Technology & ITOperationsBusiness DevelopmentStrategic PlanningHubSpotProspectingCommercial Performance TrackingPartnership ManagementInterpersonal SkillsAgility+4

This 6-month internship involves working directly with the General Manager to build the commercial engine of Companow, a company assisting international entrepreneurs in France. Responsibilities include business development, partnership management, attending international events, and contributing to CRM structuring.

...

External

Associate Data Solutions Engineer

Higher Logic•Remote, New York, United States

Full Time0-2 yrs

Compensation not disclosed

Remote

Technology & ITSoftware & SaaSSoftware EngineeringSQLPythonETL/ELTData ModelingDBTIcebergData WarehousingCommunication+5

...

External

Associate Data Engineer

UST•Frisco, Texas, United States

Full Time0-3 yrs

$50 - $55 /hr

On-site

Technology & ITSoftware & SaaSSoftware EngineeringPythonSQLReactData EngineeringETLData PipelinesCloudGit+5

...

External

Popular Domains

Explore opportunities across specialized functional areas.

Administration & OfficeRoles providing organizational, secretarial, clerical, and executive support functions.

Customer Success & SupportRoles managing customer onboarding, retention, satisfaction, and technical support.

Data Science & AnalyticsRoles using data modeling, statistics, and visualization to derive business insights.

Design & CreativeRoles focused on visual design, UX/UI, branding, illustration, and creative production.

Education AdministrationRoles managing educational institutions, programs, curriculum, and student affairs.

Finance & AccountingRoles managing financial reporting, budgeting, auditing, tax, and investment activities.

Gigs & Flexible TasksShort-term, contract, or freelance task-based work across any domain.

Healthcare & MedicalRoles in clinical care, medical practice, patient management, and health services delivery.

Trending Industries

Discover roles in the world's most innovative sectors.

Aerospace & Space Tech

Agency & Consulting Services

Agriculture & AgriTech

Automotive

Biotech & Life Sciences

Blockchain & Web3

Construction & Infrastructure

Cybersecurity

Research Engineer, Knowledge Foundations

San Francisco, California, United States • Full Time