Back to jobs/Anthropic

Regional hiringpublishedExternal employer

Anthropic•AI Research

Research Engineer / Research Scientist

Pre-training, Large Language Models, Multimodal AI

Location

Zürich, Switzerland

Work type

Hybrid

Employment

Full Time

Experience

5-10 years

Compensation

Fr280K - Fr680K per year

Posted

1 month ago

Summary and responsibilities

Role overview

Summary

As a Research Engineer/Scientist on the Pre-training team, you will contribute to developing the next generation of large language models with multimodal capabilities. This role involves conducting cutting-edge research, implementing solutions in areas like model architecture and algorithms, and optimizing training infrastructure for safe and steerable AI systems.

About Anthropic

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

About the team

We are seeking passionate Research Scientists and Engineers to join our growing Pre-training team in Zurich. We are involved in developing the next generation of large language models. The team primarily focuses on multimodal capabilities: giving LLMs the ability to understand and interact with modalities other than text.

In this role, you will work at the intersection of cutting-edge research and practical engineering, contributing to the development of safe, steerable, and trustworthy AI systems.

Responsibilities

Conduct research and implement solutions in areas such as model architecture, algorithms, data processing, and optimizer development
Independently lead small research projects while collaborating with team members on larger initiatives
Design, run, and analyze scientific experiments to advance our understanding of large language models
Optimize and scale our training infrastructure to improve efficiency and reliability
Develop and improve dev tooling to enhance team productivity
Contribute to the entire stack, from low-level optimizations to high-level model design

Qualifications & Experience

Degree (BA required, MS or PhD preferred) in Computer Science, Machine Learning, or a related field
Strong software engineering skills with a proven track record of building complex systems
Expertise in Python and deep learning frameworks
Have worked on high-performance, large-scale ML systems, particularly in the context of language modeling
Familiarity with ML Accelerators, Kubernetes, and large-scale data processing
Strong problem-solving skills and a results-oriented mindset
Excellent communication skills and ability to work in a collaborative environment

You'll thrive in this role if you

Have significant software engineering experience
Are able to balance research goals with practical engineering constraints
Are happy to take on tasks outside your job description to support the team
Enjoy pair programming and collaborative work
Are eager to learn more about machine learning research
Are enthusiastic to work at an organization that functions as a single, cohesive team pursuing large-scale AI research projects
Have ambitious goals for AI safety and general progress in the next few years, and you’re excited to create the best outcomes over the long-term

Sample Projects

Optimizing the throughput of novel attention mechanisms
Proposing Transformer variants, and experimentally comparing their performance
Preparing large-scale datasets for model consumption
Scaling distributed training jobs to thousands of accelerators
Designing fault tolerance strategies for training infrastructure
Creating interactive visualizations of model internals, such as attention patterns

If you're excited about pushing the boundaries of AI while prioritizing safety and ethics, we want to hear from you!

Logistics

Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience
Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience
Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position
Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.
Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.

How we're different

We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.

The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.

Come work with us!

Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

Updated 1 month ago

Candidate fit

Skills and qualifications

Additional skills

Software engineering • 1+ yrs

Python • 1+ yrs

Deep learning frameworks • 1+ yrs

Large-scale ML systems • 1+ yrs

Language modeling • 1+ yrs

ML Accelerators • 1+ yrs

Kubernetes • 1+ yrs

Problem-solving • 1+ yrs

Experience

5-10 years

How this role is positioned

Role classification

Job domains

Software Engineering

Industries

Technology & IT

Employment

Full Time

Contract duration

Permanent

Hiring type

Direct

Global hiring

Location specific

Offer details

Compensation and benefits

Compensation

Fr280K - Fr680K per year

VisibilityShared on listing

CurrencyCHF

PeriodYearly

Benefits and perks

Paid Parental Leave

Flexible Working Hours

Visa Sponsorship

Location, schedule, and role shape

Work setup

Work conditions

Primary locationZürich, Switzerland

Work typeHybrid

Global hiringNo

Bandwidth profile

peopleHigh • 8/10

physicalLow • 2/10

cognitiveHigh • 9/10

executionHigh • 8/10

creativityHigh • 9/10

uncertaintyHigh • 8/10

communicationHigh • 8/10

Context on the employer

Company snapshot

Company

Anthropic

Team size

Growing team

Location

Zürich, Switzerland

Visit website

Similar jobs in Software Engineering

Natural scientist, methods and algorithm development

Confidential Client•Oberkochen, Baden-Württemberg, Germany

Full Time5-8 yrs

€70K - €83K /yr

Hybrid

Wildlife & ConservationOperationsSoftware EngineeringOptimizationSimulationData ScienceProcess ModelingMachine LearningMATLABPythonC#+5

This role involves developing and implementing methods and algorithms for high-precision optics, applying modern simulation techniques, and optimizing optical processes. The position also includes managing interdisciplinary projects, developing machine learning algorithms, and building scientific networks.

...

External

Associate Data Solutions Engineer

Higher Logic•Remote, New York, United States

Full Time0-2 yrs

Compensation not disclosed

Remote

Technology & ITSoftware & SaaSSoftware EngineeringSQLPythonETL/ELTData ModelingDBTIcebergData WarehousingCommunication+5

The Associate Data Solutions Engineer supports Higher Logic’s strategic data initiatives by designing, developing, and scaling data pipelines and models. This role is critical in building the data warehouse and enabling AI capabilities, working closely with internal stakeholders to ensure data is reliable, accessible, and actionable.

...

External

Data Associate Engineer

CareFirst BlueCross BlueShield•Baltimore, Maryland, United States

Full Time2-6 yrs

$62.9K - $125K /yr

Remote

Healthcare & PharmaSoftware & SaaSSoftware EngineeringSQLNoSQLPythonData TransformationETLData ModelingDatabase DesignCommunication+5

The Data Associate Engineer understands data needs, advises on technological resources, and partners with senior team members to aggregate and analyze data for actionable insights. This role involves developing reports, dashboards, and tools for business users, as well as creating technical solutions to improve data access and usage. The engineer will also develop and execute ETL code, ensuring reliable data loading processes and high data quality.

...

External

Associate Data Engineer

UST•Frisco, Texas, United States

Full Time0-3 yrs

$50 - $55 /hr

On-site

Technology & ITSoftware & SaaSSoftware EngineeringPythonSQLReactData EngineeringETLData PipelinesCloudGit+5

This Associate Data Engineer role focuses on building and maintaining scalable, data-driven full-stack applications and platforms. The position involves collaborating with data engineers, scientists, and architects, ensuring code quality, and adhering to modern engineering practices.

...

External

Data Solutions Engineer - Data Pipelines, Regulatory Reporting

Randstad Digital•Columbus, Ohio, United States

Contract2+ yrs

$55 - $70 /hr

On-site

Technology & ITSoftware & SaaSSoftware EngineeringData PipelinesData ModelingSoftware DevelopmentTroubleshootingData QualityAutomationClient Relationship ManagementCollaboration+5

As a Data Solutions Engineer, you will be responsible for building and operating governed, auditable data pipelines and models for regulatory reporting, audit compliance, and executive insights. This role involves integrating data from various enterprise platforms and external RTO systems to improve data quality, traceability, and automation across the analytics lifecycle.

...

External

Ecosystems Digital Twin Researcher II - Edge AI, Embedded ML

San Diego Zoo Wildlife Alliance•Escondido, California, United States

Contract3-8 yrs

$99.5K - $111.9K /yr

On-site

Wildlife & ConservationOperationsSoftware EngineeringPythonPyTorchTensorFlowEdge MLEmbedded SoftwareQuantizationLinuxScience Communication+5

The Researcher II develops programs for scientific research, conducts independent and collaborative research in biological or social sciences, and supervises assigned research/laboratory operations. This role involves overseeing data collection, analysis, and interpretation for conservation programs, publishing findings, and securing funding.

...

External

More jobs from Technology & IT

Project Manager

Kraemer Design + Production•Cincinnati, Ohio, United States

Full Time3-7 yrs

Compensation not disclosed

Hybrid

Technology & ITOperationsDesign & CreativeProject ManagementBudget ManagementSchedule ManagementClient CommunicationRisk ManagementProblem SolvingMicrosoft OfficeFabrication Coordination+5

Kraemer Design + Production (KD+P) is seeking a highly organized Project Manager to lead interactive projects from kickoff through installation. This role involves aligning internal teams and external partners, managing schedules and budgets, and coordinating client communication to ensure projects are completed safely, professionally, and to KD+P’s quality standards in complex, custom-built environments.

...

External

Design Professional

Cushing Terrell•Minneapolis, Minnesota, United States

Full Time0-6 yrs

$60K - $65K /yr

On-site

Technology & ITDesign & CreativeArchitectural DesignConstruction DocumentsConstruction AdministrationRevitAutodeskSketchUpAdobe Creative SuiteCollaboration+4

As a Design Professional at Cushing Terrell, you will collaborate with project teams on architectural design, project development, construction documents, and construction administration. You will develop design and technical solutions under the direct supervision of an experienced architect, coordinating design with technical teams and managing project segments.

...

External

Data Analytics Intern

Zelestra•Seville, Spain

Internship0-1 yrs

Compensation not disclosed

Hybrid

Technology & ITSoftware & SaaSIT & System AdministrationOperationsData AnalysisPower BIPythonMicrosoft OfficeReportingProcess AutomationContinuous ImprovementCommunication+6

The Data Analytics Intern will support the analysis of operational and production data, identify deviations, and prepare reports for clients. This role involves contributing to the development and automation of analysis tools and proposing continuous improvement initiatives to enhance efficiency and support decision-making.

...

External

Business Development & Strategy Intern

Anderson Global•Paris, France

Internship0-1 yrs

Compensation not disclosed

On-site

Technology & ITOperationsBusiness DevelopmentStrategic PlanningHubSpotProspectingCommercial Performance TrackingPartnership ManagementInterpersonal SkillsAgility+4

This 6-month internship involves working directly with the General Manager to build the commercial engine of Companow, a company assisting international entrepreneurs in France. Responsibilities include business development, partnership management, attending international events, and contributing to CRM structuring.

...

External

Associate Data Solutions Engineer

Higher Logic•Remote, New York, United States

Full Time0-2 yrs

Compensation not disclosed

Remote

Technology & ITSoftware & SaaSSoftware EngineeringSQLPythonETL/ELTData ModelingDBTIcebergData WarehousingCommunication+5

...

External

Associate Data Engineer

UST•Frisco, Texas, United States

Full Time0-3 yrs

$50 - $55 /hr

On-site

Technology & ITSoftware & SaaSSoftware EngineeringPythonSQLReactData EngineeringETLData PipelinesCloudGit+5

...

External

Popular Domains

Explore opportunities across specialized functional areas.

Administration & OfficeRoles providing organizational, secretarial, clerical, and executive support functions.

Customer Success & SupportRoles managing customer onboarding, retention, satisfaction, and technical support.

Data Science & AnalyticsRoles using data modeling, statistics, and visualization to derive business insights.

Design & CreativeRoles focused on visual design, UX/UI, branding, illustration, and creative production.

Education AdministrationRoles managing educational institutions, programs, curriculum, and student affairs.

Finance & AccountingRoles managing financial reporting, budgeting, auditing, tax, and investment activities.

Gigs & Flexible TasksShort-term, contract, or freelance task-based work across any domain.

Healthcare & MedicalRoles in clinical care, medical practice, patient management, and health services delivery.

Trending Industries

Discover roles in the world's most innovative sectors.

Aerospace & Space Tech

Agency & Consulting Services

Agriculture & AgriTech

Automotive

Biotech & Life Sciences

Blockchain & Web3

Construction & Infrastructure

Cybersecurity

Research Engineer / Research Scientist

Zürich, Switzerland • Full Time