Research Engineer, Scalable Interpretability

The Effective Altruism

Opportunities Board

Work on the world's most pressing problems. Browse jobs, fellowships, internships, courses, and more at high-impact organisations.

Get notified

Submit opportunity Send feedback

Get notified about new roles

Research Engineer, Scalable Interpretability

TransluceSan Francisco, CA

San Francisco, CA

1 month ago

AI safety & policy

Full-time

Routes to impact

Direct high impact on an important cause

Skill-building & building career capital

Description

Develop scalable interpretability systems to improve oversight of advanced AI models.

Build evaluations for undesirable model behaviors
Design architectures and training objectives for interpretability assistants
Scale training and inference for frontier models
Conduct research on model activations and behavior prediction

This text was generated by AI. If you notice any inconsistencies, please let us know using this form.

View opportunity

Related opportunities

Data Scientist

Innovations for Poverty ActionKenya / Ghana / Colombia / Peru

Kenya / Ghana / Colombia / Peru

3 weeks ago

Applied AI Data Scientist

AE StudioRemote / US

Remote / US

2 days ago

Applied AI Data Scientist

AE StudioFlorianopolis, Brazil / Remote

Florianopolis, Brazil / Remote

2 days ago

Software Engineer

FutureSearchRemote

Remote

3 days ago

Research Engineer, Evals

White CircleParis, France / London, UK

Paris, France / London, UK

1 week ago

Research Scientist, AI Behaviours

White CircleParis, France / London, UK

Paris, France / London, UK

1 week ago

Researcher, Benchmark Reviews

Epoch AIRemote

Remote

2 weeks ago

Senior Machine Learning Data Processing Developer

LawZeroMontreal, Canada

Montreal, Canada

2 weeks ago

Join 60k subscribers and sign up for the EA Newsletter, a monthly email with the latest ideas and opportunities

View past editions