Quality Assurance for autonomous AI agents - AI Trainer

Mindrift · Barcelona

Nivel de experiencia---
Tipo de contratoConsultor
PublicadaHace 11h

At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI.

The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.

Who we're looking for:

We're looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil's advocate.

Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?

This is a flexible, project-based opportunity well-suited for:

Analysts, researchers, or consultants with strong critical thinking skills
Students (senior undergrads / grad students) looking for an intellectually interesting gig
People open to a part-time and non-permanent opportunity

About the project:

We're on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you'll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.

You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you've ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.

What you'll be doing:

Reviewing evaluation tasks and scenarios for logic, completeness, and realism
Identifying inconsistencies, missing assumptions, or unclear decision points
Helping define clear expected behaviors (gold standards) for AI agents
Annotating cause-effect relationships, reasoning paths, and plausible alternatives
Thinking through complex systems and policies as a human would to ensure agents are tested properly
Working closely with QA, writers, or developers to suggest refinements or edge case coverage

How to get started:

Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.

Requirements

Excellent analytical thinking: Can reason about complex systems, scenarios, and logical implications
Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements
Familiarity with structured data formats: Can read, not necessarily write JSON/YAML
Can assess scenarios holistically: What's missing, what's unrealistic, what might break?
Good communication and clear writing (in English) to document your findings.

We also value applicants who have:

Experience with policy evaluation, logic puzzles, case studies, or structured scenario design
Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research
Exposure to LLMs, prompt engineering, or AI-generated content
Familiarity with QA or test-case thinking (edge cases, failure modes, "what could go wrong")
Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.)

Benefits

Get paid for your expertise, with rates that can go up to $29/hour depending on your skills, experience, and project needs
Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments
Participate in an advanced AI project and gain valuable experience to enhance your portfolio
Influence how future AI models understand and communicate in your field of expertise

¡No te pierdas nada!

Únete a la comunidad de wijobs y recibe por email las mejores ofertas de empleo

Nunca compartiremos tu email con nadie y no te vamos a enviar spam

Suscríbete Ahora

Últimas ofertas de empleo de Derecho y Legal en Barcelona

Gerocultor/a Residencia geriátrica Barcelona

Nueva

Accent Social

Barcelona, ES

ID Oferta: 60146 Ubicación: Barcelona ¡Únete a Accent Social y marca la diferencia! Accent Social es una empresa catalana...

Estudiante en prácticas dpto. Digital - BARCELONA

Nueva

APPLE TREE

Barcelona, ES

¿Quieres comenzar tu experiencia en el mundo laboral en la agencia de comunicación más innovadora y creativa? Te...

. Office PowerPoint

Legal Counsel

Nueva

Family Office

We are seeking a highly motivated and versatile Legal Counsel to join the dedicated team of a prominent Ultra High Net...

. Office Teletrabajo

Head of Alliance - Latam

Nueva

Talent Search People

¿Dónde trabajarás? Trabajarás en una destacada compañía farmacéutica italiana con presencia global y una sólida...

. Teletrabajo

Visual Creative Lead

Nueva

FEEDBACK MARKETING RE-PRODUCTIONS

Barcelona, ES

En Feedback ayudamos a grandes marcas como Caixabank, FC Barcelona, Brico Depôt, Grupo VW o Pepsico a crecer con un...

Undergraduate Student - Computational Earth Sciences (R0)

Nueva

Barcelona Supercomputing Center

Barcelona, ES

Job Reference 584_25_ES_CES_R0 Position Undergraduate Student - Computational Earth Sciences (R0) Closing Date Friday, 17...

. Python LESS

Remote Pilot, Barcelona

Nueva

Skyports Drone Services

Job Title: Remote Pilot - English Speaking Location: Barcelona Department: Drone Services Type: Permanent, Full time...

. Teletrabajo

Assurance | Associate / Senior Associate GRC(Governance, Risk, Compliance) IT Tools

22 sept.

PwC España

Barcelona, ES

Job Description & Summary PwC es una firma líder de servicios profesionales en España y a nivel mundial, referente en el...

. Power BI

Ver más ofertas

Tipo	Nombre	Finalidad	Duración
Sesión	ASP.NET_SessionId	Administra la sesión del usuario en el sitio web	Durante la sesión del usuario
Sesión	wj_uuid	Identifca al usuario ente distintas sesiones	1 año
Anti falsificación	.AspNetCore.Antiforgery.*	Proporciona protección contra ataques de falsificación de solicitudes entre sitios	Durante la sesión del usuario
Autentificación	.AspNetCore.Cookies	Almacena datos encriptados del usuario que se requieren para acceder o mostrar datos en el sitio	Durante la sesión del usuario
RGPD	.AspNet.Consent wj_con_pe wj_con_ad wj_con_an	Almacenan información relativa a las preferencias del usuario sobre el Reglamento General de Protección de Datos o RGPD	1 año

Tipo	Nombre	Finalidad	Duración
Idioma	.AspNetCore.Culture	Almacena información relativa a tu preferencia de idioma	1 año
Búsqueda	wj_loc wj_search wj_tags wj_tags_loc	Almacena información para recordar tus preferencias de búsqueda	1 año
Favoritos	wj_bookmarks wj_likes	Almacena información relativa a tu contenido favorito	1 año
Alertas por email	wj_e_sub	Indica si el usuario esta o no suscrito a las alerta por email	1 año
Alertas por email	wj_e_sub_v	Indica si el usuario ha verificado o no su suscripción por email	1 año
Alertas por email	wj_e_sub_a	Indica si el usuario tiene o no activas las alertas por email	1 año
Alertas con OneSignal	__cfduid	Puedes conocer cómo OneSignal usa la información de sitios o aplicaciones que usan sus servicios visitando su sitio web	1 mes
Sesión	wj_tv	Indica si el usuario es recurrente	1 año

Tipo	Nombre	Finalidad y duración
Google Analytics	_ga _gat _gid AMP_TOKEN _gac_* _lc.visitor_id.*	Puedes conocer cómo Google usa la información de sitios o aplicaciones que usan sus servicios visitando su sitio web
Hotjar	_hjClosedSurveyInvites _hjDonePolls _hjMinimizedPolls _hjShownFeedbackMessage _hjid _hjRecordingLastActivity _hjTLDTest _hjUserAttributesHash _hjLocalStorageTest _hjIncludedInPageviewSample _hjIncludedInSessionSample _hjAbsoluteSessionInProgress	Puedes conocer cómo Hotjat usa la información de sitios o aplicaciones que usan sus servicios visitando su sitio web

Quality Assurance for autonomous AI agents - AI Trainer

Mindrift · Barcelona

¡No te pierdas nada!

Últimas ofertas de empleo de Derecho y Legal en Barcelona

Gerocultor/a Residencia geriátrica Barcelona

Accent Social

Estudiante en prácticas dpto. Digital - BARCELONA

APPLE TREE

Legal Counsel

Family Office

Head of Alliance - Latam

Talent Search People

Visual Creative Lead

FEEDBACK MARKETING RE-PRODUCTIONS

Undergraduate Student - Computational Earth Sciences (R0)

Barcelona Supercomputing Center

Remote Pilot, Barcelona

Skyports Drone Services

Assurance | Associate / Senior Associate GRC(Governance, Risk, Compliance) IT Tools

PwC España

Financial Advisor - Audit

LHH

PSYCHOPEDAGOGUE

LCI Education

¡No te pierdas nada!

Top Zonas

Top Empleos