TERMIUM Plus®
From: Translation Bureau
On social media
Consult the Government of Canada’s terminology data bank.
APPRENTISSAGE RENFORCEMENT PARTIR RETROACTION HUMAINE [1 record]
Record 1 - internal organization data 2025-09-25
Record 1, English
Record 1, Subject field(s)
- Artificial Intelligence
Record 1, Main entry term, English
- reinforcement learning from human feedback
1, record 1, English, reinforcement%20learning%20from%20human%20feedback
correct, noun
Record 1, Abbreviations, English
- RLHF 1, record 1, English, RLHF
correct, noun
Record 1, Synonyms, English
- RL from human feedback 2, record 1, English, RL%20from%20human%20feedback
correct, noun
- RLHF 2, record 1, English, RLHF
correct, noun
- RLHF 2, record 1, English, RLHF
- reinforcement learning with human feedback 3, record 1, English, reinforcement%20learning%20with%20human%20feedback
correct, noun
- RLHF 3, record 1, English, RLHF
correct, noun
- RLHF 3, record 1, English, RLHF
- RL with human feedback 4, record 1, English, RL%20with%20human%20feedback
correct, noun
- RLHF 4, record 1, English, RLHF
correct, noun
- RLHF 4, record 1, English, RLHF
Record 1, Textual support, English
Record number: 1, Textual support number: 1 OBS
Reinforcement learning from human feedback is used to align a pre-trained machine learning model with a specific task or behaviour. For this purpose, it relies on evaluations of the model's output by humans. The results of these evaluations are often provided to the model in the form of rewards and penalties. 5, record 1, English, - reinforcement%20learning%20from%20human%20feedback
Record 1, French
Record 1, Domaine(s)
- Intelligence artificielle
Record 1, Main entry term, French
- apprentissage par renforcement à partir de la rétroaction humaine
1, record 1, French, apprentissage%20par%20renforcement%20%C3%A0%20partir%20de%20la%20r%C3%A9troaction%20humaine
correct, masculine noun
Record 1, Abbreviations, French
Record 1, Synonyms, French
- apprentissage par renforcement avec rétroaction humaine 2, record 1, French, apprentissage%20par%20renforcement%20avec%20r%C3%A9troaction%20humaine
correct, masculine noun
Record 1, Textual support, French
Record number: 1, Textual support number: 1 OBS
L'apprentissage par renforcement à partir de la rétroaction humaine permet d'adapter un modèle d'apprentissage automatique préentraîné à une tâche ou à un comportement déterminé. Pour ce faire, cette technique se fonde sur les évaluations réalisées par des humains quant aux sorties produites par le modèle. Les résultats de ces évaluations sont souvent fournis au modèle sous forme de récompenses et de pénalités. 3, record 1, French, - apprentissage%20par%20renforcement%20%C3%A0%20partir%20de%20la%20r%C3%A9troaction%20humaine
Record 1, Spanish
Record 1, Textual support, Spanish
Copyright notice for the TERMIUM Plus® data bank
© Public Services and Procurement Canada, 2026
TERMIUM Plus®, the Government of Canada's terminology and linguistic data bank
A product of the Translation Bureau
Features
GCtranslate (available on the Government of Canada network only)
Use this artificial intelligence prototype to translate Government of Canada content up to and including Protected B. Available to employees of selected departments and agencies only.
Writing tools
The Language Portal’s writing tools have a new look! Easy to consult, they give you access to a wealth of information that will help you write better in English and French.
Glossaries and vocabularies
Access Translation Bureau glossaries and vocabularies.
- Date Modified:


