Back to news
Large Language Models
Sep 19, 2019

GPT-2 Model Enhanced with Human Feedback for Improved Task Performance

Sep 19, 2019
AI Summary

A 774M parameter GPT-2 language model has been fine-tuned using human feedback to better align with user preferences across various tasks. The model showed a tendency to copy sentences for summarization tasks, reflecting the labelers' preferences, which highlights the importance of human values in machine communication.

GPT-2 Model Enhanced with Human Feedback for Improved Task Performance
  • The GPT-2 language model has been fine-tuned with human feedback to improve task performance.
  • The fine-tuning process involved 60,000 human labels for summarization tasks and 5,000 for simpler tasks that required text continuation in various styles.
  • The model's adjustments aimed to enhance safety techniques in machine communication, emphasizing the extraction of human values.