Viblo
  • Posts
  • Questions
  • Discussions
Announcements
No announcement yet.
All Announcements
en
  • Tiếng Việt
  • English
  • Viblo
  • Viblo Code
  • Viblo CTF
  • Viblo CV
  • Viblo Learning
  • Viblo Partner
  • Viblo Battle
  • new
    Viblo Interview
new
Direct Preference Optimization

Direct Preference Optimization

  • Posts
  • Series
  • Questions
  • Followers
Sort by: Newest posts
  • Newest posts
  • Most bookmarked
  • Most viewed
  • Most voted
Avatar
Phuc Phan
thg 12 13, 2023 11:51 SA 23 min read

RLHF & DPO: Kỹ thuật mới đơn giản hơn, tăng cường khả năng Fine-tuning cho Large language models

ChatGPT Reinforcement learning RLHF Direct Preference Optimization trending
3.2K 4 0
6

Direct Preference Optimization


1Posts
0Questions
0Followers

Resources

  • Posts
  • Organizations
  • Questions
  • Tags
  • Videos
  • Authors
  • Discussions
  • Recommend System
  • Tools
  • Machine Learning
  • System Status

Services

  • Viblo Viblo
  • Viblo Code Viblo Code
  • Viblo CTF Viblo CTF
  • Viblo CV Viblo CV
  • Viblo Learning Viblo Learning
  • Viblo Partner Viblo Partner
  • Viblo Battle Viblo Battle
  • Viblo Interview Viblo Interview

Mobile App

Get it on Google Play Download on the App Store
QR code

Links

  • Atom Icon

© 2026 Viblo. All rights reserved.

  • About Us
  • Feedback
  • Help
  • FAQs
  • RSS
  • Terms
  • DMCA.com Protection Status
Viblo
Let's register a Viblo Account to get more interesting posts.
Register