Posts
Questions
Discussions
Announcements
No announcement yet.
en
Tiếng Việt
English
Viblo
Viblo Code
Viblo CTF
Viblo CV
Viblo Learning
Viblo Partner
Viblo Battle
new
Viblo Interview
new
Sign In/Sign up
RLHF
Follow
Posts
Series
Questions
Followers
Sort by:
Newest posts
Newest posts
Most bookmarked
Most viewed
Most voted
Phuc Phan
Dec 13th, 2023 11:51 a.m.
23 min read
ChatGPT series 4: RLHF & DPO: Kỹ thuật mới đơn giản hơn, tăng cường khả năng Fine-tuning cho Large language models
ChatGPT
Reinforcement learning
RLHF
Direct Preference Optimization
trending
RLHF
1
Posts
0
Questions
0
Followers
Let's register a Viblo Account to get more interesting posts.
Login
Register