Posts
Questions
Discussions
Announcements
No announcement yet.
en
Tiếng Việt
English
Viblo
Viblo Code
Viblo CTF
Viblo CV
Viblo Learning
Viblo Partner
Viblo Battle
new
Viblo Interview
new
Sign In/Sign up
Reinforcement learning
Follow
Posts
Series
Questions
Followers
Sort by:
Newest posts
Newest posts
Most bookmarked
Most viewed
Most voted
Dương Xuân Bách
May 23rd, 2025 4:15 p.m.
8 min read
[Papers Notes] RL IN NAME ONLY? ANALYZING THE STRUCTURAL ASSUMPTIONS IN RL POST-TRAINING FOR LLMS
MayFest2025
Reinforcement learning
LLM
Hoàng Minh An
in
Sun* AI Research Team
We're AI Research Team of R&D Lab @Sun Asterisk .Inc
Followed by
1904
people.
Follow
Sun* AI Research Team
May 17th, 2025 10:14 a.m.
24 min read
[Advanced-LLM] Reasoning LLM và Những Điều Thú Vị Mà Có Thể Bạn Đã Biết Phần 2.
MayFest2025
ContentCreator
RLVR
Policy Optimize
Reasoning LLM
Reinforcement learning
Trần Đăng An
Aug 9th, 2024 6:36 a.m.
12 min read
Nhập môn Reinforcement Learning: Tabular Methods.
Machine Leaning
mathematics
Reinforcement learning
Trần Đăng An
Jul 30th, 2024 4:48 p.m.
11 min read
Nhập môn Reinforcement Learning: Ứng dụng ,những điều cần biết và những lý thuyết cơ bản.
AI
deeplearning
Machine Leaning
mathematics
Reinforcement learning
Phuc Phan
Dec 13th, 2023 11:51 a.m.
23 min read
RLHF & DPO: Kỹ thuật mới đơn giản hơn, tăng cường khả năng Fine-tuning cho Large language models
ChatGPT
Reinforcement learning
RLHF
Direct Preference Optimization
trending
Phuc Phan
Apr 16th, 2023 6:19 p.m.
24 min read
Bản chất ChatGPT hoạt động như thế nào?
ChatGPT
NLP
Ai Conversation
Reinforcement learning
PPO
Lộc Đinh
Feb 17th, 2023 8:21 a.m.
12 min read
RLHF và cách ChatGPT hoạt động
KhaiButDauXuan
Deep Learning
Reinforcement learning
ChatGPT
Nguyen Tu Xuan Cong
in
Sun* AI Research Team
We're AI Research Team of R&D Lab @Sun Asterisk .Inc
Followed by
1904
people.
Follow
Sun* AI Research Team
May 31st, 2022 4:57 p.m.
5 min read
Hello world với Reinforcement Learning
MayFest2022
Reconnection
Reinforcement learning
Nguyen Tu Xuan Cong
in
Sun* AI Research Team
We're AI Research Team of R&D Lab @Sun Asterisk .Inc
Followed by
1904
people.
Follow
Sun* AI Research Team
May 31st, 2022 5:34 a.m.
11 min read
Đôi điều cơ bản về học tăng cường
MayFest2022
Reconnection
Reinforcement learning
Phạm Văn Toàn
in
Sun* AI Research Team
We're AI Research Team of R&D Lab @Sun Asterisk .Inc
Followed by
1904
people.
Follow
Sun* AI Research Team
Sep 28th, 2021 2:38 a.m.
27 min read
Một ứng dụng nho nhỏ của giải thuật di truyền trong Reinforcement Learning - Sinh chuỗi tương tự
@GeneticAlgorithm
Reinforcement learning
Long Lại Phi
Jun 11th, 2021 9:48 a.m.
9 min read
Reinforcement Learning: Q-Learning
Reinforcement learning
Machine Learning
Nguyen Viet Anh
in
Sun* AI Research Team
We're AI Research Team of R&D Lab @Sun Asterisk .Inc
Followed by
1904
people.
Follow
Sun* AI Research Team
Oct 17th, 2020 6:20 p.m.
14 min read
Điều gì tạo nên siêu AI cờ vây AlphaGo Zero?
Reinforcement learning
AlphaGo Zero
Monte Carlo Tree Search
AI
Self-play
hosjiu
Sep 22nd, 2019 10:24 a.m.
9 min read
Giới thiệu về Reinforcement Learning (RL)
Reinforcement learning
Nguyen Viet Anh
in
Sun* AI Research Team
We're AI Research Team of R&D Lab @Sun Asterisk .Inc
Followed by
1904
people.
Follow
Sun* AI Research Team
Jul 21st, 2019 5:09 a.m.
13 min read
Trending Jun 6th, 2022 10:27 p.m.
Giới thiệu về học tăng cường và ứng dụng Deep Q-Learning chơi game CartPole
Machine Learning
Reinforcement learning
Reinforcement learning
14
Posts
1
Questions
6
Followers
Let's register a Viblo Account to get more interesting posts.
Login
Register