Posts
Questions
Discussions
Announcements
No announcement yet.
en
Tiếng Việt
English
Viblo
Viblo Code
Viblo CTF
Viblo CV
Viblo Learning
Viblo Partner
Viblo Battle
new
Viblo Interview
new
Sign In/Sign up
Reinforcement learning
Follow
Posts
Series
Questions
Followers
Sort by:
Newest posts
Newest posts
Most bookmarked
Most viewed
Most voted
Long Hoàng
Mar 30th, 2:12 a.m.
5 min read
Tôi không train model. Nhưng tôi bắt đầu kiểm soát được hành vi của nó (Nắng AI v47)
Agent
AgenticAi
LLM
Adversarial machine learning
Reinforcement learning
Dương Xuân Bách
May 23rd, 2025 4:15 p.m.
8 min read
[Papers Notes] RL IN NAME ONLY? ANALYZING THE STRUCTURAL ASSUMPTIONS IN RL POST-TRAINING FOR LLMS
MayFest2025
Reinforcement learning
LLM
Hoàng Minh An
in
Sun* AI Research Team
We're AI Research Team of R&D Lab @Sun Asterisk .Inc
Followed by
1929
people.
Follow
Sun* AI Research Team
May 17th, 2025 10:14 a.m.
24 min read
[Advanced-LLM] Reasoning LLM và Những Điều Thú Vị Mà Có Thể Bạn Đã Biết Phần 2.
MayFest2025
ContentCreator
RLVR
Policy Optimize
Reasoning LLM
Reinforcement learning
Trần Đăng An
Aug 9th, 2024 6:36 a.m.
12 min read
Nhập môn Reinforcement Learning: Tabular Methods.
Machine Leaning
mathematics
Reinforcement learning
Trần Đăng An
Jul 30th, 2024 4:48 p.m.
11 min read
Nhập môn Reinforcement Learning: Ứng dụng ,những điều cần biết và những lý thuyết cơ bản.
AI
deeplearning
Machine Leaning
mathematics
Reinforcement learning
Phuc Phan
Dec 13th, 2023 11:51 a.m.
23 min read
RLHF & DPO: Kỹ thuật mới đơn giản hơn, tăng cường khả năng Fine-tuning cho Large language models
ChatGPT
Reinforcement learning
RLHF
Direct Preference Optimization
trending
Phuc Phan
Apr 16th, 2023 6:19 p.m.
24 min read
Bản chất ChatGPT hoạt động như thế nào?
ChatGPT
NLP
Ai Conversation
Reinforcement learning
PPO
Lộc Đinh
Feb 17th, 2023 8:21 a.m.
12 min read
RLHF và cách ChatGPT hoạt động
KhaiButDauXuan
Deep Learning
Reinforcement learning
ChatGPT
Nguyen Tu Xuan Cong
in
Sun* AI Research Team
We're AI Research Team of R&D Lab @Sun Asterisk .Inc
Followed by
1929
people.
Follow
Sun* AI Research Team
May 31st, 2022 4:57 p.m.
5 min read
Hello world với Reinforcement Learning
MayFest2022
Reconnection
Reinforcement learning
Nguyen Tu Xuan Cong
in
Sun* AI Research Team
We're AI Research Team of R&D Lab @Sun Asterisk .Inc
Followed by
1929
people.
Follow
Sun* AI Research Team
May 31st, 2022 5:34 a.m.
11 min read
Đôi điều cơ bản về học tăng cường
MayFest2022
Reconnection
Reinforcement learning
Phạm Văn Toàn
in
Sun* AI Research Team
We're AI Research Team of R&D Lab @Sun Asterisk .Inc
Followed by
1929
people.
Follow
Sun* AI Research Team
Sep 28th, 2021 2:38 a.m.
27 min read
Một ứng dụng nho nhỏ của giải thuật di truyền trong Reinforcement Learning - Sinh chuỗi tương tự
@GeneticAlgorithm
Reinforcement learning
Long Lại Phi
Jun 11th, 2021 9:48 a.m.
9 min read
Reinforcement Learning: Q-Learning
Reinforcement learning
Machine Learning
Nguyen Viet Anh
in
Sun* AI Research Team
We're AI Research Team of R&D Lab @Sun Asterisk .Inc
Followed by
1929
people.
Follow
Sun* AI Research Team
Oct 17th, 2020 6:20 p.m.
14 min read
Điều gì tạo nên siêu AI cờ vây AlphaGo Zero?
Reinforcement learning
AlphaGo Zero
Monte Carlo Tree Search
AI
Self-play
hosjiu
Sep 22nd, 2019 10:24 a.m.
9 min read
Giới thiệu về Reinforcement Learning (RL)
Reinforcement learning
Nguyen Viet Anh
in
Sun* AI Research Team
We're AI Research Team of R&D Lab @Sun Asterisk .Inc
Followed by
1929
people.
Follow
Sun* AI Research Team
Jul 21st, 2019 5:09 a.m.
13 min read
Trending Jun 6th, 2022 10:27 p.m.
Giới thiệu về học tăng cường và ứng dụng Deep Q-Learning chơi game CartPole
Machine Learning
Reinforcement learning
Reinforcement learning
15
Posts
1
Questions
8
Followers
Let's register a Viblo Account to get more interesting posts.
Login
Register