The Lemmy Club
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
howrar@lemmy.caM to Reinforcement Learning@lemmy.caEnglish · 3 months ago

A Little Bit of Reinforcement Learning from Human Feedback -- Nathan Lambert

rlhfbook.com

external-link
message-square
0
link
fedilink
1
external-link

A Little Bit of Reinforcement Learning from Human Feedback -- Nathan Lambert

rlhfbook.com

howrar@lemmy.caM to Reinforcement Learning@lemmy.caEnglish · 3 months ago
message-square
0
link
fedilink

https://bsky.app/profile/natolambert.bsky.social/post/3lh5jih226k2k

Anyone interested in learning about RLHF? This text isn’t complete yet, but looks to be a pretty useful resource as is already.

alert-triangle
You must log in or register to comment.

Reinforcement Learning@lemmy.ca

reinforcement_learning@lemmy.ca

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

A community dedicated to discussions on reinforcement learning, a subdiscipline of machine learning that tackles sequential decision making problems.

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 1 user / day
  • 1 user / week
  • 1 user / month
  • 2 users / 6 months
  • 1 local subscriber
  • 53 subscribers
  • 8 Posts
  • 0 Comments
  • Modlog
  • mods:
  • howrar@lemmy.ca
  • BE: 0.19.11
  • Modlog
  • Legal
  • Instances
  • Docs
  • Code
  • join-lemmy.org