An Open Source Conversation Response Path Exploration System using Monte Carlo Tree Search

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 22 hours ago

An Open Source Conversation Response Path Exploration System using Monte Carlo Tree Search

Owl [he/him]@hexbear.net · 19 hours ago

Damnit, I saw MCTS and thought it’d be something neat, but then it’s LLMs because of course every piece of tech news is LLMs.

Skye [she/her, they/them]@hexbear.net · 19 hours ago

Making the LLM use an LLM to figure out what to say almost feels like a pretty good tech news shitpost

Skye [she/her, they/them]@hexbear.net · 19 hours ago

overthinking how someone might react to what it could say in multiple branches with growing resource usage

This is it, if they can get it to reliably decide to just not say anything at all in the end then I have been fully replaced

TraschcanOfIdeology [they/them, comrade/them]@hexbear.net · 19 hours ago

Lol same, what I was thinking while reading the features was “wow, they found a way to simulate masking!”

DefinitelyNotAPhone [he/him]@hexbear.net · 18 hours ago

They have finally done it: they’ve figured out a way to make LLMs even heavier computation-wise

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 18 hours ago

I mean LLMs have gotten orders of magnitude more efficient in just the past year, but also using these types approaches might make it possible to use much smaller models, and iterate on the result.

FrankLaskey@lemmy.ml · 20 hours ago

Interesting. I’m not sophisticated enough to judge this particular implementation but the concept of generating entire conversation trees to judge the quality of an output intrigues me for sure and I’d be interested in reading more about it and any research around it. Got any good links for further reading?

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · edit-2 18 hours ago

I think that’s an interesting approach as well. There are a bunch of research papers on using MCTS with LLMs, a few examples here:

https://arxiv.org/abs/2503.19309

https://arxiv.org/abs/2505.23229

https://arxiv.org/abs/2504.02426

https://arxiv.org/abs/2504.11009

https://arxiv.org/abs/2502.13428

gay_king_prince_charles [she/her, he/him]@hexbear.net · 19 hours ago

This seems interesting, but 28 queries per response (in the demo shown) is a whole lot of compute

An Open Source Conversation Response Path Exploration System using Monte Carlo Tree Search

An Open Source Conversation Response Path Exploration System using Monte Carlo Tree Search

GitHub - MVPandey/CAE: A fully functional LLM chat backend with FastAPI and Async operations, with a built in MCTS conversation analyzer