unboiled.info
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
☆ Yσɠƚԋσʂ ☆@lemmygrad.ml to technology@hexbear.netEnglish · 3 months ago

Tree of Thoughts: Deliberate Problem Solving with Large Language Models

arxiv.org

external-link
message-square
0
link
fedilink
  • cross-posted to:
  • ai_@lemmy.world
  • technology@lemmy.ml
5
external-link

Tree of Thoughts: Deliberate Problem Solving with Large Language Models

arxiv.org

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml to technology@hexbear.netEnglish · 3 months ago
message-square
0
link
fedilink
  • cross-posted to:
  • ai_@lemmy.world
  • technology@lemmy.ml
Language models are increasingly being deployed for general problem solving across a wide range of tasks, but are still confined to token-level, left-to-right decision-making processes during inference. This means they can fall short in tasks that require exploration, strategic lookahead, or where initial decisions play a pivotal role. To surmount these challenges, we introduce a new framework for language model inference, Tree of Thoughts (ToT), which generalizes over the popular Chain of Thought approach to prompting language models, and enables exploration over coherent units of text (thoughts) that serve as intermediate steps toward problem solving. ToT allows LMs to perform deliberate decision making by considering multiple different reasoning paths and self-evaluating choices to decide the next course of action, as well as looking ahead or backtracking when necessary to make global choices. Our experiments show that ToT significantly enhances language models' problem-solving abilities on three novel tasks requiring non-trivial planning or search: Game of 24, Creative Writing, and Mini Crosswords. For instance, in Game of 24, while GPT-4 with chain-of-thought prompting only solved 4% of tasks, our method achieved a success rate of 74%. Code repo with all prompts: https://github.com/princeton-nlp/tree-of-thought-llm.
alert-triangle
You must log in or # to comment.

technology@hexbear.net

technology@hexbear.net

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@hexbear.net

On the road to fully automated luxury gay space communism.

Spreading Linux propaganda since 2020

  • Ways to run Microsoft/Adobe and more on Linux
  • The Ultimate FOSS Guide For Android
  • Great libre software on Windows
  • Hey you, the lib still using Chrome. Read this post!

Rules:

  • 1. Obviously abide by the sitewide code of conduct. Bigotry will be met with an immediate ban
  • 2. This community is about technology. Offtopic is permitted as long as it is kept in the comment sections
  • 3. Although this is not /c/libre, FOSS related posting is tolerated, and even welcome in the case of effort posts
  • 4. We believe technology should be liberating. As such, avoid promoting proprietary and/or bourgeois technology
  • 5. Explanatory posts to correct the potential mistakes a comrade made in a post of their own are allowed, as long as they remain respectful
  • 6. No crypto (Bitcoin, NFT, etc.) speculation, unless it is purely informative and not too cringe
  • 7. Absolutely no tech bro shit. If you have a good opinion of Silicon Valley billionaires please manifest yourself so we can ban you.
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 100 users / day
  • 687 users / week
  • 1.25K users / month
  • 2.06K users / 6 months
  • 1 local subscriber
  • 23.9K subscribers
  • 517 Posts
  • 3.28K Comments
  • Modlog
  • mods:
  • context [fae/faer, fae/faer]@hexbear.net
  • EmmaGoldman [she/her, comrade/them]@hexbear.net
  • SexUnderSocialism [she/her]@hexbear.net
  • gaycomputeruser [she/her]@hexbear.net
  • ZoomeristLeninist [they/them, she/her]@hexbear.net
  • UI: unknown version
  • BE: 0.19.12
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org