LLM Reasoning Playbook

LLM Reasoning Playbook

A master reference for selecting, engineering, and deploying advanced text-based LLM reasoning architectures.

The Reasoning Framework Playbook

12 core frameworks across 7 axes — taxonomy, choice matrix, boilerplate templates, and a selection flowchart.

Core reference

Worked Stack Examples

Three end-to-end examples of composing frameworks across axes, plus a stack-selection cheatsheet.

Applied

Output Eval Rubrics

How to judge each framework's output: universal dimensions, per-framework signature checks, and a cost/latency budget model.

Evaluation

Decision Log & Template

A per-task record of which stack you chose and why, plus the workflow for keeping the playbook current.

Practice

Anti-Pattern Gallery

Concrete before/after failures mapped to rubric violations. A living gallery you extend from real lessons.

Living

Quick reference — the 7 axes & 12 core frameworks

AxisFrameworks (core)
A · TopologyCoT · Thread of Thought · Tree of Thoughts · Graph of Thoughts
B · Sampling/AggregationSelf-Consistency
C · Grounding/ActionReAct · PAL / Program-of-Thoughts
D · Decomposition/PlanningLeast-to-Most
E · AbstractionStep-Back
F · Self-Evaluation/MemoryMetacognitive / Self-Refine / RaR · Reflexion
G · SteeringDirectional Stimulus Prompting

Tier-2 variants: Skeleton-of-Thought · Universal Self-Consistency · Chain-of-Verification · Plan-and-Solve · Self-Ask · Analogical.

Fast selection (4 gates)

  1. External info / tools / exact computation? → PAL (compute) · ReAct (tools) · CoVe (verify)
  2. Large / messy input context? → Thread of Thought
  3. Explore alternatives or merge sub-solutions?ToT (search) · GoT (merge) · Least-to-Most (ordered)
  4. Dominant secondary need? → Self-Consistency (reliability) · Step-Back (principle) · Reflexion/Self-Refine (retry) · RaR (ambiguity) · DSP (steering) · CoT (default)

Axes are orthogonal — layer one framework per axis for high-value stacks (e.g., ReAct × Self-Consistency × Step-Back).