Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving

Citation

Authors: Anirudh Didolkar et al. Year: 2024 Venue: URL:

Abstract

This paper explores whether LLMs possess metacognitive knowledge (knowledge about their own reasoning processes) and whether this can be leveraged to improve performance through skill-based in-context learning.

Summary

Develops a prompt-guided procedure to elicit LLM-identified skill labels, create a skill exemplar repository, and use skill-based in-context learning for improved performance.

Key Contributions

Evidence that LLMs have metacognitive knowledge about their skills
Two-stage skill discovery method (fine-grained → coarse clustering)
Skill Exemplar Repository for in-context learning
Cross-model skill transfer (GPT-4 skills improve weaker models)

Core Concepts & Definitions

Metacognitive Knowledge

The learner’s accumulated knowledge about their own cognitive processes and learning-relevant properties of data.

Skill Exemplar Repository

$Repository = {(s_{0}, q_{0}^{T}, a_{0}^{T}), (s_{1}, q_{1}^{T}, a_{1}^{T}), \dots, (s_{n}, q_{n}^{T}, a_{n}^{T})}$ where $s_{i}$ is a skill label, $(q_{i}^{T}, a_{i}^{T})$ is a question-answer pair.

Two-Stage Skill Discovery

Stage 1: LLM assigns fine-grained skill labels (~5000 for MATH dataset)
Stage 2: LLM performs semantic clustering → coarse skill families (~117 for MATH)

Main Results

Skill-based ICL exemplar selection improves accuracy on GSM8K and MATH
Skills discovered by strong LLMs (GPT-4) improve weaker LLMs
Skill exemplar repository transfers across datasets

Relevance to Project

Medium-High — Practical skill extraction methodology:

Four-word underscore-separated skill format (e.g., “circle_properties_area_calculation”)
Two-stage discovery could populate our $S_{0}$
Metacognitive framing relates to our metaskill concept
Repository structure relevant for our fitness function ground truth

Questions & Notes

Can we use their extraction method to bootstrap our skill ontology?
How do their ~117 skill families map to our algebraic primitives?
Their skill labels are domain-specific (math) — how to generalize?

Skills Calculus

Explorer

Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving

Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving

Citation

Abstract

Summary

Key Contributions

Core Concepts & Definitions

Metacognitive Knowledge

Skill Exemplar Repository

Two-Stage Skill Discovery

Main Results

Relevance to Project

Questions & Notes

Graph View

Table of Contents

Skills Calculus

Explorer

Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving

Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving

Citation

Abstract

Summary

Key Contributions

Core Concepts & Definitions

Metacognitive Knowledge

Skill Exemplar Repository

Two-Stage Skill Discovery

Main Results

Relevance to Project

Questions & Notes

Related Papers

Graph View

Table of Contents