ChemGraph Leaderboard

ChemGraph Leaderboard provides a reproducible evaluation of agentic AI frameworks and large language models (LLMs) for computational chemistry and materials science.

Models are evaluated daily on 40 chemistry queries grouped into 12 task categories:

Category Queries Description
SMILES Lookup 4 Convert molecule names to SMILES strings
Opt (Name) 4 Geometry optimization from molecule name
Opt (SMILES) 2 Geometry optimization from SMILES
Vib (Name) 2 Vibrational frequency from molecule name
Vib (SMILES) 2 Vibrational frequency from SMILES
Thermo (Name) 4 Thermochemistry from molecule name
Thermo (SMILES) 2 Thermochemistry from SMILES
Dipole (Name) 2 Dipole moment from molecule name
Dipole (SMILES) 2 Dipole moment from SMILES
Energy (Name) 4 Single-point energy from molecule name
Energy (SMILES) 2 Single-point energy from SMILES
Reaction Energy 10 Reaction Gibbs free energy calculation

Each model's score reflects its ability to follow structured tool protocols, generate physically meaningful results, and reason across chemistry-specific contexts. Results are scored by a structured judge via JSON output for evaluation with binary accuracy (correct/incorrect) and 5% relative tolerance for numerical values.

Models are evaluated under two workflow types:

  • Single-Agent โ€” one agent handles all tool calls and reasoning independently.
  • Multi-Agent โ€” multiple specialised agents collaborate to solve queries.

Use this leaderboard to explore how different models and agents perform across core chemistry tasks, from small-molecule modeling to multi-step reaction workflows.

{
  • "headers": [
    • "T",
    • "Model",
    • "Average โฌ†๏ธ",
    • "SMILES Lookup",
    • "Opt (Name)",
    • "Opt (SMILES)",
    • "Vib (Name)",
    • "Vib (SMILES)",
    • "Thermo (Name)",
    • "Thermo (SMILES)",
    • "Dipole (Name)",
    • "Dipole (SMILES)",
    • "Energy (Name)",
    • "Energy (SMILES)",
    • "Reaction Energy",
    • "1-Day",
    • "3-Day Avg",
    • "7-Day Avg",
    • "Type",
    • "Architecture",
    • "Precision",
    • "Hub License",
    • "#Params (B)",
    • "Hub โค๏ธ",
    • "Available on the hub",
    • "Model sha"
    ],
  • "data": [
    • [
      • 1,
      • "<a target="_blank" href="https://huggingface.co/anthropic/claude-opus-4.6" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">anthropic/claude-opus-4.6</a>",
      • 97.5,
      • 100,
      • 100,
      • 100,
      • 100,
      • 50,
      • 100,
      • 100,
      • 100,
      • 100,
      • 100,
      • 100,
      • 100,
      • 97.5,
      • 96.25,
      • 96,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 2,
      • "<a target="_blank" href="https://huggingface.co/openai/gpt-4o" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-4o</a>",
      • 95,
      • 100,
      • 100,
      • 100,
      • 50,
      • 50,
      • 100,
      • 100,
      • 100,
      • 100,
      • 100,
      • 100,
      • 100,
      • 88.75,
      • 86.67,
      • 90.42,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 3,
      • "<a target="_blank" href="https://huggingface.co/openai/gpt-5.1" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-5.1</a>",
      • 92.5,
      • 100,
      • 100,
      • 100,
      • 50,
      • 50,
      • 100,
      • 100,
      • 100,
      • 50,
      • 100,
      • 100,
      • 100,
      • 92.5,
      • 93.75,
      • 95,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 4,
      • "<a target="_blank" href="https://huggingface.co/openai/gpt-5.2" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-5.2</a>",
      • 92.5,
      • 100,
      • 100,
      • 100,
      • 100,
      • 50,
      • 100,
      • 100,
      • 0,
      • 100,
      • 100,
      • 100,
      • 100,
      • 92.5,
      • 91.25,
      • 91.5,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 5,
      • "<a target="_blank" href="https://huggingface.co/anthropic/claude-opus-4" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">anthropic/claude-opus-4</a>",
      • 90,
      • 100,
      • 100,
      • 100,
      • 100,
      • 50,
      • 100,
      • 100,
      • 50,
      • 100,
      • 100,
      • 100,
      • 80,
      • 90,
      • 88.75,
      • 89.5,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 6,
      • "<a target="_blank" href="https://huggingface.co/anthropic/claude-opus-4.5" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">anthropic/claude-opus-4.5</a>",
      • 90,
      • 100,
      • 100,
      • 100,
      • 0,
      • 50,
      • 100,
      • 100,
      • 50,
      • 100,
      • 100,
      • 100,
      • 100,
      • 90,
      • 93.75,
      • 92.5,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 7,
      • "<a target="_blank" href="https://huggingface.co/anthropic/claude-opus-4.1" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">anthropic/claude-opus-4.1</a>",
      • 90,
      • 100,
      • 100,
      • 100,
      • 100,
      • 50,
      • 100,
      • 100,
      • 50,
      • 50,
      • 100,
      • 100,
      • 90,
      • 90,
      • 88.75,
      • 88,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 8,
      • "<a target="_blank" href="https://huggingface.co/openai/o3" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/o3</a>",
      • 87.5,
      • 100,
      • 100,
      • 100,
      • 100,
      • 50,
      • 100,
      • 100,
      • 0,
      • 0,
      • 100,
      • 100,
      • 100,
      • 87.5,
      • 87.5,
      • 86,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 9,
      • "<a target="_blank" href="https://huggingface.co/openai/gpt-5.4" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-5.4</a>",
      • 87.5,
      • 100,
      • 100,
      • 100,
      • 100,
      • 50,
      • 100,
      • 0,
      • 100,
      • 50,
      • 100,
      • 50,
      • 100,
      • 87.5,
      • 81.25,
      • 81,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 10,
      • "<a target="_blank" href="https://huggingface.co/anthropic/claude-4.5-haiku" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">anthropic/claude-4.5-haiku</a>",
      • 82.5,
      • 100,
      • 100,
      • 100,
      • 0,
      • 0,
      • 100,
      • 100,
      • 50,
      • 0,
      • 100,
      • 100,
      • 100,
      • 82.5,
      • 86.25,
      • 86,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 11,
      • "<a target="_blank" href="https://huggingface.co/anthropic/claude-sonnet-4.5" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">anthropic/claude-sonnet-4.5</a>",
      • 82.5,
      • 100,
      • 75,
      • 50,
      • 100,
      • 50,
      • 100,
      • 100,
      • 0,
      • 0,
      • 100,
      • 100,
      • 100,
      • 82.5,
      • 85,
      • 75.5,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 12,
      • "<a target="_blank" href="https://huggingface.co/openai/o4-mini" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/o4-mini</a>",
      • 82.5,
      • 100,
      • 100,
      • 100,
      • 50,
      • 50,
      • 100,
      • 100,
      • 0,
      • 0,
      • 100,
      • 100,
      • 90,
      • 82.5,
      • 81.25,
      • 80,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 13,
      • "<a target="_blank" href="https://huggingface.co/anthropic/claude-sonnet-4" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">anthropic/claude-sonnet-4</a>",
      • 82.5,
      • 100,
      • 75,
      • 0,
      • 100,
      • 50,
      • 100,
      • 100,
      • 50,
      • 0,
      • 100,
      • 100,
      • 100,
      • 82.5,
      • 83.75,
      • 85,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 14,
      • "<a target="_blank" href="https://huggingface.co/openai/gpt-4o-latest" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-4o-latest</a>",
      • 82.5,
      • 100,
      • 100,
      • 100,
      • 100,
      • 50,
      • 100,
      • 100,
      • 0,
      • 0,
      • 100,
      • 100,
      • 80,
      • 82.5,
      • 85,
      • 83.5,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 15,
      • "<a target="_blank" href="https://huggingface.co/openai/gpt-4.1" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-4.1</a>",
      • 82.5,
      • 100,
      • 100,
      • 100,
      • 100,
      • 50,
      • 100,
      • 100,
      • 0,
      • 0,
      • 100,
      • 100,
      • 80,
      • 82.5,
      • 75,
      • 74.5,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 16,
      • "<a target="_blank" href="https://huggingface.co/openai/o1" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/o1</a>",
      • 82.5,
      • 100,
      • 100,
      • 100,
      • 100,
      • 0,
      • 100,
      • 100,
      • 0,
      • 50,
      • 100,
      • 100,
      • 80,
      • 82.5,
      • 82.5,
      • 78,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 17,
      • "<a target="_blank" href="https://huggingface.co/openai/gpt-4.1-mini" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-4.1-mini</a>",
      • 77.5,
      • 100,
      • 100,
      • 100,
      • 100,
      • 50,
      • 100,
      • 100,
      • 0,
      • 0,
      • 100,
      • 100,
      • 60,
      • 73.75,
      • 73.33,
      • 77.5,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 18,
      • "<a target="_blank" href="https://huggingface.co/anthropic/claude-3.5-haiku" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">anthropic/claude-3.5-haiku</a>",
      • 67.5,
      • 100,
      • 25,
      • 0,
      • 50,
      • 0,
      • 100,
      • 100,
      • 0,
      • 50,
      • 100,
      • 100,
      • 80,
      • 67.5,
      • 65,
      • 68,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 19,
      • "<a target="_blank" href="https://huggingface.co/openai/o3-mini" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/o3-mini</a>",
      • 65,
      • 100,
      • 100,
      • 100,
      • 100,
      • 50,
      • 100,
      • 100,
      • 0,
      • 0,
      • 100,
      • 100,
      • 10,
      • 65,
      • 67.5,
      • 68,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 20,
      • "<a target="_blank" href="https://huggingface.co/openai/gpt-4.1-nano" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-4.1-nano</a>",
      • 12.5,
      • 100,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 25,
      • 0,
      • 0,
      • 12.5,
      • 12.5,
      • 12,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 21,
      • "<a target="_blank" href="https://huggingface.co/openai/gpt-5-mini" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-5-mini</a>",
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • null,
      • null,
      • null,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 22,
      • "<a target="_blank" href="https://huggingface.co/openai/gpt-5-nano" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-5-nano</a>",
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • null,
      • null,
      • null,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ],
    • [
      • 23,
      • "<a target="_blank" href="https://huggingface.co/openai/gpt-5" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-5</a>",
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • 0,
      • null,
      • null,
      • null,
      • "",
      • "?",
      • "float16",
      • "?",
      • 0,
      • 0,
      • true,
      • "main"
      ]
    ],
  • "metadata": null
}