ChemGraph Leaderboard
ChemGraph Leaderboard provides a reproducible evaluation of agentic AI frameworks and large language models (LLMs) for computational chemistry and materials science.
Models are evaluated daily on 40 chemistry queries grouped into 12 task categories:
| Category | Queries | Description |
|---|---|---|
| SMILES Lookup | 4 | Convert molecule names to SMILES strings |
| Opt (Name) | 4 | Geometry optimization from molecule name |
| Opt (SMILES) | 2 | Geometry optimization from SMILES |
| Vib (Name) | 2 | Vibrational frequency from molecule name |
| Vib (SMILES) | 2 | Vibrational frequency from SMILES |
| Thermo (Name) | 4 | Thermochemistry from molecule name |
| Thermo (SMILES) | 2 | Thermochemistry from SMILES |
| Dipole (Name) | 2 | Dipole moment from molecule name |
| Dipole (SMILES) | 2 | Dipole moment from SMILES |
| Energy (Name) | 4 | Single-point energy from molecule name |
| Energy (SMILES) | 2 | Single-point energy from SMILES |
| Reaction Energy | 10 | Reaction Gibbs free energy calculation |
Each model's score reflects its ability to follow structured tool protocols, generate physically meaningful results, and reason across chemistry-specific contexts. Results are scored by a structured judge via JSON output for evaluation with binary accuracy (correct/incorrect) and 5% relative tolerance for numerical values.
Use this leaderboard to explore how different models and agents perform across core chemistry tasks, from small-molecule modeling to multi-step reaction workflows.
{
- "headers": [
- "T",
- "Model",
- "Average โฌ๏ธ",
- "SMILES Lookup",
- "Opt (Name)",
- "Opt (SMILES)",
- "Vib (Name)",
- "Vib (SMILES)",
- "Thermo (Name)",
- "Thermo (SMILES)",
- "Dipole (Name)",
- "Dipole (SMILES)",
- "Energy (Name)",
- "Energy (SMILES)",
- "Reaction Energy",
- "1-Day",
- "3-Day Avg",
- "7-Day Avg",
- "Type",
- "Architecture",
- "Precision",
- "Hub License",
- "#Params (B)",
- "Hub โค๏ธ",
- "Available on the hub",
- "Model sha"
- "data": [
- [
- 1,
- "<a target="_blank" href="https://huggingface.co/anthropic/claude-opus-4.5" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">anthropic/claude-opus-4.5</a>",
- 97.5,
- 100,
- 100,
- 100,
- 100,
- 50,
- 100,
- 100,
- 100,
- 100,
- 100,
- 100,
- 100,
- null,
- 96.25,
- 96.25,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 2,
- "<a target="_blank" href="https://huggingface.co/anthropic/claude-opus-4.6" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">anthropic/claude-opus-4.6</a>",
- 97.5,
- 100,
- 100,
- 100,
- 100,
- 50,
- 100,
- 100,
- 100,
- 100,
- 100,
- 100,
- 100,
- null,
- 97.5,
- 97.5,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 3,
- "<a target="_blank" href="https://huggingface.co/anthropic/claude-4.5-haiku" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">anthropic/claude-4.5-haiku</a>",
- 92.5,
- 100,
- 100,
- 100,
- 100,
- 0,
- 100,
- 100,
- 50,
- 100,
- 100,
- 100,
- 100,
- null,
- 88.75,
- 88.75,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 4,
- "<a target="_blank" href="https://huggingface.co/openai/o3" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/o3</a>",
- 90,
- 100,
- 100,
- 100,
- 100,
- 50,
- 100,
- 100,
- 50,
- 0,
- 100,
- 100,
- 100,
- null,
- 85,
- 85,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 5,
- "<a target="_blank" href="https://huggingface.co/anthropic/claude-opus-4" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">anthropic/claude-opus-4</a>",
- 87.5,
- 100,
- 100,
- 100,
- 50,
- 50,
- 100,
- 100,
- 50,
- 100,
- 100,
- 100,
- 80,
- null,
- 90,
- 90,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 6,
- "<a target="_blank" href="https://huggingface.co/openai/gpt-4o-latest" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-4o-latest</a>",
- 87.5,
- 100,
- 100,
- 100,
- 100,
- 50,
- 100,
- 100,
- 0,
- 0,
- 100,
- 100,
- 100,
- null,
- 86.25,
- 86.25,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 7,
- "<a target="_blank" href="https://huggingface.co/anthropic/claude-opus-4.1" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">anthropic/claude-opus-4.1</a>",
- 87.5,
- 100,
- 100,
- 100,
- 100,
- 50,
- 100,
- 100,
- 100,
- 100,
- 100,
- 100,
- 60,
- null,
- 88.75,
- 88.75,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 8,
- "<a target="_blank" href="https://huggingface.co/openai/gpt-5.1" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-5.1</a>",
- 87.5,
- 100,
- 100,
- 100,
- 100,
- 50,
- 75,
- 100,
- 100,
- 50,
- 75,
- 100,
- 90,
- null,
- 92.5,
- 92.5,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 9,
- "<a target="_blank" href="https://huggingface.co/openai/gpt-4o" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-4o</a>",
- 85,
- 100,
- 75,
- 100,
- 50,
- 50,
- 100,
- 100,
- 100,
- 50,
- 75,
- 100,
- 90,
- null,
- 90,
- 90,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 10,
- "<a target="_blank" href="https://huggingface.co/anthropic/claude-sonnet-4.5" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">anthropic/claude-sonnet-4.5</a>",
- 82.5,
- 100,
- 100,
- 0,
- 100,
- 50,
- 100,
- 100,
- 0,
- 0,
- 100,
- 100,
- 100,
- null,
- 86.25,
- 86.25,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 11,
- "<a target="_blank" href="https://huggingface.co/anthropic/claude-sonnet-4" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">anthropic/claude-sonnet-4</a>",
- 77.5,
- 100,
- 75,
- 0,
- 50,
- 50,
- 100,
- 100,
- 0,
- 0,
- 100,
- 100,
- 100,
- null,
- 82.5,
- 82.5,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 12,
- "<a target="_blank" href="https://huggingface.co/openai/gpt-4.1" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-4.1</a>",
- 72.5,
- 100,
- 100,
- 100,
- 100,
- 50,
- 75,
- 100,
- 0,
- 0,
- 100,
- 100,
- 50,
- null,
- 72.5,
- 72.5,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 13,
- "<a target="_blank" href="https://huggingface.co/openai/gpt-5.4" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-5.4</a>",
- 67.5,
- 100,
- 100,
- 100,
- 50,
- 0,
- 100,
- 0,
- 50,
- 0,
- 100,
- 0,
- 70,
- null,
- 75,
- 75,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 14,
- "<a target="_blank" href="https://huggingface.co/openai/o3-mini" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/o3-mini</a>",
- 65,
- 100,
- 100,
- 100,
- 100,
- 50,
- 100,
- 100,
- 0,
- 0,
- 100,
- 100,
- 10,
- null,
- 68.75,
- 68.75,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 15,
- "<a target="_blank" href="https://huggingface.co/openai/gpt-5.2" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-5.2</a>",
- 65,
- 100,
- 75,
- 50,
- 100,
- 50,
- 75,
- 100,
- 50,
- 0,
- 100,
- 0,
- 50,
- null,
- 81.25,
- 81.25,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 16,
- "<a target="_blank" href="https://huggingface.co/openai/o1" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/o1</a>",
- 65,
- 100,
- 100,
- 100,
- 100,
- 0,
- 75,
- 100,
- 0,
- 0,
- 100,
- 100,
- 30,
- null,
- 72.5,
- 72.5,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 17,
- "<a target="_blank" href="https://huggingface.co/anthropic/claude-3.5-haiku" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">anthropic/claude-3.5-haiku</a>",
- 60,
- 100,
- 25,
- 50,
- 100,
- 0,
- 75,
- 50,
- 0,
- 0,
- 75,
- 100,
- 70,
- null,
- 62.5,
- 62.5,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 18,
- "<a target="_blank" href="https://huggingface.co/openai/gpt-4.1-mini" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-4.1-mini</a>",
- 60,
- 100,
- 50,
- 100,
- 100,
- 50,
- 75,
- 50,
- 0,
- 0,
- 100,
- 50,
- 40,
- null,
- 67.5,
- 67.5,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 19,
- "<a target="_blank" href="https://huggingface.co/openai/o4-mini" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/o4-mini</a>",
- 50,
- 100,
- 50,
- 0,
- 50,
- 50,
- 75,
- 50,
- 0,
- 0,
- 100,
- 100,
- 20,
- null,
- 67.5,
- 67.5,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 20,
- "<a target="_blank" href="https://huggingface.co/openai/gpt-4.1-nano" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-4.1-nano</a>",
- 10,
- 75,
- 25,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- null,
- 10,
- 10,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 21,
- "<a target="_blank" href="https://huggingface.co/openai/gpt-5-mini" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-5-mini</a>",
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- null,
- 0,
- 0,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 22,
- "<a target="_blank" href="https://huggingface.co/openai/gpt-5-nano" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-5-nano</a>",
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- null,
- 0,
- 0,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- 23,
- "<a target="_blank" href="https://huggingface.co/openai/gpt-5" style="color: var(--link-text-color); text-decoration: underline;text-decoration-style: dotted;">openai/gpt-5</a>",
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- 0,
- null,
- 0,
- 0,
- "",
- "?",
- "float16",
- "?",
- 0,
- 0,
- true,
- "main"
- [
- "metadata": null