Dataset¶
LitXBench ships with the LitXAlloy dataset: 19 papers on high-entropy alloys with expert-annotated ground-truth extractions, comprising 1426 total measurements and 101 target materials with 68 unique compositions.
Benchmark Quality¶
A single annotator performed the extraction for all papers to ensure consistency. The extracted values were compared with the MPEA dataset as a safeguard against missing values. LLMs were also employed to double-check and catch extraction mistakes – an estimated 1.1 billion tokens were spent using Claude to catch errors. All LLM-suggested corrections were heavily scrutinized by humans before LitXAlloy was updated.
Compared to the MPEA dataset for the 18 overlapping papers, LitXAlloy has significantly higher data density: an average of 74.8 extracted measurements per paper versus 33.4 in MPEA, with an additional 745 values total.
Loading the Dataset¶
from litxbench.litxalloy import papers
# papers is a dict mapping DOI strings to list[Experiment]
print(f"Number of papers: {len(papers)}")
for doi, experiments in papers.items():
n_materials = sum(len(e.output_materials) for e in experiments)
print(f" {doi}: {len(experiments)} experiment(s), {n_materials} material(s)")
Included Papers¶
Eighteen papers are sourced from the MPEA dataset, with an additional open-access paper on Ni-based superalloys selected for its complex synthesis process and unique experimental measurements.
Papers from MPEA
DOI Key |
DOI Link |
|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Non-MPEA Papers
DOI Key |
DOI Link |
|---|---|
|
Key Statistics¶
101 target materials across 19 papers
68 unique compositions (8 papers contain duplicate compositions)
26 materials derived from other materials in the dataset (across 6 papers)
1426 total measurements
Only experimental and experimentally-derived measurements are included; computational measurements (e.g. Thermo-Calc predictions) are excluded
Data Structure¶
Each paper maps to a list of Experiment objects. A typical paper has 1-3 experiments,
each containing multiple output materials with their measurements.
from litxbench.litxalloy import papers
doi = "doi_10_3390__e21020122"
for exp in papers[doi]:
print(f"Raw materials: {list(exp.raw_materials.keys())}")
print(f"Synthesis groups: {list(exp.synthesis_group_map.keys())}")
for mat in exp.output_materials:
print(f" Material: {mat.process}")
print(f" Measurements: {len(mat.measurements)}")