A schema-guided framework that transforms fragmented international large-scale assessment literature into a searchable evidence infrastructure.
International Large-Scale Assessments have generated thousands of studies across PISA, TIMSS, PIRLS, TALIS, PIAAC, ICILS, and ICCS. Yet methodological knowledge remains fragmented across publications, making systematic evidence synthesis, methodological comparison, and cumulative knowledge building increasingly difficult.
Thousands of ILSA studies contain valuable methodological and analytical evidence, yet this knowledge remains scattered across publications, countries, assessment cycles, and research traditions. Researchers often spend weeks reviewing literature before beginning a new analysis.
A multi-stage AI pipeline extracts methodological evidence, standardizes terminology, and links findings into a structured knowledge base, transforming heterogeneous ILSA literature into a searchable research infrastructure.
11,862 structured records across 1,266 documents — enabling cumulative evidence synthesis across fragmented ILSA literature. The framework provides evidence-informed analytical guidance for researchers beginning new ILSA analyses.
Core contribution: This project establishes a reusable knowledge infrastructure for International Large-Scale Assessment research, transforming 1,266 studies into 11,862 structured records that support evidence synthesis, methodological discovery, and AI-assisted research at scale.
Corpus Construction — Literature was collected from four major sources and deduplicated using DOI matching and metadata normalization, yielding 1,266 unique ILSA-related publications.
Structured Extraction — A schema-constrained extraction framework transformed the corpus into 11,862 structured methodological records suitable for large-scale evidence synthesis.
Equivalent analytical methods and variable definitions were aligned across studies using domain-informed terminology mapping. A domain expert adjudicates ambiguous cases and validates semantic consistency across the full knowledge base.
Researchers can query prior ILSA evidence, identify relevant variables, discover methodological precedents, generate evidence-grounded hypotheses, and obtain citation-supported analytical recommendations — all grounded in the structured knowledge base.
To evaluate extraction quality, a stratified sample of 130 publications (~10% of the corpus) was manually audited against the original source documents. Validation focused on methodological accuracy, semantic consistency, source attribution, and fabrication detection.
11,862 structured records from 1,266 studies across PISA, TIMSS, PIRLS, TALIS, PIAAC, ICILS, and ICCS — released under CC BY 4.0 and hosted on HuggingFace in Parquet format. The full extraction pipeline is available on GitHub.
The RAG agent retrieves evidence from 1,266 documents (11,862 structured records) and generates evidence-grounded analytical suggestions. Select an example query to see a representative response with source attribution.
BIFIEsurvey or survey R packages implement this correctly.
If you use this dataset or pipeline in your research, please cite:
Open access — everything is freely available for research, teaching, and reuse.