Evaluating Generative AI in Historical Research: A Comparative Study on Identifying Primary Source Evidence in Ancient History

Raymond Solga; Mohammed Sarwar

doi:10.64946/aiantiquity.v2i1.003

Evaluating Generative AI in Historical Research: A Comparative Study on Identifying Primary Source Evidence in Ancient History

Authors

Raymond S. Solga The College of Westchester/New York University Author https://orcid.org/0000-0003-0689-0633
Mohammed J. Sarwar The City University of New York Author https://orcid.org/0009-0000-2405-4263

DOI:

https://doi.org/10.64946/aiantiquity.v2i1.003

Keywords:

Primary Sources, Ancient History, Historical Methodology, Generative AI, Humanities

Abstract

This study explores how traditional historical methods and generative AI tools compare in the identification, interpretation, and validation of primary sources in ancient history. Drawing from a dual case study approach—four case studies conducted by human historians and four by AI tools (GPT-4, Claude 2, Gemini, Perplexity)—we evaluate the epistemological strengths and limitations of each method. Using qualitative document analysis, historiographical criteria, and expert review, the study assesses source criticism, genre classification, provenance transparency, and evidentiary value. Results indicate that generative AI excels at broad content discovery and thematic synthesis but struggles with historical genre boundaries, source verification, and manuscript-based scholarship. Human researchers consistently outperform in contextual interpretation, critical chronology, and the adjudication of textual authority. We propose a human-in-the-loop framework combining digital speed with scholarly rigor, advocating for model pluralism, temporal prompting, and provenance-first protocols. This integrated methodology ensures AI contributes meaningfully to digital historiography without compromising historical standards.

Downloads

Download data is not yet available.

Downloads

Published

2026-02-27

Issue

Volume 2, Issue 1 (2026)

Section

Articles

License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Authors retain copyright and grant AI & Antiquity the right of first publication. Articles are published under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0), which allows others to share and adapt the material for non-commercial purposes, provided that appropriate credit is given to the original author(s) and source. Any commercial use requires the explicit permission of the author(s).

Evaluating Generative AI in Historical Research: A Comparative Study on Identifying Primary Source Evidence in Ancient History

Authors

DOI:

Keywords:

Abstract

Downloads

Downloads

Published

Issue

Section

License

Similar Articles