Exploring AI-Driven Written English Assessment: Toward Improved Assessment Quality and Learner Outcomes

Ahmed Alshehri; Saddah Aldossary; Mohammad Jamshed; Mohammad Rezaul Karim

doi:10.17507/jltr.1702.26

Authors

Ahmed Alshehri University of Bisha
Saddah Aldossary Shaqra University
Mohammad Jamshed Prince Sattam Bin Abdulaziz University
Mohammad Rezaul Karim Prince Sattam Bin Abdulaziz University

DOI:

https://doi.org/10.17507/jltr.1702.26

Keywords:

contextual sensitivity, discourse competence, technological efficiency, pedagogical integrity, validity and fairness

Abstract

The use of AI-powered tools for assessing written English is revolutionizing conventional evaluation methods by improving accuracy, consistency, and teaching efficiency. However, there is limited research on how AI assessment tools affect the validity and fairness of language evaluation. This review synthesizes current research on the technological foundations, pedagogical impact, and ethical considerations of AI in this area. This review assesses the role of AI in education, highlighting AI-driven technologies such as automated writing evaluation and natural language processing that provide timely feedback, support personalized learning, and reduce instructor workload. The study employed the PRISMA method to investigate the impact of AI on written English assessment by analyzing empirical studies from Scopus, Web of Science, and Google Scholar. The findings revealed that AI tools significantly enhanced the evaluation of written English by providing reliable assessments, immediate feedback, fostering learner autonomy, and producing assessments consistent with those of human raters, thereby promoting self-regulated writing and increasing access to quality feedback. However, specific concerns about algorithmic bias, feedback clarity, and the diminishing role of human judgment in assessing creativity and discourse were also found. It was also found that educational automation negatively impacts linguistic diversity, critical thinking, data privacy, transparency, and equitable access to AI tools. The review suggests integrating AI capabilities with human experience to balance technological efficiency and pedagogical integrity in the use of AI tools. It recommends developing hybrid assessment frameworks that integrate AI analytics with human evaluation to enhance fairness, accuracy, and comprehension in written English assessment.

Author Biographies

Ahmed Alshehri, University of Bisha

Department of English Language & Literature, College of Arts and Letters

Saddah Aldossary, Shaqra University

English Department, College of Science and Humanities

Mohammad Jamshed, Prince Sattam Bin Abdulaziz University

English Department, College of Science & Humanities

Mohammad Rezaul Karim, Prince Sattam Bin Abdulaziz University

English Department, College of Science & Humanities

References

Alharbi, W. (2023). AI in the Foreign Language Classroom: A Pedagogical Overview of Automated Writing Assistance Tools. Education Research International, 2023, 1–15. https://doi.org/10.1155/2023/4253331

Al-Obaydi, L. H., & Pikhart, M. (2025). Correction: AI partner versus human partner: comparing AI-based peer assessment with human-generated peer assessment in examining writing skills. Language Testing in Asia, 15(1). https://doi.org/10.1186/s40468-025-00383-8

Bachman, L. F. (2024). What Is the Construct? The Dialectic of Abilities and Contexts in Defining Constructs in Language Assessment 1. In The Writings of Lyle F. Bachman (pp. 220-250). Routledge.

Biju, N., Abdelrasheed, N. S. G., Bakiyeva, K., Prasad, K. D. V., & Jember, B. (2024). Which one? AI-assisted language assessment or paper format: an exploration of the impacts on foreign language anxiety, learning attitudes, motivation, and writing performance. Language Testing in Asia, 14(1). https://doi.org/10.1186/s40468-024-00322-z

Black, P., & Wiliam, D. (2009). Developing the theory of formative assessment. Educational Assessment, Evaluation and Accountability (formerly: Journal of personnel evaluation in education), 21(1), 5-31. https://doi.org/10.1007/s11092-008-9068-5

Carless, D., & Boud, D. (2018). The development of student feedback literacy: Enabling uptake of feedback. Assessment & Evaluation in Higher Education, 43(8), 1315–1325.

Chan, C. K. Y., & Hu, W. (2023). Students’ voices on generative AI: perceptions, benefits, and challenges in higher education. International Journal of Educational Technology in Higher Education, 20(1). https://doi.org/10.1186/s41239-023-00411-8

Chan, M., & Luckin, R. (2020). Preparing university assessment for a world with AI: Tasks for human intelligence. In Re-imagining university assessment in a digital world (pp. 49-63). Cham: Springer International Publishing.

Chen, A., Zhang, Y., Jia, J., Liang, M., Cha, Y., & Lim, C. P. (2024). A systematic review and meta‐analysis of AI‐enabled assessment in language learning: Design, implementation, and effectiveness. Journal of Computer Assisted Learning, 41(1). https://doi.org/10.1111/jcal.13064

Cheng, Y. (2025). Transforming English language education with AI-driven deep learning models for scalable adaptive and inclusive assessment. International Journal of Information and Communication Technology, 26(12), 15–31. https://doi.org/10.1504/ijict.2025.146163

Deepshikha, D. (2025). A comprehensive review of AI-powered grading and tailored feedback in universities. Discover Artificial Intelligence, 5(1). https://doi.org/10.1007/s44163-025-00517-0

Floridi, L., Cowls, J., Beltrametti, M., Chatila, R., Chazerand, P., Dignum, V., ... & Vayena, E. (2018). AI4People—An ethical framework for a good AI society: Opportunities, risks, principles, and recommendations. Minds and machines, 28(4), 689-707.

Ghaithi, A. A., & Behforouz, B. (2025). Effects of interaction with AI-assisted writing evaluation on EFL students’ writing performance. Knowledge Management & E-Learning, 206–224. https://doi.org/10.34105/j.kmel.2025.17.009

Hooda, M., Rana, C., Dahiya, O., Rizwan, A., & Hossain, M. S. (2022). Artificial intelligence for assessment and feedback to enhance student success in higher education. Mathematical Problems in Engineering, 2022, 1–19. https://doi.org/10.1155/2022/5215722

Jamshed, M., Fatimah Albedah, Mohd Sajid Ansari, & Sameena Banu. (2025). Assessing the Efficacy of AI-Driven Corrective Feedback via WhatsApp Application to Improve ESL Learners’ Writing Skills: An Experimental Study. International Journal of Interactive Mobile Technologies (iJIM), 19(07), 190–205. https://doi.org/10.3991/ijim.v19i07.52709

Jamshed, M., Manjur Ahmed, A. S. M., Sarfaraj, M., & Warda, W. U. (2024). The Impact of ChatGPT on English Language Learners' Writing Skills: An Assessment of AI Feedback on Mobile. International Journal of Interactive Mobile Technologies, 18(19), 18-36. https://doi.org/10.3991/ijim.v18i19.50361

Jiang, Z., Xu, Z., Pan, Z., He, J., & Xie, K. (2023). Exploring the role of artificial intelligence in facilitating assessment of writing performance in second language learning. Languages, 8(4), 247.

Jin, Y., & Fan, J. (2023). Test-Taker Engagement in AI Technology-Mediated Language Assessment. Language Assessment Quarterly, 20(4–5), 488–500. https://doi.org/10.1080/15434303.2023.2291731

Kanchana, S., & Saha, P. R. (2025). Evolving Student Assessment: AI-Driven Rubrics for Personalized and Equitable English Language Learning. Journal of Engineering Education Transformations, 38(2), 584–590. https://doi.org/10.16920/jeet/2025/v38is2/25072

Khan, M. A., Kurbonova, O., Abdullaev, D., Radie, A. H., & Basim, N. (2024). Is AI-assisted assessment liable to evaluate young learners? Parents support, teacher support, immunity, and resilience are in focus in testing vocabulary learning. Language Testing in Asia, 14(1). https://doi.org/10.1186/s40468-024-00324-x

Khasawneh, D. M. A. S. K. (2024). Improving the learning of language proficiency at tertiary education level through AI-driven assessment models and automated feedback systems. Migration Letters, 21(2), 712-726.

Lee, O. (2024). Examining AI-Based Accuracy Assessment in L2 learners’ writing. Journal of Pan-Pacific Association of Applied Linguistics, 28(2), 39–55. https://doi.org/10.25256/paal.28.2.3

Li, W. Y., Kau, K., & Shiung, Y. J. (2023). Pedagogic exploration into adapting automated writing evaluation and peer review integrated feedback into large-sized university writing classes. SAGE Open, 13(4), 1-18. https://doi.org/10.1177/215824402312090

Liao, X., Zhang, X., Wang, Z., & Luo, H. (2024). Design and implementation of an AI‐enabled visual report tool as formative assessment to promote learning achievement and self‐regulated learning: An experimental study. British Journal of Educational Technology, 55(3), 1253-1276.

Mahdi, H. S., & Alkhateeb, A. (2025). Revolutionising essay evaluation: a cutting-edge rubric for AI-assisted writing. International Journal of Computer-Assisted Language Learning and Teaching (IJCALLT), 15(1), 1-19.

Messick, S. (1995). Validity of psychological assessment: Validation of inferences from persons’ responses and performances as scientific inquiry into score meaning. American Psychologist, 50(9), 741–749. https://doi.org/10.1037/0003-066x.50.9.741

Poonpon, K., Manorom, P., & Chansanam, W. (2023). Exploring effective methods for automated essay scoring of non-native speakers. Contemporary Educational Technology, 15(4), ep475. https://doi.org/10.30935/cedtech/13740

Rahman, N. a. A., Zulkornain, L. H., Mat, A. C., & Kustati, M. (2023). Assessing Writing Abilities using AI-Powered Writing Evaluations. Journal of ASIAN Behavioural Studies, 8(24), 1–17. https://doi.org/10.21834/jabs.v8i24.420

Ranalli, J. (2018). Automated written corrective feedback: how well can students make use of it? Computer Assisted Language Learning, 31(7), 653–674. https://doi.org/10.1080/09588221.2018.1428994

Shi, H., & Aryadoust, V. (2024). A systematic review of AI-based automated written feedback research. ReCALL, 36(2), 187–209. https://doi.org/10.1017/s0958344023000265

Sweller, J. (2011). Cognitive load theory. In Psychology of learning and motivation (Vol. 55, pp. 37-76). Academic Press.

Tam, A. C. F. (2024). Interacting with ChatGPT for internal feedback and factors affecting feedback quality. Assessment & Evaluation in Higher Education, 50(2), 219–235. https://doi.org/10.1080/02602938.2024.2374485

Tapalova, O., & Zhiyenbayeva, N. (2022). Artificial Intelligence in Education: AIED for Personalised Learning Pathways. The Electronic Journal of e-Learning, 20(5), 639–653. https://doi.org/10.34190/ejel.20.5.2597

Usher, M. (2025). Generative AI vs. instructor vs. peer assessments: a comparison of grading and feedback in higher education. Assessment & Evaluation in Higher Education, 50(6), 912–927. https://doi.org/10.1080/02602938.2025.2487495

Vygotsky, L. S. (1978). Mind in society: The development of higher psychological processes (Vol. 86). Harvard University Press.

Wang, Z. (2020). Computer-assisted EFL writing and evaluations based on artificial intelligence: a case from a college reading and writing course. Library Hi Tech, 40(1), 80–97. https://doi.org/10.1108/lht-05-2020-0113

Weigle, S. C. (2002). Assessing writing. Cambridge University Press.

Zhao, R., Zhuang, Y., Zou, D., Xie, Q., & Yu, P. L. H. (2022). AI-assisted automated scoring of picture-cued writing tasks for language assessment. Education and Information Technologies, 28(6), 7031–7063. https://doi.org/10.1007/s10639-022-11473-y

Zimmerman, B. J. (2002). Becoming a Self-Regulated Learner: An Overview. Theory into Practice, 41(2), 64–70. https://doi.org/10.1207/s15430421tip4102_2