DATA-DRIVEN LEARNING IN PRE-TERTIARY EFL EDUCATION: A CRITICAL REVIEW OF APPROACHES, OUTCOMES, AND UNRESOLVED ISSUES

Authors

  • Muhammad Andrie Bagia Universitas Muhammadiyah Gresik, Indonesia
  • Dwita Laksmita Rachmawati Universitas Muhammadiyah Gresik, Indonesia

DOI:

https://doi.org/10.51878/educator.v6i2.10856

Keywords:

Pembelajaran Berbasis Data, Pedagogi Berbasis Korpus, EFL Pra-Perguruan Tinggi, Kosakata dan Kolokasi, Mediasi Guru

Abstract

ABSTRACT

English as a Foreign Language (EFL) learning in pre-tertiary education is still largely dominated by textbook-based and teacher-centered instruction, limiting learners’ exposure to authentic language use. As an alternative, Data-Driven Learning (DDL) offers a corpus-informed approach that enables learners to explore language patterns through authentic language data. This study employed a qualitative analytical literature review approach by conducting a systematic search and selection of articles through Google Scholar and related academic sources based on PRISMA principles. The data were analyzed through thematic synthesis to identify major patterns related to instructional approaches, learning outcomes, and implementation constraints. The findings reveal that DDL is commonly implemented through teacher-mediated and guided discovery approaches that balance learner autonomy with teacher scaffolding. DDL consistently supports vocabulary development, collocational awareness, and language noticing, although affective outcomes such as learner motivation remain inconsistent across contexts. Major implementation challenges include limited teacher corpus literacy, technological constraints, and difficulties integrating corpus-based activities into regular EFL instruction. Therefore, effective DDL implementation requires flexible and context-sensitive instructional strategies that are adaptable to local educational conditions.

ABSTRAK

Pembelajaran bahasa Inggris sebagai bahasa asing (EFL) pada tingkat pra-perguruan tinggi masih didominasi oleh pendekatan berbasis buku teks dan pembelajaran yang berpusat pada guru, sehingga membatasi akses peserta didik terhadap penggunaan bahasa autentik. Sebagai alternatif, Data-Driven Learning (DDL) menawarkan pendekatan berbasis korpus yang memungkinkan peserta didik mengeksplorasi pola bahasa melalui data autentik. Penelitian ini bertujuan untuk mensintesis pendekatan pembelajaran DDL, mengidentifikasi hasil belajar yang dilaporkan, serta menganalisis tantangan implementasi DDL dalam konteks EFL pra-perguruan tinggi. Penelitian ini menggunakan pendekatan tinjauan literatur analitis kualitatif dengan melakukan pencarian dan seleksi artikel secara sistematis melalui Google Scholar dan sumber akademik terkait berdasarkan prinsip PRISMA. Data dianalisis menggunakan sintesis tematik untuk mengidentifikasi pola utama terkait pendekatan pembelajaran, hasil belajar, dan kendala implementasi. Hasil penelitian menunjukkan bahwa DDL umumnya diterapkan melalui pendekatan teacher-mediated dan guided discovery yang menyeimbangkan kemandirian belajar dengan scaffolding guru. DDL secara konsisten mendukung perkembangan kosakata, kesadaran kolokasi, dan language noticing, meskipun hasil afektif seperti motivasi belajar masih bervariasi antar konteks. Tantangan utama implementasi meliputi keterbatasan literasi korpus guru, infrastruktur teknologi, dan integrasi pembelajaran berbasis korpus dalam kelas EFL. Oleh karena itu, implementasi DDL memerlukan strategi pembelajaran yang kontekstual, fleksibel, dan sesuai dengan kondisi pendidikan lokal.

Downloads

Download data is not yet available.

References

Boulton, A., & Vyatkina, N. (2021). Thirty Years Of Data-Driven Learning: Taking Stock And Charting New Directions Over Time. Language Learning & Technology, 25(3), 66–89. https://www.lltjournal.org/item/10125-73450/

O’Keeffe, A., & Mark, G. (2022). Principled Pattern Curation To Guide Data-Driven Learning Design. Applied Corpus Linguistics, 2(3), 100028.

https://doi.org/10.1016/j.acorp.2022.100028

Cahyawijaya, S., Winata, G. I., Wilie, B., Vincentio, K., Li, X., Kuncoro, A., ... & Fung, P. (2021). IndoNLG: Benchmark and resources for evaluating Indonesian natural language generation. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (pp. 8875-8898).

https://doi.org/10.48550/arXiv.2104.08200

Crosthwaite, P., & Meunier, F. (2025). When DDL Goes Wrong: The Importance Of Constructive Alignment In Secondary Education. The EuroCALL Review, 32(1), 45–63.

https://doi.org/10.4995/eurocall.2025.23844

Crosthwaite, P., Wijaya, D., & Luciana. (2021). Exploring Language Teachers’ Lesson Planning For Corpus-Based Language Teaching: A Focus On Developing TPACK For Corpora And DDL. Computer Assisted Language Learning, 36(7), 1392–1420.

https://doi.org/10.1080/09588221.2021.1995001

Drajati, N. A., Tan, L., Haryati, S., Rochsantiningsih, D., & Zainnuri, H. (2021). Investigating English language teachers in developing TPACK and multimodal literacy. Indonesian Journal of Applied Linguistics, 10(3), 575–585. https://doi.org/10.17509/ijal.v7i3.9806

Emir, G., & Yang?n Ek?i, G. (2023). Corpus Used As A Data-Driven Learning Tool In L2 Academic Writing: Evidence From Turkish Contexts. TEFLIN Journal, 34(2), 209–225.

https://doi.org/10.15639/teflinjournal.v34i2/209-225

Kalyaniwala, C., & Ciekanski, M. (2021). Autonomy Calling: A Systematic Review Of 22 Years Of Publications In Learner Autonomy And CALL. Language Learning & Technology, 25(3), 106–131. https://www.lltjournal.org/item/10125-73452/

Larsson, T., Staples, S., & Egbert, J. (2022). Teaching, Learning, And Researching With Corpora. Applied Corpus Linguistics, 2(3), 100025.

https://doi.org/10.1016/j.acorp.2022.100025

Lin, M. H. (2021). Effects Of Data-Driven Learning On College Students Of Different Grammar Proficiencies: A Preliminary Empirical Assessment In EFL Classes. SAGE Open, 11(3). https://doi.org/10.1177/21582440211029936

Liu, J., & Ma, Q. (2025). Examining Corpus-Based Language Pedagogy (CBLP) Practices In Data-Driven Learning (DDL) For Low-Proficiency L2 English Learners: A Meta-Analysis. Educational Technology & Society, 28(2), 53–76.

https://doi.org/10.30191/ETS.202504_28(2).RP04

Ludwig, C., & Tassinari, M. G. (2021). Foreign Language Learner Autonomy In Online Learning Environments: The Teachers’ Perspectives. Innovation in Language Learning and Teaching, 17(2), 217–234. https://doi.org/10.1080/17501229.2021.2012476

Luo, H. (2025). Data-Driven Learning In Second Language Writing: A Systematic Review Of Efficacy, Challenges, And Future Directions. International Journal of Education and Humanities. https://doi.org/10.54097/t1wp5032

Lusta, A., Demirel, Ö., & Mohammadzadeh, B. (2023). Language Corpus And Data-Driven Learning (DDL) In Language Classrooms: A Systematic Review. Heliyon, 9(12), e22731. https://doi.org/10.1016/j.heliyon.2023.e22731

Oktavianti, I. N., Eriani, E., Rolyna, I., & Prayogi, I. (2023). Investigating The Use Of Corpus-Informed Grammar Materials In Indonesian EFL Classrooms. Indonesian Journal of English Language Teaching and Applied Linguistics, 7(2), 417–438.

https://doi.org/10.21093/ijeltal.v7i2.1415

Pérez-Paredes, P. (2022). A Systematic Review Of The Uses And Spread Of Corpora And Data-Driven Learning In CALL Research During 2011–2015. Computer Assisted Language Learning, 35(1–2), 36–61. https://doi.org/10.1080/09588221.2019.1667832

Sari, Y. R., Drajati, N. A., So, H.-J., & Sumardi, S. (2021). Enhancing EFL Teachers’ Technological Pedagogical Content Knowledge (TPACK) Competence Through Reflective Practice. TEFLIN Journal, 32(1), 117–133.

https://doi.org/10.15639/teflinjournal.v32i1/117-133

Satake, Y. (2022). The Effects Of Corpus Use On L2 Collocation Learning. The JALT CALL Journal, 18(1), 34–53. https://doi.org/10.29140/jaltcall.v18n1.520

Tosun, S., & Sofu, H. (2023). The Effectiveness Of Data-Driven Vocabulary Learning: Hands-On Concordancing Through A Pedagogical Corpus. Journal of Language and Education, 9(3), 176–190. https://doi.org/10.17323/jle.2023.12426

Xue, L., & Liu, F. (2021). Using Data-Driven Learning Activities To Improve Lexical Awareness In Intermediate EFL Learners. Cogent Education, 8(1), 1996867.

https://doi.org/10.1080/2331186X.2021.1996867

Yoon, K.-H., & Lee, D. J. (2024). A Meta-Analysis Of Data-Driven Learning (DDL) In EFL/ESL Settings. Brain, Digital, & Learning, 14(2), 283–304.

https://doi.org/10.31216/BDL.20240017

Downloads

Published

2026-05-18

How to Cite

Bagia, M. A., & Rachmawati, D. L. (2026). DATA-DRIVEN LEARNING IN PRE-TERTIARY EFL EDUCATION: A CRITICAL REVIEW OF APPROACHES, OUTCOMES, AND UNRESOLVED ISSUES. EDUCATOR : Jurnal Inovasi Tenaga Pendidik Dan Kependidikan, 6(2), 57–70. https://doi.org/10.51878/educator.v6i2.10856

Issue

Section

Articles

Similar Articles

1 2 3 4 5 6 7 8 9 10 > >> 

You may also start an advanced similarity search for this article.