DATA-DRIVEN LEARNING IN PRE-TERTIARY EFL EDUCATION: A CRITICAL REVIEW OF APPROACHES, OUTCOMES, AND UNRESOLVED ISSUES
DOI:
https://doi.org/10.51878/educator.v6i2.10856Keywords:
Pembelajaran Berbasis Data, Pedagogi Berbasis Korpus, EFL Pra-Perguruan Tinggi, Kosakata dan Kolokasi, Mediasi GuruAbstract
ABSTRACT
English as a Foreign Language (EFL) learning in pre-tertiary education is still largely dominated by textbook-based and teacher-centered instruction, limiting learners’ exposure to authentic language use. As an alternative, Data-Driven Learning (DDL) offers a corpus-informed approach that enables learners to explore language patterns through authentic language data. This study employed a qualitative analytical literature review approach by conducting a systematic search and selection of articles through Google Scholar and related academic sources based on PRISMA principles. The data were analyzed through thematic synthesis to identify major patterns related to instructional approaches, learning outcomes, and implementation constraints. The findings reveal that DDL is commonly implemented through teacher-mediated and guided discovery approaches that balance learner autonomy with teacher scaffolding. DDL consistently supports vocabulary development, collocational awareness, and language noticing, although affective outcomes such as learner motivation remain inconsistent across contexts. Major implementation challenges include limited teacher corpus literacy, technological constraints, and difficulties integrating corpus-based activities into regular EFL instruction. Therefore, effective DDL implementation requires flexible and context-sensitive instructional strategies that are adaptable to local educational conditions.
ABSTRAK
Pembelajaran bahasa Inggris sebagai bahasa asing (EFL) pada tingkat pra-perguruan tinggi masih didominasi oleh pendekatan berbasis buku teks dan pembelajaran yang berpusat pada guru, sehingga membatasi akses peserta didik terhadap penggunaan bahasa autentik. Sebagai alternatif, Data-Driven Learning (DDL) menawarkan pendekatan berbasis korpus yang memungkinkan peserta didik mengeksplorasi pola bahasa melalui data autentik. Penelitian ini bertujuan untuk mensintesis pendekatan pembelajaran DDL, mengidentifikasi hasil belajar yang dilaporkan, serta menganalisis tantangan implementasi DDL dalam konteks EFL pra-perguruan tinggi. Penelitian ini menggunakan pendekatan tinjauan literatur analitis kualitatif dengan melakukan pencarian dan seleksi artikel secara sistematis melalui Google Scholar dan sumber akademik terkait berdasarkan prinsip PRISMA. Data dianalisis menggunakan sintesis tematik untuk mengidentifikasi pola utama terkait pendekatan pembelajaran, hasil belajar, dan kendala implementasi. Hasil penelitian menunjukkan bahwa DDL umumnya diterapkan melalui pendekatan teacher-mediated dan guided discovery yang menyeimbangkan kemandirian belajar dengan scaffolding guru. DDL secara konsisten mendukung perkembangan kosakata, kesadaran kolokasi, dan language noticing, meskipun hasil afektif seperti motivasi belajar masih bervariasi antar konteks. Tantangan utama implementasi meliputi keterbatasan literasi korpus guru, infrastruktur teknologi, dan integrasi pembelajaran berbasis korpus dalam kelas EFL. Oleh karena itu, implementasi DDL memerlukan strategi pembelajaran yang kontekstual, fleksibel, dan sesuai dengan kondisi pendidikan lokal.
Downloads
References
Boulton, A., & Vyatkina, N. (2021). Thirty Years Of Data-Driven Learning: Taking Stock And Charting New Directions Over Time. Language Learning & Technology, 25(3), 66–89. https://www.lltjournal.org/item/10125-73450/
O’Keeffe, A., & Mark, G. (2022). Principled Pattern Curation To Guide Data-Driven Learning Design. Applied Corpus Linguistics, 2(3), 100028.
https://doi.org/10.1016/j.acorp.2022.100028
Cahyawijaya, S., Winata, G. I., Wilie, B., Vincentio, K., Li, X., Kuncoro, A., ... & Fung, P. (2021). IndoNLG: Benchmark and resources for evaluating Indonesian natural language generation. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (pp. 8875-8898).
https://doi.org/10.48550/arXiv.2104.08200
Crosthwaite, P., & Meunier, F. (2025). When DDL Goes Wrong: The Importance Of Constructive Alignment In Secondary Education. The EuroCALL Review, 32(1), 45–63.
https://doi.org/10.4995/eurocall.2025.23844
Crosthwaite, P., Wijaya, D., & Luciana. (2021). Exploring Language Teachers’ Lesson Planning For Corpus-Based Language Teaching: A Focus On Developing TPACK For Corpora And DDL. Computer Assisted Language Learning, 36(7), 1392–1420.
https://doi.org/10.1080/09588221.2021.1995001
Drajati, N. A., Tan, L., Haryati, S., Rochsantiningsih, D., & Zainnuri, H. (2021). Investigating English language teachers in developing TPACK and multimodal literacy. Indonesian Journal of Applied Linguistics, 10(3), 575–585. https://doi.org/10.17509/ijal.v7i3.9806
Emir, G., & Yang?n Ek?i, G. (2023). Corpus Used As A Data-Driven Learning Tool In L2 Academic Writing: Evidence From Turkish Contexts. TEFLIN Journal, 34(2), 209–225.
https://doi.org/10.15639/teflinjournal.v34i2/209-225
Kalyaniwala, C., & Ciekanski, M. (2021). Autonomy Calling: A Systematic Review Of 22 Years Of Publications In Learner Autonomy And CALL. Language Learning & Technology, 25(3), 106–131. https://www.lltjournal.org/item/10125-73452/
Larsson, T., Staples, S., & Egbert, J. (2022). Teaching, Learning, And Researching With Corpora. Applied Corpus Linguistics, 2(3), 100025.
https://doi.org/10.1016/j.acorp.2022.100025
Lin, M. H. (2021). Effects Of Data-Driven Learning On College Students Of Different Grammar Proficiencies: A Preliminary Empirical Assessment In EFL Classes. SAGE Open, 11(3). https://doi.org/10.1177/21582440211029936
Liu, J., & Ma, Q. (2025). Examining Corpus-Based Language Pedagogy (CBLP) Practices In Data-Driven Learning (DDL) For Low-Proficiency L2 English Learners: A Meta-Analysis. Educational Technology & Society, 28(2), 53–76.
https://doi.org/10.30191/ETS.202504_28(2).RP04
Ludwig, C., & Tassinari, M. G. (2021). Foreign Language Learner Autonomy In Online Learning Environments: The Teachers’ Perspectives. Innovation in Language Learning and Teaching, 17(2), 217–234. https://doi.org/10.1080/17501229.2021.2012476
Luo, H. (2025). Data-Driven Learning In Second Language Writing: A Systematic Review Of Efficacy, Challenges, And Future Directions. International Journal of Education and Humanities. https://doi.org/10.54097/t1wp5032
Lusta, A., Demirel, Ö., & Mohammadzadeh, B. (2023). Language Corpus And Data-Driven Learning (DDL) In Language Classrooms: A Systematic Review. Heliyon, 9(12), e22731. https://doi.org/10.1016/j.heliyon.2023.e22731
Oktavianti, I. N., Eriani, E., Rolyna, I., & Prayogi, I. (2023). Investigating The Use Of Corpus-Informed Grammar Materials In Indonesian EFL Classrooms. Indonesian Journal of English Language Teaching and Applied Linguistics, 7(2), 417–438.
https://doi.org/10.21093/ijeltal.v7i2.1415
Pérez-Paredes, P. (2022). A Systematic Review Of The Uses And Spread Of Corpora And Data-Driven Learning In CALL Research During 2011–2015. Computer Assisted Language Learning, 35(1–2), 36–61. https://doi.org/10.1080/09588221.2019.1667832
Sari, Y. R., Drajati, N. A., So, H.-J., & Sumardi, S. (2021). Enhancing EFL Teachers’ Technological Pedagogical Content Knowledge (TPACK) Competence Through Reflective Practice. TEFLIN Journal, 32(1), 117–133.
https://doi.org/10.15639/teflinjournal.v32i1/117-133
Satake, Y. (2022). The Effects Of Corpus Use On L2 Collocation Learning. The JALT CALL Journal, 18(1), 34–53. https://doi.org/10.29140/jaltcall.v18n1.520
Tosun, S., & Sofu, H. (2023). The Effectiveness Of Data-Driven Vocabulary Learning: Hands-On Concordancing Through A Pedagogical Corpus. Journal of Language and Education, 9(3), 176–190. https://doi.org/10.17323/jle.2023.12426
Xue, L., & Liu, F. (2021). Using Data-Driven Learning Activities To Improve Lexical Awareness In Intermediate EFL Learners. Cogent Education, 8(1), 1996867.
https://doi.org/10.1080/2331186X.2021.1996867
Yoon, K.-H., & Lee, D. J. (2024). A Meta-Analysis Of Data-Driven Learning (DDL) In EFL/ESL Settings. Brain, Digital, & Learning, 14(2), 283–304.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 EDUCATOR : Jurnal Inovasi Tenaga Pendidik dan Kependidikan

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.












