INTRODUCING HIGH SCHOOL STATISTICS TEACHERS TO PREDICTIVE MODELLING AND APIs USING CODE-DRIVEN TOOLS

Authors

  • ANNA FERGUSSON University of Auckland
  • MAXINE PFANNKUCH University of Auckland

DOI:

https://doi.org/10.52041/serj.v21i2.49

Keywords:

Statisitcs education research, Data science education, Predictive modeling, Integrating statistical and computational thinking, Task design, High school teachers, APIs

Abstract

Tasks for teaching predictive modelling and APIs often require learners to use code-driven tools. Minimal research, however, exists about the design of tasks that support the introduction of high school students and teachers to these new statistical and computational methods. Using a design-based research approach, a web-based task was developed. The task was constructed using our design framework and implemented within a face-to-face professional development workshop involving six high school statistics teachers. The teachers were guided through the process of developing a prediction model using: an informal approach; visual prediction intervals; data about movie ratings from an API; and R code that ran in the browser. Our findings from this exploratory study indicate that the web-based task supported the development of new statistical and computational ideas related to predictive modelling and APIs.

References

Allaire, J., Xie, Y., McPherson, J., Luraschi, J., Ushey, K., Atkins, A., Wickham, H., Cheng, J., Chang, W., & Iannone, R. (2021). rmarkdown: Dynamic documents for R. RStudio. https://rmarkdown.rstudio.com

Anderson, T., & Shattuck, J. (2012). Design-based research: A decade of progress in education research?. Educational Researcher, 41(1), 16–25.

https://doi.org/10.3102%2F0013189X11428813

Bakker, A. (2018). Design research in education: A practical guide for early career researchers. Routledge. https://doi.org/10.4324/9780203701010

Bakker, A., & van Eerde, D. (2015). An introduction to design-based research with an example from statistics education. In A. Bikner-Ahsbahs, C. Knipping, & N. Presmeg (Eds.), Approaches to qualitative research in mathematics education (pp. 429–466). Springer. https://doi.org/10.1007/978-94-017-9181-6_16

Bargagliotti, A., Franklin, C., Arnold, P., Gould, R., Johnson, S., Perez, L., & Spangler, D. (2020). Pre-K–12 Guidelines for Assessment and Instruction in Statistics Education (GAISE) report II. American Statistical Association.

Ben-Zvi, D. (2000). Toward understanding the role of technological tools in statistical learning. Mathematical Thinking and Learning, 2(1-2), 127–155. https://doi.org/10.1207/S15327833MTL0202_6

Biehler, R. (2018). Design principles, realizations and uses of software supporting the learning and the doing of statistics: A reflection on developments since the late 1990s. In M. A. Sorto, A. White, & L. Guyot (Eds.), Looking back, looking forward. Proceedings of the Tenth International Conference on Teaching Statistics (ICOTS10), Kyoto, Japan, July 8–13. International Statistical Institute. https://iase-web.org/icots/10/proceedings/pdfs/ICOTS10_1B1.pdf

Biehler, R., & Schulte, C. (2017). Perspectives for an interdisciplinary data science curriculum at German secondary schools. In R. Biehler, L. Budde, D. Frischemeier, B. Heinemann, S. Podworny, C. Schulte, & T. Wassong (Eds.), Paderborn Symposium on Data Science Education at School Level 2017: The Collected Extended Abstracts (pp. 2–14). Universitätsbibliothek Paderborn.

Burr, W., Chevalier, F., Collins, C., Gibbs, A. L., Ng, R., & Wild, C. J. (2021). Computational skills by stealth in introductory data science teaching. Teaching Statistics, 43, S34–S51. https://doi.org/10.1111/test.12277

Casey, S. A., & Wasserman, N. H. (2015). Teachers’ knowledge about informal line of best fit. Statistics Education Research Journal, 14(1), 8–35. https://doi.org/10.52041/serj.v14i1.267

Cetinkaya-Rundel, M., & Rundel, C. (2018). Infrastructure and tools for teaching computing throughout the statistical curriculum. The American Statistician, 72(1), 58–65. https://doi.org/10.1080/00031305.2017.1397549

De Veaux, R. D., Agarwal, M., Averett, M., Baumer, B. S., Bray, A., Bressoud, T. C., Bryant, L., Cheng, L. Z., Francis, A., Gould, R., Kim, A. Y., Kretchmar, M., Lu, Q., Moskol, A., Nolan, D., Pelayo, R., Raleigh, S., Sethi, R. J., Sondjaja, M., … Ye, P. (2017). Curriculum guidelines for undergraduate programs in data science. Annual Review of Statistics and Its Application, 4, 15–30. https://doi.org/10.1146/annurev-statistics-060116-053930

Edelson, D. C. (2002). Design research: What we learn when we engage in design. The Journal of the Learning sciences, 11(1), 105–121. https://doi.org/10.1207/S15327809JLS1101_4

Engel, J. (2017). Statistical Literacy for Active Citizenship: A Call for Data Science Education. Statistics Education Research Journal, 16 (1), 44–49. https://doi.org/10.52041/serj.v16i1.213

Erickson, T. (2020). The BART Data Portal. An Introduction to Data Science with CODAP. http://codap.xyz/awash/bart-chapter.html

Fergusson, A., & Pfannkuch, M. (2020). Development of an informal test for the fit of a probability distribution model for teaching. Journal of Statistics Education, 28(3), 344–357. https://doi.org/10.1080/10691898.2020.1837039

Fergusson, A., & Pfannkuch, M. (2021). Introducing teachers who use GUI-driven tools for the randomization test to code-driven tools. Mathematical Thinking and Learning. https://doi.org/10.1080/10986065.2021.1922856

Fergusson, A., & Wild, C. J. (2021). On traversing the data landscape: Introducing APIs to data-science students. Teaching Statistics, 43, S71–S83.

https://doi.org/10.1111/test.12266

Finzer, W. (2013). The data science education dilemma. Technology Innovations in Statistics Education, 7(2). https://doi.org/10.5070/T572013891

Gould, R. (2010). Statistics and the modern student. International Statistical Review, 78(2), 297–315. https://doi.org/10.1111/j.1751-5823.2010.00117.x

Gould, R. (2017). Data literacy is statistical literacy. Statistics Education Research Journal, 16(1), 22–25. https://doi.org/10.52041/serj.v16i1.209

Gould, R. (2021). Toward data-scientific thinking. Teaching Statistics, 43, S11–S22. https://doi.org/10.1111/test.12267

Hardin, J. (2018). Dynamic data in the statistics classroom. Technology Innovations in Statistics Education, 11(1). https://doi.org/10.5070/T5111031079

Kaplan, D. (2007). Computing and introductory statistics. Technology Innovations in Statistics Education, 1(1). https://doi.org/10.5070/T511000030

Konold, C., & Miller, C. (2015). TinkerPlots™ Version 2.3 [Computer Software]. Learn Troop. http://www.tinkerplots.com/

Magana, A. J., Vasileska, D., & Ahmed, S. (2011). Work in progress—a transparency and scaffolding framework for computational simulation tools. 2011 Frontiers in Education Conference (FIE), (pp. S4G–1). IEEE.

https://doi.org/10.1109/FIE.2011.6142803

Makar, K., & Rubin, A. (2018). Learning about statistical inference. In D. Ben-Zvi, K. Makar, & J. Garfield (Eds.), International handbook of research in statistics education (pp. 261–294). Springer. https://doi.org/10.1007/978-3-319-66195-7_8

McKenney, S., & Reeves, T. C. (2018). Conducting educational design research. Routledge. https://doi.org/10.4324/9781315105642

National Academies of Sciences, Engineering, and Medicine. (2018). Data science for undergraduates: Opportunities and options. The National Academies of Sciences Engineering Medicine. https://doi.org/10.17226/25104

Nolan, D., & Temple Lang, D. (2010). Computing in the statistics curricula. The American Statistician, 64(2), 97–107. https://doi.org/10.1198/tast.2010.09132

New Zealand Qualifications Authority. (2019). Annotated exemplar Level 3 AS91581. Author. https://www.nzqa.govt.nz/ncea/subjects/mathematics/exemplars/level-3-as91581/

Pfannkuch, M. (2011). The role of context in developing informal statistical inferential reasoning: A classroom study. Mathematical Thinking and Learning, 13(1–2), 27–46. https://doi.org/10.1080/10986065.2011.538302

Pruim, R., Kaplan, D. T., & Horton, N. J. (2017). The mosaic package: Helping students to ‘think with data’ using R. The R Journal, 9(1), 77–102.

R Core Team. (2020). R: A language and environment for statistical computing. https://www.R-project.org/

Reeves, T. C. (2007). Design-based research from a technology perspective. In J. Van den Akker, K. Gravemeijer, S. McKenney & N. Nieveen (Eds.), Educational design research, (pp. 52–56). Routledge.

Ridgway, J. (2016). Implications of the data revolution for statistics education. International Statistical Review, 84(3), 528–549. https://doi.org/10.1111/insr.12110

Sentance, S., Waite, J., & Kallia, M. (2019). Teaching computer programming with PRIMM: a sociocultural perspective. Computer Science Education, 29(2-3), 136–176. https://doi.org/10.1080/08993408.2019.1608781

Shaughnessy, J. M. (1997). Missed opportunities in research on the teaching and learning of data and chance. In F. Biddulph & K. Carr (Eds.), People in mathematics education. Proceedings of the Twentieth Annual Conference of the Mathematics Research Group of Australasia (MERGA-20, July, 1990), Rotorua, New Zealand (Vol. 1, pp. 6–22). MERGA.

Schloerke, B., Allaire, J., & Borges, B. (2018). Learnr: Interactive tutorials for R. CRAN. https://CRAN.R-project.org/package=learnr

Son, J. Y., Blake, A. B., Fries, L., & Stigler, J. W. (2021). Modeling first: Applying learning science to the teaching of introductory statistics. Journal of Statistics and Data Science Education, 29(1), 4–21. https://doi.org/10.1080/10691898.2020.1844106

Sweller, J., van Merriënboer, J. J. G., & Paas, F. G. W. (1998). Cognitive architecture and instructional design. Educational Psychology Review, 10(3), 251–296. https://doi.org/10.1023/A:1022193728205

Van den Akker, J. (1999). Principles and methods of development research. In J. Van den Akker, R. M. Branch, K. Gustafson, N. Nieveen & T. Plomp (Eds.), Design approaches and tools in education and training, (pp. 1–14). Springer. https://doi.org/10.1007/978-94-011-4255-7_1

Van Someren, M. W., Barnard, Y. F., & Sandberg, J. A. C. (1994). The think aloud method: A practical approach to modelling cognitive processes. AcademicPress.

Weiland, T. (2017). The importance of context in task selection. Teaching Statistics, 39(1), 20–25. https://doi.org/10.1111/test.12116

Wickham, H. (2016). ggplot2: Elegant graphics for data analysis. Springer-Verlag.

Wickham, H. (2017). Tidyverse: Easily install and load the “tidyverse”. CRAN. https://CRAN.R-project.org/package=tidyverse

Wiedemann, K., Chao, J., Galluzzo, B., & Simoneau, E. (2020). Mathematical modeling with R: Embedding computational thinking into high school math classes. ACM Inroads, 11(1), 33–42. https://doi.org/10.1145/3380956

Wild, C. J., & Pfannkuch, M. (1999). Statistical thinking in empirical enquiry. International Statistical Review, 67(3), 223–248. https://doi.org/10.1111/j.1751-5823.1999.tb00442.x

Wild, C. J., Pfannkuch, M., Regan, M., & Parsonage, R. (2017). Accessible conceptions of statistical inference: Pulling ourselves up by the bootstraps. International Statistical Review, 85(1), 84–107. https://doi.org/10.1111/insr.12117

Wouters, P., Paas, F., & van Merriënboer, J. J. (2008). How to optimize learning from animated models: A review of guidelines based on cognitive load. Review of Educational Research, 78(3), 645–675.

https://doi.org/10.3102%2F0034654308320320

Zieffler, A., Justice, N., delMas, R., & Huberty, M. D. (2021). The use of algorithmic models to develop secondary teachers’ understanding of the statistical modeling process. Journal of Statistics and Data Science Education, 29(1), 131–147. https://doi.org/10.1080/26939169.2021.1900759

Downloads

Published

2022-07-04