Proposal view
| Proposal Type: | Symposium |
|---|---|
| Domain: | Learning and Cognitive Science |
| SIG: | Individual Differences in Learning and Instruction |
| Scheduling category: | Assessment methods |
| Type | Submitted Symposium |
| Title | Measuring learning strategies: What are we measuring? |
| Abstract | The ongoing focus on learning strategies and self-regulation in educational research and practice is resulting in various measurement methods. However, different methods may be aimed at assessing different facets of learning. It is important to execute multi-method research to evaluate disparate measurement methods in relation to each other. Within this symposium, all contributions are meeting this call. Muis and Winne compare three popular self-report inventories. Bråten, Hagen, and Strømsø administer a task-specific inventory which they compare to measures that involve topic knowledge, comprehension and traced strategy use. In searching for an alternative to self-reports, Cromley and Azevedo construct a multiple-choice strategy use measure that require the students to enact strategies while reading. This measure is compared with a number of more standard comprehension-related measures. Muis and Winne report low evidence of convergent validity between the three inventories, whereas Bråten and colleagues and Cromley and Azevedo report promising validation data between the different measures. In addition to the three empiric studies, Schellings and Van Hout-Wolters present, after an overview of methods for measuring learning strategies, some specific explanations why different measurement methods lead to different measurement results. In two contributions of this symposium, the advantages of inventories are observed: they are easy to administer in large-scale testing and do not require expert scoring (Bråten et al.; Schellings & Van Hout-Wolters). The instrument of Cromley and Azevedo is suited for large-scale testing, as well. However, these researchers also refer to the richness of collecting think-alouds and other process data in smaller-scale studies. In both the contributions of Muis and Winne, and of Schellings and van Hout-Wolters, the conclusion is drawn that researchers should be cautious en selective in the method they use to assess learning strategies. |
| Equipment |
Overhead projector Computer and data projector / beamer |
| Keywords | Assessment methods Cognitive Skills Learning theory |
| Chairperson list | |||||
|---|---|---|---|---|---|
| First Name | Last Name/Surname | Institution | Country | EARLI Number | |
| Gonny | Schellings | University of Amsterdam | Netherlands | g.l.m.schellings@uva.nl | |
| Organiser list | |||||
|---|---|---|---|---|---|
| First Name | Last Name/Surname | Institution | Country | EARLI Number | |
| Gonny | Schellings | University of Amsterdam | Netherlands | g.l.m.schellings@uva.nl | |
| Bernadette | Van Hout-Wolters | University of Amsterdam | Netherlands | b.h.a.m.vanhout-wolters@uva.nl | |
| Discussant list | |||||
|---|---|---|---|---|---|
| First Name | Last Name/Surname | Institution | Country | EARLI Number | |
| Danielle | McNamara | The University of Memphis | United States | d.mcnamara@mail.psyc.memphis.edu | |
| Paper Details |
|---|
| Paper type | Empirical |
|---|---|
| Title | Measuring Self-Regulated Learning: Assessing the Validity of Three Popular Self-Report Inventories |
| Abstract | Construct validity research is necessary to clarify research on self-regulation and provide a stronger basis for future research. We used multitrait-multimethod (MTMM) analysis to assess convergent and discriminant validity of three self-report self-regulation measures: the Learning and Study Strategies Inventory (LASSI; Weinstein, 1987), the Motivated Strategies for Learning Questionnaire (MSLQ; Pintrich, Smith, Garcia, & McKeachie, 1993), and the Metacognitive Awareness Inventory (MAI; Schraw & Dennison, 1994). Method bias across all inventories also was examined. Three hundred eighteen undergraduate students were recruited to participate in research on perceptions about studying and study methods. Participants spent 30-60 minutes completing all three inventories. Evidence for convergent validity was found at the matrix level but was attenuated at the individual parameter level. Evidence for discriminant validity among traits was modest. Common method bias was evident across all measures. Results revealed these inventories yielded different results. Accordingly, researchers should be selective in the inventory they use to assess self-regulated learning. |
| Summary | Aims Research investigating self-regulated learning is prominent in educational psychology. Self-regulated learners are more strategic and perform better than less self-regulated learners (Zimmerman, 1990). Theoretically, SRL can be defined as an aptitude (e.g., Snow, 1996) and as an event (e.g., Winne, 2001). As an aptitude, SRL is considered to be relatively trait-like. From this view, learners have relatively stable and contextualized approaches to selecting learning strategies. For SRL as an event, learners regulate learning “on-the-fly” (Winne, Jamieson-Noel, & Muis, 2002) by metacognitively monitoring features of their engagement and adjusting learning strategies as work on a task unfolds. Generally, SRL is measured as an aptitude using self-report inventories (Winne & Perry, 2000) wherein students rate statements about various learning strategies. Three widely used self-report measures are the Learning and Study Strategies Inventory (LASSI) (Weinstein, 1987), the Metacognitive Awareness Inventory (MAI; Schraw & Dennison, 1994), and the Motivated Strategies for Learning Questionnaire (MSLQ) (Pintrich, Smith, Garcia, & McKeachie, 1993). In light of widespread use of these inventories, it is important to inform the field about their psychometric qualities. Because operational definitions and theoretical frameworks that generate measures of SRL can substantially affect its relations with other constructs, Winne et al. (2002) questioned: What influences do response formats or other methodological features such as instructions have on measuring SRL? They called for multitrait-multimethod (MTMM) studies to contribute to answering these questions (Winne et al., 2002). We respond to this call by comparing these three frequently used inventories within the MTMM tradition. Prior to analyzing data, we examined each instrument to identify similarities in theoretically defined constructs. Based on our analysis, we chose four subscales on each instrument that represented the same theoretical constructs. Accordingly, our MTMM model includes four traits and three methods. Methodology Undergraduate students (N = 318) participated. Participants spent 60 minutes completing the LASSI, MAI, and MSLQ for a typical course. Results We applied three methods to examine convergent validity, discriminant validity, and method bias: Campbell and Fiske’s (1959) traditional correlation matrix method, matrix-level CFAs, and individual parameter estimates. Due to space limits, we report the first MTMM analysis only. Our MTMM model has four factors—Main Ideas/Organization, Elaboration, Self-Regulation, and Evaluation—and three methods: a 5-point Likert LASSI scale, a 7-point Likert MSLQ scale, and 100-point MAI scale. Convergent validity is supported when correlations across instruments for similar constructs are moderate to high; discriminant validity is supported when correlations between different constructs within an instrument are low. Using Campbell and Fiske’s (1959) criteria, evidence of convergent validity among corresponding subscales across the LASSI, MSLQ and the MAI was modest. Correlations among the Main Ideas/Organization subscales across the inventories were low, .29 to .37. Correlations across the inventories for Elaboration were moderate, .54 to .60. Self-Regulation-related subscale correlations across the inventories were low to moderate, .27 to .55. Finally, subscales for Evaluation had moderate correlations, .39 to .50. Evidence of discriminant validity within inventories was weak to modest. The MAI evidenced the highest correlations across the traits, .51 to .70. Correlations across traits in the MSLQ were lower, .20 to .52. Finally, correlations across the LASSI were lowest, .04 to .44. The LASSI provided greatest evidence of discriminant validity. To test method bias, correlations are averaged over traits within instruments. High values signal method bias. Averages for the three measures were .29 for the LASSI, .43 for the MSLQ, and .63 for the MAI. The LASSI exhibits little method bias. Discussion Researchers must wrestle with variations in methods used to measure SRL when they synthesize results of research and then design studies to advance theory about SRL. Among studies that use self-reports about SRL, the MAI, MSLQ, and LASSI instruments are prevalent. Each inventory, however, was developed from modestly different theoretical standpoints. The developer(s) of these inventories also operationalized different instructions and differing response formats to elicit respondents’ self-reports about elements of SRL. Our multitrait-multimethod study informs researchers about the comparability of data generated when using different inventories. Based on our analyses, we argue it is imprudent to assume convergent validity within facets we identified as common across these inventories. Lack of strong convergence is not due to variation among the inventories’ methods, i.e., the instructions and response formats that are unique to each instrument. Instead, data generated when learners answer items on these inventories are undifferentiated by the variations across the inventories’ instructions and response formats. Our results raise questions about the dependability of prior reviews of research wherein convergence of constructs across these inventories was assumed. On a positive note, this calls for more research that can illuminate the convergence and divergence among subscales on these inventories and across other operational definitions of SRL, such as traces of actual learning activities. This aligns with recommendations made by Winne et al. (2002) that, over time, will advance the field. References Campbell, D., & Fiske, D. (1959). Convergent and discriminant validation by the multitrait-multimethod matrix. Psychological Bulletin, 56, 81-105. Pintrich, P., Smith, D., Garcia, T., & McKeachie, W. (1993). A manual for the use of the motivated strategies for learning questionnaire. MI: U. Michigan. Schraw, G., & Dennison, R. (1994). Assessing metacognitive awareness. Contemporary Educational Psychology, 19, 460-475. Snow, R. (1996). Self-regulation as meta-conation? Learning and Individual Differences, 8, 261-267. Weinstein, C. (1987). LASSI User’s Manual. FL: H&H. Winne, P. (2001). Self-regulated learning viewed from models of information processing. In B. Zimmerman and D. Schunk (Eds.), Self-regulated learning and academic achievement: Theoretical perspectives (2nd ed, pp. 153-189). Hillsdale: LEA. Winne, P., Jamieson-Noel, D, & Muis, K. (2002). Methodological issues and advances in researching tactics, strategies, and self-regulated learning. In P. R. Pintrich & M. L. Maehr (Eds.), Advances in motivation and achievement: New directions in measures and methods (Vol. 12, pp. 121-155). CT: JAI Press. Winne, P. H, & Perry, N. (2000). Measuring self-regulated learning. In M. Boekaerts, P. Pintrich, & M. Zeidner (Eds.), Handbook of self-regulation (pp. 531-566). FL: Academic Press. Zimmerman. B. (1990). Self-regulated learning: An overview. Educational Psychologist, 25, 3-18. |
| Keywords | Assessment methods Cognitive Skills Self regulation |
| Appendices | |
| Authors | ||||||
|---|---|---|---|---|---|---|
| Name | Surname | Institution | Country | EARLI Number | Presenting | |
| Krista | Muis | McGill University | Canada | Krista.Muis@McGill.ca | * | |
| Philip | Winne | Simon Fraser University | Canada | philip_winne@sfu.ca | ||
| Paper type | Empirical |
|---|---|
| Title | Measuring strategic processing when students read multiple texts |
| Abstract | In this study, 216 education undergraduates were administered a 15-item task-specific strategy-use inventory immediately following their reading of seven texts representing partly conflicting views about climate change. The inventory included items focusing on memorizing as much information as possible from the texts and items focusing on cross-text elaboration in the form of integrating, comparing, and contrasting information across texts. Maximum likelihood exploratory factor analysis with oblique rotation using this sample identified two factors, with all items written to assess cross-text elaboration loading on the first factor and all items written to assess memorization loading on the second factor. In a sample of 68 education undergraduates, who were also administered a measure of topic knowledge about climate change before reading and a measure of intertextual comprehension after completing the strategy inventory, we found that topic knowledge was positively correlated with self-reported elaboration and negatively correlated with self-reported memorization. Moreover, self-reported elaboration was positively and self-reported memorization negatively correlated with intertextual comprehension. Hierarchical regression analysis showed that scores on the self-report strategy measures contributed to intertextual comprehension over and above topic knowledge, with self-reported elaboration being a positive and self-reported memorization a negative predictor of comprehension. Finally, in a sample of 19 undergraduates who also took notes while studying the texts, self-reported cross-text elaboration was positively correlated with traced cross-text elaboration and negatively correlated with traced paraphrasing. In contrast, self-reported memorization was negatively correlated with traced cross-text elaboration and positively correlated with traced paraphrasing. Given that our knowledge of the strategic competence required to build integrated representations of controversial topics through the reading of multiple information sources is rather limited and mainly based on very labor-intensive think-aloud methodologies, less suitable for large scale investigations, our preliminary validation data are promising and speak for continued work on this instrument. |
| Summary | Aim The aim was to design and validate a task-specific self-report measure of strategic processing during the reading of multiple texts. Theoretical framework Much research documents that the use of deeper-level strategies is linked to better comprehension when students read single texts (National Reading Panel, 2000). Presumably, more rather than less strategic effort is required when students try to build an integrated understanding by reading multiple texts on a particular topic. In a pioneer think-aloud study, where expert historians and high-school students read multiple texts on a historical event, Wineburg (1991) found that the strategies of sourcing, corroboration, and contextualization were involved in multiple-text comprehension. Later, Wolfe and Goldman (2005), in another think-aloud study using history texts, demonstrated the value of cross-text elaboration, specifically, causal and comparative cross-text self-explanations, when students read multiple texts. As reviewed by Afflerbach and Cho (in press), most of our knowledge about strategic processing of multiple texts is based on think-aloud studies, and, at least to our knowledge, no prior study tried to capture cross-text strategic processing through a self-report inventory. In the current research, we therefore tried to design a task-specific questionnaire targeting superficial as well as deeper-level cross-text strategic processing and examined whether scores on this questionnaire predicted students’ multiple-text comprehension when their topic knowledge was controlled for. Methodology For the purpose of factor analysis, we used data from 216 education undergraduates who were administered a 15-item strategy-use inventory immediately following their reading of seven texts representing partly conflicting views on the topic of climate change. The inventory included five items focusing on memorizing as much information as possible from the seven texts (sample item: I tried to remember as much as possible from all the texts that I read) and 10 items focusing on cross-text elaboration in the form of integrating, comparing, and contrasting information across texts (sample item: I tried to note disagreements between the texts). Each item of the inventory referred to the recently completed reading task as a frame of reference (cf. Bråten & Samuelstuen, 2007). In a sample of 68 education undergraduates, who were also administered a measure of topic knowledge about climate change before reading and a measure of the ability to draw inferences across texts (i.e., intertextual comprehension) after completing the strategy inventory, we examined bivariate relationships as well as the predictability of strategy-scale scores for intertextual comprehension when prior knowledge was controlled for. Finally, in a subsample of these 68 participants, consisting of 19 students who also produced notes while studying the texts, we coded those notes into categories of paraphrases and cross-text elaborations and correlated students’ traced strategy use with their self-reports. Results In the sample of 216 undergraduates, maximum likelihood exploratory factor analysis with oblique rotation identified two factors with high loadings (>.44) and no overlap for any item, having eigenvalues of 5.08 and 2.74, respectively, and explaining 52.1 % of the total sample variation. All 10 items written to assess cross-text elaboration loaded on the first factor and all five items written to assess memorization loaded on the second factor. The reliability estimates (Cronbach’s alpha) were .88 for the first factor and .82 for the second. In the sample of 68 undergraduates, self-reported elaboration and memorization were uncorrelated (r = .03). Moreover, topic knowledge was positively correlated with elaboration (r = .17) and negatively correlated with memorization (r = -.11). With respect to intertextual comprehension, self-reported elaboration was positively correlated (r = .25) and self-reported memorization was negatively correlated (r = -.28) with comprehension performance. In the same sample, we performed a hierarchical regression analysis with topic knowledge entered in the first step and the two strategy measures in the second. Dependent variable was the measure of intertextual comprehension. In the first step, topic knowledge was a positive predictor (β = .23, p = .059). Entering the strategy measures in the second step resulted in a 12 % increment in the explained variance, with self-reported elaboration being a positive predictor (β = .23, p = .053) and self-reported memorization being a negative predictor (β = -.27, p = .023) in this step. Finally, in the sample of 19 undergraduates, correlations between self-reported and traced strategy use showed that self-reported cross-text elaboration was positively correlated with traced cross-text elaboration (r = .60) and negatively correlated with traced paraphrasing (r = -.25). In contrast, self-reported memorization was negatively correlated with traced cross-text elaboration (r = -.20) and positively correlated with traced paraphrasing (r = .16). Conclusions and implications The importance of building integrated representations of controversial topics and events through the reading of multiple information sources is greater than ever, both in and out of school (Rouet, 2006). However, our knowledge of the strategic competence required to do so is thus far rather limited and mainly based upon very labor-intensive think-aloud methodologies. In this study, we presented preliminary results from a first attempt to gauge students’ strategic processing of multiple texts by means of a task-specific self-report inventory that is easily administered and scored, suitable for individual assessment as well as for large-scale investigations. Our preliminary validation data, as promising as they are, speak for continued work on this instrument. References Afflerbach, P., & Cho, B. (in press). Identifying and describing constructively responsive comprehension strategies in new and traditional forms of reading. In S. Israel & G. Duffy (Eds.), Handbook of research on reading comprehension. Mahwah, NJ: Erlbaum. Bråten, I., & Samuelstuen, M.S. (2007). Measuring strategic processing: Comparing task-specific self-reports to traces. Metacognition and Learning, 2, 1-20. Rouet, J.F. (2006). The skills of document use: From text comprehension to Web-based learning. Mahwah, NJ: Erlbaum. Wineburg, S. (1991). Historical problem solving: A study of the cognitive processes used in the evaluation of documentary and pictorial evidence. Journal of Educational Psychology, 83, 73-87. Wolfe, M.B.W., & Goldman, S.R. (2005). Relations between adolescents’ text processing and reasoning. Cognition and Instruction, 23, 467-502. |
| Keywords | Assessment methods Cognitive Skills Comprehension of Text and Graphics |
| Appendices | |
| Authors | ||||||
|---|---|---|---|---|---|---|
| Name | Surname | Institution | Country | EARLI Number | Presenting | |
| Ivar | Braten | University of Oslo | Norway | ivar.braten@ped.uio.no | * | |
| Aste | Hagen | University of Oslo | Norway | a.m.m.hagen@uv.uio.no | ||
| Helge | Stromso | University of Oslo | Norway | h.i.stromso@ped.uio.no | ||
| Paper type | Empirical | ||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Title | Measuring strategy use in context with multiple-choice items | ||||||||||||||||||||||||
| Abstract | Most studies of reading comprehension strategy use have employed self-report questionnaires. Several alternatives to self-report are available, all of which require the student to actually enact strategies while reading (Kozminsky & Kozminsky, 2001). We present data from three studies in which we developed domain-general and domain-specific multiple-choice strategy use measures and administered them, along with a number of other comprehension-related measures, to high school (N = 177) and undergraduate (N = 185 and 737) students. We report strong concurrent validity data for these measures across all three samples, with the strongest correlations for high school students, and few differences in correlations between the domain-general and domain-specific measures. We close by discussing the advantages of paper-and-pencil strategy use measures for large-scale group testing. |
||||||||||||||||||||||||
| Summary | Introduction Most studies of reading comprehension strategy use have employed self-report questionnaires. Typically, students answer on a Likert-type scale how frequently they use each strategy (e.g., Mokhtari & Reichard, 2002). In addition to self-reporting frequency of use, some questionnaires present scenarios and ask students which strategy they would use (e.g., Schmitt, 1990). However, a number of authors have challenged the validity of these types of measures Baker & Cerro, 2000). Data suggest that students are not accurate at reporting the strategies they “usually use” or “just used,” but are relatively accurate at reporting the strategies they are currently using. Several alternatives are available, all of which require the student to actually enact strategies while reading. In addition to error detection, collecting computer traces or paper-and-pencil artifacts, think-aloud protocols, or eye tracking data, another approach is to ask students to enact a strategy, and then measure whether they have done this accurately by posing multiple-choice questions (Kozminsky & Kozminsky, 2001). Method In the three studies presented below, students answered paper-and-pencil, multiple choice measures during a single session of approximately 90 min. Study 1 In study 1 (Cromley & Azevedo, 2007), we created a domain-general strategy use measure, as well as measures of inference and word reading fluency, and administered these together with standardized reading comprehension and vocabulary items. Participants. Participants were 177 ninth-grade students, selected to be at a wide range of reading comprehension proficiency. [Details omitted for reasons of space.] Measures. Measures were identified or developed for each of the five predictor variables and for reading comprehension. 1) The background knowledge measure was a 13-item multiple-choice measure developed by the researchers based on the content of the Gates-MacGinitie comprehension subtest (MacGinitie, MacGinitie, Marie & Dreyer, 2001). 2) The strategy measure was a 10-item multiple-choice measure developed by the researchers based on Kozminsky and Kozminsky (2001). We chose neighboring passages from original material used in the Gates-MacGinitie and constructed questions that asked students to apply the strategies of hypothesizing, inference, prior knowledge activation, self-questioning, summarizing, and text search to the passages. The vocabulary measure consisted of the 23 odd-numbered questions from the Gates-MacGinitie 7/9 Form S vocabulary subtest. The reading comprehension measure was the comprehension subtest of the Gates-MacGinitie, consisting of 48 items. The inference measure was a 10-item multiple-choice measure developed by the researchers based on the content of the Gates comprehension subtest, using new paragraphs from the same texts as the Gates. The word reading measure was a principal components composite of Woodcock Letter Word Identification and Word Attack (Woodcock, 1997) and a one-minute timed curriculum-based passage reading fluency task. Study 2 In study 2 (Cromley & Snyder, 2007), we created an analogous set of domain-general measures. Participants. Participants were 185 students from a large public university in a large mid-Atlantic US city, recruited from the College of Education. [Details omitted for reasons of space.] Materials. Reading comprehension and component measures included background knowledge, vocabulary, reading comprehension, inference, strategies, and word reading. [Details omitted for reasons of space.] Strategy use measure. A researcher-developed 10-item, test was developed based on Kozminsky and Kozminsky (2001) following the same approach as in Study 1 and to measure the same strategies. Study 3 In study 3 (Cromley, Snyder, Luciw, & Tanaka, 2008), we created an analogous set of domain-specific measures. Participants. The 737 participants were current or prospective biology majors enrolled in a first-semester introductory biology course at a large, moderately-selective urban university in the Mid-Atlantic region of the US. [Details omitted for reasons of space.] Measures. We developed six measures: one for each of the five predictor variables and one for reading comprehension. [Details omitted for reasons of space.] The strategy use measure was a 13-item multiple-choice measure based on Kozminsky and Kozminsky (2001) following the same approach as in Studies 1 and 2 and to measure the same strategies. Results For reasons of space, we report in Table 1 only correlations between the strategy use measure and each of the other measures. Table 1 Correlations of Strategy Use with Comprehension and Component Measures
Note: all Sig at p < .05; different superscripts represent significant differences across studies. The correlations suggest good concurrent validity for the measures. The pattern across age groups is stronger correlations of strategy use to the other variables for the high school sample, and weaker correlations for the two undergraduate samples. With regard to domain-specificity, correlations with inference, vocabulary, and word reading are parallel for the samples answering domain-general questions, and the patterns in these correlations diverge for the biology students. Discussion We created a series of new measures of strategy use based on work by Kozminsky and Kozminsky (2001), measures which show better concurrent validity than the most-frequently-used self-report instruments. They are relatively easy to administer, in that they do not require individual administration nor do they require expert scoring. Theoretically, these results suggest the importance of all of the components for comprehension—accurate strategy use is related to comprehension, but it is also related to vocabulary, inference, and background knowledge; these also have strong relationships to comprehension. The trends suggest that strategy use is more strongly related to the component measures for high school students than for undergraduate students with either type of text. However, this could be due to differences between the student groups or the measures: first, undergraduate students self-select to attend university and have high achievement. Second, we have not worked with large enough samples to equate measures or create scale scores. We believe that for large-scale administration requiring efficiencies of group administration and multiple-choice format, this approach has many advantages. Multiple-choice measures do not capture the richness of comprehension processes in real time, so we are also continuing to collect think-aloud and other process data for smaller-scale studies. |
||||||||||||||||||||||||
| Keywords | Assessment methods Cognitive Skills Comprehension of Text and Graphics |
||||||||||||||||||||||||
| Appendices | |||||||||||||||||||||||||
| Authors | ||||||
|---|---|---|---|---|---|---|
| Name | Surname | Institution | Country | EARLI Number | Presenting | |
| Jennifer | Cromley | Temple University | United States | jcromley@temple.edu | * | |
| Roger | Azevedo | University of Memphis | United States | razevedo@memphis.edu | ||
| Paper type | Theoretical |
|---|---|
| Title | Methods for measuring learning strategies can lead to different results: some explanations |
| Abstract | Both in educational practice and in educational research all kinds of methods are used to record students’ learning strategies. These methods can roughly be divided into methods measuring learning strategies apart from the learning task (off-line methods, such as learning strategy questionnaires) and methods doing this during the execution of the learning task (on-line methods, such as the thinking-aloud method). This paper presents an overview of off-line and on-line measurement methods. To the off-line methods belong, for example, learning strategy questionnaires, interviews, stimulated recall and portfolios. To the on-line methods belong the thinking-aloud method, eye movement measurement, computer logfile method, observation of behavior, trace analysis, and performance assessment. In multi-method research, correlations or other relation measures are calculated between the data gathered with two measurement methods. Low correlations are frequently found between the results of off-line and on-line methods. These low correlations are often exclusively explained by the fact that off-line methods offer less grip on the learning activities actually performed. We consider alternative explanations. A first explanation points to the content of the measurement methods. The methods that are compared may be aimed at measuring different learning strategies. A second explanation regards the learning task. Measurement methods may differ in task-specificity. We emphasize that before choosing an appropriate measurement method, it is important to define beforehand which specific learning strategies one wants to measure and related to which learning task. |
| Summary | Background For many years both educational research and practice has paid attention to students’ learning strategies and their improvement. We mean by ‘learning strategies’ certain combinations of goal-oriented cognitive, metacognitive and affective activities to improve learning. In addition to other overviews, Van Hout-Wolters, Simons and Volet (2000) present a list of forty-five learning strategies. This list illustrates that learning strategies differ very much among one another. Therefore, if one wants to measure learning strategies, it is important to clearly know which learning strategies one wants to measure. Measurement methods A large amount of measurement methods has been developed. These methods differ in many ways. A distinction can be made between two groups of measurement methods: on-line and off-line methods. This is a practical distinction indicating whether the measurement takes place during the student’s learning (on line) or apart from it (off line, that is, when the student is not learning). We present a list of frequently used methods. Off-line methods. Learning strategies questionnaires constitute the most common measurement method. They measure strategies by means of written verbal reports from the students. Oral interviews can be used to ask students about their activities during learning. Questionnaires and interviews can be especially attuned to a certain subject or a certain learning task, but generalquestionnaires and interviews are meant to record how the student goes about in general when studying. To jog the student’s memory, video-recordings of the task execution can be shown during the interviews: the stimulated recall method. In recent years portfolios have been used more often. Portfolios are meant to make visible learning activities by means of collecting student’s personal products. On-line methods. On-line methods record the student’s verbal statements and/or externally observable behavior during task execution. From these records conclusions are drawn about the student’s learning strategies. A popular on-line method is the thinking-aloud method. Students are asked to think aloud continually during studying. Sometimes they are asked to think aloud only at certain marked points. Registration of eye movements can be used to find out which parts of a learning task are studied longer and more often than other parts, and in which order they are being studied. Computer logfiles register duration and sequence in which students watched different screens. By means of observation information can be obtained about external aspects of studying, such as direction of looking, sitting position and facial expression. Trace analysis interprets the traces left by students, that is, external process products such as underlinings, markings and notes. During performance assessment the students demonstrate their activities by means of the execution of a concrete task. Comparing off-line and on-line methods Educational practice and research make much use of off-line methods, especially questionnaires are used. The main reason for this is that on-line methods are more difficult to use in groups of students and that the gathering and processing of the data is more labor-intensive. However, the question remains whether these off-line methods can say anything about the learning strategies actually applied; in other words, do students really perform the strategies they say they perform? In recent years multi-method research has been carried out. In this research, measurement methods are compared by relating data gathered for the same student and the same learning task (cf. Veenman, 2005). The following picture emerges: 1. Comparing on-line methods generally renders highly correlating results. 2. General questionnaires display low correlations with thinking-aloud measurements. 3. Task-specific questionnaires display a variable picture when compared to thinking-aloud measurements. The high correlations between the on-line methods not only indicate the strength of the thinking-aloud method, but they also point to the possibility to replace the thinking-aloud method by for example the logfile method. The fact that the correlations between the questionnaires and thinking-aloud methods are lower is often explained by the fact that off-line methods offer less grip on the learning activities the student actually performed. Students may not to be able to verbally report about their learning activities. However, there are other explanations for these low correlations. The first explanation is that the methods to be compared may be aimed at measuring learning strategies that are different in content. The questionnaire, for instance, can be aimed more at ‘orienting’, ‘structuring’ and ‘evaluating’, whereas the thinking-aloud protocols are analyzed for ‘planning’, ‘monitoring’, and ‘reflecting’. The second explanation regards the learning task. The thinking-aloud method measures the concrete approach to a specific learning task, whereas frequently-used questionnaires examine the student’s approach of learning tasks in general. Correlations between thinking-aloud methods and questionnaires may be higher if the methods to be compared are specified at the same learning task. Task-specific measuring connects to ideas and research from which appears that students’ learning strategies differ for every type of learning tasks or subject (cf. Winne & Hadwin, 1998). Theoretical/educational significance. When selecting a measurement method it is important to question which learning strategies exactly have to be measured. Further, it must be emphasized that it is important for the selection of a measurement method to know at which specific learning task it will be aimed. The EARLI-paper will pay attention to the question in how far the individual measurement methods render a clear and accurate picture of students’ learning strategies. Further, we will also go into the methodological and practical considerations in selecting an appropriate measurement method. References Van Hout-Wolters, B.H.A.M., Simons, P.R.J., & Volet, S. (2000). Active learning: Self-directed learning and independent work. In P.R.J. Simons, J.L. van der Linden, & T.M. Duffy (Eds.), New Learning (pp. 21-37). Dordrecht: Kluwer. Veenman, M.V.J. (2005). The assessment of metacognitive skills: What can be learned from multimethod designs? In C. Artelt, & B. Moschner (Eds), Lernstrategien und Metakognition: Implikationen für Forschung und Praxis (pp. 75–97). Berlin: Waxmann. Winne, P.H., & Hadwin, A.F. (1998). Studying as self-regulated learning. In: D.J. Hacker, J. Dunlosky & A.C. Graesser (Eds.), Metacognition in educational theory and practice (pp. 277-304). Hillsdale, NJ: Erlbaum. |
| Keywords | Assessment methods Cognitive Skills Meta-cognition |
| Appendices | |
| Authors | ||||||
|---|---|---|---|---|---|---|
| Name | Surname | Institution | Country | EARLI Number | Presenting | |
| Gonny | Schellings | University of Amsterdam | Netherlands | g.l.m.schellings@uva.nl | * | |
| Bernadette | Van Hout-Wolters | University of Amsterdam | Netherlands | b.h.a.m.vanhout-wolters@uva.nl | ||

