We’ve updated our privacy policy so that we are compliant with changing global privacy regulations and to provide you with insight into the limited ways in which we use your data. Show You can read the details below. By accepting, you agree to the updated privacy policy. Thank you! View updated privacy policy We've encountered a problem, please try again. Home > Tests Constructing Good Items Why Focus on Multiple-choice Items? The focus of this assessment guide is on the construction of tests using selected-response items. (See Tasks to read about the differences between selected-response tests and other types of assessments.) One type of selected-response item, the True-False question, provides a greater risk of guessing (50%) and, thus, does not typically discriminate among those who know the material and those who do not as effectively as multiple-choice items. Thus, the construction of T/F items will not be addressed in this chapter. Similarly, I will not address fill-in-the-blank items because they are less common, and because they are extremely difficult to construct so that only one possible answer could complete the blank. Instead, the following section will primarily address the construction of the most common selected-response item, the multiple-choice question. Terminology for Multiple-choice Items Before discussing the construction of such items, let's review the terminology commonly used to describe the parts of multiple-choice questions. The diagram below labels the specific components of a multiple-choice item.
Guidelines for Constructing Good Items: Eliminate Rival Explanations In the previous section on what the test should assess, I identified the first step in test construction: reviewing the standards to be addressed. The items on the test must effectively capture a representative sample of the concepts and skills laid out in the standards to generate valid inferences from student performance. So, make sure the items that you construct align with your standards. Validity will also be affected by how closely the selection of a correct answer on a test reflects mastery of the material contained in the standards. If a student selects the correct answer to a multiple-choice question, you want to be able to conclude with some confidence that the student understood the concept. However, there are a myriad of other reasons (rival explanations) the student might choose the correct alternative. For example, she might have closed her eyes and picked an answer at random. She might have been able to rule out the distractors because they were implausible or because other clues pointed her to the right answer without requiring her to understand the concept. In these cases the student selected the correct answer without understanding the concept. You want to be able to eliminate these rival explanations so that you can discriminate students who understand the concept from those who do not understand it. Obviously, you cannot eliminate the first rival explanation mentioned - guessing. However, most other rival explanations can be eliminated or reduced with careful construction of the test items. What follows are some strategies to eliminate as many rival explanations as possible. The guidelines can be understood as either
Cognitive load theory (and other related theories) recommends avoiding elements of instruction or assessment that will overload students' capacity to consciously process the immediate task on which they are working. A test is a task that requires considerable conscious attention. So, it is important to remove any elements of a test item that might distract or unnecessarily increase the cognitive load a student encounters. Cognitive load theory (e.g., Sweller, 1988; 1994) emphasizes the importance of the processes and limitations of working memory, the level of memory that is consciously processing information involved in immediate tasks. A considerable amount of research has found that much of our information processing occurs outside of our conscious awareness. That seems necessary because the conscious resources we are able to employ to attend to or make sense of information are quite limited. Thus, it does not take much to distract or interfere with our ability to consciously process information and, thus, overload our working memory.
1. Keep the stem simple, only including relevant information. Example:
*an asterisk indicates the correct answer.
Example:
Example:
Example:
If you are going to use NOT or EXCEPT, the word should be highlighted in some manner so that students recognize a negative is being used. 5. Include the same number of alternatives for each item. The more consistent and predictable a test is the less cognitive load that is required by the student to process it. Consequently, the student can focus on the questions themselves without distractions. Additionally, if students must transpose their answers onto a score sheet of some kind, there is less likelihood of error in the transposition if the number of alternatives for each item is always the same.
It is easy to inadvertently include clues in your test items that point to the correct answer, help rule out incorrect alternatives or narrow the choices. Any such clue would decrease your ability to distinguish students who know the material from those who do not, thus, providing rival explanations. Below are some common clues students use to increase their chance of guessing and some advice on how to avoid such clues. (I bet you remember using some of these yourself!)
Example:
7. Avoid including an alternative that is significantly longer than the rest. Example:
Students often recognize that a significantly longer, more complex alternative is commonly the correct answer. Even if the longer alternative is not the correct answer, some students who might otherwise answer the question correctly could be misled by this common clue and select the wrong answer. So, to be safe and avoid a rival explanation, keep the alternatives similar in length.
Example:
Another example of implausible distractors:
Example:
Example:
Of course, it is not usually obvious what knowledge students possess prior to your instruction. So, it may be helpful in certain courses to give a brief pre-test at the beginning of the course to determine the level of the students' background knowledge. That information will assist you in designing your instruction and your assessments.
It is sometimes easier for students to narrow the number of possible alternatives on such questions without fully understanding the concepts tested. For example, when all of the above is an alternative, all a student needs to do is recognize that one of the other alternatives is not true to also be able to rule out "all of the above." Thus, an item with four possible alternatives has now been reduced to just two, increasing the chances of guessing correctly. Similarly, if a student recognizes that two of the four alternatives are true, the student knows that the answer is all of the above without having to know whether the remaining alternative is true or not. Such guessing requires some knowledge of the material, but not as extensive understanding as if they had to consider all four of the alternatives. Additionally, all of the above and none of the above have been misused as alternatives on some tests because students have learned that all of the above or none of the above is almost always the right answer when it is used on those tests. So, if you use all of the above or none of the above, do not always make it the right or wrong answer. Generally, research has found more problems with the use of "all of the above" than with "none of the above," but the common recommendation for both is to limit their use (Haladyna, Downing, & Rodriguez, 2002).
Even if students have not yet learned that the world is black and white, they have learned that alternatives on tests that include terms such as always or never are almost always a wrong answer. Thus, students are able to eliminate an alternative without understanding the material.
Example:
Because mercredi appears in three of the four alternatives in the first example and terms such as maison only appear in one of the alternatives, students will often correctly conclude that mercredi should be included in the correct answer. Thus, students might eliminate d. as an alternative and increase the likelihood of guessing correctly. The solution is to evenly distribute the different terms as much as possible, as in the second example above.
Students can certainly memorize phrases or sentences without comprehending them. So, if you use wording in an item that too closely resembles the wording in the text, it is possible that students can answer a question correctly without understanding it. More commonly, students may recognize certain language or terms that they saw in a text and select the alternative that includes that language without comprehending the concepts. The obvious solution is to paraphrase the main ideas you are testing.
Example:
Obviously, if you only have two alternatives then the chance for guessing increases significantly as there will be a 50% chance of getting the item correct just by guessing. If you include five or more alternatives the item becomes increasingly confusing or requires too much processing or cognitive load. Additionally, as the number of distractors increases, the likelihood of including a bad distractor significantly increases. Thus, research finds that providing three or four alternatives leads to the greatest ability to distinguish between those test-takers who understand the material and those who do not (Haladyna, Downing, & Rodriguez, 2002; Taylor, 2005).
In other words, if students find a pattern in which answers are the correct ones (e.g., "c" is usually the right answer or "d" is never the right answer) then they can increase their chances of correctly guessing, providing another rival explanation.
If you include items on the test that are intentionally so easy that virtually everyone will answer them correctly, then you have reduced the discriminability of the test. Was the purpose to be amusing? Find another way to do so. Yes, one giveaway question on a 50-item test will not make that much difference, but when you consider all the different little things mentioned above that could affect the test's discriminability it is best to avoid all of them. Moreover, you have missed one more opportunity to assess learning. 19. Avoid providing clues for one item in the wording of another item on the test. Example:
Summary list of guidelines To summarize:
1. Keep the stem simple, only including relevant information. 2. Keep the alternatives simple by adding any common words to the stem rather than including them in each alternative. 3. Put alternatives in a logical order. 4. Limit the use of negatives (e.g., NOT, EXCEPT). 5. Include the same number of alternatives for each item.
6. Keep the grammar consistent between stem and alternatives. 7. Avoid including an alternative that is significantly longer than the rest. 8. Make all distractors plausible. 9. Avoid giving too many clues in your alternatives. 10. Do not test students on material that is already well-learned prior to your instruction. 11. Limit the use of "all of the above" or "none of the above." 12. Limit the use of always, never or similar terms. 13. If item alternatives include multiple terms or series of concepts, avoid over-representing or under-representing certain terms or concepts. 14. Avoid direct quotations from a text in an item. 15. Avoid alternatives that are opposites if one of the two must be true. 16. Include three or four alternatives for multiple-choice items. 17. Distribute correct answers fairly evenly among the "letters." 18. Avoid "giveaway" items. 19. Avoid providing clues for one item in the wording of another item on the test. 20. WORTH REPEATING: Make sure your items actually measure what they are intended to measure. Note: Some of the above examples are courtesy of Lockport Township High School, Lockport, Illinois. What are the factors to be considered in constructing the various types of tests?The five factors to consider when selecting test procedures. Selection Methods. ... . Factors to Consider for Selection. ... . Objectivity. ... . Validity. ... . Reliability. ... . Economy.. What is the most important criteria in the test construction?Validity – is the degree to which the test measures what is intended to measure. It is the usefulness of the test for a given purpose. It is the most important criterion of a good examination.
What are the different guidelines in constructing multiple choice type of test?14 RULES FOR WRITING MULTIPLE-CHOICE QUESTIONS.. Use Plausible Distractors (wrong-response options) ... . Use a Question Format. ... . Emphasize Higher-Level Thinking. ... . Emphasize Higher-Level Thinking (continued) ... . Keep Option Lengths Similar. ... . Balance the Placement of the Correct Answer. ... . Be Grammatically Correct.. What guidelines should be followed in arranging the items in a test?Arrange items systematically. If the test contains several types of items, group similar items (such as all multiple choice items) together. Provide a clear set of directions for each new group of items. Place the most difficult questions near the end of the test so that students have time to answer more questions.
|