
True-false tests often get little to no respect. One measurement text author (references for citations are listed in the "Testing References" file in this server) writes that "...the teacher who is interested in reliable, valid, and fair assessment of classroom learning will avoid using [true-false tests] when attempting to measure this learning, for they usually serve little or no measurement purpose..." (Storey, 1970). Ebel and Frisbie (1991), who provide perhaps the most thorough as well as the most positive discussion of the true-false item form, give contrary opinion by summarizing the argument for the value of such items:
True-false items do have a number of advantages. Their apparent simplicity, however, may lead the unwary teacher to construct true-false tests that justify Storey's characterization of them.
True-false items share most of the advantages of "objective" or selected-response item forms:
In comparison with other selected-response items, e.g., multiple-choice, true-false items have several additional advantages:
The wide range of items that can be used in a true-false test minimize the effects of guessing and lower item discrimination; therefore true-false tests are equal in quality to multiple-choice tests and can be constructed for administration in a given testing time. The key is to write sound test items for either form.
TopWrite items that test significant material, not trivial details. The correct answer should require the specialized knowledge being tested, not common sense.
Focus each item on a single idea so that students do not have to deal with the possible truth or falsity of two or more propositions at once. Compound statements can sometimes be divided into two separate items.
Express each item in simple and clear language.
Provide sufficient information in the item to allow its truth to be judged. Include attribution of opinion or other context where appropriate.
Word items precisely so that they can be unequivocally determined to be true or false. Use quantitative rather than qualitative language or make comparative rather than absolute statements. The keyed response should be clearly defensible and agreed on by experts in the field.
Reformulate principles or use examples, rather than use the same language as the text or reference materials, especially stereotyped phrases, to avoid encouraging reliance on rote memorization.
Avoid negatively worded statements in general and particularly double negatives. If the statement cannot be formulated positively, be sure to emphasize negative terms with underlining or bolding.
Do not use "specific determiners," such as "always", "never", "every", just to make a proposition false; conversely, do not use qualifiers such as "usually", "often", and "seldom" only in true statements. Such terms can serve as extraneous cues to test wise students unless the pattern of their use is varied.
If a proposition expresses a relationship, such as cause and effect or premise and conclusion, present the correct part of the statement first and vary the truth or falsity of the second part.
Use somewhat more false than true statements because false statements discriminate better between high-ability and low-ability students. (Students tend to mark true more often than false when guessing blindly.)
Make true and false items of approximately equal average length throughout the test.
Randomize the sequence of true and false statements.
Make use of popular misconceptions or beliefs as false statements.
Try to word items so that the incorrect response is more plausible or attractive to those without the specialized knowledge being tested.
Make use, where appropriate, of multiple true-false items, in which a single stem is followed by several statements or phrases based on the stem, each of which requires a true-false response. (Note that this guideline is the complement of the guideline to avoid writing multiple-choice items in which the options are just a set of true-false statements. The latter are inefficient because they provide just one scored response per item, whereas the multiple true-false item provides as many scored responses as there are parts to the item.)
Consider the use of introductory material to base true false items such as maps, graphs or written passages.