Skip to main content

Currently Skimming:

Test 2: Principles of Learning and Teaching Test: K-6
Pages 371-392

The Chapter Skim interface presents what we've algorithmically identified as the most significant single chunk of text within every page in the chapter.
Select key terms on the right to highlight them within pages of the chapter.


From page 371...
... 10~. Comment: Stating the purpose of the test publicly and having it available for potential test takers are appropriate and consistent with good measurement practice.
From page 372...
... After the External Review Panel reviewed and modified the Draft inventory (all by telephone interview) , the revised inventory was reviewed by a nine-member Advisory/Test Development Committee (five practicing teachers and four teacher educators with the same qualifications as the External Review Panel)
From page 373...
... Although there was diverse representation geographically, by sex and job classification, a larger and more ethnically diverse membership on the External Review Panel would have been preferred. The subsequent review by the Advisory/Test Development Committee helped ensure an adequate list of skills.
From page 374...
... · Procedures used to develop items and tasks (including qualifications of personnel) : ETS has provided only a generic description of the test development procedures for all of its licensure tests.
From page 375...
... The ETS standards and the Establishing the Validity of Praxis Test Score Interpretations Through Evidence Based on Test Content both require that congruence studies be undertaken. As part of the job analysis the various committee members and the final survey respondents respond to such questions as: 1.
From page 376...
... The interrater reliability estimates for the six constructed-response items combined (calculated appropriately using a multistep process described in the materials provided) for the four test administrations all were greater than .9, suggesting a high degree of consistency in ratings across the six constructed-response items.
From page 377...
... Summary comparative data for all four test administrations are in a single-page document, Principles of Learning and Teaching Grades K-6 Comparative Summary Statistics.) Comment: The interrater reliability estimates are excellent.
From page 378...
... sets its own unique passing score; thus, each state could have a different pass/fail decision point. The statistical report of the October 1995 test administration provides conditional standard errors of measurement at a variety of score points, many of which represent the passing scores that have been set by the 12 different state users.
From page 379...
... and the item score that is hypothesized to have an underlying continuous distribution that is dichotomized as right or wrong. Biserial correlations are computed only when the percent correct is between 5 and 95 and more than half the analysis sample reaches the item.
From page 380...
... Although the item discrimination has improved since the base form, continued efforts to eliminate items that have biserial correlations lower than .20
From page 381...
... (Recall that one of the questions external reviewers answered in the discussion of the match of the items to the test specifications was a fairness question.) This summary was developed from ETS's Overview: ETS Fairness Review.
From page 382...
... The recommended cut score is the average cut score for the entire group of panelists. For the constructed-response items, the same panelists who perform the modified Angoff method use one of two methods for setting a passing score on the constructed-response items.
From page 383...
... If ETS uses the procedures described for setting a recommended cut score in each of the states that use this test, the process reflects what is considered by most experts in standard setting to be sound measurement practice. There is some controversy in the use of the Angoff method, but it remains the most often used method for setting cut scores for multiple-choice licensure examinations.
From page 384...
... In some cases both validity and standard-setting studies may be conducted concurrently by the same panels. Comment: The procedures described by ETS for collecting content validity evidence are consistent with sound measurement practice.
From page 385...
... · Examinees have comparable questions/tasks (e.g., equating, scaling, calibration) : The ETS standards and other materials provided suggest that substantial efforts are made to ensure that items on this test are consistent with the test specifications derived from the job analysis.
From page 386...
... . Test security: Procedures for test security at administration sites are provided in ETS' s 1999-2000 Supervisor's Manual and the 1999-2000 Supervisor's Manualfor Nonstandard Test Administrations.5 These manuals indicate the need for test security and describe how the security procedures should be undertaken.
From page 387...
... The risk of contamination may be moderated somewhat by the use of multiple-choice items to provide measures of all dimensions within the test specifications and to the extensive item review process that all such tests are subject to if the ETS standard test development procedures are followed. No studies on the coachability of this test were provided.
From page 388...
... The 1999-2000 supervisor's manuals describe the space and other requirements (e.g., making sure left-handed test takers can be comfortable) for both standard and nonstandard administrations.
From page 389...
... The manuals describe what procedures are to be followed to collect the test materials and to ensure that all materials are accounted for. The ETS standards also speak to issues associated with the appropriate administration of tests to ensure fairness and uniformity of administration.
From page 390...
... ETS may also cancel a test score if it finds that a discrepancy in the process has occurred. Score reports to institutions and states are described as containing information about the status of the examinee with respect to the passing score appropriate to that recipient only (e.g., if an examinee requests that scores be sent to three different states, each state will receive pass/fail status only for itself)
From page 391...
... COMMENTS This test seems to be well constructed and has moderate-to-good psychometric qualities. The procedures reportedly used for test development, standard setting, and validation are all consistent with sound measurement practices.
From page 392...
... arcane with workshops on classroom management and instructional strategies. Teachers with Pre.


This material may be derived from roughly machine-read images, and so is provided only to facilitate research.
More information on Chapter Skim is available.