VALIDITY

TEST CONSTRUCTION PROCESS

 

Reliability

·       Reliability is the degree to which the scores are consistent, dependable, or repeatable, that is, the degree to which they are free of errors of measurement (AERA, APA, & NCME, 1985, p.93).

 

Types of reliability:

·       Stability

·       Internal consistency

·       Decision consistency (criterion)

         

 

Validity

 

·        Validity is defined as the degree to which a certain inference from a test is appropriate or meaningful and validation as the process of the investigation by which the degree of validity of a proposed test interpretation can be evaluated. (AERA, APA, & NCME, 1985, p.94).

 

« Validity is inferred not measured »

 

C        Type of validity

·        Content validity (substantive stage)    

·        Construct validity (structural stage)

·        Criterion related validity (external stage)

·        Concurrent & predictive validity

 

 


Process of test construction

Simplified process

Substantive stage

Theory-based (including previous research and observations)

Generate theoretical and empirical definitions

Gather content-related evidence

Consider construct’s under-representation and relevancy

Step 1

Identify the behavior(s)/construct(s) you want to measure (Theory-based construct or observation)

Step 2

Identify the behaviors to represent the construct(s)

Step 3

Define the behavior(s) and identify the proportion of items needed to measure the construct(s).  Generate theoretical and empirical definitions and develop a set of test specifications.

Step 4

Generate an item pool

Step 5

Determine guidelines for scoring and interpretation

Step 6

Conduct an expert evaluation (cognitive interviews, expert review, item-content matching, . . .)

Step 7

Consider the inclusion of validation items “social desirability”

Step 8

Preliminary item try out and revise as necessary (wordings, understanding, . . .)

Structural stage (internal relations among the questions)

Item/subscale intercorrelations (item analysis-alpha/ reliability)

Generalizability theory (reliability analysis)

Item response theory (item analysis - reliability)

Exploratory factor analysis (not hypothesis driven)

Confirmatory factor analysis (hypothesis driven)

External stage (relations with other constructs)      

Correlations with other tests (partial correlations, regressions, ...)

Multitrait-multimethod matrix

Group differentiation (Anova, Ancova, Manova, Mancova, Discriminant analysis, ...)

existing or know groups

experimental manipulations

Structural equation modeling

 

Step 9

Develop a validation plan (sample size, sample characteristics, stratifications, . . .)

Step 10

Validation:  Include more than one method in each of the group of methods presented above.

Step 11

Assess the behavioral stability of the test (short and long term)

Step 12

Determine the preliminary psychometric properties of the test, revise and redo some of these steps as needed.

  Next