A few weeks ago, I was asked about a certain test vendor. Being a good researcher, I visited their site and was absolutely astounded by the misinformation contained therein. “Wow,” I thought, “this could be the basis of an entire article about what NOT to believe about testing!” Here are some of the areas where I foresaw major problems for employers who chose to use their tests.
Legality
What they state: Their tests have passed a review to ensure a fully legal program. They can ensure users that they will “fully comply” with the new EEOC rules. A personal letter from an attorney states the test does not violate the law. The test complies with all EEOC regulations. What users should know: According to the Uniform Guidelines on Employee Selection Procedures, under no circumstances will the general reputation of a test or other selection procedures, its author, or its publisher, or casual reports of its validity, be accepted in lieu of evidence of validity. Specifically ruled out are:
Validation Procedures
What they state: This test vendor states that “some people” believe in construct validity and others prefer criterion validity as the best approach. They further state that “common sense, good business sense, and concurrent validity on a local level” is the best approach. They do this by benchmarking the top one-third and bottom one-third of producers on a local level. What users should know: This is absolute nonsense. Here is a partial list why:
Validity of the Advertised Test
What they state: In a national survey conducted on salespeople and commissions, “All of the above values show that there is at best a weak correlation between profile scores and commissions earned. Scores obtained on a national level would be unsatisfactory predictors of commissions earned.” What users should know: What kind of “spin statement” is that? Sales commissions are the number one indicator of sales performance, and the vendor publicly admits its test scores are an unsatisfactory predictor on a national level? It takes five key sales skills to make a good salesperson (i.e., relating, communication, questioning, presenting, and managing the relationship). If these cannot be measured nationally, doesn’t it seem like this test is looking at the wrong things?
Accuracy of the Test
What they state: Both managers and test-takers generally agree with the results of the test. What users should know: Well, duh! The test-taker just checked a dozen items stating he is determined. Should there be any surprise that his or her test scores were high in “determination”? Besides, are we measuring agreement with the test results or are we measuring future job performance?
Theory of the Test
What they state: The test was developed based on Hippocrates’ 2,400-year-old theory of temperaments. What users should know: Hippocrates also believed that black bile, yellow bile, phlegm, and blood were responsible for all physical and mental health. Furthermore, we have learned a few things about testing and hiring in the last 2,400 years.
Test Reports
What they state: The test uses 60 items to generate a two to fifteen page report. What users should know: Any reasonable person should suspect that 60 questions cannot possibly generate an accurate portrait of human personality. Any test promising that much data about a candidate should be examined for abundant filler, boilerplate information, and marketing general nonsense.
Moving Forward
Although this article focused on one single test site, I suggest the comments apply to many applications that ignore best test practices. There is a good reason why there are entire graduate-level courses devoted to doing a job analysis, developing decent tests, legal testing issues, statistics, and experimental design. This field is complex, and filled with opportunity to 1) hire the wrong people, 2) turn away the right ones, or 3) get your company sued. Want to learn more about how to build and use a test? Go to http://www.apa.org/science/standards.html. Here is an outline of its contents: Part I: Test Construction, Evaluation, and Documentation
Part II: Fairness in Testing
Part III: Testing Applications