Nevertheless, certain myths persist in the writings and actions of many professional psychologists who are either unaware of this research or choose to ignore it. As part of the childs evaluation, the parent is asked to complete a rating form designed to assess the behavioral-emotional functioning of the child. The egalitarian fallacy contends that all human populations are in fact identical on all mental traits or abilities. We must practice intelligent testing (Kaufman, 1994). However, much of what has been learned is summarized in book chapters and entire books easily accessible to mainstream psychologists and some of the empirical and methodological research has appeared in the most widely subscribed journals of the American Psychological Association. Factors such as proper seating, adequate lighting, strictly controlled time limits, and test proctor responsibilities can be adequately controlled with minimal effort within a test setting, but for nationally based tests that occur (e.g., the SAT, the National Council Licensure Examination for nurses, etc. 34, pp. After identifying the eight most racially discriminating and eight least racially discriminating items on the Wonderlic Personnel Test, Jensen (1976) asked panels of five Black psychologists and five Caucasian psychologists to sort out the eight most and eight least discriminating items when only these 16 items were presented to them. Such models specify various parameters that describe the behavior of the item within the model; most IRT models include one, two, or three parameters, which may be graphically represented in an item characteristic curve (ICC). Many studies now demonstrate this stereotype effect, but some incorporate controversial statistical procedures that might confound the results by equating the two groups (i.e., erasing the group differences) on the basis of variables irrelevant to the effect of the stereotype threat. Some research suggests these gaps are narrowing (Dickens & Flynn, 2006; Neisser et al., 1996; Nisbett, 2009), while other research disputes the narrowing of gaps (Rushton & Jensen, 2005, 2010). With regard to bias in prediction, the empirical evidence suggests conclusions similar to those regarding bias in test content and other internal characteristics. Although the resulting debate has generated a number of models from which to examine bias, these models usually focus on the decision-making system and not on the test itself. The many definitions of test bias. In J. R. Graham, J. . This method essentially holds the total score constant across the groups, and the resulting differences may be used to identify problematic items if it is significant and particularly if meaningful. on the Manage Your Content and Devices page of your Amazon account. A test or test item can be culturally loaded without being culturally biased. We must remember that it is the purpose of the assessment process to beat the prediction made by the test, to provide insight into hypotheses for environmental interventions that prevent the predicted failure or subvert the occurrence of future maladaptive behavior. The CTBH represents the contention that any gender, ethnic, racial, or other nominally determined groups who perform differently on mental tests are due to inherent, artifactual biases produced within the tests through flawed psychometric methodology. Lower test scores for minorities, then, may reflect only this intimidation and difficulty in the communication process, not lower ability.
New guidance on race and ethnicity for psychologists ), Handbook of psychology: Assessment psychology (pp. Similarly, Helms (1992) proposed a cognitive-difference model that emphasizes differences in European-centered and African-centered values and beliefs. Reynolds, C. R. (1987). within the practice of psychological assessment and/or evaluation. Reynolds, C. R. (1982). American Psychologist, 30, 1541 This is the report of a group appointed by the APAs Board of Scientific Affairs to study the use of psychological and educational tests with disadvantaged students--An early and influential article. In this method, the partial correlation between item score and the nominal variable of interest (e.g., sex) is calculated, partialling the correlation between total test score and the nominal variable. When the objections were first raised, very little data existed to answer these charges. If these methods turn out to be culturally biased when used with native-born American ethnic minorities, what about other alternative methods of making these decisions that are inherently more subjective, e.g., interviews, observation, review of references, performance, or portfolio assessments? Figure 15.1 demonstrates an ICC that uses a one-parameter (also known as the Rasch Model after its originator) unidimensional model which is widely used in aptitude testing. Reynolds, C., Willson, V., & Chatman, S. (1984). Stereotype threat research then goes on to argue, as one example, that if one takes a test of mental ability, but the examinee is told it is not for evaluating the test taker, but to examine the test itself and no racial identifier is requested, then racial group differences in performance on the test will disappear. This would in turn create major upheavals in professional psychology, because the foundations of clinical, counseling, educational, industrial, and school psychology are all strongly tied to the basic academic field of individual differences. Group differences are believed then to stem from characteristics of the tests and to be unrelated to any actual differences in the psychological trait, skill, or ability in question. However, we must hold to the data even if we do not like them. The three parameters in the three-parameter (3P) model are (a) discrimination power of the item, or slope of the ICC, (b) item difficulty, located at the point on the difficulty level of the latent trait at which the examinee has a 50% chance of correctly answering the item, and (c) guessing parameter. Test bias in the assessment of intelligence and personality. Because most psychologists are White and speak only standard English, they may intimidate Black and other ethnic minorities and so examiner and language bias result. The most frequently stated problems fall into one of the following categories (Reynolds, 2000; Reynolds et al., 1999; Reynolds & Ramsay, 2003). Such attempts at solutions are difficult. In J. R. Graham & J. Implications Of Cultural Bias In Psychological Research: Carrying our culturally bias research in Psychology can lead to a number of negative side effects which calls in question the practice of Psychology. The exceptional minority child: Issues and some answers. While differences in subgroups scores should increase scrutiny that possible test bias exists, group differences alone do not indicate that a testing application is biased or unfair. Much effort has been expended to determine why group differences occur but we do not know for certain why they exist. Notes on a bit of psychological nonsense: Race differences in intelligence. The controversy over test bias is also not about blatantly inappropriate administration and usage of mental tests. He prohibited (or enjoined) the use of IQ tests with Black children in the California public schools. Behavioral and Brain Sciences, 3, 352. Halpern, D. F. (1997). Steele, C. M., & Aronson, J. Hostname: page-component-7494cb8fc9-hzz9v The belief of many people in the mean differences as bias definition is quite likely related to the nature-nurture controversy at some level. ), Comprehensive clinical psychology (pp. Psychology, Public Policy, and Law, 6, 144150. It is imperative that the evaluation of bias in testing be undertaken from the standpoint of scholarly inquiry and debate. Contrary to the situation decades ago when the current controversy began, research now exists that examines many of these concerns and does so in great detail. Embretson, S., & Reise, S. (2000). Much of the rancor in psychology and education regarding proper definitions of test bias is due to the divergent uses of this term in general but especially by professionals in the same and related academic fields. Figure 15.2 demonstrates this DIF gap across two hypothetical groups. Steele, C. M., Spencer, S. J., & Aronson, J. Psychological tests measure traits that are not directly observable, subject to differences in definition, and measurable only on a relative scale. (1993). The first two sections discuss best practice-related issues in assessment in relation to (1) clinical diagnosis and (2) psychological testing and assessment. Standardized tests intend to measure intelligence and general knowledge, but they are normed based on the knowledge and values of the majority groups, which can create bias against minority groups, levels of performance on cognitive tasks between two groups historically (and mistakenly) are believed to constitute test bias by a number of writers (e.g., Alley & Foster, 1978; Chinn, 1979; Hilliard, 1979). Ethnic group differences in test performance also occur and are the most controversial and polemic of all group differences. The cultural shaping of depression: Somatic symptoms in China, psychological symptoms in North America? The partial r and subsequently its effect size stabilize at smaller sample sizes compared to the IRT approach. No one has been able to identify those characteristics of an item that cause the item to be biased. There is no single Almost without exception, those studies have produced results that can be adequately depicted by Fig. We find it anathema that ethnic differences in aptitude or ability might be real; we simply do not want it to be so. Psychologists, in order to make consistent interpretations of test score data, must be certain that the test(s) measures the same variable across populations. A total of 45 WISC-R items was presented to each judge; these items included the 15 most difficult items for Blacks as compared to Whites, the 15 most difficult items for Hispanics as compared to Whites, and the 15 items showing the most nearly identical difficulty indexes for minority and White children. (1975). ), International handbook of personality and intelligence (pp. Information concerning the home, community, and school environment must all be evaluated in individual decisions. We want everyone to be created equal not just in the sense of worth as a human being as acknowledged in our Constitution, but in the sense of level of aptitude or ability. Since the late 1960s, a substantial body of content and methodological research on bias has been conducted.
Cultural Bias in Psychological Testing - ResearchGate On the other hand, carrying out Psychological research can make us more aware of cultural variations and differences. Note: Equal intercepts and differing slopes result in nonparallel regression lines, with the degree of bias depending on the distance of the individuals score from the origin, Differing Slopes and Intercepts. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Steele and Aronson in 1995 posited a unique explanation for group differences on mental test scores. Factor analysis allows one to determine patterns of interrelationships of performance among groups of individuals. is added to your Approved Personal Document E-mail List under your Personal Document Settings In this article, we outline the theoretical framework that cultural validity should be the foundation for building validity evidence of an assessment program.
Do You Fire Someone Before Or After Their Shift,
Why Are Olive Trees Banned In Arizona,
Articles C