Abstract
The University of Tehran English Proficiency Test (UTEPT) is a high-stakes entrance examination taken by more than 10,000 master’s degree holders annually. The examinees’ scores have a significant influence on the final decisions concerning admission to the University of Tehran Ph.D. programs. As a test validation investigation, the present study, which is a bias detection research in nature, utilized multistep logistic regression (LR) procedure to examine the presence of gender differential item functioning (DIF) in the UTEPT with a sample of 6,555 examinees who took the test in November 2006. Specifically, the LR DIF two-degree of freedom Chi-squared test of significance was employed to test the significance of DIF. Following what has been recently suggested in the literature, the test of significance for DIF was accompanied by a measure of magnitude, namely the R-squared effect size for LR DIF, for the interpretation of which the two widely accepted classification schemes were used comparatively. The results reveal that 39 of the 100 items in the test display significant gender differences. However, these group differences are viewed as “negligible” based on both of the schemes. Accordingly, it could be argued that the UTEPT is a gender DIF-free test, though the Reading Comprehension section of the test remains in need of further analysis as it seems that the general trend of DIF indices at the item level may hint at an inclination towards males.