Ability Estimation Under Different Item Parameterization and Scoring Models (open access)

Ability Estimation Under Different Item Parameterization and Scoring Models

A Monte Carlo simulation study investigated the effect of scoring format, item parameterization, threshold configuration, and prior ability distribution on the accuracy of ability estimation given various IRT models. Item response data on 30 items from 1,000 examinees was simulated using known item parameters and ability estimates. The item response data sets were submitted to seven dichotomous or polytomous IRT models with different item parameterization to estimate examinee ability. The accuracy of the ability estimation for a given IRT model was assessed by the recovery rate and the root mean square errors. The results indicated that polytomous models produced more accurate ability estimates than the dichotomous models, under all combinations of research conditions, as indicated by higher recovery rates and lower root mean square errors. For the item parameterization models, the one-parameter model out-performed the two-parameter and three-parameter models under all research conditions. Among the polytomous models, the partial credit model had more accurate ability estimation than the other three polytomous models. The nominal categories model performed better than the general partial credit model and the multiple-choice model with the multiple-choice model the least accurate. The results further indicated that certain prior ability distributions had an effect on the accuracy …
Date: May 2002
Creator: Si, Ching-Fung B.
System: The UNT Digital Library
Adult Learner Satisfaction with Web-Based Non-Credit Workforce Training. (open access)

Adult Learner Satisfaction with Web-Based Non-Credit Workforce Training.

Web-based training has become a billion dollar industry in the United States. Electronically aided learning is viewed by many companies as a cost-effective way to deliver the up-to-date, up-gradable job-related training that the industry is demanding. This study sought to examine the relationship between learners’ satisfaction with online training as it relates to learner readiness, online features, and course relevance. The population for this study was adults seeking non-credit workforce training, specifically library professionals who were involved in web-based training through the Lifelong Education @ Desktop (LE@D) program at the University of North Texas, Denton. Online methods of training are used most extensively in the area of mandatory or compliance training, in which 35 % of training is conducted mostly or completely online. The total potential library population using LE@D product to date is approximately 4,000 unique enrollments nationwide. Participants were selected from a complete list of unique LE@D users over a 90-day period. A survey instrument was sent via e-mail to 514 enrollees who had completed a recent LE@D online training course. In total, 254 participants responded to the survey. Bivariate analysis of the variables using the Pearson product-moment correlation was used to determine the occurrence and strength of …
Date: August 2007
Creator: Morgan, Pamela Cope
System: The UNT Digital Library
An Analysis of How Interest Groups Influence the Policy-making Process for the Individuals With Disabilities Education Act of 1997 (open access)

An Analysis of How Interest Groups Influence the Policy-making Process for the Individuals With Disabilities Education Act of 1997

This study examined the policy letters and verbal testimony transcripts submitted by interest groups to the United States Department of Education (USDE) in response to the proposed regulations pertaining to the implementation of the 1997 reauthorization of P. L. 105-17, Individuals with Disabilities Education Act (IDEA). Specifically, this study analyzed the emerging themes in the area of discipline. Responses were received from the following interest groups: (a) school administrators, (b) parents, (c) teachers, (d) state educational agencies (SEAs), (e) national educational organizations, and (f) members of the United States Congress. In addition to analyzing the emerging themes, the study compared these themes to ones found in the current literature and court cases.
Date: December 1998
Creator: Price, Laura Black
System: The UNT Digital Library
Analysis of Leadership Perceptions Using Multirater Feedback. (open access)

Analysis of Leadership Perceptions Using Multirater Feedback.

Performance improvement intervention begins with assessment. How that assessment is interpreted can mean the difference between success and failure. Previous research of 360-degree feedback instruments has tried to reconcile the differences between multiple rater groups. Rather than searching for agreement, this research proposes to understand the meaning of the differences using multirater feedback. Individuals determine ratings based upon their own perspective and building upon the understanding of rater perspective may result in improved assessments. Data from an existing data set was processed using a second-order CFA in structural equation modeling. Covariance between the second-order factors and rater groups determined the difference in how each rater group perceived the leader.
Date: May 2004
Creator: Bradley, Thomas P.
System: The UNT Digital Library
Analysis of Perceptional Differences Among Department Chairs, Faculty, and Instructors Toward the Barrier to Using Multiple Teaching Strategies in Two-Year Technical and Community College Electronics Courses (open access)

Analysis of Perceptional Differences Among Department Chairs, Faculty, and Instructors Toward the Barrier to Using Multiple Teaching Strategies in Two-Year Technical and Community College Electronics Courses

The purpose of this study was to identify and analyze perceptional differences among department chairs, faculty, and instructors toward the barrier to using multiple teaching strategies in two-year technical and community college electronics courses. The literature review focused on defining multiple teaching strategies and identifying and discussing four major perceived barriers to implementing them in the electronics classroom: student, resources, classroom environmental, and teacher training/teaching technology. The targeted population consisted of 150 out of 231 electronics teaching technical and community college department chairs, faculty, and instructors throughout the state of Texas. In actuality, the targeted population's breakdown consisted of 36 full-time electronics teaching department chairs, 96 full-time electronics teaching faculty and instructors, and 18 part-time electronics teaching faculty and instructors who were actively involved in the delivery of instruction in their respective schools. Analysis of the data revealed that: (1) there are no significant differences among the perceptions of department chair people, faculty, and instructors toward the four perceived barriers to implementing multiple teaching strategies in a post-secondary electronics program; and (2) there are no significant differences in the perceptions electronics faculty members categorized by years teaching experience toward each of the four perceived barrier categories to implementing multiple teaching …
Date: May 2004
Creator: Hutyra, Jerry Emil
System: The UNT Digital Library
An Analysis of the Characteristics of Female Juvenile Offenders as Predictors of Resocialization or Recidivism. (open access)

An Analysis of the Characteristics of Female Juvenile Offenders as Predictors of Resocialization or Recidivism.

Because there has been a paucity of research on the educational needs of females with academic, behavioral, and emotional problems involved with the juvenile justice system, this study has been an attempt to classify and compare specific characteristics of this population. In particular, it examined their demographics, disability prevalence rates, along with academic, behavioral, and emotional functioning levels, in order to further understand their relationship to the resocialization or recidivism of the different groups of female juveniles incarcerated in the state of Texas, and contribute to the research for further developing successful prevention and intervention programs. Various demographic factors of the female juveniles in this study were examined: (a) offender type, (b) county of commitment, (c) race/ethnicity, (d) age at first referral, and (e) English language proficiency. Prevalence rates of special education disabilities were determined. Academic functioning was measured by (a) IQ; (b) last school grade completed; (c) Test of Adult Basic Education (TABE) reading gain score; and (d) TABE math gain score. Behavioral functioning was indicated through (a) offense history, (b) documented behavior incidents, and (c) total risk score. Emotional functioning included DSM-IV diagnoses and treatment needs. Due to the design of the research being a descriptive exploration, the …
Date: May 2007
Creator: Aiello, Jan Elizabeth
System: The UNT Digital Library
Applying Cognitive Load Theory to the Design of Online Learning. (open access)

Applying Cognitive Load Theory to the Design of Online Learning.

The purpose of the study was to investigate the application of cognitive load theory to the design of online instruction. Students in three different courses (N = 146) were measured on both learning performance and perceptions of mental effort to see if there were any statistically significant differences. The study utilized a quasi-experimental posttest-only control group design contrasting modified and unmodified instructional lessons. Both groups were given a posttest to measure knowledge gained from the lesson (cognitive domain of learning) and perceptions of mental effort involved. Independent samples t-tests were used to compare the mean performance scores of the treatment groups (i.e. the sections using redesigned materials) versus the control groups for all three courses. Cohen's d was also computed to determine effect size. Mental effort scores were similarly compared for each group on the overall cognitive load score, for a total of six data points in the study. Of the four hypotheses examined, three (H1, H2, H4) found no statistically significant difference between the experimental and control groups. Negative significance was found between the experimental and control group on the effect of modality (H3). On measures of cognitive load, no statistically significant differences were found.
Date: May 2007
Creator: Burkes, Kate M. Erland
System: The UNT Digital Library
Assessing Allied Health and Nursing Post-Secondary Career and Technical Education Teacher Attitudes and Beliefs About Reading (open access)

Assessing Allied Health and Nursing Post-Secondary Career and Technical Education Teacher Attitudes and Beliefs About Reading

This study examined allied health and nursing career and technical education (CTE) teacher beliefs and attitudes about reading. Since beliefs and attitudes influence the way teachers teach, it is important to understand what those beliefs and attitudes are, especially in relationship to reading in subject matter classrooms. One hundred twelve individuals responded to a written survey concerning their attitudes and beliefs about reading. A four-factor solution was achieved with a principal components factor analysis. A significant number of variables were associated with the factor labeled Reading Apathy, which appears to be indicative of the condition known as aliteracy among faculty who participated in the study. Professional development activities grounded in novice-to-expert theory are suggested as a way of overcoming the phenomenon. Recommendations for future research involve a more detailed study to further characterize the condition of aliteracy and its impact on student learning.
Date: May 2005
Creator: Moore, Bridgit R.
System: The UNT Digital Library
Assessing the Efficacy of Learning Communities at Four North Texas Community Colleges. (open access)

Assessing the Efficacy of Learning Communities at Four North Texas Community Colleges.

This observational study involving intact groups and convenient sampling examined learning communities at four North Texas Community Colleges. The purpose of this study was to determine if there was a significant difference in cathectic learning climate, inimical ambiance, academic rigor, affiliation and structure among students in learning communities and freestanding classes. Learning communities are gaining nationwide popularity as instruments of reform in Higher Education. Recent studies have discussed the benefits of learning communities to student, faculty and institutions. As learning communities are gaining popularity, especially at the community college level, there is a need to determine if the learning communities are significantly different than freestanding classes. The College Classroom Environment Scales, developed by Winston, Vahala, Nichols, Gillis, Wintrow, and Rome (1989), was used as the survey instrument for this study. Using SPSS 10.1, a multivariate analysis of variance, (Hotelling's T2) was performed on five dependent variables: cathectic learning climate (CLC), inimical ambiance (IA), academic rigor (AR), affiliation (AF), and structure (ST), which yielded a significant difference. The independent variable was learning community compared to freestanding classes (group). Follow-up independent t tests were also conducted to evaluate the differences in the means between the two groups and to explore which dependent …
Date: August 2002
Creator: Dodd, Patricia M.
System: The UNT Digital Library
An Assessment of Technology Learning Styles, Skills, and Perceptions Among Teachers of Grades Pre-Kindergarten Through Four. (open access)

An Assessment of Technology Learning Styles, Skills, and Perceptions Among Teachers of Grades Pre-Kindergarten Through Four.

This study investigated whether a relationship exists between learning style and the self-reported technology-related needs, beliefs, stages of adoption, software expertise, and technology competencies of teachers in a large suburban school district. The Gregorc Style Delineator was used to identify dominant learning style, and the Snapshot Survey was used to measure technology-related needs, beliefs, stages of adoption, and software expertise. Technology competencies were measured using the Technology in Education Competency Survey. Data collected from 499 participants was included in data analysis. The study was conducted at each of the 12 elementary schools of a large suburban district in the Dallas-Fort Worth Metroplex. The findings suggest that there is a significant relationship between learning style and the technology-related needs, stages of adoption, software expertise, and competencies of teachers. The relationship between learning style and technology-related needs was significant at the p < .01 level. The relationships between learning style and technology-related stages of adoption, software expertise, and technology competencies were significant at the p < .05 level. Members of the abstract sequential [AS] learning style group reported having significantly fewer needs and significantly higher stages of adoption, software expertise, and competency than members of one or more of the other learning …
Date: December 2004
Creator: Brubaker, Douglas D.
System: The UNT Digital Library
Assessment of the Perceived Competencies Possessed by Women Administrators in Vocational Education at Community Colleges in Texas (open access)

Assessment of the Perceived Competencies Possessed by Women Administrators in Vocational Education at Community Colleges in Texas

The need for a high-quality workforce to meet increased competition in the world economy has increased the need for competent vocational administrators in public 2-year postsecondary institutions. Researchers have agreed that vocational education is in a state of metamorphosis and must change to meet its challenges in the coming century. At the same time, more women are seeking and obtaining vocational administrative positions. Several studies have been done to identify the competencies needed by vocational administrators to perform their duties, but there has been little research on the actual ability to perform the administrative tasks identified by these studies. Two main purposes of this study are: (a) to determine the perceived level of administrative competencies possessed by women administrators in vocational education at the community college level in Texas; (b) to determine the adequacy of the preservice training received by these administrators to perform their administrative functions. Of the 175 women administrators randomly selected to participate in the study, 71% completed the Administrator Task Inventory. In addition to the descriptive statistics, two multiple regression analyses were tested. First, principal component analysis was used to reduce the number of dependent variables from 11 to 2, after which two multiple regression analyses …
Date: May 1997
Creator: Chiawa, Chioma B. (Chioma Bernadette)
System: The UNT Digital Library
The Awareness and Perception of Distance Education by the Leadership in the Texas State Technical College System (open access)

The Awareness and Perception of Distance Education by the Leadership in the Texas State Technical College System

The purpose of this study was to determine whether there were differences in the levels of awareness and perception concerning distance education among the leadership at the seven campuses of the Texas State Technical College (TSTC) System.
Date: May 1998
Creator: Knue, John Raymond
System: The UNT Digital Library
Behavior Management Techniques Used by Teachers of Emotionally/behaviorally Disordered Students in Various Educational Settings (open access)

Behavior Management Techniques Used by Teachers of Emotionally/behaviorally Disordered Students in Various Educational Settings

The purpose of this study was to delineate the differences between the types of behavioral management techniques used by teachers of students with emotional/behavioral disorders.
Date: December 1998
Creator: Elizondo, Leigh A.
System: The UNT Digital Library
Bias and Precision of the Squared Canonical Correlation Coefficient under Nonnormal Data Conditions (open access)

Bias and Precision of the Squared Canonical Correlation Coefficient under Nonnormal Data Conditions

This dissertation: (a) investigated the degree to which the squared canonical correlation coefficient is biased in multivariate nonnormal distributions and (b) identified formulae that adjust the squared canonical correlation coefficient (Rc2) such that it most closely approximates the true population effect under normal and nonnormal data conditions. Five conditions were manipulated in a fully-crossed design to determine the degree of bias associated with Rc2: distribution shape, variable sets, sample size to variable ratios, and within- and between-set correlations. Very few of the condition combinations produced acceptable amounts of bias in Rc2, but those that did were all found with first function results. The sample size to variable ratio (n:v)was determined to have the greatest impact on the bias associated with the Rc2 for the first, second, and third functions. The variable set condition also affected the accuracy of Rc2, but for the second and third functions only. The kurtosis levels of the marginal distributions (b2), and the between- and within-set correlations demonstrated little or no impact on the bias associated with Rc2. Therefore, it is recommended that researchers use n:v ratios of at least 10:1 in canonical analyses, although greater n:v ratios have the potential to produce even less bias. …
Date: August 2006
Creator: Leach, Lesley Ann Freeny
System: The UNT Digital Library
Comparing outcome measures derived from four research designs incorporating the retrospective pretest. (open access)

Comparing outcome measures derived from four research designs incorporating the retrospective pretest.

Over the last 5 decades, the retrospective pretest has been used in behavioral science research to battle key threats to the internal validity of posttest-only control-group and pretest-posttest only designs. The purpose of this study was to compare outcome measures resulting from four research design implementations incorporating the retrospective pretest: (a) pre-post-then, (b) pre-post/then, (c) post-then, and (d) post/then. The study analyzed the interaction effect of pretest sensitization and post-intervention survey order on two subjective measures: (a) a control measure not related to the intervention and (b) an experimental measure consistent with the intervention. Validity of subjective measurement outcomes were assessed by correlating resulting to objective performance measurement outcomes. A Situational Leadership® II (SLII) training workshop served as the intervention. The Work Involvement Scale of the self version of the Survey of Management Practices Survey served as the subjective control measure. The Clarification of Goals and Objectives Scale of the self version of the Survey of Management Practices Survey served as the subjective experimental measure. The Effectiveness Scale of the self version of the Leader Behavior Analysis II® served as the objective performance measure. This study detected differences in measurement outcomes from SLII participant responses to an experimental and a …
Date: August 2007
Creator: Nimon, Kim F.
System: The UNT Digital Library
A Comparison of a Computer-Administered Test and a Paper and Pencil Test Using Normally Achieving and Mathematically Disabled Young Children (open access)

A Comparison of a Computer-Administered Test and a Paper and Pencil Test Using Normally Achieving and Mathematically Disabled Young Children

This study investigated whether a computer-administered mathematics test can provide equivalent results for normal and mathematically disabled students while retaining similar psychometric characteristics of an equivalent paper and pencil version of the test. The overall purpose of the study was twofold. First, the viability of using computer administered assessment with elementary school children was examined. Second, by investigating items on the computer administered mathematics test for potential bias between normally achieving and mathematically disabled populations, it was possible to determine whether certain mathematical concepts consistently distinguish between the two ability groups.
Date: May 1997
Creator: Swain, Colleen R. (Colleen Ruth)
System: The UNT Digital Library
Comparison of Computer Testing versus Traditional Paper and Pencil Testing (open access)

Comparison of Computer Testing versus Traditional Paper and Pencil Testing

This study evaluated 227 students attending 12 classes of the Apprentice Medical Services Specialist Resident Course. Six classes containing a total of 109 students took the Block One Tests in the traditional paper and pencil form. Another six classes containing a total of 118 students took the same Block One Tests on computers. A confidence level of .99 and level of signifi­cance of .01 was established. An independent samples t-test was conducted on the sample. Additionally, a one-way analysis of variance was performed between the classes administered the Block One Tests on computers. Several other frequencies and comparisons of Block One Test scores and other variables were accomplished. The variables examined included test versions, shifts, student age, student source, and education levels. The study found no significant difference between test administration modes. This study concluded that computer-administering tests identical to those typically administered in the traditional paper and pencil manner had no significant effect on achievement. It is important to note, however, that the conclusion may only be valid if the computer-administered test contains exactly the same test items, in the same order and format, with the same layout, structure, and choices as the traditional paper and pencil test. In …
Date: August 2000
Creator: Millsap, Claudette M.
System: The UNT Digital Library
Comparison of Evangelical Christian Children's God-Concepts and Logical Thinking Ability. (open access)

Comparison of Evangelical Christian Children's God-Concepts and Logical Thinking Ability.

God-concepts of 24 third to sixth grade evangelical Christian children were compared with the children‘s logical thinking abilities in a mixed-method study. Measurements included the Children‘s Interview and the Group Assessment of Logical Thinking (GALT). God-concepts among the children were Biblical, comforter, communicates, creator, empowering, protector, provider, purposeful, human characteristics, lives in heaven, male, counselor, God is Jesus, all-knowing, loving, perfect, powerful, real, and parental. The majority of concrete thinkers conceptualized God as a gracious guide. The majority of transitional thinkers viewed God also as a gracious guide as well as a distant divinity. Implications were given for religious educators to develop a model for age-appropriate instruction and curriculum and to equip parents to promote spiritual development with children at home.
Date: May 2007
Creator: Penick, Starrla
System: The UNT Digital Library
A Comparison of Knowledge/Skills Statements Needed by Teachers of Students with Emotional and Behavioral Disorders and Teachers in Juvenile Correctional Special Education Settings (open access)

A Comparison of Knowledge/Skills Statements Needed by Teachers of Students with Emotional and Behavioral Disorders and Teachers in Juvenile Correctional Special Education Settings

This study had a two-fold purpose. The first purpose was to compare the rankings of a set of knowledge/skills statements as reported by teachers of students with emotional behavioral disorders and teachers in juvenile correctional special education settings. A survey instrument designed to measure the importance, proficiency, and frequency of use of clusters of knowledge/skills statements was administered to 123 teachers in juvenile correctional special education settings in state institutions. Mann Whitney U analyses were calculated to compare the mean rankings of the two groups of teachers. The findings indicated that teachers in juvenile correctional special education settings and teachers of students with emotional and behavioral disorders were very similar as to which knowledge/skills clusters were important to their job performance, which clusters they were most proficient at using, and which clusters they utilized most frequently. The second purpose was to compare the teachers in juvenile correctional special education settings and to determine whether their mean rankings of the knowledge/skills clusters varied when analyzed by differing categories of age, type of certification held, years of teaching experience, and level of the teachers' education. Analysis of variance revealed no significant difference in the mean rankings in any of the comparison groups. …
Date: December 1994
Creator: McArthur, Patrick L. (Patrick Lee)
System: The UNT Digital Library
A Comparison of Multivariate Normal and Elliptical Estimation Methods in Structural Equation Models (open access)

A Comparison of Multivariate Normal and Elliptical Estimation Methods in Structural Equation Models

In the present study, parameter estimates, standard errors and chi-square statistics were compared using normal and elliptical estimation methods given three research conditions: population data contamination (10%, 20%, and 30%), sample size (100, 400, and 1000), and kurtosis (kappa =1,10, 20).
Date: August 1999
Creator: Cheevatanarak, Suchittra
System: The UNT Digital Library
A Comparison of Three Criteria Employed in the Selection of Regression Models Using Simulated and Real Data (open access)

A Comparison of Three Criteria Employed in the Selection of Regression Models Using Simulated and Real Data

Researchers who make predictions from educational data are interested in choosing the best regression model possible. Many criteria have been devised for choosing a full or restricted model, and also for selecting the best subset from an all-possible-subsets regression. The relative practical usefulness of three of the criteria used in selecting a regression model was compared in this study: (a) Mallows' C_p, (b) Amemiya's prediction criterion, and (c) Hagerty and Srinivasan's method involving predictive power. Target correlation matrices with 10,000 cases were simulated so that the matrices had varying degrees of effect sizes. The amount of power for each matrix was calculated after one or two predictors was dropped from the full regression model, for sample sizes ranging from n = 25 to n = 150. Also, the null case, when one predictor was uncorrelated with the other predictors, was considered. In addition, comparisons for regression models selected using C_p and prediction criterion were performed using data from the National Educational Longitudinal Study of 1988.
Date: December 1994
Creator: Graham, D. Scott
System: The UNT Digital Library
A comparison of traditional and IRT factor analysis. (open access)

A comparison of traditional and IRT factor analysis.

This study investigated the item parameter recovery of two methods of factor analysis. The methods researched were a traditional factor analysis of tetrachoric correlation coefficients and an IRT approach to factor analysis which utilizes marginal maximum likelihood estimation using an EM algorithm (MMLE-EM). Dichotomous item response data was generated under the 2-parameter normal ogive model (2PNOM) using PARDSIM software. Examinee abilities were sampled from both the standard normal and uniform distributions. True item discrimination, a, was normal with a mean of .75 and a standard deviation of .10. True b, item difficulty, was specified as uniform [-2, 2]. The two distributions of abilities were completely crossed with three test lengths (n= 30, 60, and 100) and three sample sizes (N = 50, 500, and 1000). Each of the 18 conditions was replicated 5 times, resulting in 90 datasets. PRELIS software was used to conduct a traditional factor analysis on the tetrachoric correlations. The IRT approach to factor analysis was conducted using BILOG 3 software. Parameter recovery was evaluated in terms of root mean square error, average signed bias, and Pearson correlations between estimated and true item parameters. ANOVAs were conducted to identify systematic differences in error indices. Based on many …
Date: December 2004
Creator: Kay, Cheryl Ann
System: The UNT Digital Library
A Comparison of Trainee and Supervisor Perceptions of Transfer Climate in a Union-Based Training Program. (open access)

A Comparison of Trainee and Supervisor Perceptions of Transfer Climate in a Union-Based Training Program.

A supportive work climate is critical for successful transfer of learning. Influences in the work environment affect the trainee's ability to apply new skills to the job. The supervisor can be a significant figure in the trainee's perception of a supportive transfer climate. Little is known of the effect of supervisor participation in the training on transfer climate. The purpose of this study was to identify differences in trainee and supervisor self-perceptions of the factors affecting transfer climate. Additionally, this study examined the effects of supervisor participation in the training program on perceptions of transfer climate. The participants in this study were trainees in a union-sponsored instructor training program and their supervisors. The study found perception gaps between the overall perception of transfer climate and supervisor support. The level of supervisor participation in the training program was not to be a factor in the differences between the trainee and supervisor perceptions. No statistically significant difference exists in the perception of other transfer climate factors: supervisor sanctions, peer support, resistance/openness to change, and feedback/performance coaching. In addition, the study found that supervisor participation in the training made little difference in the perceptions of transfer climate by supervisors and trainees. Studies comparing …
Date: December 2004
Creator: Dodson, Gayle J.
System: The UNT Digital Library
A Comparison of Two Differential Item Functioning Detection Methods: Logistic Regression and an Analysis of Variance Approach Using Rasch Estimation (open access)

A Comparison of Two Differential Item Functioning Detection Methods: Logistic Regression and an Analysis of Variance Approach Using Rasch Estimation

Differential item functioning (DIF) detection rates were examined for the logistic regression and analysis of variance (ANOVA) DIF detection methods. The methods were applied to simulated data sets of varying test length (20, 40, and 60 items) and sample size (200, 400, and 600 examinees) for both equal and unequal underlying ability between groups as well as for both fixed and varying item discrimination parameters. Each test contained 5% uniform DIF items, 5% non-uniform DIF items, and 5% combination DIF (simultaneous uniform and non-uniform DIF) items. The factors were completely crossed, and each experiment was replicated 100 times. For both methods and all DIF types, a test length of 20 was sufficient for satisfactory DIF detection. The detection rate increased significantly with sample size for each method. With the ANOVA DIF method and uniform DIF, there was a difference in detection rates between discrimination parameter types, which favored varying discrimination and decreased with increased sample size. The detection rate of non-uniform DIF using the ANOVA DIF method was higher with fixed discrimination parameters than with varying discrimination parameters when relative underlying ability was unequal. In the combination DIF case, there was a three-way interaction among the experimental factors discrimination type, …
Date: August 1995
Creator: Whitmore, Marjorie Lee Threet
System: The UNT Digital Library