Eurasian Journal of Educational Research 75 () Eurasian Journal of Educational Research monash.pw As a Potential Source of Error, Measuring the Tendency of University Students to Copy the Answers: A Scale Development Study Ergul DEMIR 1 ARTICLE INFO ABSTRACT Article History: Purpose: The answer-copying tendency has the Received: 07 Jan. potential to detect suspicious answer patterns for Received in revised form: 22 Mar. prior distributions of statistical detection Accepted: 28 Apr. techniques. The aim of this study is to develop a DOI: /ejer valid and reliable measurement tool as a scale in order to observe the tendency of university Keywords students’ copying of answers. Also, it is aimed to answer-copying dishonesty provide evidence with more comprehensive scale development validity and reliability studies than the copy-detection. previously available researches. Research Methods: This is a scale development study. The “Answer-Copying Tendency Scale in University Students (ACTS)” was developed under the “Classical Test Theory”. Other theories were also considered, especially “Item Response Theory”. After preliminary studies and item writing, a trial application with students and main applications with students was conducted. Structural validity, item and test descriptive statistics, item discriminations, inconsistency and test–retest reliability, classification accuracy, and item bias with differential item functioning were examined. Findings: The ACTS composed of 2 factor and 20 items. Total scores and item scores distributions are normal. Item discriminations are very high and over α inconsistency coefficients are over and test–retest reliability coefficient is It provides highly correct classifications according to the students’ answer-copying positions. There is no significant and serious DIF on items. Implications for Research and Practice: Unlike similar examples, it was studied on the large groups and used more comprehensive techniques to obtain evidences. Results show that the validity and reliability levels of the ACTS are very high. The ACTS can be used to understand the nature of the answer-copying. Also, and more importantly, it is thought that the ACTS can be used to detect suspicious answer patterns for prior distributions. © Ani Publishing Ltd. All rights reserved 1 Ankara University, Educational Science Faculty, TURKEY, e-mail:[email protected], ORCID: monash.pw 38 Ergul DEMIR / Eurasian Journal of Educational Research 75 () Introduction Academic integrity, dishonesty, cheating, or, in a more limited use, answer- copying has become an increasingly important problem in schools. Academic integrity is a more comprehensive definition including answer-copying, having proxies, plagiarism, academic misconduct, falsifying, etc. (Mullens, ). Some research shows that answer-copying has been observed at very serious proportions (Burke, ; Genereux & McLeod, ; May & Loyd, ; McCabe, ). Dwyer and Hecht () stated that such dishonest behaviors by students have been increasing with increasing class sizes and reduced instructional resources. Similarly, McCabe and Trevino () reported that the percentage of cheating behaviors among college students rose slightly. The inability to take control of this tendency leads to the injured reputation of institutions and hurts honest students, their families, and all of society (Aaron & Georgia, ). Statistically, answer-copying is a source of systematic error and bias on items or tests. It may lead to unfair results. Unfortunately, it is very hard to detect answer- copying because initial response patterns are not easy to determine accurately for the statistical models. Improving the statistical techniques to detect answer-copying has been studied since the s (Angoff, ; Frary, Tideman, & Watts, ; Holland, ; Maeda & Zhang, ; van der Linden & Sotaridona, ; Wesolowsky, ; Wollack, ). According to Dwyer and Hact (), some probabilistic techniques to detect cheaters have been used in American higher education since the s. Generally, these techniques depend on matching responses between the copier and the source with a complex standardization process. Thus, the copiers and the sources have to be defined accurately. Also, there are some other challenges to using these techniques. As stated by van der Linden & Sotaridona (), these common techniques are limited by the null distribution of the set of items on which the statistics is defined. It is based on population-based statistics, and it is possible to arrive at unfair results. Alternatively, performances of IRT-based techniques also depend on the accurate estimation of copier ability, and it is obvious that the responses of copiers may be contaminated. For these techniques, especially, suspicious answer patterns should be defined accurately for prior and posterior distributions. It is understood that although there are some statistical techniques to detect answer-copying, these are limited or show doubt, a potential source of error, or just a probability. Another way to understand the nature of copying is to consider related factors (Gerdeman, ; Hughes & McCabe, ). Some research reported that dishonest behaviors, mostly including answer-copying, were related to individual characteristics like GPA, age, gender, etc. (Crown & Spiller, ; McCabe & Trevino, ; Selçuk, ; Whitley, ). Students with lower GPAs, younger students, and males are most likely to cheat. Also, these groups show more tolerance for cheating behaviors. Ergul DEMIR / Eurasian Journal of Educational Research 75 () 39 Although they are less likely to engage in dishonesty or copying, dishonest behaviors seem to be related to particular educators and institutional policies (Aaron, ; Genereux & McLeod, ; McCabe & Trevino, ). Other important factors are attitudes, perceptions, and tendencies toward copying (Genereux & McLeod, ; Hughes & McCabe, ; McCabe & Trevino, ; Öztürk & Yeşilyaprak, ). The students with high work ethic, self-esteem, and lower test anxiety are less likely to cheat. On the other hand, the prevalence of cheating and the perception of cheating as acceptable increases cheating behaviors. In many studies, questionnaires or self-reporting have been used to observe students’ perceptions or tendencies on dishonest behavior such as copying (Bolin, ; McCabe & Travino, ; Selçuk, ). There are a few measurement tools available developed in these contexts (Eminoğlu & Nartgün, ; Gardner & Miller, ; LaGrange, ). Unfortunately, these tools are very limited in use and mostly unavailable. Generally, these tools were developed on a small and limited group and have comprehensive context. Most of them provide less proof about validity and reliability. Most of them were developed a long time ago. So, although cheating has been studied for a long time, it is obvious that valid and reliable tools are still needed. Research Objectives The aim of this study is to develop a valid and reliable measurement tool as a scale in order to observe the tendency of university students to copy answers. Also, it is aimed to provide evidence with comprehensive validity and reliability studies. With this aim, psychometric studies have been carried out for the “Answer-Copying Tendency Scale in University Students (ACTS)”: (1) Structural validity, (2) item and test descriptive statistics, (3) item discriminations, (4) inconsistency and test–retest reliability, (5) classification accuracy, and (6) item bias with differential item functioning. Statistically, the detection of answer-copying is a challenge. On the other hand, research shows that attitudes, perceptions, and tendencies are related to answer- copying behavior. In order to understand the nature of answer-copying and then detect it, we need a valid, reliable, and, most importantly, available tool to be used in this context. Also, the answer-copying tendency has the potential to make suspicious answer patterns detectable for prior/initial distributions of statistical detection techniques. Method This is a scale development study. The “Answer-Copying Tendency Scale in University Students (ACTS)” was developed under the “True Score Theory” or “Classical Test Theory”. In this process, other theories were also considered, especially 40 Ergul DEMIR / Eurasian Journal of Educational Research 75 () “Item Response Theory”. The scale development steps were followed by considering DeVellis (). Preliminary Studies and Item Writing Developing the ACTS began with the preliminary studies. First, the trait and the aim were defined. The aim of the scale was defined as to measure the tendency of university students to copy answers. After that, the observable behaviors of students’ copying tendencies was tried to be linked with the related literature. Simultaneously, around 80 university students were asked to write an essay on their perceptions and views about answer-copying. These documents were analyzed, and a total of draft expressions were prepared for review by experts. The prepared draft form was sent to six specialists and academics working in the field of educational sciences. According to their opinions, 35 items were cancelled, 14 items were reorganized, and 79 items were accepted as they were. Of the 93 items, there were 34 items with negative direction. At the same time, the experts were asked about which type of scale would be used most effectively. Some experts recommended the Likert-type scale with five categories, while others recommended more than five categories. In response, and also by considering the related literature, it was decided that two differently numbered scoring categories would be used in the trial application. The first would be the classic Likert-type scale with five categories, and the second would be to score between 0 and 10 points. At the end of the preliminary studies, the trial application form was organized with 93 items. Trial and Main Application Trial application was executed with undergraduate students from 16 universities and 18 faculties in May and June of in Turkey as a paper-pencil test. The main application was executed with undergraduate students from 29 universities and 30 faculties in November and December of in Turkey as a web based application. Some demographic characteristics of the students whom participated in trial and main applications are given at Table 1. Data Analysis Defining the factor structure, exploratory and confirmatory studies were executed. “Principal component analysis” and “multidimensional scaling with ALSCAL method” was used with the trial application data. “Confirmatory factor analysis” was used with the main application data. As the test and item statistics, it was analyzed the distribution of the scores and calculated the descriptive statistics. As the item discrimination, item-total score correlations were calculated with Pearson’ Product Moments correlation coefficient. For reliability, α inconsistency coefficient and test- retest reliability were calculated. For classification accuracy, discriminant function Ergul DEMIR / Eurasian Journal of Educational Research 75 () 41 analysis was used with four classification model. Finally, item bias was examined with the “lordif” techniques based on Item Response Theory. Before all analyses, main assumptions were checked in detail. The softwares SPSS and Lisrel and R were used for these analyses. Table 1 Students’ Demographic Characteristics for Trial and Main Application Samples Trial Application n % Main Application n % Female Female Male Gender Male Gender Missing 28 Total Ankara Total University Ankara Trakya University University Hacettepe Gazi 77 University University TED University Hacettepe 29 69 University University University Konya Selçuk 23 Other University Other 22 Missing 2 Missing 29 Total Total Education Educational Educational Sciences Sciences Science & Education Literature Faculty Language Theology 55 & History- 88 Faculty Geography Science 47 Others Other Total Preparation Missing 28 and 1 Total 2 1 3 2 4 3 Class 5 and 50 Graduate Class 4 Total Graduate 34 Missing 29 Total 42 Ergul DEMIR / Eurasian Journal of Educational Research 75 () Results Structural Validity: Exploratory and Confirmatory Studies Exploratory studies for the ACTS were executed on the trial application data, and confirmatory analysis was executed on the main application data. As an exploratory analysis, “principal component analysis (PCA)” and “multidimensional scaling with ALSCAL model” were used respectively. PCA was executed with both the classic five categories of Likert-type scores and the scores between 0 and 10 separately. Before analysis, some items were recoded in order to equalize the way of all items. Missing values were checked, and there was no serious missing data problem. Nor were there any outliers (max. Mahalanobis distance < χ2kritik=25). For the five-category Likert-type scale data, sampling is adequate (KMO=), and multiple correlations among variables are statistically significant (for Bartlett’s test of sphericity, χ2=, df= and p<). After the data reduction, 20 items with one factor could be identified as a structure. Communalities of each item are between and The eigenvalue of one factor is , and the total variance explained is 57%. Factor loadings are between and Similarly, for scoring 0 to 10, sampling is adequate (KMO=), and multiple correlations among variables are statistically significant (for Bartlett’s test of sphericity, χ2=, df= and p<). After the data reduction with varimax and oblique rotation, 20 items with two factors could be identified as a structure. The communalities of each item are between and The eigenvalue of the first factor is and second is Total variance explained for the first factor is %, the second is %, and the total is %. The correlation between factors is statistically significant an, shows a negative and moderate relationship (r= and p<). These items and factor loadings are shown in the Table 2. As seen at Table 2, 12 of 20 items are in the first factor and eight items are in the second factor. These eight items are the items with negative directions. The first factor was named “negative perception of exam and grade (NEGALGI)”, and the second factor was named “ethical value (ETIK)”. These two factors have negative correlations to each other (r= and p<). According to these, after standardization, students who have high ethical values also have lower levels of negative perception about exams and grades. Ergul DEMIR / Eurasian Journal of Educational Research 75 () 43 Table 2 Factor Loadings for ACTS Items No. Item F1 F2 12 I'll copy answers when I get the chance. Having friends I know that get higher scores by copying answers, I 18 also have the ambition to copy answers. I think about copying answers for the examinations that I have not 20 prepared enough for. 38 Copying answers can be enough to pass the exam. 40 I feel compelled to copy answers myself in some lessons. It makes sense for me to copy answers in lessons that I will forget and 58 not use in the future. Even if I do not do it, it gives me confidence to know that I can copy 61 answers at the exam. It makes sense for me to copy answers to the questions I do not 64 know. 70 I will copy answers if I know I will not be punished. It makes more sense to copy answers when I memorize so much 78 information that will not work in my own life. 81 Anxiety about earning high grades pushes me to copy answers. 91 If I am not afraid of getting caught, I'll copy answers. 8 Copying answers makes a fool of a person. 16 I see copying answers as an unfair advantage. 19 I'm absolutely against copying answers. 33 Copying answers is disrespectful to the teacher's endeavors. 53 Copying answers is not my achievement but my deceit. 71 Defending copying answers is completely nonsense. I would like to take real deterrent measures to prevent students from 72 copying answers. 87 I believe that the copying answers is immoral behavior. Eigenvalue Total Variance Explained (%) After PCA, the multidimensional scaling ALSCAL model provides visual and supportive evidence of the structure. According to the Euclidean distances, item locations are shown in Graphic 1. 44 Ergul DEMIR / Eurasian Journal of Educational Research 75 () Graphic 1. Item Distribution According to Multidimensional Scaling (ALSCAL) Euclidean Distances As seen in Graphic 1, items are clearly separated by two factors. In the right-hand graphic, eight items are on one factor and twelve items are on the other factor. Furthermore, in the left-hand graphics, these two factors show a linear relationship. Also, outputs of MDA-ALSCAL show that the model provides good-fit with high variance accounted for (Stress= and RSQ=). After the exploratory studies, “confirmatory factor analysis (CFA)” was conducted on the main application data. Graphic 2 shows the standardized values for each path. Graphic 2. Standardized Path Coefficients According to the Results of the CFA Ergul DEMIR / Eurasian Journal of Educational Research 75 () 45 In Graphic 2, all paths are statistically significant (ti> and pi<). Standardized solutions of the errors are under Similar to the exploratory studies’ results, there are moderate and negative correlations between the factors. Also, goodness of fit statistics show at least acceptable model-data fit (χ2/sd=, RMSEA=, SRMR=, GFI=, NFI=, CFI=, ECVI Free model = and Saturated model=, AIC Free model= and Saturated Model=). Total Score Distributions and Descriptive Statistics The ACTS has 20 items, which are each scored between 0 and The available total scores are between 0 and In Table 3, the total score distributions and descriptive statistics are given for both trial and main applications. Table 3 ACTS Total Score Distributions and Descriptive Statistics Statistics Value Histogram Normal Q-Q Plot Mean 5% monash.pw Trial Application SE (Mean) Median 91 monash.pw Minimum 0 Maximum Skewness Kurtosis Mean 5% monash.pw Main Application SE (Mean) Median 74 monash.pw Minimum 0 Maximum Skewness Kurtosis As seen in Table 3, total scores distribute normally. The mean, trimmed mean, and median are close to each other. Skewness and kurtosis are between (-1, +1). Also, the graphics supporting the normality. 46 Ergul DEMIR / Eurasian Journal of Educational Research 75 () Item Statistics and Item-Total Score Correlations The item statistics and item-total score correlations of the ACTS items were calculated for both trial and main application data. The results are given in Table 4. Table 4 Item Statistics and Item-Total Score Correlations of the ACTS Items Trial Application Main Application Item Mean Med. Std.D. Skew. Kurt. r Mean Med. Std.D. Skew. Kurt. r 12 5 ** 3 ** 18 4 ** 2 ** 20 5 ** 4 ** 38 5 ** 3 ** 40 5 ** 3 ** 58 5 ** 5 ** 61 5 ** 3 ** 64 5 ** 4 ** 70 5 ** 4 ** 78 5 ** 3 ** 81 5 ** 4 ** 91 5 ** 3 ** 8* 7 ** 8 ** 16* 8 ** 9 ** 19* 5 ** 6 ** 33* 7 ** 7 ** 53* 7 ** 8 ** 71* 5 ** 6 ** 72* 6 ** 6 ** 87* 6 ** 7 ** * Second factor items were recoded before the estimations. **p< As seen in Table 4, trial and main application results are similar. Item means and medians are close to each other. Skewness values are between (-1, +1). Kurtosis values are mostly under -1, but between (, +). Tabachnick and Fidel (, p) stated that non-strong skewness and kurtosis violations cannot lead the difference for the statistics in large samples. As a result, item scores show reasonable normality. Also, all item-total score correlations are significant at the level of This is strong evidence of the discriminative validity of the items and the test. Ergul DEMIR / Eurasian Journal of Educational Research 75 () 47 Reliability Studies For the reliability of the ACTS, first α inconsistency coefficients were calculated for both trial and main applications. If there is no violation of normality, α is an available estimation. The results are given in Table 5. Table 5 α Coefficients for the ACTS Trial Application Main Application Factors k n α n α Negative perception of exams and grades (NEGALGI) 12 Ethical value (ETIK) 8 Total 20 As seen in Table 5, the inconsistency of the ACTS is very high for both the factor level and the total test. Besides the inconsistency estimates, the test–retest reliability was considered. Test–retest application was conducted with 95 students in December of after two weeks from the main application. There was no normality violation, and the Pearson product-moment correlation coefficient was used to estimate. Results show that the ACTS has a high level of test–retest reliability (r= ve p<). Classification Accuracy In the trial and main applications of the ACTS, participants were asked whether they copied answers, gave answers for others to copy, and/or witnessed answer- copying. Similar results were obtained from both the trial and main applications. So, just the main application data was considered for further analysis. Table 6 Taking, Giving, and Witnessing the Answer-Copying among Undergraduates I did I gave I witnessed n % n % n % Yes No Total As seen in Table 6, most of the students stated that they did not copy answers (%) or did not gave answers for others to copy (%). On the other hand, the vast majority stated that they witnessed answer-copying (%). The proportion of copying answers (%) was more than giving answers for others to copy (%). As validity evidence, it is expected that the ACTS can classify students according to their answer-copying positions. For this purpose, “discriminant function analysis (DFA)” was used. A total of four discriminant models were identified depending on the 48 Ergul DEMIR / Eurasian Journal of Educational Research 75 () answer-copying positions and ACTS factors. Before the analyses, the main assumptions were checked. As mentioned before, there are no violations about missignes, outliers, and normalities. Furthermore, the homogeneity of the variance- covariance matrices was checked by using Box’s M statistics. These values are not significant at the level of , and there is no violation (Hair, Black, Babin, & Anderson, , p). Descriptive statistics for each model are given in Table 7. And the test result for group differences and canonical discriminant functions are given in Table 8. Table 7 Descriptive Statistics for Classification Models Model Predictor(s) Grouping Variable n Mean monash.pw 1a ACTS_Total Answer-Copying Yes No Total 2b ACTS_NEGALGI Answer-Copying Yes No Total ACTS _ETIK Answer-Copying Yes No Total 3c ACTS _Total Giving answer-copy Yes No Total 4d ACTS _NEGALGI Giving answer-copy Yes No Total ACTS _ETIK Giving answer-copy Yes No Total aBox’ M=, F=, df1=1, df2= and p= bBox’ M=, F=, df1=3, df2= and p= cBox’ M=, F=, df1=1, df2= and p= dBox’ M=, F=, df1=1, df2= and p= Table 8 Test Results for Group Difference and Canonical Discriminant Functions Equality of group means Canonical Discriminant Functions Model Predictors Wilks Wilks Eigen Cann. F df1 df2 p χ2 df p ’λ ’λ value Corr. 1 ACTS_Total 1 1 ACTS_NEGALGI 1 2 2 ACTS_ETIK 1 3 ACTS _Total 1 1 ACTS _NEGALGI 1 4 2 ACTS _ETIK 1 Ergul DEMIR / Eurasian Journal of Educational Research 75 () 49 As seen in Table 8, each model shows significant group differences. Also, each model is statistically significant. After that, the maximum chance criteria were calculated according to the recommendation of Tabachnick and Fidel (, p) for unequal group sizes. The proportions of the correct classifications and related criteria are given in Table 9. Table 9 Proportions of the Correct Classifications and Maximum Chance Criteria Model Correct Classification Maximum Chance (%) (%) 1 2 3 4 As seen in Table 9, each model provides correct classifications beyond chance. For 1st and 2nd models, this ratio is higher. It is expected that the correct classification values should be higher, at least over 10% of maximum chance values (Hair et al.., , p). According to this criteria, all models have high power for classifications. As a result, the ACTS can predict significantly both answer-copying and giving the answers to copy. Differential Item Functioning Studies DIF studies for the ACTS were conducted on the main application data. Gender sub-groups and faculties are considered the group variable. For gender, females were defined as the reference group (n=, %), and males were defined as the focal group (n=, %). For faculties, education faculties were defined as the reference group (n=, %), and other faculties were defined as the focal group (n=, %). “Logistic Ordinal Regression Differential Item Functioning using IRT (lordif)” (Choi, Gibbons, & Crane, ) was used for the estimations. This technique was developed for polythomous items and based on IRT with GPCM or GRM models. Both uniform and nonuniform DIF can be detected. DIF results for the ACTS are given in Table 10 and Table 50 Ergul DEMIR / Eurasian Journal of Educational Research 75 () Table 10 DIF Results for Gender Sub-Groups Obtained with lordif* Probability Cox & Snell Nagelkerke McFadden β12 χ χ χ R R R R R R R R R 1 0e+00 2 4e 3 0e+00 4 1e 5 0e+00 6 0e+00 7 1e 8 2e 9 1e 10 7e 11 2e 12 0e+00 13 2e 14 5e 15 5e 16 0e+00 17 0e+00 18 4e 19 1e 20 1e *Replication=, α=, ΔR2 =, Δβ= As seen in Table 10, except the 15th item, there is no significant DIF. The 15th item shows DIF with all χ2 , R2 , and β values. These differences are observed between the 1st-2nd and 1st-3rd models. So, it is possible that DIF should be uniform. The 15th item of the ACTS is “defending copying answers is completely nonsense”. Distributions of females’ and males’ responses with their tendency levels (θ) are given in Graphic 3. Graphic 3. Females’ (Reference) and Males’ (Focal) Responses with their Tendency Levels (θ) in the 15th Item of the ACTS Ergul DEMIR / Eurasian Journal of Educational Research 75 () 51 As seen in Graphic 3, at each θ level, females are located in the middle, whereas males have different locations. At the lower θ levels, males show more admittance to the 15th item. At the higher θ levels, the opposite is the case. This can show real differences between the gender sub-groups. Indeed, some research reported that males are more prone to cheating and copying (Crown & Spiller, ; Genereux & McLeod, ; Selçuk, ; Whitley, ). If there is a real difference in the 15th item, it can be stated that the ACTS has no DIF between gender sub-groups. Table 11 DIF Results for Faculty Sub-Groups Obtained with lordif* Probability Cox & Snell Nagelkerke McFadden β12 χ χ χ R R R R R R R R R 1 7e 2 1e 3 0e+00 4 0e+00 5 1e 6 0e+00 7 1e 8 0e+00 9 8e 10 5e 11 3e 12 3e 13 2e 14 0e+00 15 0e+00 16 0e+00 17 0e+00 18 0e+00 19 6e 20 1e *Replication=, α=, ΔR2 =, Δβ= As seen in Table 11, there is no DIF on any items except the 5th item. The 5th item shows DIF according to the χ2 values. It is known that χ2 is affected by sample size (Tabachnick & Fidel, ; Hair et al., ). On the other hand, the R2 and β values are very small and close to 0. This shows that there are no significant DIF on the items of the ACTS. Discussion and Conclusions In this study, it was thought that the answer-copying tendency would be one of the indicators of the answer-copying behaviors. It is highly possible that the students with higher levels of tendency will be answer-copying. It is obvious that the statistical detection techniques couldn’t provide exact solutions. These known techniques need initial response patterns for defining the suspicious focal group of answer-copying. 52 Ergul DEMIR / Eurasian Journal of Educational Research 75 () Therefore, we need such indirect solutions at least in order to understand the nature of answer-copying. The “Answer-Copying Tendency Scale in University Students (ACTS)” was developed within this context. Unlike similar examples, it was studied on large groups and used more comprehensive techniques to obtain psychometric evidence. Results show that the validity and reliability levels of the ACTS are very high. The ACTS can be used to understand the nature of answer copying. Also, and more importantly, it is thought that the ACTS can be used to define suspicious answer patterns for prior distributions. References Aaron, R.M. (). Student academic dishonesty: Are collegiate institutions addressing the issue? NASPA Journal, 29(2), Aaron, R.M., & Georgia, R.T. (). Administrator perceptions of student academic dishonesty in collegiate institutions. NASPA Journal, 31(2), Angoff, W.H. (). The development of statistical indices for detecting cheaters. Journal of the American Statistical Association, 69, 44– Bolin, A.U. (). Self-control, perceived opportunity, and attitudes as predictors of academic dishonesty. The Journal of Psychology, (2), Burke, J. L. (). Faculty perceptions of and attitudes toward academic dishonesty at a two year college. (Unpublished doctoral dissertation). University of Georgia. Choi, S.W., Gibbons, L.E., & Crane, P.K. (). Lordif: An R package for detecting differential item functioning using iterative hybrid ordinal logistic regression/item response theory and Monte Carlo simulations. Journal of Statistical Software, 39(8), Crown, D.F., & Spiller, M.S. (). Learning from the literature on collegiate cheating: A review of empirical research. Journal of Business Ethics, 17, DeVellis, R.F. (). Scale development, theory and applications (Second edition). Sage Publication, Inc. Dwyer, D.J., & Hecht, J.B. (). Cheating detection: Statistical, legal and policy implications. Illinois State University. Eminoglu, E., & Nartgun, Z. (). Universite ogrencilerinin akademik sahtekarlık egilimlerinin olculmesine yonelik bir olcek gelistirme calısması [A scale development study to measure academic dishonesty tendency of university students]. Uluslararası Insan Bilimleri Dergisi, 6(1), Ergul DEMIR / Eurasian Journal of Educational Research 75 () 53 Frary, R.B., Tideman, T.N., &Watts, T.M. (). Indices of cheating on multiple-choice tests. Journal of Educational Statistics, 6, – Gardner, W.M. & Kenneth, B.M. (). A scale for measuring attitude toward cheating. Bulletin of the Psychonomic Society, 26(5), Genereux, R.L., & McLeod, B.A. (). Circumstances surrounding cheating: A questionnaire study of college students. Research in Higher Education, 36(6), Gerdeman, R.D. (). Academic dishonesty and the community college. ERIC Digest, ED Hair Jr., J.F., Black, W.C., Babin, B.J., & Anderson, R.E. (). Multivariate data analysis (seventh edition). Edinburg Gate, England: Pearson Education Limited. Holland, P.W. (). Assessing unusual agreement between the incorrect answers of two examinees using the K-index: Statistical theory and empirical support (Research Report RR ). Princeton, NJ: Educational Testing Service. Hughes, J.M.C., & McCabe, D.L. (). Understanding Academic Misconduct. Canadian Journal of Higher Education, 36(1), 49 – Maeda, H., & Zhang, B. (). An algorithm to improve test answer copying detection using the omega statistic. International Journal of Testing, 17(1), May, K.M., & Loyd, B.H. (). Academic dishonesty: The honor system and students' attitudes. Journal of College Student Development, 34, McCabe, D.L. (). Academic integrity: What the latest research shows. Synthesis: Law and Policy in Higher Education, 5(1), McCabe, D.L., & Trevino, L.K. (). What we know about cheating in college: Longitudinal trends and recent developments. Change, 28(1), McCabe, D.L., & Trevino, L.K. (). Individual and contextual influences on academic dishonesty: A multi-campus investigation. Research in Higher Education, 38(3), Mullens, A. (). Cheating to win. University Affairs, 41(10), Ozturk, B. & Yesilyaprak, B. (). Universite ogrencilerinin kopya cekme olayına iliskin tutumları [Attitudes of the university students toward copying]. Egitim Yonetimi, 3(2), Selcuk, Z. (). Bir egitim ve rehberlik sorunu: Okullarda kopya cekme [An education and consultansy problem: Copying in schools]. Kuram ve Uygulamada Egitim Yonetimi, 1(3), 54 Ergul DEMIR / Eurasian Journal of Educational Research 75 () Tabachnick, B.G., & Fidell, L.S. (). Using multivariate statistics (sixth edition). New Jersey, USA: Pearson Education. van der Linden, W.J., & Sotaridona, L.S. (). Detecting answer copying when the regular response process follows a known response model. Journal of Educational and Behavioral Statistics, 31(3), – Wesolowsky, G. (). Detecting excessive similarity in answers on multiple choice exams. Journal of Applied Statistics, 27(7), Whitley Jr, B.E. (). Factors associated with cheating among college students: A review. Research in Higher Education, 39(3), Wollack, J.A. (). A nominal response model approach to detect answer copying. Applied Psychological Measurement, 21, – Appendix A. Kopya Çekme Eğilimleri Ölçeği (KÇE) [Answer-Copying Tendency Scale in University Students (ACTS)] Aşağıdaki ifadelere katılma düzeylerinize 0 (hiç katılmıyorum) ile 10 (tamamen katılıyorum) arasında puan veriniz. [Point your participation levels below between 0 (I do not agree) and 10 (I fully agree).] Katılma Düzeyiniz [Participation Level] Kopya çekmek, insanın kendisini kandırmasıdır. [Copying answers makes a 1* fool of a person.] Fırsatını yakaladığım durumlarda kopya çekerim. [I'll copy answers when I 2 get the chance.] Kopya çekmeyi haksız bir kazanç olarak görüyorum. [I see copying answers 3* as an unfair advantage.] Kopya çekerek yüksek puanlar aldığını bildiğim arkadaşların olması bende 4 de kopya çekme hırsı uyandırıyor. [Having friends I know that get higher scores by copying answers, I also have the ambition to copy answers.] Kopya çekilmesine kesinlikle karşıyım. [I'm absolutely against copying 5* answers.] Yeterince hazırlanamadığım sınavlarda kopya çekmeyi düşünürüm. [I 6 think about copying answers for the examinations that I have not prepared enough for.] Kopya çekmek, öğretmenin emeğine saygısızlıktır. [Copying answers is 7* disrespectful to the teacher's endeavors.] Dersi geçmeye yetecek kadar kopya çekilebilir. [Copying answers can be 8 enough to pass the exam.] Bazı derslerde kendimi kopya çekmeye mecbur hissediyorum. [I feel 9 compelled to copy answers myself in some lessons.] Ergul DEMIR / Eurasian Journal of Educational Research 75 () 55 Kopya çekmem benim başarımı değil hilekârlığımı gösterir. [Copying 10* answers is not my achievement but my deceit.] Zaten unutacağım ve ileride kullanmayacağım konuları içeren derslerde 11 kopya çekmek bana mantıklı geliyor. [It makes sense for me to copy answers in lessons that I will forget and not use in the future.] Çekmesem bile sınavda kopya çekebilecek olduğumu bilmek bana güven 12 verir. [Even if I do not do it, it gives me confidence to know that I can copy answers at the exam.] Bilmediğim konulardan gelen sorularda kopya çekmek bana makul geliyor. 13 [It makes sense for me to copy answers to the questions I do not know.] Herhangi bir ceza almayacağımı bilsem kopya çekerim. [I will copy answers 14 if I know I will not be punished.] Kopya çekmenin savunulması tamamen saçmalıktır. [Defending copying 15* answers is completely nonsense.] Kopya çekilmesini önlemeye yönelik gerçekten caydırıcı önlemler alınmasını 16* istiyorum. [I would like to take real deterrent measures to prevent students from copying answers.] İşime yaramayacak onca bilgiyi ezberleyeceğime kopya çekmek daha 17 mantıklı geliyor. [It makes more sense to copy answers when I memorize so much information that will not work in my own life.] Not kaygısı beni kopya çekmeye itiyor. [Anxiety about earning high grades 18 pushes me to copy answers.] Kopya çekmenin ahlak dışı bir davranış olduğuna inanıyorum. [I believe that 19* the copying answers is immoral behavior.] Yakalanma korkum olmasa kopya çekerim. [If I am not afraid of getting 20 caught, I'll copy answers.] *İkinci faktöre (ETIK) ait maddelerdir. Ölçek toplam puanı için ters kodlanması gerekir. [Belongs to the second factor (ETIK). Should be recode for total score.] Potansiyel Bir Hata Kaynağı Olarak Üniversite Öğrencilerinde Kopya Çekme Eğilimlerinin Ölçülmesi: Bir Ölçek Geliştirme Çalışması Atıf: Demir, E. (). As a potential source of error, measuring the tendency of university students to copy the answers: A scale development study. Eurasian Journal of Educational Research, 75 (), , DOI: /ejer Özet Problem Durumu: Kopya çekmeyi de içeren akademik sahtekarlık okullarda giderek daha önemli bir sorun haline gelmektedir. Bu sorun, sınıf büyüklüklerinin artırılması ve öğretim kaynaklarının azaltılması ile artmaktadır. Kopya çekmenin tespit edilmesi 56 Ergul DEMIR / Eurasian Journal of Educational Research 75 () için bazı istatistiksel teknikler olmasına rağmen, bunlar sınırlıdır ve ancak bir hata, olası bir hata kaynağı veya bir olasılık gösterebilmektedir. Kopya çekmenin yapısını ve doğasını anlamak için bir başka yol, tutumlar ya da eğilimler gibi ilişkili diğer faktörleri dikkate almaktır. Çalışma ahlakı yüksek olan, sınav kaygısı düşük olan, öz- benlik saygısı yüksek olan vb. öğrencilerin daha az kopya çektiği gözlenmektedir. Diğer taraftan kopya çekmenin yaygınlaşması, normal bir davranış olarak görülmesi ve kabul edilmesi durumlarında, kopya çekme davranışları daha sık görülmektedir. Pek çok çalışmada öğrencilerin sahtekarlık ya da kopya çekmeye yönelik görüş ve önerileri, ancak anketler aracılığıyla gözlenmiştir. Ölçek düzeyinde olan yani toplam puan alınabilen çok az sayıda araç vardır. Bu araçların kullanımı ve erişimi oldukça sınırlıdır. Ayrıca bu araçların genellikle, küçük ve sınırlı gruplarda geliştirildiği görülmektedir. Kapsamları geniştir. Çoğunluğu, geçerlik ve güvenirlik düzeylerine yönelik az sayıda kanıt içermektedir. Çoğunlukla bu araçlar geliştireli çok zaman geçmiştir. Anlaşıldığı üzere, uzun süredir konu üzerinde çalışılıyor olmasına ragmen, geçerli ve güvenilir araçlara olan ihtiyaç devam etmektedir. Araştırmanın Amacı: Bu çalışmanın amacı, üniversite öğrencilerinin kopya çekme eğilimlerini gözlemlemek için ölçek olarak geçerli ve güvenilir bir ölçme aracı geliştirmektir. Ayrıca, daha kapsamlı geçerlilik ve güvenilirlik çalışmaları ile kanıt sağlamayı amaçlamaktadır. Araştırmanın Yöntemi: Bu bir ölçek geliştirme çalışmasıdır. “Klasik Test Teorisi” altında “Üniversite Öğrencilerinde Kopya Çekme Eğilimi Ölçeği (KÇE)” geliştirilmiştir. Ayrıca diğer teoriler, özellikle “Madde Tepki Kuramı” da dikkate alınmıştır. Ön çalışmalar ve madde yazıma çalışmalarında, 80 civarı öğrenciden kopya çekmeye yönelik görüşlerini kompozisyon biçiminde yazmaları istenmiş, bu dokumanlar analiz edilerek maddeden oluşan bir taslak form oluşturulmuştur. Bu form eğitim bilimleri alanında çalışan 6 akademisyenin görüşlerine sunulmuştur. Görüşler doğrultusunda gerekli düzenlemeler yapılarak 93 maddelik deneme formu oluşturulmuştur. Bu çalışmalardan sonra öğrenci ile deneme uygulaması ve öğrenci ile ana uygulamalar yapılmıştır. Elde edilen veriler kullanılarak, yapı geçerliği, madde ve test betimsel istatistikleri, madde ayırıcılıkları, içtutarlılık ve test-tekrar test güvenirliği, sınıflandırma doğruluğu ve değişen madde fonksiyonu ile madde yanlılığı incelenmiştir. Araştırmanın Bulguları: Deneme ve esas uygulama verileri üzerinde ayrı ayrı yürütülen “Temel Bileşenler Analizi (TBA)” sonuçları birbirini destekler niteliktedir. Bu sonuçlara göre KÇE, 2 faktör ve 20 maddeden oluşmaktadır. İlk faktör 12 maddeden oluşmaktadır ve “negatif sınav ve not algısı (NEGALGI)” olarak tanımlanmıştır. İkinci faktör 8 maddeden oluşmaktadır ve “etik değerler (ETIK)” olarak tanımlanmıştır. Açıklanan varyans yüzdeleri, birinci faktör için %, ikinci faktör için % ve toplamda %’dir. Bu faktörler arasında negatif yönlü ve orta düzey bir ilişki Ergul DEMIR / Eurasian Journal of Educational Research 75 () 57 bulunmaktadır (r= ve p<). TBA’nın yanı sıra hem destekleyici kanıt sağlamak hem görsel sunum sağlamak için ALSCAL tekniği ile Öklit uzaklıklarına dayalı çok boyutlu ölçekleme çalışması da yapılmıştır. Elde edilen saçılma grafikleri de yapının belirgin bir şekilde 2 boyuttan oluştuğunu göstermiştir. KÇE’de her bir madde 0 ile 10 arasında puanlanmaktadır. Toplam puanlar 0 ile arasında değişmektedir. Toplam puanlar ve madde puanlarına yönelik dağılımlar, normaldir. Ayrıca madde- toplam puan korelasyonlarına yönelik olarak madde ayırıcılıkları, çok yüksek ve 0,40'ın üzerinde gözlenmiştir. Güvenirlik çalışmaları olarak iki ayrı yöntem kullanımıştır. Öncelikle hem deneme uygulaması hem esas uygulama verileri üzerinde, hem test geneli hem faktörler düzeyinde α içtutarlılık katsayıları hesaplanmıştır. Bu değerler; deneme uygulaması sonuçlarına göre birinci faktör için , ikinci faktör için ve ölçek geneli için ; esas uygulama sonuçlarına göre ise sırasıyla , ve ’tir. Bu değerler çok yüksek güvenirlik düzeylerine işaret etmektedir. Ayrıca esas uygulamaya katılan 95 öğrenciye tekrar test uygulaması yapılarak test-tekrar test güvenirliği hesaplanmıştır. Bu değer de 'tür ve yüksek düzeyde güvenirliğe işaret etmektedir. Bir diğer inceleme olarak KÇE’nin öğrencileri kopya çekme ve kopya verme durumlarına göre ne düzeyde doğru sınıflandırabildiği “Diskriminant Fonksiyon Analizi” ile analiz edilmiştir. Bu kapsamda test edilen dört ayrı modelde de yüksek sınıflama doğrlukları elde edilmiştir. Madde yanlılığı çalışmaları her bir madde düzeyinde değişen madde fonksiyonlaşması incelenerek yapılmıştır. Bu amaçla, çok kategorili maddelere yönelik olarak MTK’ya dayalı bir şekilde geliştirilmiş “lordif” tekniği kullanılmıştır. Analiz sonuçlarına göre maddeler üzerinde manidar ve ciddi düzeyde bir DIF gözlenmemiştir. Araştırmanın Sonuçları ve Önerileri: Bu çalışmada ana hipotez, kopya çekme eğiliminin kopya çekme davranışlarının bir göstergesi olacağı yönündeydi. Daha yüksek düzeyde bir eğilime sahip olan öğrencilerin, kopya çekme davranışı göstermeleri olasılığı yüksektir. İstatistiksel tespit teknikleri kesin çözümler sağlayamadığından, kopya çekmenin doğasını anlamak için en azından bu tür dolaylı çözümlere ihtiyacımız var. Bu kapsamda bu çalışmada “Üniversite Öğrencilerinde Kopya Çekme Eğilim Ölçeği (KÇE)” geliştirilmiştir. Benzer örneklerden farklı olarak, bu çalışmada büyük gruplar üzerinde çalışılmış ve kanıt elde etmek için daha kapsamlı teknikler kullanılmıştır. Sonuçlar, KÇE'nin geçerlilik ve güvenilirlik düzeylerinin çok yüksek olduğunu göstermektedir. KÇE, kopya çekmenin doğasını anlamak için kullanılabilir. Ayrıca ve daha önemlisi, KÇE'nin kopya belirleme tekniklerinin öncül dağılımları için şüpheli cevap örüntülerini tespit etmek amacıyla da kullanılabileceği düşünülmektedir. Anahtar Sözcükler: kopya çekme eğilimi, sahtekarlık, ölçek geliştirme, kopya-belirleme