مطالب مرتبط با کلیدواژه
۲۱.
۲۲.
۲۳.
۲۴.
۲۵.
۲۶.
۲۷.
۲۸.
۲۹.
۳۰.
۳۱.
۳۲.
۳۳.
۳۴.
Validity
حوزههای تخصصی:
The objective of this study was to validate a bilingual Spanish-English version of the Vocabulary Size Test (VST) considering its potential use as a discriminator between learners in terms of language competence. This version was designed based on the two forms available on one of the creators’ websites as well as considering practices recommended regarding the elimination of cognates and loans. A one-way ANOVA test was used to confirm the test’s capacity to discriminate among learners of different linguistic competence. Additionally, Principal Axis Factoring (PAF) was conducted to revise the existence of only one underlying variable. As a result of this study, a VST version for Spanish speakers consisting of 9 vocabulary frequency levels is shared. This version is in line with validation standards put forward in previous research. It is expected that this instrument will help future studies that seek to measure Spanish speakers’ competence in English as a foreign or second language without having to deal with the interference of other intervening factors.
Psychometric Evaluation of Cloze Tests with the Rasch Model
منبع:
International Journal of Language Testing, Volume ۱۲, Issue ۲, Summer and Autumn ۲۰۲۲
95 - 106
حوزههای تخصصی:
Cloze tests are gap-filling tests designed to measure overall language ability and reading comprehension in a second language. Due to their ease of construction and scoring, cloze tests are widely used in the context of second and foreign language testing. Previous research over the past decades has shown the reliability and validity of cloze tests in different contexts. However, due to the interdependent structure of cloze test items, item response theory models have not been applied to analyze cloze tests. In this research, we apply a method to circumvent the problem of local dependence for analyzing cloze tests with the Rasch model. Using this method, we applied the Rasch model to a cloze test composed of eight passages each containing 8-15 gaps. Findings showed that the Rasch model fits the data and thus it is possible to scale persons and cloze passages on an interval unidimensional scale. The test had a high reliability and was well-targeted to the examinees. Implications of the study are discussed.
Reliability and Validity of Self-Assessments among Iranian EFL University Students
منبع:
International Journal of Language Testing, Volume ۱۳, Issue ۱, Winter and Spring ۲۰۲۳
225 - 235
حوزههای تخصصی:
Modern teaching practices emphasize learner autonomy and learner-centered approaches to language learning. Such teaching methods require corresponding assessment approaches. Self-assessment is viewed as an assessment mode which matches modern learner-centered teaching methodologies. However, the validity and reliability of self-assessments are not yet conclusively established. This study aimed to provide validity and reliability evidence for self-assessments among Iranian EFL university learners. The Common European Framework of Reference (CEFR) Self-Assessment Grid was translated into Persian and was given to a sample of Iranian undergraduate students of English. A C-Test battery containing four passages was used as a criterion for concurrent validation. Self-assessments of university EFL learners were examined for internal consistency and test-retest reliability. Findings showed that while self-assessments are highly reliable they lack validity as evidenced with low correlations between components of self-assessment grid and the C-Test. The implications of the study for the application of self-assessments in foreign language education are discussed.
Validity and reliability of the Iranian force plate(مقاله علمی وزارت علوم)
منبع:
Sport Sciences and Health Research, Volume ۱۴, Issue ۱,۲۰۲۲
109 - 114
حوزههای تخصصی:
Background: Force plates are widely used in biomechanics and sports sciences to measure various aspects of human movement. The accuracy and reliability of force plate measurements are critical for valid data interpretation. Aim: The purpose of the study was to evaluate the validity and reliability of the Iranian force plate in the vertical, anterior-posterior, and medial-lateral directions using two manual dynamometers and a load cell. Materials and Methods: In this study, the force plate device utilized had a frequency of 1200 Hz and was manufactured by the Danesh Salar Iranian Company. Additionally, to determine the device's validity, we used Lafayette hand-held dynamometers manufactured in the United States and a load cell by Zemik. Pearson's correlation coefficient was employed to determine the validity of the force plate, while the internal consistency coefficient (ICC) was used to assess the force plate's reliability. Results: The study findings indicated a significant and high level of reliability between the maximum force obtained from the force plate device and manual dynamometer devices and load cell. Additionally, the internal consistency coefficient was found to be excellent (very high) for 20 trials in the three directions of vertical (0.98), anterior-posterior (0.96), and medial-lateral (0.97). Conclusion: The study demonstrated that the Iranian force plate is a reliable device for measuring maximum force in the three directions of vertical, anterior-posterior, and medial-lateral, with very high validity.
Investigating Psychometric properties of the Scale of Emotional Experience towards the Spouse(مقاله علمی وزارت علوم)
حوزههای تخصصی:
The objective of the present study was to establish and assess the psychometric properties of the scale of emotional experience towards the spouse in 2018-19. For this purpose, all the married women in the city of Isfahan were considered as the statistical population from which 300 married women were selected as the statistical sample using convenience sampling. The research instruments included the scale of emotional experience towards the spouse, extroversion and introversion subscales of NEO Personality Inventory (Costa & McCrae, 1992), and triangulation (Dehghan and Yousefi, 2019). The data were analyzed using descriptive statistics, mean and standard deviation) and inferential statistics (correlation analysis, exploratory factor analysis, and norm determination. Convergent validity and divergent validity results revealed that the subscale of negative emotional experience towards the spouse was significantly positively related to neuroticism and triangulation (convergent validity), but negatively related to extroversion (divergent validity). The subscale of positive emotional experience towards the spouse, on the other hand, had a positive relationship to extroversion (convergent validity) and a significantly negative relationship to neuroticism and triangulation (divergent validity). Exploratory factor analysis showed two basic factors called positive emotional experience towards the spouse and negative emotional experience towards the spouse. Test-retest coefficients, at a three-week interval, confirmed test-retest reliability. Thus, based on what the results revealed, this test can be used to assess the scale of emotional experience towards the spouse in the married women both in research and psychotherapy.
A systematic review of validity and reliability assessment of measuring Spasticity Evaluation Tool and Wheelchair Skills Tests at the level of international classification of functioning, disability and health (ICF) in people with spinal cord injury(مقاله علمی وزارت علوم)
منبع:
Sport Sciences and Health Research, Volume ۱۵, Issue ۲, ۲۰۲۳
203 - 217
حوزههای تخصصی:
Background: Assessment of spasticity and wheelchair skills performance is important in both clinical practice and research.Aim: The present study aimed to systematically review the psychometric properties (reliability and validity) of outcome measures used to assess spasticity and wheelchair skill tests in people with spinal cord injury.Materials and Methods: A search was conducted using terms through PubMed, Embase, Scopus, and Web of Science databases. Related articles included measures of spinal cord injury patients published in English from 2010 to 2021.To determine the publication quality of studies COSMIN checklist was used.Results: A total of 2150 potentially eligible studies were retrieved from four databases. The remaining 20 full-text studies were retrieved for complete review. Finally, 12 studies involving a total of 658 participants were included in the systematic review.Conclusion: Ethical, safety, and psychological issues were considered during the test for people with disabilities. According to previous studies, the Spasticity Evaluation Tool has been suggested as a reliable tool for assessing spasticity in SCI subjects. However, due to the variety of tests and the elimination of selected tools, wheelchair skills tests cannot be recommended.
Validation of C-Test among Iraqi EFL University Students
حوزههای تخصصی:
This study aims to assess the performance of university students through the C-Test and to analyze the extent to which this test is valid in measuring language ability. A standardized C-Test has been created with four brief passages, each containing 20 gaps. The length of each passage varied from 95 to 109 words. Throughout each passage, only the first and last sentences were not changed. The test was taken by 100 students; 39 were male and 61 were female at Al-Nisour University/Department of English in Baghdad, Iraq. The sample consists of two groups. Both groups come from the same school and would receive similar educational input in both cases based on their grade level. The validity and reliability of the C-Test were investigated using various techniques. The study analyzed the performance of Stage 4 and Stage 3 students on the Common Language Proficiency Test in Iraq. The results showed that the test discriminates well between high-ability and low-ability examinees, with no significant difference between the two groups. The Rasch model separation reliability was relatively high, and the data were one-dimensional. The students faced difficulties in guessing the most appropriate words due to their limited English proficiency. The results suggest that developing and implementing this test could significantly improve students' academic achievements in basic foreign language classes in Iraq.
Normative Study and Psychometric Properties of the Digital Quotient Test in Children and Adolescents Aged 8-18 in the Iranian Community
حوزههای تخصصی:
Objective: Digital Quotient (DQ) refers to a comprehensive set of digital competencies derived from universal ethical values that aim to enhance human interaction with, control, and create technology. The present study aimed to establish norms and examine the psychometric properties of the Digital Quotient Test in children and adolescents aged 8-18 in the Iranian community.Methods: This study's statistical population included students of the First and Second Elementary Schools and the First and Second Secondary Schools of Tehran in the academic year 2020-2021. A total of 521 students (277 girls and 244 boys) were examined using a convenience sampling method. To analyze the data obtained from the test, inferential statistics to determine construct validity, Pearson correlation matrix, and test-retest reliability using SPSS software version 26.Results: The results indicated that the construct validity of the Digital Quotient Test, using the internal consistency between its eight domains and the total score as evidence for this validity, was found to be appropriate (P < 0.05). Using the test-retest method with a coefficient of 0.872, the test reliability was estimated to be appropriate (P < 0.01).Conclusion: The Digital Quotient Test has appropriate validity and reliability in children and adolescents aged 8-18 years in Iranian community.
Validity of the Persian translation of the COVID-19 Attitudes and Behaviors (ACAB)(مقاله علمی وزارت علوم)
حوزههای تخصصی:
Introduction: Of particular global concern is the coronavirus disease of 2019 (COVID-19) outbreak. All Persian versions of COVID-19 measures assess the intrapsychic aspects of it, and there is a crucial need to measure the intergroup aspects of this pandemic. Aim: The current study aims to validate the Persian version of COVID-19 attitudes and behaviors in the Iranian sample. Method: The participants included 250 people from all over Iran in cyberspace who were selected availability (177 men and 73 women). They voluntarily participated in the study by filling out questionnaires that were made available through Google Forms and then disseminated online. Results: The ACAB scale had satisfactory reliability and validity according to content, face, and construct validity tests except for the first subscale (social distancing adjustment). Consequently, confirmatory factor analysis supported the ACAB with 12-item and three subscales. Therefore, three subscales remained, including self-prioritization, prosocial behaviors, and belief in conspiracies, and social distancing adjustment was eliminated because the factor loading values of its items were less than 0.4. Conclusion: Results indicated that the ACAB is a reliable and helpful tool in research, especially for governmental surveys to understand why people do not cooperate in vaccination or prosocial behaviors.
The validity and reliability of the Persian version of passions athlete adults
حوزههای تخصصی:
The passion scale mainly focuses on the passion for achievement or becoming good in some area/theme/skill. This study aimed to translate the passion scale and assess reliability and content and construct validity for the passion scale in athletic adults in Tehran city. A cross-cultural translation was used to generate a Persian-English version of the passion scale. A total of 200 athletes adults with age 26/79 ± 5/01 completed Persian version of passion scale (PS), enabling us to investigate its feasibility, content validity, internal consistency, construct validity and test-retest reliability. 30 athletes adults stated that all the questionnaire items were simple, clear, and related to the objectives. The overall pattern of results suggests that the scale for passion presented here is applicable for the age studied. The calculated CVI and CVR were 0.94 and 0.91, respectively. All individual item scores correlated positively with the total score, with correlations ranging from 0.67 to 0.81. The Cronbach's alpha value for the standardized items was 0.88. Pearson correlations coefficient between total score passion scale and Grit-S scale were 0.53 for athletic adults. Intra class correlation coefficients (ICCs) between test and retest scores for the total score was 0.92. The results of this study showed that this Persian version of passion scale in athletes adults has a good validity and reliability and can be used in investigating passion of athletes adults.
Examining Indicators of Validity in Online Formative Assessment: Insights from Iranian EFL Teachers(مقاله پژوهشی دانشگاه آزاد)
منبع:
The Journal of English Language Pedagogy and Practice, Vol.۱۶, No.۳۳, Fall & Winter ۲۰۲۳
201 - 223
حوزههای تخصصی:
Valid online formative assessments are crucial for accurate measurement of students' progress and effective pedagogical decision-making in digital learning. This quantitative-based study followed two primary aims. First, it aimed to investigate the extent to which Iranian EFL teachers working in universities and language institutes apply indicators of online formative assessment validity. Twenty-one online classrooms were observed in three sessions using a checklist. The second aim of this study was to determine the effect of EFL teachers’ place of living on the validity of online formative assessment. To this end, 316 Iranian EFL teachers from diverse EFL settings, including public schools, private schools, language institutes, and universities were asked to fill out online formative assessment validity scale developed by Maleki et al. (2023). The sample included both male and female teachers with varying age group ranges and academic degrees. The findings of the study indicated that Iranian EFL teachers in universities and language institutes tend to overlook indicators associated with the learner-centered aspects of online formative assessment validity. Furthermore, it was revealed that EFL teachers’ place of living could impact the validity of online formative assessment. This study has several implications for online EFL teachers and policymakers. The findings of this study emphasize the context-bound nature of validity in online formative assessment. Besides, it helps Iranian EFL teachers identify specific areas that need more attention and improvement in order to enhance the validity of their online formative assessments.
Reliability and Concurrent Validity of a Clinometer+ Bubble Smartphone Application to Assess Hamstring Muscle Length
حوزههای تخصصی:
Purpose: Muscle flexibility is a component of physical fitness. Using traditional tools in muscle length evaluation tests creates challenges. Therefore, the use of smartphones and health-related software as an alternative method has become widespread. This study aimed to investigate smartphones' intra- and inter-rater reliability and validity for measuring hamstring muscle length. Method: In a blinded study design, two researchers measured hamstring flexibility through four types of tests on each of the 22 asymptomatic participants with a total of 44 lower limbs. The measurements were compared between the traditional goniometer method and the practical smartphone application method. The intraclass correlation coefficient (ICC) was used to evaluate the reliability of each smartphone measurement, and Bland-Altman analysis was used to check the measurement errors. The validity of the two methods was also investigated. Results: Intra- and inter-rater reliability (ICC≥0.8) were good to almost perfect. In intra-rater reliability, PSLR angle showed consistent imprecision; other tests were free of systematic error and measurement error. The inter-rater reliability revealed a constant error in the right leg's PKE angle. A good to excellent correlation (r = 0.817–0.699) was observed in all the measured values, indicating the two methods' validity. Conclusion: These findings support from intra- and inter-rater reliability and validity of both instruments when measuring hamstring muscle length.
Cross-cultural adaptation of the Short forms of Wisconsin Schizotypy Scales: Psychometric evaluation in Iran(مقاله علمی وزارت علوم)
منبع:
مطالعات روان شناسی بالینی سال ۱۴ بهار ۱۴۰۳ شماره ۵۴
1 - 10
حوزههای تخصصی:
Introduction and Objective: Since their development the Wisconsin Schizotypy Scales have been extensively used to assess schizotypy in clinical and nonclinical samples. The purpose of this study was to examine the psychometric properties of the short forms of the Wisconsin Schizotypy Scales (SWSS) in an Iranian population.
Research Methodology: This research was a correlational design. Three hundred and twelve participants from universities students along with thirty-four schizophrenic patient, and fifty theirs first degree relatives were included in this study. Participants answered to Persian version of SWSS, Short form Oxford-Liverpool Inventory of Feelings and Experiences (SO-LIFE) Questionnaire and Coleridge’s STA scale.
Findings: The results showed internal consistency in terms of Cronbach's α, and test–retest reliability. The results of the two methods showed a good reliability for the shortened WSS. Concurrent validity was tested by comparing with SO-LIFE and STA which showed acceptable relationship. Differential validity was tested by comparing SWSS scores between schizophrenic patients, theirs first degree relatives and normal people which was acceptable.
Conclusion: Moreover, the factor structure of Persian version of SWSS showed two factor solutions, and resembles that seen in previous related studies, providing further cross-cultural empirical evidence of the two factorial structures of the WSS. The present results provide the further demonstration of the validity of the shortened WSS and support their use in the study of schizotypy particularly among Iranian population.
Computer based Stroop Animal Size Test in Children: Construction, Validation and Psychometric Properties(مقاله علمی وزارت علوم)
منبع:
مطالعات روان شناسی بالینی سال ۱۴ بهار ۱۴۰۳ شماره ۵۴
33 - 44
حوزههای تخصصی:
Objective: An important cognitive control process is the ability to inhibit, that mature at different rates during childhood to adolescence. The aim of our research was to construct and validate computer-based Stroop animal size test in children for measuring selective attention and response inhibition.
Research Methodology: In this survey study, we prepared the test, after the approval of experts and software development, at first test-retest reliability with a two week was evaluated in a separate sample (N=50, 5-12 years old children), then data was collected from 92 children 5-12 years old (46 girls and 46 boys) included 22 children with attention deficit/hyperactivity disorder (ADHD) studied in the academic year 2022-2023 in Alborz province. Participants decided the real size of animals by pressing response keys on computer. ANOVA, Multivariate analysis, Pearson correlation coefficients and Cronbach α were used to assess reliability and validity (p<0.05).
Findings: The findings showed test-retest reliability in significant range (p<0.01). The correlation was high for Stroopnum, (r=0.83), Inconsistent answers, (r=0.72), Wrong answers, (r=0.89) but lower for Consistent answers, (r=0.28) and Reaction time, (r=0.41). To assess the internal consistency, Cronbach Alpha 0.91 computed. ANOVA analysis for comparing children's function in different age groups was (p<0.000).
Conclusion: Multivariate analysis was used in comparing children with ADHD to the control group, results showed significant difference between groups in Stroop components, (p<0.007). This computerized Stroop animal size test had satisfactory reliability and validity, that can measure cognitive functions such as selective attention and inhibition in children.