Date of Award


Document Type


Degree Name

Master of Science (MS)


Educational Foundations & Research


The purpose of this study was to design an instrument that would measure the tumbling and apparatus skill proficiency of male physical education majors of the University of North Dakota.

Two groups were used in the study. An experimental group of fifteen subjects, which was taking the required tumbling and apparatus course five times weekly; and a control group of ten subjects that had taken the same class the semester before, were utilized in this study.

The two groups were given the initial test for purposes of determining item validity. The experimental group was also given a retest to determine test item reliability.

Two statistical comparisons were made: (1) a within group comparison between the initial test and retest means given to the experimental group, and (2) a comparison between the means on the initial test of both groups in the areas tested. The null hypothesis was assumed in analyzing the significance of the difference between the means, of the within group comparison for reliability, at the .05 level. The .10 level of significance was used to determine item validity in the between group comparison.

The results of the within group comparison showed a significant difference in eight of the twenty-two items tested, and therefore these items were rejected. The between group comparison indicated six items to be significant.

It was concluded that the six items found statistically significant in both between group comparisons were reliable and valid test items for measuring tumbling and apparatus skill proficiency, fore, these test items made up the final test battery.