Multidimensional computerized adaptive scholastic aptitude test program used for grade 9 students under different reviewing test conditions

The aim of this research is to study the accurate prediction of comparing test information and evaluation result by multidimensional computerized adaptive scholastic aptitude test program used for grade 9 students under different reviewing test conditions. Grade 9 students of the Secondary Educational Service Area Office in the Northeast of Thailand, in 2014 academic year were the sample used in this research. The research materials were two test programs: the test program that allows reviewing of answers and the one that does not allow reviewing of answers. The manual of the test program and evaluation form of the multidimensional computerized adaptive scholastic aptitude test program used for grade 9 students in different reviewing test conditions showed that: 1) The test program is an accurate predictor of the students’ achievement, as verbal factor correlates with the students’ achievement in five core subjects. The aptitude tests on number and reasoning correlate with the students’ achievement in Mathematics, science and social science. The aptitude test on space correlates with the students’ achievement in Mathematics and science. 2) The test program has statistical and significant difference at .05 level. 3) The evaluation result of the two test programs has statistical significance at .05 level.


INTRODUCTION
The work studies grade 9 adolescent students, who have many problems to deal with.Most students choose to study what their friends study, regardless of their aptitude.This corresponds with many research works that have discovered that aptitude influences one's decision making on what to study in future.Pha-on (2010) studied the factors that affected the exciting learning of grade 9 students under Primary Education Service Area Office in Saraburi.In the study, it was found out that the highest average was obtained by the students who chose to study based on their aptitude.As a result, education management must provide content and learning activities *Corresponding author.E-mail: dear.dear888@gmail.com.
Authors agree that this article remain permanently open access under the terms of the Creative Commons Attribution License 4.0 International License that are in line with the aptitude and differences of the individuals in order to make the students perfect all way round, be able to find a job and live happily with other people.Each school must provide flexible learning process for students to choose according to their aptitude and interests (Ministry of Education, 2010).However, aptitude is an innate ability that cannot be directly realized, so testing theory is used as a tool to scale aptitude.
Scholastic aptitude test, mostly made and developed from Multi-Factor Theory by Thurstone, is used to measure students' academic ability.This results from the processing of knowledge and experience gained by students which in turn will enable them to have success in their choice field of study as well as success in their future occupation.Moreover, the test is also used in qualifying examinations, classifying students, diagnosing their capabilities, measuring the development of learners, predicting success, comparing intelligence, evaluating school-record and research.The research showed that scholastic aptitude correlated with students' academic achievement and the ability to predict scholastic achievements, which were mostly verbal, number, reasoning and space factors (Loard and Nicely, 1997;Morton, 2004).The test where students mostly write on answer sheet is called paper and pencil test or conventional test.It had several weak points, and so theorists developed and reformed it from conventional test to modern test theory.
Currently, computerized adaptive test is based on item response theory.Multidimensional computerized adaptive test is implemented in each dimension separately, and each dimension has a high relevance, that is, using multidimensional item response theory.Segall suggested that this theory should be used for selection of limited number for each content.This is consistent with the teaching of today, which focuses on integrating more knowledge.So, measurement is intended to change in part with a focus on measurements of performance that are more complex (Junpeng, 2007).Computerized adaptive test still has issues that need to be discussed.After the test is completed, the examinees should be given the opportunity to review answers.Vispoel (1998), Olea et al. (2000) Vispole et al. (2000) and Revuelta et al. (2003) found out that most examinees like to review answers in order to reduce anxiety.After reviewing the answers, the number of correct answers will increase; but researchers, like Wainer et al. (1993) and Green et al. (1984) have proposed it, stressing that opportunities should not be given for reviewing answers due to limited time of the test process.Previously in the country, there were only a few studies on unidimensional computerized adaptive test.One of such was that of Pimsiri (2006) who found that there was no difference between the examinees who had high level of ability and those with average ability under conditions of not reviewing and reviewing answers.But there was statistical significant difference (.05 level) between examinees with medium and low level ability and examinees with average ability under the conditions of not reviewing and reviewing answers.
The researcher is interested in developing the multidimensional computerized adaptive scholastic aptitude test program under different reviewing test conditions, focusing on verbal, number, reasoning and space factors.In addition, this program will guide further study of both general and vocational education consistent with testing the ability of individual examinees.It will be used as information for further study and as a guide for students to realize their aptitude.It will also be useful for students who are going to graduate from grade 9 and as a method for developing multidimensional computerized adaptive test in details.

Research questions
1. How are the test program that allows reviewing of answers, and the one that does not allow reviewing of answers accurate predictors ? 2. Are they any differences between information of the test program that allows reviewing of answers and the one that does not allow reviewing of answers? 3. Do the examinees have different opinions about the test program that allows reviewing of answers and the one that does not allow reviewing of answers?
The purpose of the study 1.To study the accuracy of Multidimensional Computerized Adaptive Scholastic Aptitude Test Program to predict 2. To compare the test information of multidimensional computerized adaptive scholastic aptitude test program used for grade 9 students under different reviewing test conditions.3. To compare the evaluation of multidimensional computerized adaptive scholastic aptitude test program used for grade 9 students under different review test conditions.

METHODOLOGY Materials
The materials used include 1, multidimensional computerized adaptive scholastic aptitude test program for grade 9 students; it has 2 programs: program that allows reviewing of answers and program that does not allow reviewing of answers.2. Manual of multidimensional computerized adaptive scholastic aptitude test program for grade 9 students under different reviewing test conditions.3. Evaluation form of multidimensional computerized adaptive scholastic aptitude test program for grade 9 students under varying reviewing test conditions.

Development of materials used for data collection
Multidimensional computerized adaptive scholastic aptitude test program for grade 9 students: this program selects item based on the ability of each examinees; that is, each examinee is given different items.The examinee gets 1 score if the answer is right and 0 score if the answer is wrong.See the procedures below: 1. Manual of program design includes information about the program, objective and utilization of the program, preliminary agreement, installations, running of the program, and definitions of specific terms.2. Preparing the manual of the program 3. Experts verify the quality of the manual 4. The recommendation of the experts was improved and the complete manual was developed.
Evaluation form of multidimensional computerized adaptive scholastic aptitude test program for grade 9 students under different reviewing test condition: The procedures are as follows: 1. Learning and synthesizing how to create evaluation form related documents.2. Building evaluation form using psycho-social criteria given by Sympson; they consist of 4 aspects : 1) the statement and how to perform the exam, 2) Attraction of test program, 3) The anxiety in the test and 4) The general opinion about test using computer program 3. Experts verify the quality of evaluation form 4. The recommendation of the experts was improved and a complete evaluation form was developed.Data collection is divided into 4 steps as follows: Step 1: Developing the test bank of scholastic aptitude (Figure 1).
Step 3: Using the multidimensional computerized adaptive scholastic aptitude test program (Figure 3).
Step 4: Evaluating the multidimensional computerized adaptive scholastic aptitude test program (Figure 4).

Data analysis
To study the predictive accuracy of the test programme, Pearson Product moment correlation was used to analyze the relationship between the mean of the examinees' total ability from the test program and mean of the examinees' average ability from school record.Independent sample t-test was used to compare and analyze the test information; its formula is given below (Hambleton and Cook. 1977

RESULTS
Results on the predictive accuracy of multidimensional computerized adaptive scholastic aptitude test program for grade 9 students under different reviewing test conditions showed that the test program was accurate in predicting the students' achievement (Table 1).Verbal factor was related with the students' achievement in 5 core subjects.Number factor and reasoning factor had relation with the students' achievement in Mathematics, Science and Social Science but had no relation with the students' achievement in Thai and English.Space factor was connected to the students' achievement in Mathematics and Science but had no relation with the students' achievement in Thai, Social and English subjects.
The results of comparison in test information form from multidimensional computerized adaptive scholastic aptitude test program for grade 9 students under different reviewing test conditions: The researcher did a random sampling of the two research programs, which have the same amount.The result showed that multidimensional computerized adaptive scholastic aptitude test program for grade 9 students under different reviewing test conditions had different test information form with statistical significance at .05 level.The test program that allows reviewing test condition has average test information higher than the test program that does not allow reviewing test condition (Table 2).
Results of comparing evaluation of multidimensional computerized adaptive scholastic aptitude test program for grade 9 students under different review conditions: the result of the two programs was statistically significant at .05 level.When analyzing the issue of evaluation in 4 parts, the result of explanation and test operation in terms of general opinion about using computer program for test was statistically significant at .05 level.In three sections, the test program that allows reviewing of answers has high average greater than the test program that does not allow reviewing of answers.For assurance of the test program, there is no difference in the evaluation as shown in Tables 3-4.

DISCUSSION
The test program was an accurate predictor of the students' achievement.The computerized adaptive scholastic aptitude test was appropriate for testing the examinees' ability.So, this result is consistent with the reality that verbal factor affects learning of general communication, understanding the meaning of conversation, listening to explanation and reading of the main idea for comprehension of each subject.Chinese aptitude is related with 5 core subjects, while quantity aptitude is related to number, calculation and reasoning factors, which are the basic characteristics necessary for the students.Therefore, these two aptitude factors correlate with Mathematics, Science and Social Science.Space factor is the basic factor that affects imagination and vision, which are connected to Mathematics and Science.Moreover, the sample in this research had moderate to high ability and cooperated in the test as well.So, item response showed a careful reflection.The result of this study is in line with the study of Philatun and On-uma (2008: 77) who found that the good predictor of students' Mathematics achievement involved the students' aptitude such as number, space, reasoning and verbal factors.
Multidimensional computerized adaptive scholastic aptitude test program for Grade 9 students under different reviewing test conditions has different test information, which is statistically significant at .05 level.The computerized adaptive scholastic aptitude test program that allows reviewing of answers made the mean ability of the examinees higher than test program that does not allow reviewing of answers.In calculating the information test, the mean ability of the examinees were calculated.The test program that allows reviewing of answers and the test program that does not allow reviewing of answers had different information.This is connected to the study of Tienoraset (2006) that compared the average examinees' mean ability under the condition of not allowing reviewing of answers and allowing reviewing of answers.The examinees with medium and low ability and those with average ability under the condition of not allowing reviewing of answers and allowing reviewing of answers had statistical and significant difference at .05 level.However, there was no difference between the examinees with high ability and those with average ability in the condition of not allowing reviewing of answers and allowing reviewing of answers.From the study of Vispoel et al. (2000) on computerized adaptive testing which allows reviewing of answers, it is found that the mean ability of the examinees slightly increased after reviewing of answers.Therefore, in this research , the examinees from allowing reviewing of answers test program had higher mean than the program that does not allow reviewing of answers; so, allowing of reviewing of answers test program has higher information test mean than the test program that does not allow reviewing of answers.
The result of the two evaluation programs is statistically significant at .05 level.The test program that allows reviewing of answers is greater than the test program that does not allow reviewing of answers.The examinees may likely want to review answers in order to be confident in the test; so, the opinions about the test program that allows reviewing of answers and the one that does not allow reviewing of answers differ.The research sample tested with the program that allows reviewing of answers has more satisfaction than the ones tested with the test program that does not allow reviewing of answers.However, for test program, it is necessary to use interesting and fashionable program as well as computer, which is not related to reviewing or not reviewing of answers.Therefore, in the research result, there is no difference between these two programs with the use of computer.Vispoel (1998) found that mostly of the examinees need to review answers and insisted that the examinees had positive opinion and satisfaction towards the test program that allow reviewing of answers more than the one that does not allow reviewing of answers.

Figure 1 Figure 1 .
Figure 1 Shows the development of test bank of scholastic aptitude

Figure 2 .
Figure 2. Development of multidimensional computerized adaptive scholastic aptitude test program.

Figure 3 .
Figure 3. Using the multidimensional computerized adaptive scholastic aptitude test program.

Figure 4 .
Figure 4. Evaluating the multidimensional computerized adaptive scholastic aptitude test program.

Table 1 .
Results of the correlation between mean of the examinees' total ability from test program with mean of the examinees' ability from average school record.

Table 2 .
The comparison result of information test form of multidimensional computerized adaptive scholastic aptitude testing program under varying review test conditions.
* statistical significance at .05 level

Table 3 .
The results using one-way MANOVA for comparison of evaluation of the test program *statistical significance at .05 level

Table 4 .
The results of using one-way MANOVA to make comparison in evaluation test program.