Statistics of KIT Intelligence Test

© April 2006 Paul Cooijmans

Because of the unusual nature of this test, this report will first consider its standard error of measurement. As the KIT can be taken multiple times, the standard error is directly observed in the standard deviation of the scores of repetitive KIT takers. This standard deviation is equal to the standard error of measurement.

For better understanding one must know the KIT has an internal method to estimate the standard error (SE) of each reported score, and in its initial period of use, until early March 2006, scores have consistently been reported with an estimated SE of 2.5 GT points or less. So .25 standard deviation on the high-range standard scale. From early March 2006 on, scores are being reported tighter, with the SE at 1.5 GT or less, to narrow the range of scores achieved by any individual. This is done to reduce the chance that someone taking the KIT many times will once "hit the jackpot" (get a score far above his real intelligence level).

Note that such an event - jackpot score - is theoretically possible on any IQ test, if one keeps taking test after test. But with the KIT, this risk is greater because one can take it many times. This report deals with KIT scores from the initial period with the estimated SE at 2.5. Here are the scores of the repetitive takers:

Score series of repetitive KIT takers

So the actual SE, computed as the weighted average of these standard deviations (SD), is 2.08 GT, or .208 SD on the high-range scale. That would be quite acceptable for a normal IQ test. But because the KIT can be taken many times, one would want it to be smaller. We may later see if the current convention of reporting scores with an estimated SE of 1.5 will indeed result in a smaller actual SE. To avoid confusion: a smaller SE does NOT mean LOWER scores on average, but a more narrow range of scores for any individual.

The usual statistics based on first attempts only

Scores so far

# testees: 13
Mean: 53.62
SD: 4.91
Male mean: 53.62
Female mean: NaN (0 persons)

44 *
48 *
49 *
50 *
53 **
54 *
55 ***
58 *
60 *
63 *

Note: there are a few mores scores that are not considered here because they are "or lower" ones of people whose real level is far below the "or lower" score (because they chose far too hard items), so including them would corrupt the statistics.

Correlation with other tests

Long Test For Genius30.98
Qoymans Multiple-Choice #350.98
Analogies of LTFG30.97
Spatial Insight Test30.97
Final Test30.86
Nemesis Test30.83
Cartoons of Shock40.78
Space, Time & Hyperspace40.77
Genius Association Test70.73
Test To End All Tests50.72
Bonsai Test60.67
Qoymans Multiple-Choice #470.56
Test of Shock and Awe40.46
Spatial section of Test For Genius Revision 200450.43
Test For Genius - Revision 200450.3
Verbal section of Test For Genius Revision 20045-0.02
Association Subtest of LTFG4-0.29
Cattell Culture Fair3-0.65

Weighted average of correlations: 0.52

Estimated minimum g loading: 0.72

Ranking in above table is based on the unrounded correlations.

National averages

Country# scoresAverage score

Correlation with personal details

PSIA True80.75
PSIA Ethics100.59
Educational level130.53
PSIA Extreme80.5
PSIA Rare80.41
PSIA Rational80.33
Father's eduational level120.24
PSIA Aspergoid80.22
PSIA Deviance100.21
PSIA Introverted80.16
PSIA Orderly80.15
PSIA Neurotic8-0.09
Mother's educational level13-0.15
Disorders (parents and siblings)13-0.19
PSIA Antisocial8-0.41
PSIA Cold8-0.44
Year of birth13-0.45
PSIA Cruel8-0.48
Disorders (own)13-0.66
PSIA Just8-0.82

Correlation with national IQs

Correlation of this test with national average IQs published by Lynn and Vanhanen:

Checking the height of the KIT scores

To see how the KIT's norming method works, the reported GT scores are equated to IQs using scores on other tests. The GT scale has a high-range mean and SD of 50 and 10, and the KIT's GTs are computed using a method discovered by me, described in the article The Golden Standard of Intelligence. This method does NOT use scores on other tests. So it is interesting to see how its results compare to hypothetical norms based on scores on other tests.

The below IQs are not meant as norms, but to check the working of the new method. Used are scores from other tests correlating .43 and higher with the KIT. The normal method is used (equating ranks of KIT GTs and IQs on other tests)[and the IQs were converted to protonorms using the initial protonorm formula in March 2008].


Although based on only 13 scores, it is worrying that the IQs especially under KIT 55 are much lower than the IQs that ACTUALLY correspond to those GTs when the GTs are computed as explained at the table with IQs, centiles etcetera. For the moment I conclude that this (the KIT's) norming method does not suffice when applied in this manner. To avoid confusion: the height of the norms is unrelated to the standard error of measurement meant on top of this report.