The Nemesis Test - Statistics

© Feb. 2014 Paul Cooijmans


Scores on The Nemesis Test

Contents type: Verbal, numerical, spatial, logical.   Period: 1997-present

0 ********
1 ***
2 ****
3 ****
4 **
4.5 *
6 *
7 *
9 *
10.5 *
12 *
14 *
16 **

Correlation of The Nemesis Test with other tests by Paul Cooijmans

(Test index) Test name n r
(48) Narcissus' last stand40.94
(85) Cooijmans Intelligence Test - Form 140.93
(35) Intelligence Quantifier by assessment160.93
(56) Short Test For Genius50.89
(79) Association subtest of Long Test For Genius60.86
(63) Long Test For Genius60.86
(69) Odds40.86
(57) Space, Time, and Hyperspace80.84
(54) Test of Shock and Awe50.83
(4) A Paranoiac's Torture: Intelligence Test Utilizing Diabolic Exactitude80.82
(36) Reflections In Peroxide40.81
(3) Qoymans Multiple-Choice #580.79
(77) Analogies #140.79
(42) The Marathon Test70.78
(26) Verbal section of Test For Genius - Revision 2004130.77
(21) Psychometric Qrosswords50.77
(45) Numerical and spatial sections of The Marathon Test70.77
(87) Cooijmans Intelligence Test - Form 290.76
(0) Test of the Beheaded Man100.74
(31) Numerical section of The Marathon Test70.74
(66) Test For Genius - Revision 2004110.73
(25) The Sargasso Test100.72
(11) Isis Test140.70
(75) Analogies of Long Test For Genius60.69
(10) Genius Association Test80.69
(30) Verbal section of The Marathon Test80.68
(7) The Final Test160.68
(40) Reason Behind Multiple-Choice - Revision 200880.68
(16) Lieshout International Mesospheric Intelligence Test90.67
(32) Spatial section of The Marathon Test70.67
(2) Cooijmans Intelligence Test - Form 370.66
(68) Numbers70.66
(1) Cartoons of Shock110.64
(28) The Test To End All Tests150.63
(27) Spatial section of Test For Genius - Revision 2004120.61
(82) Reason60.61
(24) Reason - Revision 200880.47
(44) Associative LIMIT60.45
(80) Qoymans Multiple-Choice #490.40
(5) Daedalus Test50.39
(84) Bonsai Test40.33
(62) Reason Behind Multiple-Choice60.32
(29) Words4-0.41

Weighted average of correlations: 0.690

Conservatively estimated minimum g loading: 0.83

Ranking in above table is based on the unrounded correlations. All available data is present in this table, no tests are left out except for those with less than 4 score pairs. All known pairs are used to obtain the true, honest statistics; correlations have not been artificially inflated by leaving out ceiling scores, outliers or other anomalies.

Correlation of The Nemesis Test with tests by others

(Test index) Test name n r
(239) Titan Test40.99
(231) Mysterium Entrance Exam40.82
(220) Cattell Culture Fair40.70
(240) Strict Logic Spatial Exam 4840.66
(211) Culture Fair Numerical Spatial Examination - Final version50.63
(225) Logima Strictica 3670.24
(243) Scholastic Aptitude Test (old)40.05

Weighted average of correlations: 0.553

Ranking in above table is based on the unrounded correlations. All available data is present in this table, no tests are left out except for those with less than 4 score pairs. All known pairs are used to obtain the true, honest statistics; correlations have not been artificially inflated by leaving out ceiling scores, outliers or other anomalies.

Please be aware that correlations with these external tests are in most cases affected (depressed, typically) by one or more of the following: (1) Little overlap with the object test because of the much lower ceilings and inherent ceiling effects of the tests used in regular psychology; (2) Candidates reporting scores selectively, for instance only the higher ones while withholding lower ones; (3) Candidates reporting, or having been reported by psychometricians, incorrect scores.

Estimated loadings of The Nemesis Test on particular item types

These are estimated g factor loadings, but against homogeneous tests containing only particular item types, as opposed to non-compound heterogeneous tests. Although tending to surprise the lay person, it is not uncommon for tests to have high loadings on item types they do not actually contain themselves. Such loadings reflect the empirical fact that most tests for mental abilities measure primarily g, regardless of their contents; that the major part of test score variance is caused by g, and only a minor part by factors germane to particular item types. It is of key importance to understand that this is a fact of nature, a natural phenomenon, and not something that was built into the tests by the test constructors.

Typeg loading of The Nemesis Test on that type

Compound tests have been left out of this table to avoid overlap.

Balanced g loading = 0.81

National medians for The Nemesis Test

Country n median score

For reasons of privacy, only countries with 2 or more candidates are included in this table. Ranking is based on the medians, and then alphabetic.

Correlation of The Nemesis Test with personal details

Personalia n r
P.S.I.A. System factor50.88
Observed associative horizon70.83
P.S.I.A. Cruel100.63
P.S.I.A. Antisocial100.61
Observed behaviour90.57
Educational level250.39
P.S.I.A. Cold100.38
P.S.I.A. Rare100.29
P.S.I.A. Neurotic100.18
P.S.I.A. Rational100.18
P.S.I.A. Aspergoid100.16
P.S.I.A. Introverted100.13
P.S.I.A. Just100.11
Mother's educational level240.10
P.S.I.A. Deviance factor120.03
Father's educational level24-0.01
Year of birth30-0.14
Disorders (parents and siblings)24-0.37
Gifted Adult's Inventory of Aspergerisms6-0.37
P.S.I.A. Orderly10-0.38
P.S.I.A. Extreme10-0.43
Disorders (own)24-0.46
P.S.I.A. True10-0.46
P.S.I.A. Ethics factor12-0.56

Correlation with national I.Q.'s of The Nemesis Test

Correlation of this test with national average I.Q.'s published by Lynn and Vanhanen:

Estimated g factor loadings upward and downward of particular scores

In parentheses the number of score pairs on which that estimated g factor loading is based. The goal of this is to verify the hypothesis that g becomes less important, accounts for a smaller proportion of the variance, at higher I.Q. levels. The mere fact of restricting the range like this also depresses the g loading compared to computing it over the test's full range, so it would be normal for both values to be lower than the test's full-range g loading.

Raw scoreUpward g (n)Downward g (n)
00.83 (337)NaN (0)
2.50.79 (168)0.67 (43)
5.50.74 (25)0.75 (223)
22NaN (0)0.83 (337)


Remark: For a test with only 22 items, these reliabilities are very high. This is because, on the level item analysis, almost all of the items turn out to be hard to very hard and have high to very high correlations with total score. This illustrates that it is possible for a test of about this size to have sufficient reliability for use as a stand-alone test, but only if the items are of exceptional quality. A raw score mode of zero is typical for a test of this nature.


Scores by age

Age class n median score
50 to 5420.5
45 to 4934.0
40 to 4456.0
35 to 3946.3
30 to 3433.0
25 to 2952.0
22 to 2451.0
20 or 2112.0
18 or 1911.0

Scores by year taken

Year taken n median score

ryear taken × median score = -0.25 (n = 30)

Remark: The high scores of 2007 are probably related to the test's being used for the Prize of the Beheaded Man then.

Robustness and overall test quality

Item analysis

Item statistics are not published as that would help future candidates. To detect bad items, answers and comments from candidates are studied, as well as, for each problem, the correlation with total score and the proportion of candidates getting it wrong (hardness of the item). Possible bad items are removed or revised, resulting in a revised version of the test.