The Nemesis Test - Statistics

Scores on The Nemesis Test as of 31 October 2022

Contents type: Verbal, numerical, spatial, logical. Period: 1997-present

n: 64
Median: 2.0
Quartile deviation: 1.5
Range: 17
Maximum possible: 22
Male median: 2.0 (n = 60)
Female median: 0.5 (n = 4)
Resolution: 0.25

0	****************
1	**************
1.5	*
2	**********
3	*******
4	*****
4.5	*
6	**
7	*
8	*
9	*
10.5	*
12	*
14	*
16	**

Correlation of The Nemesis Test with other mental ability tests

(Test index) Test name	n	r
(239) Titan Test (Ronald K. Hoeflin)	4	0.99
(35) Only idiots	5	0.94
(85) Cooijmans Intelligence Test - Form 1	4	0.93
(56) Short Test For Genius	5	0.89
(258) Tests by Mislav Predavec (aggregate)	4	0.87
(79) Association subtest of Long Test For Genius	6	0.86
(63) Long Test For Genius	6	0.86
(69) Odds	4	0.86
(113) The Piper's Test	10	0.86
(46) Labyrinthine LIMIT	6	0.85
(114) Dicing with death	8	0.85
(57) Space, Time, and Hyperspace	8	0.84
(107) The Alchemist Test	10	0.80
(201) Wechsler Adult Intelligence Scales	5	0.79
(111) Test For Genius - Revision 2016	17	0.78
(117) The Hammer Of Test-Hungry - Revision 2013	7	0.78
(118) Divine Psychometry	6	0.78
(108) Verbal section of Test For Genius - Revision 2016	17	0.77
(77) Analogies #1	5	0.77
(104) The Final Test - Revision 2013	8	0.77
(87) Cooijmans Intelligence Test - Form 2	9	0.76
(26) Verbal section of Test For Genius - Revision 2004	17	0.76
(48) Narcissus' last stand	14	0.73
(66) Test For Genius - Revision 2004	11	0.73
(54) Test of Shock and Awe	6	0.73
(19) Numerical section of Test For Genius - Revision 2010	25	0.73
(39) Combined Numerical and Spatial sections of Test For Genius - Revision 2010	12	0.72
(42) The Marathon Test	15	0.72
(32) Spatial section of The Marathon Test	19	0.72
(43) Test For Genius - Revision 2010	9	0.71
(4) A Paranoiac's Torture: Intelligence Test Utilizing Diabolic Exactitude	22	0.71
(240) Strict Logic Spatial Exam 48 (Jonathan Wai)	5	0.70
(45) Numerical and spatial sections of The Marathon Test	19	0.70
(0) Test of the Beheaded Man	21	0.70
(11) Isis Test	30	0.70
(1) Cartoons of Shock	13	0.69
(75) Analogies of Long Test For Genius	6	0.69
(103) Problems In Gentle Slopes of the second degree	15	0.68
(7) The Final Test	16	0.68
(36) Reflections In Peroxide	19	0.68
(68) Numbers	7	0.66
(106) Cooijmans Intelligence Test - Form 4	21	0.65
(21) Psychometric Qrosswords	9	0.64
(44) Associative LIMIT	20	0.64
(28) The Test To End All Tests	24	0.63
(31) Numerical section of The Marathon Test	19	0.63
(211) Culture Fair Numerical Spatial Examination - Final version (Etienne Forsström)	5	0.63
(27) Spatial section of Test For Genius - Revision 2004	18	0.63
(112) Combined Numerical and Spatial sections of Test For Genius - Revision 2016	18	0.63
(30) Verbal section of The Marathon Test	16	0.62
(33) Problems In Gentle Slopes of the first degree	10	0.62
(10) Genius Association Test	22	0.62
(12) Cooijmans On-Line Test - Two-barrelled version	10	0.62
(82) Reason	6	0.61
(16) Lieshout International Mesospheric Intelligence Test	26	0.59
(110) Cooijmans Intelligence Test 5	22	0.59
(220) Cattell Culture Fair	6	0.59
(109) The Bonsai Test - Revision 2016	17	0.58
(23) Gliaweb Riddled Intelligence Test - Revision 2011	14	0.58
(3) Qoymans Multiple-Choice #5	26	0.56
(47) Psychometrically Activated Grids Acerbate Neuroticism	7	0.55
(105) Space, Time, and Hyperspace - Revision 2016	19	0.54
(25) The Sargasso Test	22	0.54
(40) Reason Behind Multiple-Choice - Revision 2008	25	0.52
(5) Daedalus Test	12	0.51
(2) Cooijmans Intelligence Test - Form 3	26	0.51
(80) Qoymans Multiple-Choice #4	10	0.48
(234) Strict Logic Sequences Exam I (Jonathan Wai)	5	0.47
(37) Problems In Gentle Slopes of the third degree	15	0.46
(41) The LAW - Letters And Words	6	0.45
(15) Letters	6	0.44
(260) Tests by Nikolaos Soulios (aggregate)	5	0.43
(84) Bonsai Test	4	0.33
(242) Unknown and miscellaneous tests	16	0.32
(62) Reason Behind Multiple-Choice	6	0.32
(215) Tests by Jason Betts (aggregate)	5	0.31
(24) Reason - Revision 2008	25	0.30
(223) Strict Logic Sequences Exam II (Jonathan Wai)	4	0.27
(225) Logima Strictica 36 (Robert Lato)	8	0.25
(29) Words	7	0.18
(216) Tests by Ivan Ivec (aggregate)	5	0.18
(231) Tests by Greg Grove (aggregate)	5	0.10
(243) Scholastic Aptitude Test (old)	4	0.05

Weighted average of correlations: 0.625 (N = 1011, weighted sum = 631.87)

Conservatively estimated minimum g loading: 0.79

Ranking in above table is based on the unrounded correlations. All available data is present in this table, no tests are left out except for those with less than 4 score pairs. All known pairs are used, including possible floor/ceiling scores or outliers.

Estimated loadings of The Nemesis Test on particular item types

These are estimated g factor loadings, but against homogeneous tests (containing only particular item types) as opposed to non-compound heterogeneous tests. Although tending to surprise the lay person, it is not uncommon for tests to have high loadings on item types they do not actually contain themselves. Such loadings reflect the empirical fact that most tests for mental abilities measure primarily g, regardless of their contents; that the major part of test score variance is caused by g, and only a minor part by factors germane to particular item types. It is of key importance to understand that this is a fact of nature, a natural phenomenon, and not something that was built into the tests by the test constructors.

Type	n	g loading of The Nemesis Test on that type
Verbal	201	0.79
Numerical	60	0.82
Spatial	95	0.80
Logical	43	0.63
Heterogeneous	352	0.81

N = 751

Compound tests have been left out of this table to avoid overlap.

Balanced g loading = 0.77

National medians for The Nemesis Test

Country	n	median score
Spain	3	8.0
Canada	2	4.5
Netherlands	5	3.0
Turkey	2	2.5
United_States	15	2.0
Italy	4	1.8
Korea_South	8	1.5
Greece	5	1.0
Finland	6	0.5
Germany	3	0.0

For reasons of privacy, only countries with 2 or more candidates are included in this table. Ranking is based on the medians, and then alphabetic.

Correlation with national I.Q.'s of The Nemesis Test

Correlation of this test with national average I.Q.'s published by Lynn and Vanhanen:

r = 0.08 (n = 63)

Correlation of The Nemesis Test with personal details

Personalia	n	r
Observed associative horizon	8	0.87
Observed behaviour	16	0.54
P.S.I.A. True - Revision 2007	13	0.42
P.S.I.A. Ethics factor - Revision 2007	13	0.38
P.S.I.A. Orderly - Revision 2007	13	0.38
Educational level	59	0.37
P.S.I.A. Rational - Revision 2007	13	0.36
P.S.I.A. Neurotic - Revision 2007	13	0.23
Sex	64	0.16
P.S.I.A. System factor - Revision 2007	13	0.14
P.S.I.A. Cold - Revision 2007	13	0.13
P.S.I.A. Introverted - Revision 2007	13	0.05
P.S.I.A. Antisocial - Revision 2007	13	-0.02
Mother's educational level	53	-0.06
P.S.I.A. Extreme - Revision 2007	13	-0.07
Father's educational level	53	-0.08
P.S.I.A. Deviance factor - Revision 2007	13	-0.09
P.S.I.A. Just - Revision 2007	13	-0.14
P.S.I.A. Aspergoid - Revision 2007	13	-0.15
Disorders (parents and siblings)	57	-0.15
P.S.I.A. Cruel - Revision 2007	13	-0.17
Disorders (own)	58	-0.20
Year of birth	64	-0.24
P.S.I.A. Rare - Revision 2007	13	-0.25
Gifted Adult's Inventory of Aspergerisms	13	-0.50
Cooijmans Inventory of Neo-Marxist Attitudes	6	-0.56

Estimated g factor loadings upward and downward of particular scores

In parentheses the number of score pairs on which that estimated g factor loading is based. The goal of this is to verify the hypothesis that g becomes less important, accounts for a smaller proportion of the variance, at higher I.Q. levels. The mere fact of restricting the range like this also depresses the g loading compared to computing it over the test's full range, so it would be normal for both values to be lower than the test's full-range g loading.

Raw score	Upward g (N)	Downward g (N)
0	0.79 (1011)	NaN (0)
0.5	0.75 (660)	NaN (0)
2	0.71 (466)	0.54 (483)
3.5	0.66 (172)	0.70 (704)
5	0.54 (29)	0.75 (838)
6.5	0.52 (23)	0.75 (849)
8	0.62 (18)	0.79 (894)

Reliability

Split-half (odd-even) = 0.84
Split-half (1, 2… 5, 6… vs. 3, 4… 7, 8…) = 0.92
Cronbach's alpha = 0.89

This is an example of a relatively short test that nevertheless has enough reliability to serve as a standalone test. This is a result of it being composed of almost exclusively very difficult items of extremely high quality. In such a case, you need fewer items to achieve high reliability than with easier and/or mediocre items. A raw score mode of zero is typical of such tests.

Error

Standard error = 1.3 raw score points

Scores by age

Age class	n	median score
65 to 69	1	2.0
60 to 64	1	0.0
50 to 54	3	1.0
45 to 49	5	4.0
40 to 44	7	6.0
35 to 39	7	2.0
30 to 34	10	1.5
25 to 29	15	1.0
22 to 24	7	3.0
20 or 21	2	1.5
18 or 19	4	1.0
17	1	1.0
16	1	3.0

N = 64

Scores by year taken

Year taken	n	median score
1997	1	0.0
1998	1	10.5
2001	1	6.0
2002	2	2.5
2005	5	3.0
2006	2	0.0
2007	5	14.0
2008	3	3.0
2009	1	3.0
2010	1	9.0
2011	1	0.0
2012	5	2.0
2013	1	0.0
2014	4	0.5
2015	2	4.0
2016	1	1.0
2017	5	1.0
2018	5	1.0
2019	2	2.0
2020	5	1.0
2021	8	2.5
2022	3	1.5

r_{year taken × median score} = -0.37 (N = 64)

Robustness and overall test quality

Robustness by month = 0.80 (r_{raw scores × months} = -0.29)
Quality = 0.769

Item analysis

Item statistics are not published as that would help candidates. To detect bad items, answers and comments from candidates are studied, as well as, for each problem, the correlation with total score on the remaining problems (item-rest correlation) and the proportion of candidates getting it wrong (hardness of the item). Possible bad items are revised, replaced, or removed, possibly resulting in a revised version of the test.

[To main page]