Contents type: Logical. Period: 1999-present
-1 | ******* |
0 | *********** |
132 | * |
204 | * |
206 | ** |
208 | * |
211 | * |
220 | * |
221 | *** |
222 | ** |
(Test index) Test name | n | r |
---|---|---|
(33) Problems In Gentle Slopes of the first degree | 4 | 0.93 |
(47) Psychometrically Activated Grids Acerbate Neuroticism | 7 | 0.92 |
(68) Numbers | 4 | 0.89 |
(106) Cooijmans Intelligence Test - Form 4 | 5 | 0.89 |
(46) Labyrinthine LIMIT | 11 | 0.88 |
(104) The Final Test - Revision 2013 | 4 | 0.88 |
(111) Test For Genius - Revision 2016 | 4 | 0.86 |
(107) The Alchemist Test | 4 | 0.70 |
(43) Test For Genius - Revision 2010 | 5 | 0.70 |
(108) Verbal section of Test For Genius - Revision 2016 | 4 | 0.68 |
(21) Psychometric Qrosswords | 7 | 0.67 |
(42) The Marathon Test | 6 | 0.66 |
(2) Cooijmans Intelligence Test - Form 3 | 8 | 0.64 |
(103) Problems In Gentle Slopes of the second degree | 7 | 0.63 |
(35) Intelligence Quantifier by assessment | 9 | 0.61 |
(30) Verbal section of The Marathon Test | 6 | 0.61 |
(1) Cartoons of Shock | 5 | 0.58 |
(18) The Nemesis Test | 7 | 0.55 |
(39) Combined Numerical and Spatial sections of Test For Genius - Revision 2010 | 7 | 0.53 |
(12) Cooijmans On-Line Test - Two-barrelled version | 8 | 0.53 |
(31) Numerical section of The Marathon Test | 8 | 0.53 |
(11) Isis Test | 9 | 0.50 |
(48) Narcissus' last stand | 6 | 0.49 |
(45) Numerical and spatial sections of The Marathon Test | 8 | 0.46 |
(44) Associative LIMIT | 5 | 0.42 |
(4) A Paranoiac's Torture: Intelligence Test Utilizing Diabolic Exactitude | 7 | 0.39 |
(25) The Sargasso Test | 6 | 0.39 |
(36) Reflections In Peroxide | 7 | 0.38 |
(19) Numerical section of Test For Genius - Revision 2010 | 11 | 0.38 |
(32) Spatial section of The Marathon Test | 8 | 0.37 |
(0) Test of the Beheaded Man | 9 | 0.35 |
(7) The Final Test | 6 | 0.33 |
(28) The Test To End All Tests | 9 | 0.32 |
(112) Combined Numerical and Spatial sections of Test For Genius - Revision 2016 | 5 | 0.30 |
(3) Qoymans Multiple-Choice #5 | 9 | 0.27 |
(10) Genius Association Test | 5 | 0.27 |
(105) Space, Time, and Hyperspace - Revision 2016 | 6 | 0.19 |
(16) Lieshout International Mesospheric Intelligence Test | 13 | 0.10 |
(26) Verbal section of Test For Genius - Revision 2004 | 8 | 0.09 |
(27) Spatial section of Test For Genius - Revision 2004 | 8 | 0.05 |
(40) Reason Behind Multiple-Choice - Revision 2008 | 9 | 0.05 |
(66) Test For Genius - Revision 2004 | 4 | 0.04 |
(57) Space, Time, and Hyperspace | 4 | -0.11 |
(24) Reason - Revision 2008 | 9 | -0.12 |
Weighted average of correlations: 0.445
Conservatively estimated minimum g loading: 0.67
Ranking in above table is based on the unrounded correlations. All available data is present in this table, no tests are left out except for those with less than 4 score pairs. All known pairs are used, including possible floor/ceiling scores or outliers.
Remark: The correlations with other tests, and therewith the g loading, are depressed by the fact that this test, because of its unique nature, effectively works as a single item. Single items have limited reliability (which is why tests normally have several dozens of items at least; the error of individual items is then averaged out and their true score components are piled up) and reliability determines the upper limit of a test's (significant) correlation with any other variable. Reliability and g loading of this test are lower than those of regular I.Q. tests.
As some have pointed out, the reliability and g loading of the test could be raised by letting candidates send in multiple solutions at once, and reporting on the highest scoring solution among those. However, that would give candidates feedback on the correctness or falseness of the various solutions, and thus endanger the security of the test. The current policy will remain effective therefore: only one solution is allowed, and the possible lower reliability that results from thus limiting candidates is to be considered a price paid to keep the test's solutions secret.
(Test index) Test name | n | r |
---|---|---|
(242) Unknown and miscellaneous tests | 9 | -0.19 |
(225) Logima Strictica 36 | 4 | -0.23 |
(234) Strict Logic Sequences Exam I | 4 | -0.38 |
Weighted average of correlations: -0.244
Ranking in above table is based on the unrounded correlations. All available data is present in this table, no tests are left out except for those with less than 4 score pairs. All known pairs are used, including possible floor/ceiling scores or outliers.
Please be aware that correlations with these external tests are in most cases affected (depressed, typically) by one or more of the following: (1) Little overlap with the object test because of the much lower ceilings and inherent ceiling effects of the tests used in regular psychology; (2) Candidates reporting scores selectively, for instance only the higher ones while withholding lower ones; (3) Candidates reporting, or having been reported by psychometricians, incorrect scores.
These are estimated g factor loadings, but against homogeneous tests (containing only particular item types) as opposed to non-compound heterogeneous tests. Although tending to surprise the lay person, it is not uncommon for tests to have high loadings on item types they do not actually contain themselves. Such loadings reflect the empirical fact that most tests for mental abilities measure primarily g, regardless of their contents; that the major part of test score variance is caused by g, and only a minor part by factors germane to particular item types. It is of key importance to understand that this is a fact of nature, a natural phenomenon, and not something that was built into the tests by the test constructors.
Type | g loading of Daedalus Test on that type |
---|---|
Verbal | 0.57 |
Numerical | 0.68 |
Spatial | 0.37 |
Logical | 0.18 |
Heterogeneous | 0.73 |
Compound tests have been left out of this table to avoid overlap.
Balanced g loading = 0.50
Country | n | mean score |
---|---|---|
Sweden | 2 | 111.00 |
Italy | 2 | 110.00 |
Germany | 2 | 104.00 |
Spain | 2 | 65.50 |
United_States | 4 | 55.25 |
China | 2 | -1.00 |
For reasons of privacy, only countries with 2 or more candidates are included in this table. Ranking is based on the unrounded means, and then alphabetic.
Correlation of this test with national average I.Q.'s published by Lynn and Vanhanen:
Personalia | n | r |
---|---|---|
P.S.I.A. Neurotic | 3 | 0.98 |
P.S.I.A. Antisocial | 3 | 0.93 |
P.S.I.A. Cruel | 3 | 0.87 |
P.S.I.A. System factor | 3 | 0.79 |
Observed behaviour | 8 | 0.68 |
Educational level | 20 | 0.57 |
P.S.I.A. Cold | 3 | 0.55 |
P.S.I.A. Rare | 3 | 0.50 |
P.S.I.A. Just | 3 | 0.37 |
Candidate's self-estimated I.Q. | 5 | 0.22 |
Sex | 30 | 0.15 |
Observed associative horizon | 5 | 0.07 |
P.S.I.A. Ethics factor | 4 | 0.04 |
P.S.I.A. Introverted | 3 | 0.01 |
Mother's educational level | 20 | -0.03 |
P.S.I.A. True | 3 | -0.18 |
P.S.I.A. Deviance factor | 4 | -0.18 |
Year of birth | 27 | -0.27 |
Disorders (parents and siblings) | 19 | -0.34 |
P.S.I.A. Aspergoid | 3 | -0.35 |
P.S.I.A. Extreme | 3 | -0.43 |
Disorders (own) | 19 | -0.45 |
Father's educational level | 20 | -0.47 |
P.S.I.A. Orderly | 3 | -0.61 |
P.S.I.A. Rational | 3 | -0.85 |
Gifted Adult's Inventory of Aspergerisms | 3 | -0.87 |
In parentheses the number of score pairs on which that estimated g factor loading is based. The goal of this is to verify the hypothesis that g becomes less important, accounts for a smaller proportion of the variance, at higher I.Q. levels. The mere fact of restricting the range like this also depresses the g loading compared to computing it over the test's full range, so it would be normal for both values to be lower than the test's full-range g loading.
Raw score | Upward g (n) | Downward g (n) |
---|---|---|
-1 | 0.65 (330) | NaN (0) |
0 | 0.48 (208) | 0.69 (205) |
104 | -0.36 (88) | 0.69 (205) |
135 | 0.74 (54) | 0.67 (229) |
208 | 0.79 (48) | 0.61 (251) |
215 | 0.53 (14) | 0.61 (253) |
220 | 0.53 (14) | 0.61 (253) |
221 | 0.53 (14) | 0.67 (321) |
222 | NaN (0) | 0.65 (330) |
This table illustrates that the test tends to work as a single item in that it separates the candidates largely into two groups; those who succeed and those who fail, with some discrimination within each group and some discrimination between the groups.
Due to the fact that this test, effectively, consist of only one item, its reliability can not be computed from internal statistics as with multi-item tests. A single item will normally have much lower reliability than a multi-item test. Considering the apparent g loading of the test, the reliability should be at least about .7.
Age class | n | mean score |
---|---|---|
60 to 64 | 1 | 220.00 |
50 to 54 | 1 | -1.00 |
45 to 49 | 3 | 43.67 |
40 to 44 | 3 | 137.33 |
35 to 39 | 2 | -0.50 |
30 to 34 | 2 | 110.50 |
25 to 29 | 5 | 88.40 |
22 to 24 | 3 | 73.33 |
20 or 21 | 1 | -1.00 |
18 or 19 | 3 | 68.67 |
17 | 2 | 105.50 |
16 | 1 | 0.00 |
Year taken | n | mean score |
---|---|---|
2001 | 5 | 127.80 |
2002 | 5 | 129.80 |
2004 | 3 | 0.00 |
2008 | 1 | 0.00 |
2009 | 2 | 220.50 |
2011 | 1 | 0.00 |
2012 | 2 | 176.50 |
2013 | 2 | -1.00 |
2014 | 1 | 208.00 |
2015 | 2 | -0.50 |
2016 | 1 | -1.00 |
2017 | 1 | 0.00 |
2018 | 4 | 50.25 |
ryear taken × median score = -0.48 (n = 30)
Effectively, this test has only one item, so that item statistics in the usual sense can not be computed.