Statistics of behaviour scores

© June 2015 Paul Cooijmans


Over the past twenty years I have been collecting various kinds of data for thousands of test candidates. This report deals with the predictive validity of a number of variables with regard to certain types of objectively observable misbehaviour that, sadly, occur in a small fraction of the candidates.

Insults, threats, and (re)tests under false names are common responses of these people when they are disappointed about their scores or otherwise dismayed. Notice that the false names are an implicit insult, as they result from their thinking "if he sees my real name he will deliberately give me a too low score to make it match my earlier score(s) on his tests or because he does not like me".

The fact they use this reasoning reveals their own dishonesty; this thought can only occur to them because it reflects what they would do in my place, or, as one says, "Ill doers are ill deemers".

To correlate misbehaviour to other statistics I added a field that contains 0 in the case of misbehavers and 1 in the case of people with objectively observable markedly positive, constructive behaviour. I left it blank in all other cases. Leaving these "neutral" cases blank rather than giving them, for instance, .5, makes this study more sensitive. The following statistics result:

Scores on Observed behaviour

Contents type: Assessment.   

0 ******************************************************************
1 *******************************************************************

Scores by males

n = 113

0 ****************************************************************
1 *************************************************

Scores by females

n = 20

0 **
1 ******************

Correlation of Observed behaviour with other tests by Paul Cooijmans

(Test index) Test name n r
(21) Psychometric Qrosswords50.86
(19) Numerical section of Test For Genius - Revision 201050.84
(52) Qoymans Multiple-Choice #260.83
(56) Short Test For Genius130.81
(1) Cartoons of Shock100.73
(65) Long Test For Genius (Netherlandic)50.65
(63) Long Test For Genius140.64
(84) Bonsai Test70.59
(77) Analogies #150.58
(18) The Nemesis Test90.57
(12) Cooijmans On-Line Test (Two-barrelled)60.54
(81) Association subtest of Long Test For Genius (Netherlandic)50.53
(35) Intelligence Quantifier by assessment700.49
(11) Isis Test150.47
(85) Cooijmans Intelligence Test - Form 1160.46
(69) Odds70.44
(2) Cooijmans Intelligence Test - Form 3110.41
(75) Analogies of Long Test For Genius150.41
(70) A-22 - Early experimental association test in Netherlandic, 27 items (maximum score 31)50.40
(76) Analogies subtest of Long Test For Genius (Netherlandic)50.38
(44) Associative LIMIT110.37
(72) Qoymans Automatic Test #160.35
(57) Space, Time, and Hyperspace280.34
(68) Numbers280.34
(7) The Final Test260.33
(3) Qoymans Multiple-Choice #5160.32
(40) Reason Behind Multiple-Choice - Revision 2008160.31
(53) Qoymans Multiple-Choice #3130.28
(16) Lieshout International Mesospheric Intelligence Test190.28
(87) Cooijmans Intelligence Test - Form 2120.26
(79) Association subtest of Long Test For Genius160.25
(28) The Test To End All Tests160.23
(83) KIT Intelligence Test - first attempts80.22
(27) Spatial section of Test For Genius - Revision 2004240.21
(10) Genius Association Test210.19
(24) Reason - Revision 2008160.18
(66) Test For Genius - Revision 2004210.16
(51) Qoymans Multiple-Choice #1130.12
(80) Qoymans Multiple-Choice #4230.07
(54) Test of Shock and Awe70.06
(62) Reason Behind Multiple-Choice130.05
(26) Verbal section of Test For Genius - Revision 2004230.03
(33) Problems In Gentle Slopes of the first degree5-0.02
(82) Reason14-0.08
(25) The Sargasso Test7-0.30
(55) Spatial Insight Test6-0.36
(74) Cooijmans On-Line Test6-0.50
(50) Qoymans Automatic Test #25-0.53
(0) Test of the Beheaded Man5-0.91

Weighted average of correlations: 0.301

Conservatively estimated minimum g loading: 0.55

Ranking in above table is based on the unrounded correlations. All available data is present in this table, no tests are left out except for those with less than 5 score pairs. All known pairs are used, including possible floor/ceiling scores or outliers.

Note: One should realize this is a point-biserial correlation (where one of the variables is a dichotomy, which through its limited variance suppresses the size of the correlation) so the actual g loading of positive, constructive behaviour combined with the reverse of insulting behaviour may be higher than shown above.

Correlation of Observed behaviour with tests by others

(Test index) Test name n r
(210) Drenth analogies31.00
(228) Miller Analogies Test (raw; old version)30.99
(212) Raven's Advanced Progressive Matrices (raw)50.88
(227) Concep-T30.86
(205) Cito-toets50.30
(238) 916 Test60.23
(242) Unknown and miscellaneous tests410.23
(211) Culture Fair Numerical Spatial Examination - Final version140.11
(239) Titan Test150.07
(208) California Test of Mental Maturity70.03
(235) Nonverbal Cognitive Performance Examination140.02
(248) Gliaweb Riddled Intelligence Test (old version)80.00
(229) Mega Test14-0.01
(236) International High IQ Society Miscellaneous tests13-0.04
(206) W-875-0.07
(231) Mysterium Entrance Exam13-0.10
(218) Raven's Advanced Progressive Matrices (I.Q.)9-0.12
(204) Chimera High Ability Riddle Test5-0.12
(237) Sigma Test7-0.13
(201) Wechsler Adult Intelligence Scales15-0.14
(217) G-test4-0.19
(200) American College Testing program4-0.26
(234) Strict Logic Sequences Exam I18-0.27
(233) Hoeflin Power Test7-0.28
(202) Cattell Verbal6-0.36
(220) Cattell Culture Fair13-0.46
(225) Logima Strictica 3619-0.49
(246) Sequentia Numerica Form I3-0.50
(213) Encephalist - R4-0.56
(230) Omega Contemplative Items Pool5-0.61
(243) Scholastic Aptitude Test (old)9-0.63
(223) Strict Logic Sequences Exam II4-0.70
(219) Graduate Record Examination7-0.76
(240) Strict Logic Spatial Exam 485-0.83

Weighted average of correlations: -0.095

Ranking in above table is based on the unrounded correlations. All available data is present in this table, no tests are left out except for those with less than 3 score pairs. All known pairs are used, including possible floor/ceiling scores or outliers.

Please be aware that correlations with these external tests are in most cases affected (depressed, typically) by one or more of the following: (1) Little overlap with the object test because of the much lower ceilings and inherent ceiling effects of the tests used in regular psychology; (2) Candidates reporting scores selectively, for instance only the higher ones while withholding lower ones; (3) Candidates reporting, or having been reported by psychometricians, incorrect scores.

Remark: The non-positive correlation with tests by others may indicate that insulting, misbehaving persons are (even) more dishonest in reporting their other scores than are candidates in general; for instance, more likely to report only their highest scores, or to report incorrect high scores.

Correlation of Observed behaviour with other tests by Paul Cooijmans - for females

(Test index) Test name n r
(68) Numbers40.87
(57) Space, Time, and Hyperspace50.71
(35) Intelligence Quantifier by assessment10-0.13

Weighted average of correlations: 0.303

Conservatively estimated minimum g loading among females: 0.55

Ranking in above table is based on the unrounded correlations. All available data is present in this table, no tests are left out except for those with less than 3 score pairs. All known pairs are used, including possible floor/ceiling scores or outliers.

Estimated loadings of Observed behaviour on particular item types

These are estimated g factor loadings, but against homogeneous tests (containing only particular item types) as opposed to non-compound heterogeneous tests. Although tending to surprise the lay person, it is not uncommon for tests to have high loadings on item types they do not actually contain themselves. Such loadings reflect the empirical fact that most tests for mental abilities measure primarily g, regardless of their contents; that the major part of test score variance is caused by g, and only a minor part by factors germane to particular item types. It is of key importance to understand that this is a fact of nature, a natural phenomenon, and not something that was built into the tests by the test constructors.

Typeg loading of Observed behaviour on that type

Compound tests have been left out of this table to avoid overlap.

Balanced g loading = 0.47

National means for Observed behaviour

Country n mean score

For reasons of privacy, only countries with 2 or more candidates are included in this table. Ranking is based on the unrounded means, and then alphabetic.

Correlation with national I.Q.'s of Observed behaviour

Correlation of this test with national average I.Q.'s published by Lynn and Vanhanen:

Correlation of Observed behaviour with personal details

Personalia n r
P.S.I.A. True290.70
Observed associative horizon460.61
P.S.I.A. Ethics factor410.55
P.S.I.A. Extreme290.39
P.S.I.A. Orderly290.32
Educational level780.32
P.S.I.A. Rare290.30
P.S.I.A. Introverted290.25
P.S.I.A. Deviance factor410.18
P.S.I.A. Rational290.17
Disorders (parents and siblings)710.16
P.S.I.A. Aspergoid290.16
P.S.I.A. System factor240.14
P.S.I.A. Neurotic290.10
P.S.I.A. Cold290.05
Father's educational level66-0.08
Mother's educational level68-0.15
Gifted Adult's Inventory of Aspergerisms31-0.17
P.S.I.A. Just29-0.19
Disorders (own)82-0.24
Year of birth123-0.24
P.S.I.A. Antisocial29-0.37
Candidate's self-estimated I.Q.7-0.38
P.S.I.A. Cruel29-0.40
Cooijmans Inventory of Neo-Marxist Attitudes9-0.68

P.S.I.A. True and Ethics (the latter combines True with the reverse of Cruel) are good at identifying misbehavers, while Extreme, Cruel, and Antisocial also help. But the question is if the P.S.I.A. would still work this well if explicitly used for selection; candidates might try to manipulate the outcome by answering dishonestly.

Interesting newcomers are Cooijmans Inventory of Neo-Marxist Attitudes and Candidate's self-estimated I.Q., but their n is too low to make these promising correlations with observed behaviour significant. Observed associative horizon is an obvious good correlate of behaviour, but as it concerns an assessment it is of less practical value than an actual personality test score.

Correlation with personal details of Observed behaviour - within females

Personalia n r
Disorders (own)100.33
Disorders (parents and siblings)90.32
Observed associative horizon60.25
Year of birth180.20
Mother's educational level7-0.50
Father's educational level7-0.61
Educational level10-0.62