Statistics of behaviour scores

Introduction

Over the past twenty-eight years I have been collecting various kinds of data for thousands of test candidates. This report deals with the predictive validity of a number of variables with regard to certain types of objectively observable misbehaviour that, sadly, occur in a small fraction of the candidates. Insults, threats, and (re)tests under false names are common responses of these people when disappointed about their scores or otherwise dismayed.

For this purpose I added a data field that contains 0 in the case of misbehavers and 1 in the case of people with objectively observable markedly positive, constructive behaviour. I left it blank in all other cases. The following statistics result:

Scores on Observed behaviour as of 27 January 2023

Contents type: Observation.

n: 164
Mean: 0.43
Standard deviation: 0.50
Range: 1.0
Maximum possible: 1
Male mean: 0.36 (n = 142)
Female mean: 0.90 (n = 21)
Unknown sex mean: 0.00 (n = 1)
Proportion of males among candidates: 0.866

0	**********************************************************************************************
1	**********************************************************************

Notice that since the possible values are 0 and 1, a mean as given above can be directly interpreted as the proportion of candidates having a 1. For instance, the male mean of .36 means that 36 % of the 142 males have a 1.

Correlation of Observed behaviour with mental ability tests

Test name	n	r
The Marathon Test	5	0.96
Verbal section of The Marathon Test	6	0.96
Raven's Advanced Progressive Matrices (raw)	5	0.88
Labyrinthine LIMIT	7	0.85
Test For Genius - Revision 2010	7	0.85
The Final Test - Revision 2013	7	0.83
Short Test For Genius	12	0.80
Cartoons of Shock	12	0.79
Dicing with death	4	0.74
Combined Numerical and Spatial sections of Test For Genius - Revision 2010	10	0.73
The Alchemist Test	9	0.71
Association subtest of Long Test For Genius	17	0.70
Cooijmans Intelligence Test 5	13	0.70
Daedalus Test	10	0.69
Test of Inductive Reasoning / J.C.T.I. (Xavier Jouve)	7	0.66
Long Test For Genius (Netherlandic)	5	0.65
Numerical section of Test For Genius - Revision 2010	20	0.65
Long Test For Genius	14	0.65
The Bonsai Test - Revision 2016	9	0.60
Cooijmans Intelligence Test - Form 4	14	0.58
Analogies #1	5	0.58
Test For Genius - Revision 2016	8	0.57
Bonsai Test	6	0.56
The Nemesis Test	16	0.54
Association subtest of Long Test For Genius (Netherlandic)	5	0.53
A Paranoiac's Torture: Intelligence Test Utilizing Diabolic Exactitude	13	0.53
Problems In Gentle Slopes of the third degree	10	0.51
Verbal section of Test For Genius - Revision 2016	8	0.50
Cooijmans Intelligence Test - Form 3	23	0.50
Cooijmans Intelligence Test - Form 1	15	0.50
Numerical section of The Marathon Test	11	0.50
Qoymans Multiple-Choice #2	7	0.48
Associative LIMIT	19	0.47
Spatial section of The Marathon Test	10	0.47
Psychometric Qrosswords	6	0.45
Numerical and spatial sections of The Marathon Test	9	0.44
Qoymans Multiple-Choice #5	26	0.42
The Test To End All Tests	21	0.42
Narcissus' last stand	8	0.40
A-22 - Early experimental association test in Netherlandic, 27 items (maximum score 31)	5	0.40
Words	5	0.40
Analogies subtest of Long Test For Genius (Netherlandic)	5	0.38
Isis Test	22	0.37
Genius Association Test	29	0.37
Odds	6	0.37
The Final Test	28	0.36
The Hammer Of Test-Hungry - Revision 2013	7	0.36
Analogies of Long Test For Genius	15	0.35
Combined Numerical and Spatial sections of Test For Genius - Revision 2016	11	0.35
Cooijmans On-Line Test - Two-barrelled version	11	0.35
Reason Behind Multiple-Choice - Revision 2008	26	0.34
KIT Intelligence Test - first attempts	8	0.34
Space, Time, and Hyperspace	29	0.34
Qoymans Multiple-Choice #3	14	0.32
Problems In Gentle Slopes of the first degree	8	0.32
Qoymans Automatic Test #1	5	0.32
Cito-toets	5	0.30
Problems In Gentle Slopes of the second degree	13	0.29
European I.Q. Test	5	0.28
Numbers	26	0.28
Reflections In Peroxide	13	0.28
Cooijmans Intelligence Test - Form 2	12	0.26
Qoymans Multiple-Choice #1	12	0.26
Space, Time, and Hyperspace - Revision 2016	12	0.25
Gliaweb Riddled Intelligence Test (old version)	8	0.24
Unknown and miscellaneous tests	75	0.21
Lieshout International Mesospheric Intelligence Test	31	0.19
Gliaweb Riddled Intelligence Test - Revision 2011	14	0.19
Mega Test (Ronald K. Hoeflin)	14	0.16
Spatial section of Test For Genius - Revision 2004	27	0.16
The Sargasso Test	13	0.14
Verbal section of Test For Genius - Revision 2004	25	0.12
916 Test (Laurent Dubois)	7	0.12
Reason - Revision 2008	26	0.11
Test For Genius - Revision 2004	21	0.11
Culture Fair Numerical Spatial Examination - Final version (Etienne Forsström)	14	0.11
Titan Test (Ronald K. Hoeflin)	16	0.06
Test of Shock and Awe	7	0.06
Qoymans Multiple-Choice #4	23	0.05
California Test of Mental Maturity	7	0.05
Tests by Greg Grove (aggregate)	17	0.04
Tests by Ivan Ivec (aggregate)	7	0.03
Test of the Beheaded Man	10	-0.02
Tests by Nicolas Elenas (aggregate)	6	-0.03
Non-Verbal Cognitive Performance Examination (Xavier Jouve)	13	-0.05
Queendom Culture Fair	5	-0.07
International High IQ Society tests (aggregate)	13	-0.07
W-87 (International Society for Philosophical Enquiry)	5	-0.07
Reason Behind Multiple-Choice	13	-0.07
Chimera High Ability Riddle Test (Bill Bultas)	5	-0.12
Raven's Advanced Progressive Matrices (I.Q.)	8	-0.14
Wechsler Adult Intelligence Scales	17	-0.16
Strict Logic Sequences Exam I (Jonathan Wai)	19	-0.19
G-test (Nikos Lygeros)	4	-0.19
Reason	14	-0.20
American College Testing program	4	-0.26
Cattell Culture Fair	14	-0.28
Hoeflin Power Test (Ronald K. Hoeflin)	7	-0.28
Epiq Tests (aggregate)	6	-0.30
Sigma Test (Melão Hindemburg)	6	-0.30
Scholastic Aptitude Test (old)	9	-0.40
Cooijmans On-Line Test	5	-0.41
F.N.A. (Xavier Jouve)	4	-0.41
Cattell Verbal (also known as Cattell B)	5	-0.42
Logima Strictica 36 (Robert Lato)	20	-0.47
Strict Logic Sequences Exam II (Jonathan Wai)	5	-0.49
Qoymans Automatic Test #2	4	-0.52
New York High I.Q. Society tests	4	-0.52
Strict Logic Spatial Exam 48 (Jonathan Wai)	7	-0.61
Omega Contemplative Items Pool (Tommy Smith)	5	-0.61
Spatial Insight Test	4	-0.73
Graduate Record Examination	7	-0.76
Tests by Mislav Predavec (aggregate)	4	-0.81

Weighted average of correlations: 0.250 (N = 1355, weighted sum = 338.51)

Conservatively estimated minimum g loading: 0.50

Ranking in above table is based on the unrounded correlations. All available data is present in this table, no tests are left out except for those with less than 4 score pairs. All known pairs are used, including possible floor/ceiling scores or outliers.

So, observed positive behaviour does have a moderate loading on general intelligence, but lower than that of any actual I.Q. test that one would trust.

Correlation of Observed behaviour with other tests - for females

Test name	n	r
(65) Long Test For Genius (Netherlandic)	2	1.00
(76) Analogies subtest of Long Test For Genius (Netherlandic)	2	1.00
(81) Association subtest of Long Test For Genius (Netherlandic)	2	1.00
(68) Numbers	4	0.87
(57) Space, Time, and Hyperspace	6	0.72
(242) Unknown and miscellaneous tests	7	0.44
(220) Cattell Culture Fair	2	-1.00

Weighted average of correlations: 0.595 (N = 25, weighted sum = 14.87)

Conservatively estimated minimum g loading among females: 0.77

Ranking in above table is based on the unrounded correlations. All available data is present in this table, no tests are left out except for those with less than 2 score pairs. All known pairs are used, including possible floor/ceiling scores or outliers.

Estimated loadings of Observed behaviour on particular item types

These are estimated g factor loadings, but against homogeneous tests (containing only particular item types) as opposed to non-compound heterogeneous tests. Although tending to surprise the lay person, it is not uncommon for tests to have high loadings on item types they do not actually contain themselves. Such loadings reflect the empirical fact that most tests for mental abilities measure primarily g, regardless of their contents; that the major part of test score variance is caused by g, and only a minor part by factors germane to particular item types. It is of key importance to understand that this is a fact of nature, a natural phenomenon, and not something that was built into the tests by the test constructors.

Type	n	g loading of Observed behaviour on that type
Verbal	274	0.61
Numerical	82	0.55
Spatial	120	0.41
Logical	62	0.39
Heterogeneous	372	0.57

N = 910

Compound tests have been left out of this table to avoid overlap.

Balanced g loading = 0.51

The highest loadings of observed behaviour are on verbal and heterogeneous tests, which more or less concords with the long-term anecdotal observation that one-sided non-verbal tests let through misbehavers when used for society admission.

National means for Observed behaviour

Country	n	mean score
Finland	5	1.00
Malta	2	1.00
Turkey	2	1.00
Netherlands	23	0.65
United_Kingdom	5	0.60
Australia	6	0.50
Belgium	6	0.50
China	2	0.50
Portugal	2	0.50
South_Africa	2	0.50
United_States	52	0.48
Sweden	7	0.43
Canada	5	0.40
Italy	4	0.25
Germany	5	0.20
Brazil	2	0.00
France	2	0.00
Greece	4	0.00
Korea_South	3	0.00
Mexico	2	0.00
New_Zealand	2	0.00
Poland	2	0.00
Spain	3	0.00

For reasons of privacy, only countries with 2 or more candidates are included in this table. Ranking is based on the unrounded means, and then alphabetic.

Correlation with national I.Q.'s of Observed behaviour

Correlation of this test with national average I.Q.'s published by Lynn and Vanhanen:

r = 0.09 (n = 160)

Correlation of Observed behaviour with personal details

Personalia	n	r
Observed associative horizon	45	0.65
PSIA Rational - Revision 2007	16	0.56
PSIA Extreme - Revision 2007	16	0.45
PSIA True - Revision 2007	16	0.44
PSIA Ethics factor - Revision 2007	16	0.44
PSIA Deviance factor - Revision 2007	16	0.39
PSIA Cold - Revision 2007	16	0.36
PSIA System factor - Revision 2007	16	0.33
PSIA Just - Revision 2007	16	0.31
PSIA Introverted - Revision 2007	16	0.29
Educational level	99	0.26
PSIA Rare - Revision 2007	16	0.23
Disorders (parents and siblings)	93	0.21
PSIA Aspergoid - Revision 2007	16	0.15
PSIA Orderly - Revision 2007	16	0.11
Father's educational level	84	-0.05
PSIA Neurotic - Revision 2007	16	-0.09
Gifted Adult's Inventory of Aspergerisms	32	-0.10
PSIA Antisocial - Revision 2007	16	-0.13
Disorders (own)	104	-0.17
Mother's educational level	88	-0.19
Year of birth	151	-0.30
PSIA Cruel - Revision 2007	16	-0.35
Sex	163	-0.37
Cooijmans Inventory of Neo-Marxist Attitudes	12	-0.46

The significant correlations are those with Observed associative horizon, PSIA Rational, Sex, Year of birth, and Education level.

Correlation with personal details of Observed behaviour - within females

Personalia	n	r
Disorders (parents and siblings)	9	0.32
Observed associative horizon	5	0.17
Year of birth	18	0.16
Disorders (own)	10	0.00
Mother's educational level	7	-0.50
Father's educational level	7	-0.61
Educational level	9	-0.61

Correlation with personal details of Observed behaviour - within males

Personalia	n	r
Observed associative horizon	40	0.70
PSIA Rational - Revision 2007	14	0.62
PSIA True - Revision 2007	14	0.51
PSIA Ethics factor - Revision 2007	14	0.51
PSIA Extreme - Revision 2007	14	0.48
PSIA Deviance factor - Revision 2007	14	0.40
PSIA Just - Revision 2007	14	0.39
PSIA Cold - Revision 2007	14	0.33
PSIA System factor - Revision 2007	14	0.32
Educational level	90	0.26
PSIA Introverted - Revision 2007	14	0.26
PSIA Rare - Revision 2007	14	0.22
Disorders (parents and siblings)	84	0.19
PSIA Aspergoid - Revision 2007	14	0.15
PSIA Orderly - Revision 2007	14	0.07
PSIA Antisocial - Revision 2007	14	-0.06
Father's educational level	77	-0.07
PSIA Neurotic - Revision 2007	14	-0.15
Mother's educational level	81	-0.17
Disorders (own)	94	-0.18
Gifted Adult's Inventory of Aspergerisms	27	-0.18
Year of birth	133	-0.28
PSIA Cruel - Revision 2007	14	-0.39
Cooijmans Inventory of Neo-Marxist Attitudes	11	-0.51

[More statistical reports]