Statistics of Bonsai Test

© Paul Cooijmans

Introduction

This test was too short to have sufficient reliability and validity and is no longer used in its own right but part of The Bonsai Test - Revision 2016. These statistics are from the period when it was used in its own right.

Scores on Bonsai Test as of 10 February 2023

Contents type: Verbal, numerical, spatial.   Period: 2002-2006

3 **
5 **
6 **
7 *
7.5 *
8 ***
9 *
10 ****
11 *****
12 ***
13 *

Correlation of Bonsai Test with other mental ability tests

Test name n r
Analogies of Long Test For Genius80.92
Long Test For Genius80.79
Lieshout International Mesospheric Intelligence Test70.78
Cartoons of Shock110.76
Analogies #140.71
Strict Logic Sequences Exam I (Jonathan Wai)60.68
Association subtest of Long Test For Genius80.67
KIT Intelligence Test - first attempts60.67
Reason50.63
Reason - Revision 200850.61
Reason Behind Multiple-Choice40.59
Qoymans Multiple-Choice #170.57
Test of Shock and Awe90.54
Space, Time, and Hyperspace140.53
Spatial section of Test For Genius - Revision 2004100.47
The Final Test120.46
Qoymans Multiple-Choice #390.45
Logima Strictica 36 (Robert Lato)60.45
Mega Test (Ronald K. Hoeflin)40.42
Qoymans Multiple-Choice #480.40
Reason Behind Multiple-Choice - Revision 200850.37
Odds60.36
Test For Genius - Revision 2004100.36
Cooijmans Intelligence Test - Form 1100.36
Isis Test80.34
The Nemesis Test40.33
Genius Association Test80.29
Short Test For Genius50.27
The Sargasso Test60.21
Cooijmans On-Line Test40.19
Titan Test (Ronald K. Hoeflin)40.18
Numbers130.17
The Test To End All Tests90.16
Verbal section of Test For Genius - Revision 2004100.15
Unknown and miscellaneous tests140.09
Sigma Test (Melão Hindemburg)60.08
Wechsler Adult Intelligence Scales40.05
Qoymans Multiple-Choice #55-0.16
Tests by Greg Grove (aggregate)6-0.27
Cooijmans Intelligence Test - Form 26-0.28
Cattell Culture Fair4-0.35
Encephalist - R (Xavier Jouve)4-0.37
Non-Verbal Cognitive Performance Examination (Xavier Jouve)5-0.44
Spatial Insight Test5-0.74

Weighted average of correlations: 0.340 (N = 312, weighted sum = 106)

Estimated g factor loading: 0.58

Ranking in above table is based on the unrounded correlations. All available data is present in this table, no tests are left out except for those with less than 4 score pairs. All known pairs are used, including possible floor/ceiling scores or outliers.

Estimated loadings of Bonsai Test on particular item types

These are estimated g factor loadings, but against homogeneous tests (containing only particular item types) as opposed to non-compound heterogeneous tests. Although tending to surprise the lay person, it is not uncommon for tests to have high loadings on item types they do not actually contain themselves. Such loadings reflect the empirical fact that most tests for mental abilities measure primarily g, regardless of their contents; that the major part of test score variance is caused by g, and only a minor part by factors germane to particular item types. It is of key importance to understand that this is a fact of nature, a natural phenomenon, and not something that was built into the tests by the test constructors.

Typeng loading of Bonsai Test on that type
Verbal880.65
Numerical250.58
Spatial360.62
Logical140.71
Heterogeneous780.53

N = 241

Compound tests have been left out of this table to avoid overlap.

Balanced g loading = 0.62

National medians for Bonsai Test

Country n median score
Germany210.0
United_Kingdom310.0
Finland29.8
United_States67.0

For reasons of privacy, only countries with 2 or more candidates are included in this table. Ranking is based on the medians, and then alphabetic.

Correlation with national I.Q.'s of Bonsai Test

Correlation of this test with national average I.Q.'s published by Lynn and Vanhanen:

Correlation of Bonsai Test with personal details

Personalia n r
Observed behaviour60.56
Observed associative horizon60.51
Sex250.42
Year of birth250.27
Mother's educational level170.26
Disorders (parents and siblings)170.24
Disorders (own)170.16
Gifted Adult's Inventory of Aspergerisms40.14
Educational level17-0.08
Father's educational level17-0.17

Estimated g factor loadings for restricted ranges

In parentheses the number of score pairs on which that estimated g factor loading is based. The goal of this is to verify the hypothesis that g becomes less important, accounts for a smaller proportion of the variance, at higher I.Q. levels. The mere fact of restricting the range like this also depresses the g loading compared to computing it over the test's full range, so it would be normal for these values to be lower than the test's full-range g loading.

Below 1st quartile0.65 (34)
Below median0.63 (136)
Above median-0.41 (190)
Above 3rd quartile-0.58 (137)

Reliability

Error

Scores by age

Age class n Median score
55 to 5917.5
50 to 5435.0
45 to 49210.0
40 to 44311.0
35 to 39311.0
30 to 34311.0
25 to 2969.0
22 to 24112.0
20 or 2116.0
18 or 1919.0
1717.0

N = 25

Scores by year taken

Year taken n median score
2002710.0
200346.0
200488.0
2005411.0
200629.3

ryear taken × median score = 0.29 (N = 25)

Robustness and overall test quality

Item analysis

Item statistics are not published as that would help candidates. To detect bad items, answers and comments from candidates are studied, as well as, for each problem, the correlation with total score on the remaining problems (item-rest correlation) and the proportion of candidates getting it wrong (hardness of the item). Possible bad items are revised, replaced, or removed, possibly resulting in a revised version of the test.