You may download a copy of this article for private use, provided you leave this copyright warning attached. However, the distribution of multiple copies, electronically or in hard copy, or the use of this article for any commercial purpose, is a copyright violation.
Sex differences in human problem-solving behaviour have been reliably found across many studies over the past few decades, and in several cultures. The major findings are summarized in Table I. The list is not exhaustive, but men and women clearly display different cognitive strengths, hence different patterns of ability. It must be kept in mind that these are average differences, and that the overlap between the sexes on many tests is extensive. Predicting an individuals performance level based on sex alone would be highly inaccurate.


Figure 1. An example of a complex task requiring rotation in "three" dimensions. The subject must decide whether the two figures are the the same figure rotated, or are two different figures.
The Vandenberg Mental Rotations test, based on the Shepard and Metzler designs [91] (Figure 1), depicts complex 3-dimensional figures on a 2-dimensional page. However, the mental rotation of a figure in three dimensions is not a prerequisite for producing a large sex difference. Difficult figures requiring imaginal rotation in only two dimensions may yield an equally large effect size [14]. Performance on a mental rotation test has been found to be significantly related to performance on a route-finding task [23] and a computerized labyrinth [78], suggesting that such ability may contribute substantially to navigation.
A quite different spatial task, favouring women, is depicted in Figure 2. The subject is asked to indicate, from memory, which objects have changed locations from the first to the second array [18]. Other studies have confirmed a female advantage in processing the locations of multiple objects in an array [46, 69, 83]. However, when the task emphasizes location per se, regardless of object identity, women perform less well than men [43, 83], leading to the suggestion that women may use a common system to process object identity and location, whereas men do not [43, 46]. Such an interpretation would be consistent with findings that females tend more often to employ landmarks in navigating, in contrast to males employment of geometric cues [105, 109].


School marks on mathematics tests may not show sex differences, or girls may even do better, as they do on most school subjects on average. However, when mathematical aptitude tests are given which are not simply variations of previously learned problems, and which therefore emphasize math reasoning, there is a consistent male advantage [60, 66]. On strictly calculation tests, females more often perform better.
The male advantage appears to increase with increasing difficulty or complexity of the problems; and males are over-represented at the high end of most math aptitude tests. In an extensive study in the United States spanning decades that identified mathematically talented youth, males have always outnumbered females [4, 61]. However, the preponderance of males is especially marked at the upper end of this population on the Scholastic Aptitude Test-Mathematics (SAT-M), by more than 10 to one.
On another very demanding mathematical reasoning test available to math-oriented North American undergraduates, the Putnam competition, male and female scores are also very divergent. In 1999, of the top 138 contestants, only five could be unequivocally identified (by name) as female. In a more recent competition in which sex of contestants was identified, there were again four times as many men as women taking the test, but 84% of women and 54% of men failed the test completely, that is, they got no score. 37.4% of men but only 11% of women got passing scores of 10 and higher.
So for many tests of ability, we must look not merely at the average difference, but at the extremes of the scores. Even when the average sex difference on any test is small, the extremes of the distribution may show a sharp divergence. For most professions, it is the upper end which is critical.Although popular opinion holds that women are superior on most verbal tasks, this is not the case. The assumption probably stems from the earlier onset of fluent articulate speech in young girls [see 64 for an early review, 68]. In adults, however, reliable differences to date favouring women are found primarily on tasks of verbal memory [7, 55] and verbal fluency [19], not on vocabulary or verbal reasoning. Verbal fluency tests usually require the generation of words or sentences which have some constraint on the letters they contain, e.g., listing words that begin with a particular letter. Verbal memory taps the recall of material employing words, whether in lists or meaningful paragraphs, and whether the words are abstract or concrete. Verbal memory tests have shown the largest effect sizes so far of tests favouring women, in one study from .58 to .97 on recall of a word list [13].
In contrast, women do better than men on small amplitude fine motor skills, such as required on the Purdue Pegboard. Some studies have found this advantage to be related to womens smaller finger size but this is not so in other studies [e.g., 31]. Women also tend to be better at performing manual sequences, relative to their speed on the individual movements [80]; and this advantage is enhanced when performing without vision [12], suggesting that womens motor skills may be more closely related to intrapersonal than to extrapersonal neural systems.
There are some nonhuman parallels to human cognitive sex differences, which bolster evolutionary interpretations. In polygynous vole species, in which the males territory is larger than in monogamous species, males perform better than females on a maze learning task [24]. Male rats make fewer errors than females in a radial arm maze, and moreover the sexes employ different methods of solution [109]. Male baboons perform better on a mental rotation task [102], consistent with their navigational role in guiding the troop.
Whatever the ultimate determinants for human cognitive sexual differentiation, the evidence suggests that the proximate mechanisms have relied heavily on sex hormones, discussed later.


It might be expected, to the extent that sex differences are determined by lifetime experiences, that they should have declined in the last few decades. Such claims have indeed been made [21], but there are difficulties in comparing the heterogenous tests and heterogeneous populations over time. For example, the samples of people taking the tests over time has probably changed [32]. Moreover, some tests have been deliberately altered to dilute sex differences, by removing some items that differentiate the sexes.
We are on firmer ground comparing sex differences over different time periods when we know that the same test has been used throughout, and that the samples we are comparing are equivalent. For example, sex differences on the Vandenberg Mental Rotation test have remained quite stable over the last three decades. We have already noted that this task shows a large sex difference, the effect size hovering around 1.0. Masters and Sanders [67] reviewed 14 studies employing the task between 1975 and 1992, nearly all on college students. Effect sizes varied from .69 to 1.27 across samples, but there was no systematic change across time. The 1975 study showed an effect size of 1.0, and the two most recent papers they cited found effects sizes of 1.27 and 1.14. We can add to that from our own studies in 1997 we found an effect size of .86 [14], and in 1999, effect sizes of .98 and .90 in two samples [13]. In 2001 Geary and DeSoto found an effect size of approximately 1.06 [27].
Alternative methods of evaluating experiential contributions to sex differences in cognition have employed attempts at relating past activities which were rated as more spatial (e.g., skiing, building model planes), to current scores on a spatial visualization test [79]. A small but significant correlation was found. However, this kind of result does not allow us to determine whether life activities influence spatial ability, or the reverse. A review of studies in which men and women were given intensive practice on a variety of spatial tasks, to see whether women might be differentially advantaged by such practice (as might be expected if lack of past experience were the critical factor), reported negative findings. The sex differences were not erased with short-term intensive practice in adulthood [3].
Of course we would not conclude that life experience plays no role in the appearance of sex differences in cognition. However, it seems inevitable that such experience will manifest itself within the context of nervous systems already differently predisposed at birth.
The "simple" version of mammalian sexual differentiation [see 30] is that the undifferentiated or default form is the female, and that active processes, largely under control of androgens and their derivatives, are needed to produce a male. The story appears to be very well established for formation of genitalia and general physical type, and for some reproductive behaviours. It has also been convincingly applied to some non-reproductive behaviours, including rough-and-tumble-play and certain problem-solving activities.
Within this framework it has been proposed that there are critical developmental periods, prenatal and immediately postnatal, during which the action of androgens irreversibly masculinizes the brain. Some of these actions depend on the aromatization of testosterone to estrogen; hence the female brain is thought to be protected from such influence by the sequestering of estrogen by a substance called alpha-fetoprotein. Such early developmental infl uence is called "organizational". In adulthood, sex hormones are again required to activate the hormonally predisposed brain along male or female lines, as appropriate called an activational influence. Some behaviours organized by androgens, such as rough-and-tumble play, need no further activational influence to appear.
Since this pioneering schema, the story has inevitably become more complex:1) The line between organizational and activational influence has become less distinct, with suggestions that these phases overlap in their mechanisms of operation, if not in time [110].
2) Doubts have been raised about assumptions concerning the negligible role of ovarian hormones in organizing female behaviour [see 22 for review].
3) There is clearly variation in the critical hormonesensitive time period for different behaviours [29], raising the possibility that early individual variation in timing of hormone surges also influences cognitive patterns.
4) It is unclear whether the demonstrated contribution to sexual differentiation by aromatization to estrogen in rodents, is as important in primates [99], including humans [98].
5) In addition to the genomic effects of hormones, acting through the cell nucleus to bind to DNA, and slowly taking effect, there are more immediate nongenomic effects acting on the cell membrane [70].
These considerations may all have a bearing on the interpretation of the role of sex hormones in human cognition. Generalizing from non-humans has provided useful models, but these must ultimately be assessed in humans. Such models can usually only be tested in people by means of naturally-occurring hormonal variation, or in hormonal anomalies, since the opportunity for manipulation of hormones for other than therapeutic purposes is very limited. However, since such anomalies are often the result of a genetic variant, it is sometimes difficult to rule out non-hormonal genetic influences.
Some of the naturally occurring hormonal anomalies studied in the context of cognitive makeup include congenital adrenal hyperplasia (CAH), androgen insensitivity (AI), idiopathic hyogonadotropic hypogonadism (IHH), and Turners syndrome. Cognitive pattern has also been studied in relation to individual hormone levels across normal men and women, the underlying assumption being that there is a stable base which can vary from person to person.
If after training the maze was surrounded by a circular curtain, so that the geometry of the room could no longer be used as a cue, the performance of normal males and hormone-treated females, but not of the other two groups, was negatively affected. Conversely, if after training no curtain was used, but the landmarks (pictures on the wall, furniture, etc) around the maze were moved, then the performance of only the normal females and the castrated males was negatively affected. Thus not only the overall performance, but also the cues that were used, were influenced by early hormonal manipulation.
Such studies can be performed in rodents because the brain continues to be susceptible to hormone influences in the first few days after birth, even though the genitals are already formed. No precisely analagous preparations, in whom hormonal disruption is limited to a period right after birth, exist in humans. Several human hormonal anomalies have resulted in prenatal, but due to surgical or therapeutic intervention, minimal anomalous postnatal influence of sex hormones (CAH, cloacal exstrophy, DES exposure). Others have had anomalies throughout their lives (IHH, AI). The former would of course be more informative about any early organizing effects on cognition.
The most compelling evidence for prenatal androgenic influences on cognition in humans comes from CAH cases, in whom high levels of androgens are produced by the adrenals before birth. The androgen overproduction is stopped by cortisol therapy after birth and any virilization of the genitals in females is repaired by surgery. The latter are raised as girls, and most of them are heterosexually oriented and marry men, though their interest in infants is lower than is the case in unaffected women [see 5 for review].
Of interest for cognitive processes is the fact that spatial ability has often been reported to be better in CAH girls than unaffected girls [6, 36, 86]. This advantage appears selective, that is, so far, other abilities are not enhanced. In fact, Hampson et al. [36] report that perceptual speed, an ability favouring women, is reduced in CAH girls. Boys may also suffer from CAH and the consequent overproduction of androgens; but their spatial skills are reduced relative to unaffected boys [36]. The reasonable inference, that the optimal level of androgens for the expression of spatial ability is in the low normal male range, is consistent also with studies on normal young adults described below. No cognitive effects of prenatal exposure to diethylstilbestrol (DES), which has some behavioural effects that mimic androgens, have so far been found in humans [41, 42].
Boys born with cloacal exstrophy may have normal prenatal levels of androgens but because they are born without a penis they have in the past usually been castrated within days of birth and raised as girls. An astonishingly high number of them opt to live as males when they reach adolescence [85], in which case one might speculate that their gender identity was determined prenatally. We do not yet have information on their cognitive patterns.
Hypogonadal men who have lower than normal levels of testosterone throughout their lives, with syndromes like Kallmanns, Klinefelters, and Idiopathic Hypogonadotropic Hypogonadism (IHH), have poorer spatial ability than normal males even though their Vocabulary scores may not differ from normal [1, 40]. Another study found poorer performance not only on a visuo-constructional task, but also on several verbal tasks [81]. In order to rule out current hormonal environment as the causative factor in these studies, we would need to know the cognitive profile of men who had only recently experienced androgen deficiency. Hier and Crowley [40] provide data on five such men and report that their spatial scores are normal, but more data are needed.
Androgen insensitivity (AI) is a syndrome in which genetic males have testes (hidden in the body cavity), and produce testosterone, but the bodys cells do not have androgen receptors. Hence, the androgens are ineffective, the external genitalia are female, and the "girls" are raised as such without any indication of their male genetic makeup until they fail to menstruate. Information on cognitive pattern is incomplete, studies to date employing standard intelligence tests, which generally yield no or small sex differences. Even if AI individuals resembled females, as claimed [44], it would be difficult to evaluate the relative contribution of hormonal influences and of being reared as girls.
Variations in testosterone (T) across seasons, time of day, etc., presumably occur on an underlying stable baseline. We do not yet have information on the relation between T levels early and late in life in the same individuals, but correlations between individuals T levels across a year, calculated from Smals at al. data [97], are highly significant. There is also evidence from a twin study that a large proportion of the variation in T from person to person is genetically based [74]. These facts suggest that some part of the relationship of T to cognitive function refl ects a lifelong or "organizational" effect, particularly when chronological factors are controlled.
It is generally found that levels of testosterone are non-linearly related to cognitive function, in particular to spatial abilities of the kind favouring men [28, 76, 92]. Normal young men with lower levels of T perform better than young men with higher levels; but young women with higher levels perform better than those with lower levels. Such findings are consistent with the idea that spatial ability is enhanced in the low normal male range, at least in the 2030 age group.
One study found a similar trend for math reasoning, for lowT men to have better scores than highT men [28]. This result has not yet been confirmed.
Thus, variations in estrogen appear to selectively affect cognitive pattern. Such effects have been confirmed by others [37, 88, 94]. Further evidence for the specificity of the effects of estrogen in the motor sphere comes from a report that in the high-estrogen midluteal phase, performance on a manual sequencing task was predictably enhanced, whereas on a male-favouring throwing accuracy task, there was no overall effect on performance [90]. Several studies measuring behavioural asymmetry have also suggested that right-hemisphere cerebral function is depressed, relative to left, in high-estrogen phases [72, 73, 88, 90].
Since we know from across-subject studies that men with T levels in the low normal range perform best on certain spatial tasks, it might be expected that we would see both seasonal and diurnal fluctuations in spatial ability. We found, as expected, that men performed better on a composite spatial measure in spring, when T levels were lower, than in fall [52]. Similarly, Moffat and Hampson [76] found performance on either a mental rotations task or a composite spatial measure to be worst in men in early morning, when T levels are highest. Note that not all "masculine" tests need be similarly affected. In our seasonal study, neither math reasoning nor throwing accuracy were substantially influenced by the season. This does not of course preclude early organizing effects on these skills.
Obviously, the size of the sex difference would also be expected to vary with time of day or season of the year, with sex differences smaller in the highT phases. What little data we have suggests that this might be so. In the one study comparing time of day, scores on a mental rotation task did not differ between the sexes early in the day, but was significant later in the day [76]. These effects were not accounted for by changes in cortisol levels. In our data on seasonal changes on a spatial composite score, although the sex difference was significant in autumn, it was smaller than in spring [52]. Were seasonal and diurnal changes in men, combined with menstrual changes in women, studied systematically, there might be other extreme situations where sex differences are diminished or even disappear. However, this would not be expected to happen on a random basis.
Women on hormone replacement therapy usually have been taking it for prolonged periods, so it is not clear whether any beneficial effects are due to the immediate or long-term influence. In one study comparing medicated women during on-therapy and off-therapy phases, it was found that women performed better in the on-therapy phase on tests of manual sequencing and perceptual speed (both of which normally favour women), but not on two spatial tasks [53]. Yet the same women showed a more generalized cognitive advantage when compared with women not on therapy [50], suggesting that the immediate and long-term effects of estrogen therapy may be somewhat different.
Estrogen therapy, combined with anti-androgen therapy, is common in cases of transsexuals seeking sex re-assignment from male to female. This research is in its early stages, and further controls are needed, but there is some indication that male-favouring spatial ability may not be impaired by the treatment combination [75, 96], suggesting that in men such ability may be largely determined early in life. Neither were female-favouring tasks such as verbal fl uency or perceptual speed enhanced. However, Miles et al. [75] report that scores on a verbal memory task (Paired Associate Learning) were better in the hormone-treated group than in a similar transsexual group who had not yet undergone hormone therapy.
In contrast, administration of T to a younger group of men, perhaps raising them to above-optimum levels, depressed scores on Block Design relative to a placebo group, at least in the initial weeks of treatment [81]. Verbal fluency and manual dexterity, however, usually better in women, were both improved. The interpretation of another study showing apparent improvement on a mental rotation task after T administration [1] is complicated by the fact that the T levels of the treated group appear lower initially than the comparison group. Since different studies use different assay methods and cite different measures, the elusive "optimal level" is usually difficult to determine.
Androgen treatment is also given to female-to-male transsexuals. Investigations into cognitive function so far have not used control groups who did not undergo therapy. Some studies [96, 101] report that FM transsexuals showed improved scores on male-favouring mental rotation tasks, but female-favouring verbal fluency in one study was unchanged, and in the other got worse.
It would of course be of interest to demonstrate even short-lived cognitive effects of T administration in genetic females, since this might indicate an extension of the role of androgens in cognition into adulthood. However, apart from the lack of control subjects, we should also be cautious about the generalizability of such findings to the non-transgendered population. One study reported that the level of androgens was higher in FM transsexuals than in control females [8], before any hormone treatment. It is reasonable to assume that individuals of either sex who wish to have their sex re-assigned have a somewhat different brain from those who do not. Some recent anatomical evidence to this effect also exists [56]. Thus, FM individuals may have brains more susceptible to androgenic influences. Nevertheless, the transsexual population is potentially a rich source of information on the effects of sex hormones on cognition in adults.There is substantial evidence in humans that androgens present before birth influence human cognitive abilities into adulthood, at least for certain spatial abilities of the kind at which men excel. Prenatal androgens may also depress certain functions on which women typically excel, but this is not well established. Across young adults, current levels of both androgens and estrogens are systematically related to cognitive pattern, apparently due in part to a stable individual baseline of these hormones. As well, fluctuations in sex hormones across daily, monthly and yearly cycles are accompanied by changes in several cognitive tests, especially those that are sexually differentiated suggesting a continuing sensitivity of the nervous system to hormonal changes in the adult. The administration of exogenous hormones, usually for therapeutic purposes, at present provides modest support for such a proposition, but this is still a rich field of information to be tapped.
For references please consult the original paper.
This research was supported by a grant to the first author from the Natural Sciences and Engineering Council.