Sex-Specific Effects of Psilocybin Versus Escitalopram on Anxiety and Anhedonia: A Bayesian Reanalysis of Antidepressant Treatment Outcomes
This reanalysis of a six-week double-blind randomised controlled trial compared psilocybin with escitalopram in adults with moderate-to-severe depression and found sex-specific patterns in response. Women given psilocybin had greater reductions in anxiety, while women given escitalopram had greater reductions in anhedonia; other differences were small and uncertain.
Authors
- David Erritzoe
- Robin Carhart-Harris
- David Nutt
Published
Abstract
Rationale
Major depressive disorder (MDD) shows marked sex differences in prevalence, symptomatology, and treatment response. However, women remain underrepresented in many clinical trials, and sex-specific treatment outcomes are rarely examined.
Objectives
This study reanalyzed data from a randomized controlled trial comparing psilocybin and escitalopram for MDD to evaluate sex differences across multiple psychological domains.
Methods
We reanalyzed data from a six-week, double-blind randomized controlled trial comparing psilocybin with escitalopram in adults with moderate-to-severe MDD. Post-treatment depressive symptoms (MADRS, QIDS-SR-16, BDI), anhedonia (SHAPS), anxiety (STAI), thought suppression (WBSI), and well-being (WEMWBS) were modeled as a function of sex, treatment condition, their interaction, and baseline symptom severity. Sexual dysfunction severity (PRSexDQ-SALSEX), assessed only at the six-week follow-up, was analyzed separately as an ordinal outcome.
Results
Sex-related patterns emerged for anxiety and anhedonia. Women receiving psilocybin showed greater reductions in anxiety than men (STAI: 95% CrI − 17.5 to − 3.29), whereas women receiving escitalopram showed greater reductions in anhedonia than men (SHAPS: 95% CrI − 4.63 to 0.00). For the remaining continuous outcomes, sex differences were generally small and uncertain. Sexual dysfunction severity was lower overall in the psilocybin group than in the escitalopram group and lower in women than in men, although the treatment-by-sex interaction was not significant.
Conclusions
This reanalysis identified domain-specific sex-related patterns in anxiety and anhedonia, suggesting that responses to psilocybin and escitalopram may differ between women and men. These preliminary findings support adequately powered, sex-balanced, and hormone-informed trials of serotonergic treatments for MDD.
Research Summary of 'Sex-Specific Effects of Psilocybin Versus Escitalopram on Anxiety and Anhedonia: A Bayesian Reanalysis of Antidepressant Treatment Outcomes'
βBlossom's Take
Introduction
Major depressive disorder is described as a common and disabling condition with substantial sex differences in prevalence, symptom expression, and clinical course. The authors note that women are often underrepresented in antidepressant and psychedelic trials, and that earlier studies have rarely examined whether treatment responses differ by sex. They also argue that this matters because psilocybin and escitalopram act on the serotonin system through different mechanisms, and ovarian hormones may influence serotonergic signalling, stress responsivity, and antidepressant effects. Frick and colleagues therefore set out to reanalyse data from a randomised controlled trial comparing psilocybin with escitalopram in people with moderate-to-severe major depressive disorder, focusing on whether sex-by-treatment interactions were present across several psychological domains. Their aim was to test whether men and women differed in post-treatment outcomes for depression, anxiety, anhedonia, thought suppression, well-being, and sexual dysfunction, and whether any differences appeared clinically meaningful. They frame the work as a retrospective Bayesian reanalysis intended to generate more nuanced and conservative evidence from a small existing trial.
Methods
The paper is a retrospective Bayesian reanalysis of a six-week, double-blind randomised controlled trial in adults with moderate-to-severe major depressive disorder. Fifty-nine participants were included, of whom 20 were female; 30 were allocated to psilocybin and 29 to escitalopram. The authors state that participants met a depression severity threshold at entry and that all provided written informed consent for the original trial. Because this was a secondary analysis of previously collected data, separate consent for the reanalysis was not required. The original intervention involved two dosing visits spaced three weeks apart. Participants in the psilocybin group received 25 mg psilocybin, while those in the escitalopram group received 1 mg psilocybin as an active low-dose control intended to preserve expectancy and procedural similarity. Between dosing sessions, participants took daily oral capsules, with the number increased from one to two after three weeks. Trained clinicians provided psychological support before, during, and after dosing, including integration therapy at intermediate visits, and optional follow-up support was available after the final visit. Outcomes were assessed at baseline and after six weeks using validated scales for depressive symptoms (MADRS, QIDS-SR-16, BDI), anxiety (STAI), anhedonia (SHAPS), thought suppression (WBSI), and well-being (WEMWBS). Sexual dysfunction was measured with PRSexDQ-SALSEX only at the six-week follow-up, so it was analysed separately as an ordinal outcome. The main continuous outcomes were modelled as post-treatment scores adjusted for baseline severity. For the seven continuous measures, the authors fitted Bayesian linear regression models with treatment condition, sex, and their interaction as predictors, along with baseline score as a covariate. They used sceptical priors that shrink estimates towards zero, and repeated the analyses with wider priors as a sensitivity analysis. Evidence was summarised with posterior means, 95% credible intervals, and Bayes factors for whether effects were non-zero, clinically meaningful, or negligible. The sexual dysfunction outcome was analysed using cumulative link ordinal regression, followed by within-arm Mann-Whitney U tests. Missing data were handled by complete case analysis, and participants were analysed according to their randomised assignment.
Results
Across the depression scales, sex-related effects were small and uncertain. For MADRS, BDI, and QIDS-SR-16, the posterior estimates for female-versus-male differences within each treatment arm, and for the sex-by-treatment interactions, all had wide credible intervals that included zero. Bayes factors were generally weak or inconclusive, indicating no clear evidence for meaningful sex differences or for strict equivalence. The QIDS-SR-16 showed the most attenuated pattern of the three depression measures. The clearest finding was for anxiety. On the STAI, women in the psilocybin group showed substantially lower post-treatment anxiety than men, with a posterior mean female-male contrast of -10.5 and a 95% credible interval from -17.5 to -3.29. The corresponding contrast in the escitalopram group was smaller and uncertain, and the sex-by-treatment interaction also trended in the same direction. The Bayes factor suggested strong evidence that the female advantage under psilocybin exceeded a minimally clinically important difference. Anhedonia also showed a possible sex-related pattern, though less strongly than anxiety. On the SHAPS, women in the escitalopram group tended to have lower post-treatment anhedonia than men, with a posterior mean contrast of -2.36 and an upper credible limit touching zero. Under psilocybin, the sex difference was smaller and more uncertain. The interaction estimate suggested some difference between treatment arms in the size of the female-male contrast, but the evidence remained imprecise and did not establish a robust interaction. Thought suppression on the WBSI showed no meaningful sex-specific pattern. Female-male contrasts were small, credible intervals were wide, and Bayes factors were close to 1, indicating that the data were compatible with both small effects and practical equivalence. Well-being on the WEMWBS was numerically slightly higher in women across both treatment groups, but again the estimates were imprecise and the evidence was modest rather than definitive. The sensitivity analyses using wider priors produced very similar posterior estimates and did not change the qualitative interpretation. Depression, well-being, and thought suppression remained inconclusive, while the psilocybin-related female advantage in anxiety and the possible escitalopram-related female advantage in anhedonia persisted. For sexual dysfunction at six weeks, the descriptive distribution suggested lower severity in women than in men in both treatment arms, and lower severity overall in the psilocybin group than in the escitalopram group. In the escitalopram arm, 89% of women were classified as having no sexual dysfunction compared with 25% of men; in the psilocybin arm, 91% of women and 68% of men were classified as having no sexual dysfunction. Ordinal regression showed lower odds of more severe sexual dysfunction with psilocybin than with escitalopram and lower odds in women than in men, but the treatment-by-sex interaction was not significant. Within-arm testing found a significant sex difference in the escitalopram group, with men reporting greater sexual dysfunction than women, whereas no significant sex difference was detected in the psilocybin group.
Discussion
The authors interpret the findings as evidence that sex-related variation in response to psilocybin and escitalopram may be domain-specific rather than broad and uniform across depressive symptoms. They emphasise two main patterns: women in the psilocybin arm had greater reductions in anxiety, and women in the escitalopram arm had greater reductions in anhedonia. By contrast, sex differences in overall depression severity, thought suppression, and well-being were small or uncertain. They also note that sexual dysfunction severity was lower in women than in men and lower overall in the psilocybin group, although this did not amount to a clear sex-by-treatment interaction. In discussing possible mechanisms, the authors suggest that ovarian hormones may help explain the anxiety finding with psilocybin, because estradiol and progesterone influence fear and stress circuitry, synaptic plasticity, and anxiety regulation. They also connect the escitalopram-related anhedonia finding to previous work suggesting that women may respond more favourably to SSRIs, potentially because of sex differences in serotonergic metabolism, transporter binding, and neurosteroid pathways such as allopregnanolone. These explanations are presented as speculative because the dataset contained no direct hormonal measures. The authors place their findings in the context of earlier research showing sex differences in depression prevalence, symptom profiles, and antidepressant response. They argue that the present results are consistent with the idea that psilocybin and escitalopram may differ in how they interact with sex-related biology, but that this is more apparent in specific symptom domains than in global depression scores. They also note that the QIDS-SR-16, which was the primary outcome in the original trial, showed little evidence of sex-related effects, and they suggest this may reflect the conservativeness of that measure in this small dataset. Several limitations are highlighted. The study was a secondary, retrospective analysis of a trial that was not designed or powered to test sex-by-treatment interactions, and the sample was small with relatively few women. The authors also lacked information on hormonal contraceptive use, menstrual cycle phase, premenstrual symptom worsening, and menopausal status, all of which could have influenced outcomes. They further note that the analysis relied only on questionnaire data and did not include biological or neurophysiological markers, so mechanistic explanations remain untested. As a result, they describe the findings as hypothesis-generating rather than confirmatory. For future work, the authors recommend larger, sex-balanced trials that incorporate hormone-related information and recruit women across different reproductive stages. They suggest that cycle-aware or hormone-informed designs could improve interpretation of psychedelic and SSRI treatment effects, reduce unexplained variability, and help distinguish biological sex effects from hormone-context effects. They also argue that mechanistic studies combining clinical outcomes with hormonal, neuroimaging, or other biological measures will be needed to clarify how serotonergic and stress-related pathways shape differential treatment response.
View full paper sections
HORMONAL MODULATION OF SEROTONERGIC SYSTEMS
Sex hormones exert profound effects on mood regulation, stress responsivity, and antidepressant responseEstradiol has been shown to exert neuroprotective and mood-stabilizing effects through modulation of serotonergic transmission by reducing monoamine oxidase (MAO) activity, increasing glutamatergic signaling and synaptic plasticity within the hippocampus and prefrontal cortex, and upregulating 5-HT₂A receptor expression and brain-derived neurotrophic factor (BDNF). This is particularly relevant in the context of psychedelic treatment, given the central role of 5-HT₂A receptor signaling, neuroplasticity, and large-scale network reorganization in proposed mechanisms of psilocybin action. In addition, progesterone and its neuroactive metabolite allopregnanolone act on GABA-A receptors to modulate stress sensitivity, emotional lability, and inhibitory regulation of anxiety-related states. Through these effects, ovarian hormones may shape both the neurobiological and affective context in which serotonergic treatments are experienced. Fluctuations in estrogen and progesterone across the menstrual cycle are associated with predictable changes in mood, emotional regulation, and stress sensitivity. In particular, periods of rapid hormonal change, rather than absolute hormone levels alone, may be especially relevant for affective vulnerability. Furthermore, fluctuations in these hormones across the menstrual cycle, peripartum, and perimenopausal phases are closely linked with serotonergic sensitivity and mood vulnerability, as seen in conditions such as premenstrual dysphoric disorder (PMDD), peripartum depression, and perimenopausal affective disturbances. Relatedly, premenstrual exacerbation (PME) in major depressive disorder highlights that cyclical hormone-related changes may not only trigger distinct reproductive mood disorders but may also worsen symptoms of ongoing psychiatric conditions. This is particularly relevant in MDD, where anxiety, anhedonia, and affective instability may vary across the cycle and potentially influence both baseline symptom severity and apparent treatment response. In contrast, androgens such as testosterone appear also to influence affective regulation through dopaminergic and GABAergic mechanisms. Although testosterone is present in both sexes, it is the primary sex hormone in males and typically varies more across developmental stages and diurnal rhythms than across a monthly cycle. This distinction is relevant because it suggests that sex-related variability in affective symptoms may arise from different temporal endocrine patterns in males and females. Psilocybin and SSRIs appear to engage partly different serotonergic and plasticity-related mechanisms, which may be differentially modulated by ovarian hormones. A better understanding of these interactions and their pharmacological targets may support the development of more sex-sensitive and mechanism-based treatments for depression. More broadly, sex-related neuromodulatory processes may contribute to differences in stress vulnerability and antidepressant response. However, in clinical trials there is a widespread requirement for effective contraceptives for women of childbearing age, most commonly achieved through hormonal contraceptives, which may suppress endogenous ovarian hormone fluctuations. Recent PET evidence indicates that women using combined oral contraceptives show reduced cortical and subcortical 5-HT₂A/2C receptor binding compared with naturally cycling women, suggesting that suppressed estradiol levels may lower serotonergic receptor availability and alter sensitivity to 5-HT₂Amediated treatments such as psilocybin. Despite increasing therapeutic interest, women remain underrepresented in psilocybin trials (≈ 25-35%)To address this gap, we conducted a retrospective Bayesian analysis of data from a phase 2 double-blind randomized controlled trial (N = 59; 20 females; NCT03429075) comparing psilocybin and escitalopram for moderate-to-severe MDD (Carhart-Harris et al., 2021). The original trial found that both treatments reduced depressive symptoms, with no significant difference on the primary outcome (QIDS-SR-16), although several secondary measures favored psilocybin. Importantly, no sex-specific analyses were conducted.
PRESENT STUDY
In the present study, we evaluated sex-by-treatment interactions across multiple clinical domains using eight validated psychological measures. We test the hypothesis that males and females may differ in post-treatment outcomes and that these differences may vary across treatment conditions. To examine these questions, we implemented Bayesian linear models with sceptical priors, which shrink estimates toward zero and provide a conservative inferential framework, and conducted sensitivity analyses using wider priors to assess robustness. This approach was intended to provide a more nuanced assessment of possible sex-specific treatment effects while accounting for the uncertainty inherent in a relatively small sample. In addition, sexual dysfunction severity at the 6-week follow-up was examined separately as an ordinal outcome.
STUDY DESIGN
Detailed study procedures have been described in a previous publication. In brief, adults with moderate-to-severe depression (HAM-D ≥ 17) were randomly assigned to the psilocybin (n = 30) or escitalopram (n = 29) group and completed a total of six study visits over a six-week period. All participants gave written informed consent prior to participation. The baseline visit (V1) included a preparatory psychotherapy session. On the first and second dosing days (V2 and V4 spaced three weeks apart), individuals in the psilocybin group received 25 mg of psilocybin, while those in the escitalopram group were given 1 mg of psilocybin. This is a sub-threshold dose intended to maintain expectancy consistency and align procedural elements across groups. Between the two dosing sessions, participants were instructed to take daily oral capsules. Initially, one capsule per day was administered. This was increased to two capsules daily starting at the three-week mark. Therapeutic support was provided by trained clinicians before, during, and after each dosing session, including integration therapy during V3 and V5. Optional follow-up support was also available after the final visit (V6), which marked the end of the primary six-week study window. All participants consented to take part in the study; however, consent was not required for this reanalysis of previously collected data.
PARTICIPANT
A total of 59 participants (20 females) were included in the trial, with 30 allocated to the psilocybin group and 29 to the escitalopram group. Prior to enrollment, 39% of participants (n = 23) were taking psychiatric medications, all of which were discontinued before the start of the intervention. In addition, four participants (7%) withdrew from ongoing psychotherapy in accordance with predefined exclusion criteria (Carhart-Harris et al. 2021). Within the escitalopram condition, four individuals discontinued treatment prematurely due to adverse effects attributed to the medication. In the psilocybin group, one participant reported continued cannabis use throughout the study period, and two others were unable to attend the second dosing session as a result of COVID-19-related restrictions. The average participant age was 41 years.
MEASURES AND OUTCOMES
Data were collected at baseline and post-treatment (post 6 weeks) using a validated battery assessing depressive symptom severity, anxiety, anhedonia, well-being, thought suppression, and sexual dysfunction severity. Depressive symptoms were measured with the Montgomery-Åsberg Depression Rating Scale (MADRS,, Quick Inventory of Depressive Symptomatology (QIDS-SR-, and Beck Depression Inventory (BDI,. Anxiety was assessed with the State-Trait Anxiety Inventory (STAI,, anhedonia with the Snaith-Hamilton Pleasure Scale (SHAPS,, well-being with the Warwick-Edinburgh Mental Well-being Scale (WEMWBS,, thought suppression with the White Bear Suppression Inventory (WBSI,) and sexual dysfunctioning with the Psychotropic-Related Sexual Dysfunction Questionnaire (PRSexDQ-SALEX,. For the MADRS, QIDS-SR-16, BDI, SHAPS, STAI, WEMWBS, and WBSI, baseline and 6-week post-treatment scores were available, and post-treatment outcomes were modeled with adjustment for baseline symptom severity. Because the PRSexDQ-SALSEX was collected only at the 6-week follow-up, it was analyzed as an endpoint measure of sexual dysfunction severity. Higher scores indicated greater symptom severity on the MADRS, QIDS-SR-16, BDI, SHAPS, STAI, and WBSI, greater well-being on the WEMWBS, and greater sexual dysfunction on the PRSexDQ-SALSEX
STATISTICAL ANALYSIS
Analysis was conducted on data collected at two timepoints, baseline and 6 weeks post-treatment, for all continuous clinical measures. Participants were analyzed according to their randomized treatment assignment. Initial data preprocessing was conducted using Microsoft Excel (Office 16 for macOS), and all statistical modeling and visualization procedures were implemented in RStudio using the brms and rstanarm packages. Missing data were handled using complete case analysis. Participants were included in the analysis only if they had complete data for all variables of interest: baseline scores, post-treatment scores, treatment condition, and sex. Sexual dysfunction severity was analyzed separately as an endpoint outcome.
DESCRIPTIVE AND VARIANCE ANALYSIS
Descriptive statistics were computed to characterize the sample, including age distributions to provide contextual information on the possible proportion of female participants in the menopausal transition. For the seven continuous self-reported and clinician-rated questionnaires (MADRS, SHAPS, STAI, QIDS-SR-16, BDI, WEMWBS, and WBSI), group-level means, standard deviations, and change scores were calculated and stratified by treatment condition (psilocybin vs. escitalopram) and sex (male vs. female). To assess the assumption of equal variance across sex, F-tests were used to compare post-treatment score variances between males and females within each treatment arm. For the PRSexDQ-SALSEX, descriptive distributions across severity categories (none, mild, moderate, and severe) were summarized within each treatment-by-sex subgroup.
BAYESIAN MODELING FRAMEWORK
Sex differences in treatment outcomes were analyzed using Bayesian linear regression models applied separately to each of the seven continuous clinical scales (MADRS, SHAPS, STAI, QIDS, BDI, WEMWBS, and WBSI). The dependent variable was the post-treatment score, and baseline scores were included as covariates. The model structure was defined as Condition was coded as 0 for escitalopram and 1 for psilocybin, and sex was coded as 0 for males and 1 for females. Three main contrasts were evaluated for each outcome: (1) female minus male differences within the escitalopram group, (2) female minus male differences within the psilocybin group, and (3) sex by treatment interaction. Posterior draws from each model were transformed into these three contrasts and used for subsequent Bayesian hypothesis testing.
INTERPRETATION AND EVIDENCE THRESHOLDS
To quantify evidence for sex differences, we computed two-sided Bayes factors for each of the three contrasts. These Bayes factors were evaluated: (1) whether a given contrast differed from zero (BF any ), (2) whether it exceeded a minimally clinically important difference (BF MCID ), and (3) whether it lay within a predefined non-inferiority margin indicating a small or negligible effect (BF NI ). Bayes factors were calculated by comparing the ratio of posterior to prior odds that each contrast lay inside the relevant region of interest. Values greater than 3 were interpreted as moderate-to-strong evidence for an effect, between 0.33 and 3 as weak or inconclusive, and below 0.33 as moderate evidence favoring the null. Where MCIDs were not available for a given instrument, we applied standardized effect size benchmarks: medium effects as 0.5 times the baseline standard deviation (SD). This approach allowed for interpretation grounded in clinical relevance while addressing the limitations of small-sample Bayesian estimation (Mouheli et al., 2020). Sceptical priors were applied to promote conservative inference, with Normal(0, 7.5) distributions assigned to condition, sex, and interaction coefficients; Normal(0, 5) to the baseline covariate; and Normal(0, 10) to the intercept. Posterior estimates were summarized using posterior means and 95% credible intervals (CrIs), providing probabilistic estimates of sex differences and their interaction with treatment condition.
SENSITIVITY ANALYSES
To examine the robustness of our findings, all seven continuous outcome models were re-estimated with wider priors (Normal(0, 15)) for the regression coefficients. Comparisons of posterior distributions and BFs under these relaxed prior assumptions were used to assess the consistency of inference. PRSexDQ-SALSEX sexual dysfunction severity was analyzed as an ordinal outcome using cumulative link ordinal regression with treatment condition, sex, and their interaction as predictors. Follow-up Mann-Whitney U tests compared males and females separately within the escitalopram and psilocybin groups.
DESCRIPTIVE AND VARIANCE STATISTICS
Information on the use of hormonal contraceptives was not available. The median age of female participants was 38 years (SD = 9.17; range = 22-60), and the median age of male participants was 42.8 years (SD = 11.46; range = 21-64). Descriptive statistics for each sex-by-treatment subgroup across continuous outcome measures indicated that female participants tended to show greater symptom reductions than male participants within each treatment condition (Supplementary Table). To assess equality of score variability between sexes, Levene's F-tests were conducted for baseline, posttreatment, and change scores (Supplementary Table). Levene's F-tests indicated broadly comparable variability of post-treatment scores between men and women within each treatment arm. The only clear exception was SHAPS post-treatment scores under escitalopram, where males showed greater dispersion than females (F = 4.15, p = .045). For WEMWBS, variance in change scores under escitalopram also differed by sex (F = 0.31, p = .048), reflecting more heterogeneous improvements among women. All other variance comparisons were non-significant (all p > .10). Welch's t-tests of mean differences within treatment arms identified two statistically significant findings. First, post-treatment anhedonia scores on the SHAPS were lower in women than in men in the escitalopram group (t = 2.61, p = .015), consistent with greater reduction in anhedonia among women. Second, change scores on the STAI in the psilocybin arm indicated larger anxiety reductions in women than in men (t = 2.64, p = .018). No other questionnaire showed statistically significant sex differences in mean baseline, posttreatment, or change scores within either treatment arm (Supplementary Table).
POSTERIOR ESTIMATES AND BAYES FACTORS
Across the seven psychological domains, the strongest sex-related patterns emerged for anxiety and anhedonia, whereas depression severity, thought suppression, and well-being showed smaller and more uncertain effects. Bayesian results for depression outcomes are presented in Table, followed by anxiety, anhedonia, thought suppression, and well-being in Tables. Sensitivity analyses are provided in the Supplementary Materials.
DEPRESSION SEVERITY (MADRS, QIDS, BDI)
Across all three depression scales, posterior estimates of sex differences were small with wide CrIs that included zero. For MADRS, the female-male contrast in the escitalopram group had a posterior mean of 1.12 (95% CrI - 4.37 to 7.03), while under psilocybin, the mean contrast was - 2.57 (CrI - 8.63 to 3.65). The interaction contrast (difference of differences) was - 3.70 (CrI - 11.1 to 3.81). Bayes factors for any non-zero effect were close to or below 1 (BF_any = 1.79, 0.25, and 0.20 for escitalopram, psilocybin, and the interaction, respectively), providing at most weak evidence and favoring neither substantial sex differences nor strong equivalence. For the BDI, in the escitalopram group, the female-male contrast had a posterior mean of - 1.58 (CrI - 7.82 to 4.78); in the psilocybin group, it was - 5.46 (CrI - 12.1 to 1.09). The interaction mean was - 3.88 (CrI -12.0 to 4.31). All CrIs comfortably overlapped zero, and BFs again indicated weak evidence (BF_any = 0.43, 0.05, and 0.21, respectively), suggesting that the data did not clearly support either meaningful sex differences or strict non-inferiority. For QIDS, posterior contrasts were closest to zero. The mean female-male differences were - 0.55 (CrI -4.65 to 3.47) for escitalopram and - 2.04 (CrI - 6.15 to 1.90) for psilocybin, with an interaction mean of -1.49 (CrI - 6.83 to 3.82). All associated Bayes factors fell within the inconclusive range (BF_any between 0.17 and 0.65). Across the depression scales, all credible intervals included zero, and corresponding Bayes factors remained in the weak or inconclusive range. Notes: BF_any = evidence for any nonzero effect; BF_MCID = effect ≥ minimally clinically important difference. BF_NI = non-inferiority. Posterior values represent the mean estimate with the associated 95% credible interval (CrI). BFs < 1 favor the null; BFs > 1 favor the alternative. Interpretive thresholds: BF < 0.33 = moderate evidence for the null, BF 0.33-3 = weak/inconclusive evidence, and BF > 3 = moderate-to-strong evidence for an effect.
ANXIETY (STAI)
The STAI revealed a pronounced sex difference within the psilocybin group. The posterior mean femalemale contrast under psilocybin was - 10.5, with a 95% CrI from - 17.5 to - 3.29, indicating that women had substantially lower post-treatment anxiety than men after adjusting for baseline scores. The corresponding contrast under escitalopram was smaller and more uncertain (mean - 2.54, CrI - 9.33 to 4.07). The sex × treatment interaction contrast was also negative (mean - 7.97, CrI - 16.3 to 0.45), suggesting a tendency for the female advantage to be more pronounced under psilocybin than escitalopram, although the CrI narrowly included zero. Bayes factors provided strong evidence that the psilocybin female-male difference exceeded a minimally clinically important difference (MCID; BF_MCID = 13.0). Corresponding Bayes factors for escitalopram and for the interaction contrast were smaller and remained below conventional thresholds for strong evidence. Notes: BF_any = evidence for any nonzero effect; BF_MCID = effect ≥ minimally clinically important difference. BF_NI = non-inferiority. Posterior values represent the mean estimate with the associated 95% credible interval (CrI). BFs < 1 favor the null; BFs > 1 favor the alternative. Interpretative thresholds: BF < 0.33 = moderate evidence for the null, BF 0.33-3 = weak/inconclusive evidence, BF > 3 = moderateto-strong evidence for an effect.
ANHEDONIA (SHAPS)
For SHAPS, both drugs reduced anhedonia in men and women, but posterior contrasts suggested a possible female advantage under escitalopram. The female-male contrast in the escitalopram group had a posterior mean of - 2.36 (CrI - 4.63 to 0.00), indicating lower anhedonia scores among women, with the upper bound of the CrI just touching zero. Under psilocybin, the mean contrast was - 1.03 (CrI - 3.25 to 1.16), reflecting a smaller and more uncertain difference. The interaction contrast, representing differential female benefit between conditions, had a mean of 1.34 (CrI - 1.87 to 4.49). Bayes factors for escitalopram on SHAPS supported the presence of some sex difference (BF_any ≈ 0.03 when framed against a null centered at zero and the observed negative effect), although evidence that this exceeded the MCID was modest, and non-inferiority BFs were close to 1. For psilocybin, BFs favored neither strong effects nor strict equivalence. The interaction contrast yielded a BF_any of about 4, suggesting moderate evidence for some difference between treatment arms in the size of the female-male contrast, but CrIs remained wide. Taken together, SHAPS findings suggest a tendency for greater reduction in anhedonia in women, particularly under escitalopram, but do not establish a robust treatment-by-sex interaction.
THOUGHT SUPPRESSION (WBSI)
For thought suppression, female-male contrasts were small and highly uncertain. In the escitalopram group, the posterior mean contrast was - 0.76 (CrI - 7.38 to 5.84); in the psilocybin group it was - 2.52 (CrI - 9.51 to 4.49). The interaction mean was - 1.76 (CrI - 10.4 to 6.71). All Bayes factors were close to 1 (BF_any between 0.31 and 0.70), indicating that the data were consistent with both small effects and practical equivalence. Thus, WBSI provided no evidence for sex-specific treatment differences. Notes: BF_any = evidence for any nonzero effect; BF_MCID = effect ≥ minimally clinically important difference. BF_NI = non-inferiority. Posterior values represent the mean estimate with the associated 95% credible interval (CrI). BFs < 1 favor the null; BFs > 1 favor the alternative. Interpretative thresholds: BF < 0.33 = moderate evidence for the null, BF 0.33-3 = weak/inconclusive evidence, BF > 3 = moderateto-strong evidence for an effect
WELL-BEING (WEMWBS)
Posterior estimates for WEMWBS indicated slightly higher post-treatment well-being in women than in men across both treatment arms. In the escitalopram group, the mean female-male contrast was 1.81 (CrI -4.62 to 8.37); in the psilocybin group, it was 2.50 (CrI - 4.25 to 9.38). The interaction contrast was close to zero (mean 0.69, CrI - 7.59 to 9.04). Bayes factors for any effect were modest (BF_any ≈ 2.4-3.2) and did not reach thresholds typically interpreted as strong evidence. Thus, while women tended to report numerically higher well-being, uncertainty remained substantial, and no clear sex-specific treatment effect was demonstrated.
SENSITIVITY ANALYSES
All analyses were repeated using wider prior distributions to assess the robustness of Bayes factors and posterior intervals to prior specification. Posterior means and CrIs for all contrasts were very similar to those obtained with the main sceptical prior, and the direction of effects was unchanged across all questionnaires. Bayes factors varied slightly in magnitude but did not alter the qualitative interpretation for any outcome: Depression (MADRS, BDI, QIDS), well-being (WEMWBS), and thought suppression (WBSI) measures remained inconclusive with respect to sex differences, whereas STAI continued to show a pronounced female advantage under psilocybin, and SHAPS suggested a possible, but uncertain, female benefit under escitalopram. Overall, the results appear robust to reasonable changes in prior specification but remain constrained by the limited sample size and associated uncertainty (Supplementary Tableand Fig.).
SEXUAL DYSFUNCTION SEVERITY (PRSEXDQ-SALSEX)
Descriptive distributions of PRSexDQ-SALSEX severity at the 6-week follow-up suggested lower sexual dysfunction severity in women than in men across both treatment conditions and a more favorable overall distribution in the psilocybin group than in the escitalopram group (Fig.). In the escitalopram group, 89% of women were classified as having no sexual dysfunction, compared with 25% of men; among men receiving escitalopram, the remaining cases were distributed across mild (10%), moderate (25%), and severe (40%) dysfunction. In the psilocybin group, 91% of women and 68% of men were classified as having no sexual dysfunction. Ordinal regression supported these descriptive patterns (Table). Participants receiving psilocybin had lower odds of being classified in a more severe sexual dysfunction category than those receiving escitalopram (p = .013), and women had lower odds of greater dysfunction severity than men (p = .008). The treatment-by-sex interaction was not significant (p = .411), indicating no clear evidence that the sex difference differed between treatment conditions. Follow-up within-arm comparisons showed that males had significantly greater sexual dysfunction severity than females in the escitalopram arm (p = 0.0026), whereas no significant sex difference was observed in the psilocybin arm (p = 0.1306). Notes: Estimates are from a cumulative link ordinal logistic regression model of PRSexDQ-SALSEX severity at a 6-week follow-up. Severity categories were ordered from none to severe, with higher categories reflecting greater sexual dysfunction. Escitalopram and male served as the reference categories. Negative coefficients indicate lower odds of greater dysfunction severity. The treatment × sex interaction tests whether the female-male difference differed between treatment conditions.
DISCUSSION
This study presents a retrospective reanalysis of sex differences in treatment response to psilocybin and escitalopram, using a well-characterized dataset from a recent clinical trial in participants with MDD. Eight validated psychological and psychiatric scales were examined, including measures of depression, anxiety, anhedonia, thought suppression, well-being, and sexual dysfunction severity. The primary aim was to determine whether men and women differed in their responses to these treatments and whether any differences reached clinically meaningful magnitudes. Across the examined domains, the clearest sex-related patterns emerged for anxiety and anhedonia. Women in the psilocybin arm showed greater reductions in anxiety than men, whereas women in the escitalopram arm showed greater reductions in anhedonia than men. For the remaining domains, credible intervals generally overlapped zero, and Bayes factors were weak or inconclusive, consistent with either small effects or insufficient precision. In addition, sexual dysfunction severity at follow-up was lower in women than in men and lower overall in the psilocybin group than in the escitalopram group, although the treatment-by-sex interaction was not significant. Taken together, these findings suggest that sex-related variation in treatment response may be more apparent in specific clinical domains than across broad depressive symptom severity alone. In the psilocybin group, females exhibited greater reductions in anxiety compared with males. One possible explanation involves the combined influence of estradiol and progesterone on fear and anxiety circuitry, particularly within the amygdala and hippocampus, where ovarian hormones shape synaptic plasticity, stress sensitivity, and inhibitory regulation. Progesterone withdrawal and altered sensitivity to changing progesterone levels have also been linked to heightened anxiety in some women, particularly when HPA-axis responsivity is elevated. Within this framework, psilocybin may interact with hormonally sensitive stress-regulation pathways in a way that preferentially facilitates anxiolytic effects in at least a subset of female participants. However, given the absence of direct hormonal measures in the present dataset, these mechanisms remain speculative. Preclinical evidence also suggests that stress may differentially modulate psilocybin's anxiolytic effects in males and females. Farinha-Ferreira et al.reported that psilocybin reduced anxiety-like behaviour in both sexes under non-stressed conditions. However, when animals were exposed to an additional acute stressor during the psychedelic phase, these anxiolytic effects were attenuated, particularly in males. In contrast, females retained more of the post-acute anxiolytic benefit and showed a less reactive corticosterone profile, suggesting that ovarian hormones may buffer stress-related interference with psilocybin's effects. By contrast, reductions in anhedonia were more pronounced in females receiving escitalopram than in males. This aligns with evidence that women often respond more favorably to SSRIs, potentially due to sex differences in serotonin transporter binding, pharmacokinetics, and serotonergic metabolism. One possible implication is that stronger serotonergic enhancement in women may translate more readily into improvements in reward processing and hedonic tone, particularly when baseline anhedonia is closely tied to serotonergic dysregulation. Moreover, progesterone and its metabolite allopregnanolone may further enhance SSRI efficacy in women through GABAergic mechanisms. SSRIs have been shown to increase allopregnanolone, which in turn enhances GABA-A-mediated inhibitory signaling and may reduce stress-related interference with reward responsiveness. Through this pathway, serotonergic and neurosteroid effects could jointly contribute to improved hedonic tone in women, whereas these hormonal influences may interact less directly with the mechanisms primarily targeted by psilocybin. A related pattern was observed for sexual dysfunction severity. Participants receiving psilocybin showed lower sexual dysfunction severity overall than those receiving escitalopram, and women showed lower severity than men across treatment conditions. Although the treatment-by-sex interaction was not significant, this finding is clinically relevant because sexual side effects are among the most common reasons for reduced tolerability and poorer adherence during SSRI treatment. In that sense, the sexual dysfunction results complement the domain-specific findings for anxiety and anhedonia by suggesting that sex-related differences may extend beyond efficacy outcomes to tolerability-related domains. Given the well-established burden of SSRIrelated sexual side effects on quality of life and treatment adherence, this aspect may be particularly important in future comparative trials of serotonergic interventions. At the same time, these findings should be interpreted cautiously, as sexual functioning is influenced by multiple biological and psychosocial factors, and the female subgroup likely spanned different reproductive stages, including possible perimenopausal or menopausal transition, which may independently affect sexual functioning. Taken together, the stronger female response to psilocybin in anxiety and to escitalopram in anhedonia suggests that distinct neurobiological mechanisms underlie these domain-specific outcomes. By contrast, sex differences across the broader depression measures were small and uncertain, with the QIDS showing the most attenuated pattern of the three depression scales. This is notable because in the original trial, the QIDS-SR-16 was the primary endpoint and did not significantly differentiate psilocybin from escitalopram, despite several secondary outcomes favoring psilocybin. Accordingly, the nearnull QIDS findings in the present reanalysis may reflect the relative conservativeness of this measure in this dataset rather than the absence of sex-related effects across all symptom domains. More broadly, the lack of robust sex differences on depressive symptom scales may also reflect limited statistical power, smaller domain-general effects, and unmeasured hormonal variables such as menstrual phase or contraceptive use that could be used in our analysis. These mechanistic patterns generate specific, testable hypotheses for future hormone-stratified psychedelic trials. Beyond these clinical findings, the results contribute to the growing recognition that sex hormones modulate antidepressant response (
LIMITATIONS
Several limitations should be considered when interpreting these findings. First, this was a secondary analysis of a trial that was not originally designed or powered to detect sex-bytreatment interactions. Although the Bayesian framework enabled a more graded assessment of uncertainty, the relatively small sample size and unequal sex distribution, particularly the low number of female participants, limited statistical precision. Further stratification by treatment arm reduced power still further, meaning that the observed sex-related differences should be interpreted cautiously and regarded as preliminary. Secondly, information on hormonal status was unavailable, including hormonal contraceptive use, menstrual cycle phase, premenstrual exacerbation of symptoms, and menopausal stage. This heterogeneity among female participants likely introduced variability in estradiol and progesterone exposure, both of which influence emotional regulation and treatment responsiveness. Ovarian hormones and their neuroactive metabolites modulate 5-HT₂A receptor signaling, glutamatergic plasticity, and GABAergic tone, thereby shaping subjective and clinical responses to serotonergic compounds. The use of combined oral contraceptives may blunt endogenous hormonal fluctuations and alter serotonergic receptor architecture, as described in a recent brain imaging study in which women using combined oral contraceptives reported lower cortical and subcortical 5-HT₂A receptor binding, suggesting a potential mechanism through which synthetic hormones could influence treatment sensitivity (Kaufmann et al., 2025). Consequently, the observed outcomes may reflect not only biological sex differences but also the hormonal context shaped by contraceptive use or perimenopausal transition. This limits the translatability of the findings of this and other clinical studies that do not account for hormonal phase or recruit women across hormonal life stages and with and without exogenous hormone exposure. In addition, the population had a wide age range of female participants, which raises the possibility that some may have been in the menopausal transition. Age-related endocrine changes may therefore have contributed to variability in treatment response, but this could not be formally evaluated in the present analysis. Finally, this retrospective analysis relied on clinical questionnaire data and did not include biological or neurophysiological measures that could directly test the mechanisms discussed in the interpretation of the findings. Accordingly, explanations involving ovarian hormones, serotonergic sensitivity, or stress-related neurobiology remain speculative. The present results should therefore be viewed as hypothesis-generating rather than confirmatory.
IMPLICATIONS FOR FUTURE RESEARCH
Future research should examine hormonal context as a potentially meaningful contributor to antidepressant and psychedelic treatment response. In the present study, the absence of information on menstrual cycle phase, hormonal contraceptive use, and menopausal status limited interpretation of the observed sex-related patterns. Future trials should therefore incorporate more systematic characterization of these factors as they may differentially influence serotonergic signaling, stress responsivity, and affective regulation. Such characterization is important not only for mechanistic understanding but also for improving interpretability of clinical outcomes. Hormone-related fluctuations in anxiety, anhedonia, and mood symptoms may affect both baseline severity and post-treatment assessment, thereby influencing apparent treatment response. Future studies should also examine whether hormonal context is associated not only with treatment efficacy but also with tolerability, acute psychological distress, and adverse event profiles, particularly in psychedelic interventions where transient increases in emotional intensity may form part of the treatment process. In this context, accounting for within-person hormonal variability may help reduce unexplained variance, improve signal detection, and strengthen inference, particularly in psychedelic trials that often rely on relatively small samples and a limited number of dosing sessions. Future studies should also aim to recruit women across a broader reproductive spectrum, including naturally cycling, hormonally contracepting, perimenopausal, and postmenopausal participants. Rather than collapsing female participants into a single category, stratified or phase-aware designs may offer a more informative approach to understanding when and for whom treatment effects differ. Where feasible, adaptive or cycle-informed trial designs could be used to evaluate whether the timing of psychedelic administration relative to hormonal state influences acute experience, post-acute symptom change, or durability of response. In addition, mechanistic work will be needed to clarify how ovarian hormones interact with the neurobiological systems implicated in psychedelic and SSRI response. This includes serotonergic, glutamatergic, and GABAergic pathways, as well as stress-related and network-level processes that may shape domain-specific outcomes such as anxiety and anhedonia. Integrating clinical outcomes with hormonal measures, neuroimaging, and other biological markers may be particularly valuable for distinguishing general sex effects from more specific hormone-sensitive processes. Taken together, future work should aim not only to include women more adequately in psychopharmacology and psychedelic trials but also to characterize female biological heterogeneity with greater precision. Such approaches will be important for determining whether the patterns observed here reflect true hormone-sensitive differences in treatment response and for advancing more biologically informed and equitable models of depression treatment.
Full Text PDF
Full Paper PDF
Create a free account to open full-text PDFs.
Study Details
- Study Typeindividual
- Populationhumans
- Characteristicsplacebo controlleddouble blindrandomizedre analysis
- Journal
- Compounds
- Topics
- Authors
- APA Citation
References (5)
Papers cited by this study that are also in Blossom
Carhart-Harris, R. L., Giribaldi, B., Watts, R. et al. · New England Journal of Medicine (2021)
Dos Santos, R. G., Hallak, J. E., Baker, G. et al. · Journal of Psychopharmacology (2021)
Gill, H., Puramat, P., Patel, P. et al. · Psychiatry Research (2022)
Reiff, C. M., Richman, E. E., Nemeroff, C. B. et al. · American Journal of Psychiatry (2020)
Roseman, L., Demetriou, L., Wall, M. B. et al. · Neuropharmacology (2018)