The proliferation of digital mental health tools over the past decade has generated a persistent and consequential debate: does the human element matter? Can a well-designed app, a structured set of modules, an intelligent chatbot produce meaningful reductions in depression without any involvement from a real person?
The evidence is now sufficient to answer this question with reasonable confidence. The answer is yes — but significantly less so than when a human is involved, and the difference is large enough to have major implications for how the field designs, funds, and scales digital mental health programs.
This post reviews what the research tells us about guided versus unguided digital self-help for depression, with particular focus on what the five RCTs of WHO Step-by-Step contribute to this question.
Defining the Terms
In the digital mental health literature, the distinction between guided and unguided self-help is well-established but sometimes applied inconsistently. For the purposes of this review:
Guided self-help (GSH) refers to programs that combine structured digital or bibliotherapy content with some form of human support contact — typically provided by a therapist, counselor, or trained lay supporter. The support is usually brief (15-30 minutes per week) and may be delivered by phone, video call, or asynchronous message. The supporter’s role is not to provide therapy per se, but to encourage engagement, answer questions, monitor progress, and facilitate referral if needed.
Unguided self-help (USH) refers to programs that provide structured content without any human contact. Participants work through the program entirely independently, using only the material itself as a guide.
This distinction matters because the two categories produce substantially different outcomes — and because many technology-focused discussions of digital mental health elide the difference, treating any evidence-based digital program as interchangeable regardless of whether it includes human support.
The Headline Finding
The most recent and comprehensive meta-analysis comparing guided and unguided digital self-help for depression found effect sizes of approximately 0.78 for guided programs and 0.30 for unguided programs — a difference of nearly 0.5 standard deviations.
To put that in clinical terms: guided self-help produces effects in the medium-to-large range, comparable to face-to-face cognitive behavioral therapy (effect size approximately 0.99). Unguided self-help produces effects in the small range, comparable to what one might observe from low-intensity interventions like psychoeducation or self-monitoring.
The difference between guided and unguided is larger than the difference between guided self-help and in-person therapy. This is a striking finding. It means that whether you include a human support component is a more important determinant of outcomes than whether you deliver the intervention digitally or in person.
For program designers and funders, this has a direct implication: investing in guided programs is substantially more evidence-backed than investing in unguided apps, regardless of how sophisticated, engaging, or well-designed the latter may be.
What the WHO Step-by-Step Trials Add
The five RCTs of WHO Step-by-Step contribute to this literature in several important ways.
First, they extend the guided self-help evidence base to low- and middle-income country contexts. The majority of prior GSH trials have been conducted in high-income settings with relatively well-resourced participants. The Step-by-Step trials — Lebanon, China, Pakistan, Egypt, South Africa — demonstrate that guided self-help produces comparable effects in resource-limited contexts with populations facing compound adversity.
Second, they use a specific variant of the guided support model — lay counselors providing 15-minute weekly calls — rather than therapist-delivered support. This is important because most prior GSH research used therapist or trainee-therapist contact, which is more expensive and harder to scale. Step-by-Step demonstrates that the human support component can be provided by trained community members without clinical qualifications and still produce the effect sizes associated with GSH in the broader literature.
Third, several of the trials included analyses of the dose-response relationship between counselor contact and outcomes. These analyses consistently showed that participants who completed more counselor calls had better outcomes — a within-study replication of the guided vs. unguided finding at the level of individual participant engagement patterns.
Why Does Human Support Make the Difference?
Several mechanisms likely explain the guided/unguided effect size gap, and understanding them has implications for program design.
Engagement and completion. The most straightforward explanation is differential completion. People with depression face significant motivational barriers to sustained self-directed activity. The low mood, anhedonia, and reduced self-efficacy that characterize depression are precisely the factors that make independent engagement with a self-help program difficult. A weekly check-in call — even a brief one — provides accountability, encouragement, and a social commitment that supports continued engagement.
Completion rates in unguided digital programs for depression are typically low — often below 10-20 percent for all recommended sessions. In guided programs, including Step-by-Step, completion rates are higher, though still not high by clinical standards (approximately 25 percent completed all five sessions across the Step-by-Step RCTs). But partial engagement still produces partial benefits, and guided programs generate more of both.
Therapeutic alliance. A substantial body of research in psychotherapy demonstrates that the quality of the therapeutic relationship — the alliance between therapist and client — is one of the strongest predictors of outcomes. Even brief, non-specialist support creates a form of relational engagement that may activate some of the alliance-related factors associated with better outcomes. Unguided digital programs, by definition, cannot create this.
Problem-solving support. When participants encounter difficulties with program content — a concept that doesn’t resonate, an activity that feels impossible given their circumstances — guided programs provide a mechanism for working through those difficulties. The lay counselor call is an opportunity to troubleshoot, adapt, and maintain momentum. Unguided programs have no equivalent mechanism; participants who hit a barrier simply stop.
Risk monitoring. From a safety perspective, guided programs provide a mechanism for detecting and responding to clinical deterioration. A participant who is worsening — showing signs of increasing suicidality, psychotic features, or severe functional impairment — can be identified during counselor check-ins and referred to appropriate care. Unguided programs have no equivalent safety net.
Implications for Program Design
The evidence on guided versus unguided self-help has several practical implications for program designers operating in LMIC contexts.
The minimum effective dose of human support. The Step-by-Step model uses 15-minute weekly calls. This appears sufficient to produce the effect sizes associated with guided self-help. Whether even briefer contact — 10-minute calls, less frequent calls, asynchronous messaging rather than synchronous calls — would produce comparable effects is not yet well-established. This is an important research question with significant cost implications.
The current evidence suggests that some human contact, at some minimum threshold, is necessary for guided-level effects. Below that threshold, outcomes converge toward unguided self-help results. Where exactly the threshold lies is unknown, but the data point against very minimal contact (e.g., a single introductory call with no further follow-up) being sufficient.
Counselor training requirements. The Step-by-Step evidence demonstrates that lay counselors — community members without clinical qualifications — can provide the support that differentiates guided from unguided self-help. This has profound implications for scalability. Programs that require therapist-delivered support are fundamentally limited by therapist availability; programs that can use lay counselors can scale in contexts where therapists are scarce.
The key training requirements appear to be: the ability to communicate clearly and empathetically, knowledge of the program content sufficient to answer participant questions, ability to recognize signs of clinical deterioration requiring referral, and capacity to maintain appropriate boundaries. These are learnable competencies that do not require clinical training, though they do require careful selection, adequate training, and ongoing supervision.
Technology platform selection. The evidence from Step-by-Step supports WhatsApp as a viable delivery platform in LMIC contexts with high WhatsApp penetration. The shift from a dedicated app to WhatsApp delivery in later iterations of Step-by-Step appears to have improved accessibility — particularly for participants with limited smartphone experience or literacy barriers — without sacrificing outcomes.
The broader implication is that delivery platform should be chosen based on participant accessibility rather than technological sophistication. The most impressive app in the world is less effective than a program that participants actually use.
The Equity Dimension
There is an equity argument for guided programs that goes beyond efficacy alone. In high-income contexts, people who are struggling with a digital self-help program have fallback options: they can seek professional help, call a crisis line, discuss their difficulties with a well-resourced support network. In low-income contexts, the Step-by-Step program may be the only formal support available. This makes the safety and effectiveness of the program more, not less, important.
Unguided programs in LMIC contexts are particularly concerning from this perspective. A program that fails to engage — or worse, that produces modest benefits that mask the absence of more effective treatment — in a context where no alternatives exist is not merely ineffective. It may delay or displace the limited access to help that participants have.
The argument for guided programs in LMICs is therefore not only that they produce better outcomes, but that the equity stakes of program quality are higher when there is no safety net of alternative options.
Outstanding Questions
Despite the strength of the guided/unguided evidence, several important questions remain.
What is the minimum counselor contact that preserves guided-level effects? Current evidence supports 15-minute weekly calls, but the lower bound of effective support intensity is not established. Research directly manipulating contact frequency and duration would help optimize the cost-effectiveness of future programs.
How do different counselor characteristics and training protocols affect outcomes? The lay counselor model is promising, but the relative importance of selection criteria, training content, supervision intensity, and ongoing support for counselors is not well-characterized. More research on the “active ingredients” of effective lay counselor delivery would help programs invest training resources efficiently.
Can asynchronous digital support (messaging rather than calls) substitute for synchronous voice contact? The interpersonal and accountability functions of counselor contact may or may not require synchronous voice communication. If asynchronous messaging produces comparable effects, it could substantially reduce program costs and remove barriers related to scheduling and connectivity.
How do guided self-help outcomes compare in populations with significant trauma histories or comorbid conditions? The Step-by-Step trials appropriately excluded participants with severe depression or active safety concerns, but many participants in LMIC contexts present with trauma-related symptoms alongside depression. Research on the program’s effectiveness and safety in trauma-exposed populations would extend its clinical applicability.
Conclusion
The evidence is clear: human support makes the difference in digital mental health programs for depression. The guided/unguided effect size gap is clinically meaningful, consistent across the literature, and present within individual trials at the level of counselor contact dose-response relationships.
For the WHO Step-by-Step program specifically, the five RCTs demonstrate that brief lay counselor contact — 15 minutes per week, no clinical qualifications required — is sufficient to produce guided-level effects in LMIC populations across diverse cultural contexts. This combination of efficacy and scalability is what makes the program, and the organizations scaling it, such a compelling focus for both research attention and programmatic investment.
The question for the field is not whether to include human support in digital mental health programs — that question has been answered. The question is how to optimize and scale that support most efficiently. The research is pointing the way.
For more on the WHO Step-by-Step program and the work of Kaya Guides, visit besidehealth.org.
Related reading
- What 5 Randomized Controlled Trials Tell Us About Guided Digital Self-Help for Depression in LMICs — the trial-by-trial evidence review.
- The Lay Counselor Model: Delivering Evidence-Based Mental Health Care at Scale Without Psychiatrists — what effective lay counsellor delivery actually looks like.
- You Don’t Need a Therapist to Get Better: How a WhatsApp Program Is Treating Depression in the World’s Hardest-to-Reach Places — accessible overview for a general audience.
- Beside’s evidence base