banner

News

Oct 14, 2024

Personal protective effect of wearing surgical face masks in public spaces on self-reported respiratory symptoms in adults: pragmatic randomised superiority trial | The BMJ

Surgical face masks to prevent respiratory symptoms

The polarised discourse around face masks is hindering constructive debate

Objective To evaluate the personal protective effects of wearing versus not wearing surgical face masks in public spaces on self-reported respiratory symptoms over a 14 day period.

Design Pragmatic randomised superiority trial.

Setting Norway.

Participants 4647 adults aged ≥18 years: 2371 were assigned to the intervention arm and 2276 to the control arm.

Interventions Participants in the intervention arm were assigned to wear a surgical face mask in public spaces (eg, shopping centres, streets, public transport) over a 14 day period (mask wearing at home or work was not mentioned). Participants in the control arm were assigned to not wear a surgical face mask in public places.

Main outcome measures The primary outcome was self-reported respiratory symptoms consistent with a respiratory infection. Secondary outcomes included self-reported and registered covid-19 infection and self-reported sick leave.

Results Between 10 February 2023 and 27 April 2023, 4647 participants were randomised of whom 4575 (2788 women (60.9%); mean age 51.0 (standard deviation 15.0) years) were included in the intention-to-treat analysis: 2313 (50.6%) in the intervention arm and 2262 (49.4%) in the control arm. 163 events (8.9%) of self-reported symptoms consistent with respiratory infection were reported in the intervention arm and 239 (12.2%) in the control arm. The marginal odds ratio was 0.71 (95% confidence interval (CI) 0.58 to 0.87; P=0.001) favouring the face mask intervention. The absolute risk difference was −3.2% (95% CI −5.2% to −1.3%; P<0.001). No statistically significant effect was found on self- reported (marginal odds ratio 1.07, 95% CI 0.58 to 1.98; P=0.82) or registered covid-19 infection (effect estimate and 95% CI not estimable owing to lack of events in the intervention arm). Self-reported sick leave was equally distributed between the intervention and control groups (marginal odds ratio 1.00, 0.81 to 1.22; P=0.97).

Conclusion Wearing a surgical face mask in public spaces over 14 days reduces the risk of self-reported symptoms consistent with a respiratory infection, compared with not wearing a surgical face mask.

Trial registration ClinicalTrials.gov NCT05690516.

As of 3 November 2023, more than 76.9 million confirmed SARS-CoV-2 infections and more than 6.9 million deaths with covid-19 have been recorded worldwide.1 Although public health and social measures, such as wearing face masks and school closures, were widely implemented to limit the spread of the virus,2 evidence on the effectiveness and unintended consequences of these measures is limited.34

Systematic reviews of observational studies have reported an association between wearing face masks and lower risk of respiratory infections.56 On the basis of findings from 10 randomised trials, however, the authors of a recent Cochrane review concluded that use of a face mask in the community had little or no effect on risk of developing a respiratory viral infection.7 They also noted that adverse effects were rarely measured and poorly reported.7 Several factors could explain the seemingly discrepant findings from observational studies and randomised trials, including the higher risk of bias inherent to observational studies, insufficient power of the randomised controlled trials, or low adherence to the intervention.7

We carried out a pragmatic randomised trial to evaluate the personal protective effect of wearing surgical face masks in public spaces over 14 days on self-reported symptoms consistent with respiratory infection, compared with not wearing face masks.

We conducted a pragmatic parallel two arm individually 1:1 randomised superiority trial. Details on the rationale, design, and statistical analysis plan can be found elsewhere.89 The trial was performed according to a published protocol, with exceptions (see Protocol Amendments section), and the principles outlined in the Declaration of Helsinki. We followed the Consolidated Standards of Reporting Trials (CONSORT) guidelines (see supplementary material, table 1).

The trial took place in Norway between 10 February 2023 and 27 April 2023. This was after the most acute phase of the covid-19 pandemic, but during the normal influenza season in the Nordic countries.10 No public health or social measures were enforced by Norwegian authorities on the general population during the trial.

To be eligible for inclusion, individuals had to be aged at least 18 years, be willing to be randomly assigned to either wear face masks (intervention) or not wear face masks (control) in public spaces when near to other people for a period of 14 days, and provide written informed consent (online consent form). No exclusion criteria were applied.

Participants were recruited from multiple locations across Norway, using a diverse range of methods, with participant entry occurring predominantly in three phases. The initial phase was triggered by publicity through Norwegian national TV, radio, and various media channels, including paid print advertisements. The two following phases commenced after engaging two data collection firms that invited members of their survey panels to take part in our study.

One amendment was made to the protocol before the start of the trial. In the original protocol, we proposed evaluating the effectiveness of wearing either FFP3 respirators or surgical face masks, or not wearing a mask. We revised this design in a protocol update published on 6 December 2022, narrowing the scope of the study to investigate the effectiveness of surgical face masks only. This modification was done based on sample size considerations and to simplify the procedure for trial participants and pharmacy staff. After the 14 day trial period, but before unblinding, we introduced four modifications: two sensitivity analyses were added to deal with missing data; no exploratory analysis of immune status and covid-19 was to be carried out owing to low numbers of reported covid-19 infections; the main analysis was unadjusted rather than adjusted; and we decided to primarily report the main results as odds ratios rather than risks ratios. The supplementary material provides details of the adjusted analyses and risk ratios.

Participants in the intervention arm were assigned to wear a surgical face mask when close to people in public spaces (eg, shopping centres, streets, and public transport) over a 14 day period (mask wearing at home or work was not mentioned). These participants collected a pack of 50 three ply, disposable, surgical face masks (type II/IIR, compliant with the EN 14683 standard) from their nearest pharmacy, provided at no cost using a unique verification email. The email also contained instructions on the proper use of face masks in line with Word Health Organization (WHO) recommendations.11 Participants assigned to the control arm were to remain mask-free when close to people in public spaces.

Eligible participants were randomised 1:1 to the intervention or control arm. We used Nettskjema, an independent web based survey tool, to randomise participants using a computer generated pseudorandom sequence over which we had no influence.12 Randomisation occurred after consent had been obtained and the baseline survey completed.

It was not possible to blind participants owing to the nature of the intervention. The researchers and study statistician were blinded to intervention allocation throughout the trial, and all main analyses were performed blinded. After analysis but before unblinding, we agreed how we would interpret the results, including possible explanations for the degree and direction of imbalance of missing outcome data.13

After consent had been obtained, trial participants immediately completed an online questionnaire about sociodemographic and lifestyle factors, beliefs about face masks and risk of infection, and face mask use in the two weeks before the study period. On completion of the questionnaire, the participants were randomised and notified of the arm to which they had been assigned both in Nettskjema and by email (see supplementary material, table 2). The email encouraged participants to see their doctor for a covid-19 polymerase chain reaction (PCR) test if they experienced symptoms of respiratory symptoms or covid-19. In Norway, covid-19 PCR tests were analysed by laboratories that directly notified a national registry, the Norwegian Surveillance System for Communicable Diseases. We sent a follow-up questionnaire (see supplementary material, table 3) on day 17, three days after the 14 day intervention, asking participants about outcomes, use of public transport, testing behaviours, adherence to the face mask intervention, and any adverse events. Participants identified themselves using their national identification number and therefore could be linked to the Norwegian Surveillance System for Communicable Diseases, the National Population Register, and the Norwegian Immunisation Registry.

Nettskjema was used to record baseline and outcome data. The primary outcome was self-reported respiratory symptoms consistent with a respiratory infection. This outcome is a composite that required participants to give a positive response to having experienced symptoms of a cold or covid-19, and having experienced fever and one respiratory symptom (stuffy or runny nose, sore throat, coughing, sneezing, heavy breathing); or one respiratory symptom and at least two other symptoms (body ache, muscular pain, fatigue, reduced appetite, stomach pain, headache, loss of smell).

Secondary outcomes were self-reported positive covid-19 test results (confirmed by either PCR or rapid antigen self-test), positive covid-19 test result registered with the Norwegian Surveillance System for Communicable Diseases, self-reported sick leave (participants who answered “I don’t know” were coded as missing data in the complete case analysis), self-reported healthcare use for respiratory symptoms (eg, visit to the family doctor), and self-reported healthcare use for any injury. All outcomes from the follow-up questionnaire, Norwegian Surveillance System for Communicable Diseases, and Norwegian Immunisation Registry were binary and assessed over a 17 day period after randomisation (days 1-17 of the trial).

We calculated that a minimum of 2692 participants (1346 in each arm) would be required to detect a risk reduction of 30% from an assumed 10% risk of infection in the control arm to a 7% risk in the intervention arm, with a two sided α of 0.05 (significance criterion) and 80% power. The power calculation is described in detail elsewhere.8

We estimated marginal odds ratios14 using unadjusted logistic regression for all outcomes following the intention-to-treat principle. The intention-to-treat analysis included all eligible participants who had provided consent. We report point estimates with two sided 95% confidence intervals and P values.

As prespecified, missing primary outcome data was accounted for using multiple imputation via chained equations.15 We imputed and analysed 50 completed datasets and combined the estimates using Rubin’s rules.16 The prespecified imputation model included intervention and all auxiliary variables collected in the baseline form.

In a non-prespecified sensitivity analysis, we estimated Manski-type bounds: best case and worst case bounds on intervention effect calculated by assuming that missing outcome data either maximally favoured wearing a surgical face mask or maximally favoured no face mask.17 In another non-prespecified analysis, we also considered three less extreme scenarios using a method similar to the mean score method suggested by White et al (table 1).18

Three scenarios of missing outcome data on incidence of infection

Scenario 1 assumes that the incidence of infection in the intervention arm is the same between those who did and did not drop out, but 50% lower for those who dropped out versus did not drop out in the control arm. Scenario 2 assumes that the incidence of infection in the control arm is the same between those who did and did not drop out, but 50% higher for those who dropped out versus did not drop out in the intervention arm. Scenario 3 assumes that the incidence of infection in the control arm is 50% lower for those who dropped out versus did not drop out, but 50% higher for those who dropped out versus did not drop out in the intervention arm.

The non-prespecified sensitivity analyses, including choice of scenarios, were decided on during the blinded assessment of the trial results.13 We also performed complete case analyses, including participants with complete data at baseline and follow-up. Prespecified subgroup analyses were also performed to explore potential effect modification owing to sex (male or female), age (<30 years, 31-59 years, or ≥60 years), household includes children (yes or no), regular use of face masks (<50% of the time or ≥50% of the time), and beliefs about wearing face masks and risk of infection (reduce risk, no effect, or increase risk). We conducted all analyses using R version 4.2.2. No data monitoring committee was involved in the trial.

Neither patients nor public representatives were involved in the design, implementation, or analysis of this study, largely due to time constraints as the trial was conducted in a window of opportunity during the influenza season. Participation in the trial was open to all adults (≥18 years) residing in Norway.

Between 10 February 2023 and 27 April 2023, 5086 individuals read the consent form. Of these, 4647 (91%) provided consent, completed the baseline form, and were randomised (2371 to the intervention arm and 2276 to the control arm; fig 1). Of those randomised, we excluded 72 from the intention-to-treat analysis owing to ineligibility (<18 years; n=20) or withdrawal of consent (n=52). This left 4575 participants for inclusion in the intention-to-treat analysis (fig 1). At follow-up, 479 (20.7%) participants in the intervention arm and 295 (13.1%) in the control arm did not respond to the questionnaire (fig 1). Table 2 shows the baseline characteristics of the participants.

Flow of participants through trial

Baseline characteristics of participants assigned to either wearing a surgical face mask or not wearing a face mask in public spaces during the day. Values are number (percentage) unless stated otherwise

Overall, 163 (8.9%) participants in the intervention arm and 239 (12.2%) in the control arm self-reported respiratory symptoms. In the intention-to-treat analysis, which used the prespecified multiple imputation via chained equations analysis and included data from 4575 participants, the estimated effect on the primary outcome of self-reported respiratory symptoms was in favour of the face mask intervention (odds ratio 0.71, 95% CI 0.58 to 0.87; P=0.001; absolute risk difference −3.2%, 95% CI −5.2% to −1.3%; P<0.001) (table 3). The complete case analysis (n=3801) supported the findings of the main analysis (odds ratio 0.71, 95% CI 0.57 to 0.87; P<0.001; absolute risk difference −3.3, 95% CI −5.2 to −1.3; P<0.001). Supplementary material, tables 5 and 6 provide details of the adjusted analysis and relative risks. The results from the adjusted analysis were essentially identical to those of the main unadjusted analysis. The non-prespecified sensitivity analyses comparing different scenarios of the missing outcome data (table 3) suggest that the intervention was effective in scenarios 1 and 2 (odds ratio 0.76, 95% CI 0.62 to 0.92; P=0.006 and 0.79, 0.65 to 0.95; P=0.01, respectively) but not in scenario 3 (0.85, 0.70 to 1.03; P=0.08). The Manski-type bounds (table 3) covered all possible missingness mechanisms and therefore included large beneficial and detrimental effects.

Effects of wearing a surgical face mask on primary and secondary outcomes. Values are number (percentage) unless stated otherwise

Overall, 42 participants, equally distributed between the two arms, self-reported covid-19 either by PCR or antigen test (odds ratio 1.07, 95% CI 0.58 to 1.98; P=0.82; absolute risk difference 0.1%, 95% CI −6.0% to 8.0%; P=0.82) (table 3).

The Norwegian Surveillance System for Communicable Diseases registered covid-19 test results for 37 participants in the control arm and 32 in the intervention arm (n=69 (1.5%); P=0.06 for difference in proportions). Two tests were positive in the control arm and none in the intervention arm (effect estimate and 95% CI not estimable owing to lack of events in the intervention arm; table 3).

In total, 144 (6.3%) participants in the control arm and 102 (4.5%) in the intervention arm reported needing healthcare during the trial. Of these participants, 29 (20%) in the control and 23 (23%) in the intervention arm reported that this was due to respiratory symptoms, whereas 40 (28%) participants in the control arm and 27 (26%) in the intervention arm reported other reasons.

Self-reported sick leave was equally distributed between the intervention and control groups (odds ratio 1.00, 95% CI 0.81 to 1.22; P=0.97). In total 29 participants answered, “I don’t know” and were coded as missing data in this analysis.

Among participants in the intervention arm, 450 (25%) reported always wearing a face mask, 753 (41%) wearing face masks more than 75% of the time, 265 (14%) wearing face masks 75-50% of the time, and 357 (19%) wearing face masks less than 50% of the time. Among participants in the control arm, 1865 (95%) reported not wearing face masks.

At follow-up, the percentage of participants who reported commuting to work by public transport was comparable between the control and intervention arms (60% and 58%, respectively; P=0.32). Attendance at cultural events was, however, more frequent among participants in the control arm compared with those in the intervention arm (39% and 32%, respectively; P<0.001). Similarly, a larger percentage of participants in the control arm (62%) visited restaurants compared with those in the intervention arm (53%; P<0.001).

In total, 155 participants (3.4%; 143 in the intervention arm) reported adverse effects, with 128 participants describing these experiences using the free text field in the questionnaire. The most reported adverse event (80 participants) was unpleasant comments from other people when wearing a face mask in public spaces and feeling “silly” being the only one wearing a face mask in public. Some participants (n=40) reported that wearing face masks was uncomfortable or tiring owing to difficulty breathing, fogging of glasses, and poor fit.

Figure 2 shows results for the prespecified subgroup analyses. The only analysis for which significant effect modification was estimated was for participants’ beliefs about wearing face masks and risk of infection (P=0.04 for interaction). A beneficial effect was estimated for participants who reported that they believed face masks reduced the risk of infection. Estimates for participants who reported that they believed face masks had no effect or increased risk were consistent with benefit, no effect, and harm. However, owing to the small number of events in these subgroups, the confidence intervals were wide and therefore these estimates lack precision.

Prespecified subgroup analysis on the intervention effect of wearing surgical face masks

We found that wearing surgical face masks in public spaces reduced the risk of self-reported respiratory symptoms among Norwegian adults. The results support the claim that face masks may be an effective measure to reduce the incidence of self-reported respiratory symptoms consistent with respiratory tract infections, but the effect size was moderate. With a 12.2% risk of being infected and an absolute risk reduction of −3.2% (95% CI −5.2% to −1.3%), wearing a face mask reduced the risk to 8.9%, equivalent to around 3300 fewer infections per 100 000 people. Wearing face masks in public spaces was safe and generally well tolerated. The most reported adverse effects were unpleasant comments from other people.

Our findings are consistent with the two randomised trials of face masks conducted during the covid-19 pandemic. Although the Danish face mask trial from 2020 was similar in many respects to ours, a major difference was it used a positive covid-19 result based on rapid antigen testing as the main outcome, whereas we relied on self-reporting of respiratory symptoms.19 The Danish trial reported a point estimate similar to ours, but its findings were uncertain study owing to low statistical power (odds ratio 0.82, 95% CI 0.54 to 1.23). In a trial in Bangladesh, 600 rural villages were randomised to face mask promotion strategies or no intervention at community level.20 The trial reported a smaller, but statistically significant reduction in symptomatic seroprevalence at nine weeks (adjusted prevalence ratio 0.91, 95% CI 0.82 to 1.00) in favour of face mask promotion.20 Compared with the earlier face mask trials, our findings provide a more precise estimate of effect. When our study is compared with not only the earlier face masks studies to prevent covid-19, but also the face mask studies for influenza prevention, our findings indicate a somewhat larger effect.37

The proposed mechanisms of action for face masks include limiting droplet and aerosol transmission.21 We decided to evaluate the effect of surgical face masks that were recommended by WHO during the covid-19 pandemic.22 It has been presumed that FFP2 masks (N95 by American standards) protect people better than surgical masks because of their higher filtering rate, but randomised trials23 and meta-analyses2425 suggest that surgical masks offer similar protection to FFP2/N95 masks in healthcare settings. We did not, however, study mechanisms of action or effectiveness in healthcare settings.

Our trial has several limitations. Firstly, outcome data were missing for 13.7% and 20.7% of the participants in the control arm and intervention arm, respectively. This is broadly similar to the 19% missing outcome data in the Danish face mask trial.19 We mitigated the impact of missing outcome data by prespecifying and using multiple imputation. Since it is plausible that the imbalance in missing data was due to outcome (ie, outcome data may be missing not at random), we performed non-prespecified sensitivity analyses to explore whether our main finding was robust to more extreme assumptions about missingness. Only under extreme and arguably implausible assumptions (eg, scenario 3 and the upper end of the Manski-type bounds) do the non-prespecified sensitivity analyses suggest that the intervention is not beneficial.13 Moreover, the estimate from multiple imputation is similar to the complete case estimate.

Secondly, our primary outcome was self-reported rather than an objective outcome or based on immunological biomarkers. Although an outcome based on a PCR test result would have provided more specific information about infections, our primary outcome assessed symptoms that are important to both individuals and the public in a real world setting. For example, self-reported respiratory symptoms had important consequences during the covid-19 pandemic, such as denial of air travel. We encouraged our participants to take a covid-19 test when they felt unwell, but testing was neither mandatory nor recommended at the time of our trial, and the number of participants taking a test was low.

Thirdly, blinding of participants was not possible owing to the nature of the intervention and it is not intended in a pragmatic trial aiming to provide evidence in a real world setting. However, we cannot deny that awareness of intervention allocation may have introduced bias in reporting of symptoms either way. The group allocation might also have led to additional effects on participants’ behaviour. For instance, a higher proportion of participants in the control arm reported attending cultural events and restaurants during the trial period. Furthermore, some participants in the intervention arm reported feeling awkward wearing face masks when there was no official requirement to do so. This finding may imply that non-participants tended to keep a larger social distance from participants in the intervention arm, which could be seen as an inherent effect of wearing face masks.

Fourthly, our results apply under trial conditions and therefore caution is needed when generalising the findings to other settings. The duration of our trial period was only 14 days, so our findings might not apply to situations where face masks are used for longer periods. The most acute phase of the covid-19 pandemic was over when we conducted our study, but the trial period occurred during a normal influenza season in the Nordic countries.10

Fifthly, although we acknowledge environmental concerns associated with face mask usage (eg, manufacture and transport emissions, littering, landfill), these were not measured in our study.

Finally, we focused on evaluating the personal protective effects of wearing surgical face masks, specifically examining how effectively they safeguard wearers against infection. We did not study source control (preventing the spread of infection from the mask wearer to other people). The total effect of wearing face masks, including both personal protection and source control, therefore could be higher than our findings suggest.

Study strengths include the pragmatic randomised design, prespecified analyses, and transparent treatment of missing outcome data, which included non-prespecified sensitivity analyses.1718 Trial staff were blinded. The outcomes we studied are likely important to individuals as well as from a public health perspective. People could participate regardless of where they lived in Norway, an approach that allowed broad recruitment, and probably increased the wider applicability of our findings. The results from our trial represent real world evidence on the effect of wearing surgical face masks. Similar trials should consider investigating other public health and social measures.

Future research should study face masks for source control. Research should also concentrate on the protective effectiveness of face masks for vulnerable populations, such as elderly people and individuals with pre-existing health conditions. For example, people with lung disease are typically at higher risk of respiratory tract infections, so the benefits of wearing face masks for personal protection need to be weighed against potential adverse effects, such as discomfort and breathing difficulties. It is vital to explore alternatives to single use masks that are sufficiently effective but minimise environmental harm, tackling the ecological problems linked to extensive and long term face mask use, as occurred during the covid-19 pandemic. Finally, future studies should consider including cost-benefit analyses.

Wearing surgical face masks is superior to not wearing surgical face masks in reducing the risk of respiratory symptoms over 14 days. The effect size was moderate, but wearing a face mask is a simple intervention with low burden and of relatively low cost and is one of several public health and social measures that may be worth considering for reducing the spread of respiratory infections.

The effectiveness of face masks as a protective measure against infection is uncertain

Observational studies suggest that face masks reduce the risk of respiratory tract infections

Findings from randomised trials are, however, highly uncertain owing to methodological limitations such as insufficient statistical power

Our pragmatic trial provides evidence that wearing surgical face masks in public spaces reduces the incidence of self-reported respiratory symptoms consistent with respiratory infections in real world settings

Unlike most earlier trials of face mask, our study was sufficiently powered

Similar trials can and should be conducted for other public health and social measures

This study was approved by the Regional Ethics Committee South East Norway (reference 36544).

The final anonymised trial dataset and statistical codes will be freely available to the public through GitHub (https://github.com/folkehelseinstituttet/2024-facemask-trial-bmj).

Contributors: All authors conceived and designed the study. RS is the guarantor. PE performed the statistical analysis. RS and IHE independently verified the data and statistical analyses. CR supervised the statistical analysis and reporting. RS wrote the first draft of the manuscript. AF, IHE, MF, BGI, LGH, CR, and PE critically reviewed and commented on each draft of the manuscript and approved the final manuscript for submission. RS, AF, IHE, MF, BGI, LGH, CR, and PE were responsible for the decision to submit the manuscript. The corresponding author attests that all listed authors meet authorship criteria and that no others meeting the criteria have been omitted.

Funding: The costs of running the trial were covered by the Centre for Epidemic Interventions Research, Norwegian Institute of Public Health.

Competing interests: All authors have completed the ICMJE uniform disclosure at www.icmj.org/disclosure-of-interest/ and declare: support from the Centre for Epidemic Interventions Research, Norwegian Institute of Public Health; no financial relationship with any organisation that might have an interest in the submitted work in previous three years; AF has received financial support from the World Health Organization (WHO) to attend meetings it has hosted; CR is named inventor on a patent application related to vaccine development. LGH’s institution (RC2NB) was contracted by WHO for the development of study protocol templates to evaluate public health and social measures; LGH received travel support from WHO; no other relationships or activities that could appear to have influenced the submitted work.

Transparency: The lead author (the manuscript’s guarantor) affirms that the manuscript is an honest, accurate, and transparent account of the study being reported; that no important aspects of the study have been omitted; and that any discrepancies from the study as planned (and if relevant, registered) have been explained.

Dissemination to participants and related patient and public communities: The advisory board of our research centre, which includes representatives from various civil society organisations (eg, Norwegian Pensioners’ Association) will be invited to engage in the dissemination of the findings. The research findings will be disseminated via press release through several of the National Institute of Public Health channels. We will also disseminate the findings on social media, in Norwegian and international newspapers, on television, at academic conferences, and to relevant patient and public organisations. Finally, we will inform every trial participant by email of the trial findings.

Provenance and peer review: Not commissioned; externally per reviewed.

This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

ObjectiveDesignSettingParticipantsInterventionsMain outcome measuresResultsConclusionTrial registration
SHARE