Normative data for instrumented posturography: a systematic review and meta-analysis

1 Introduction

Balance arises from a multisensory adaptive system performing predictive (anticipatory) and/or reactive (compensatory) actions, with varying degrees of accuracy, to impact balance in changing environmental contexts (Pollock et al., 2000). Numerous techniques and methods are employed to evaluate postural control at a functional and neurophysiological level, in static and dynamic conditions (Paillard and Noé, 2015). Basic non-instrumented tests, such as the Timed Up and Go test (Podsiadlo et al., 1991), the Berg Balance Scale (Berg et al., 1992), and the Tinetti Balance Scale (Tinetti, 1986), among others, are well known and widely used by clinicians. However, these tests give a broad overview of the functional state. In contrast, instrumented tests offer the possibility to carry out in depth quality of movement assessment under different conditions and difficulty levels. Instrumented tests evaluating balance are carried out through computerized force platforms calculating the displacements of the center of pressure in static and dynamic conditions (Bizzo et al., 1985). When standing and walking, the point of application of the ground reaction forces can be measured as the center of the pressure signal using linear and non-linear variables (Quijoux et al., 2021).

Many instrumented devices are available on the market for clinical and research use. Common instrumentation includes static and dynamic force platforms. Both platforms can assess postural control; the added sway-referenced perturbations on the dynamic platform constitute its main advantage. These perturbations allow the evaluation of sensory integration, reweighting, and the use of sensory strategies while standing and walking. The most used static platforms are laboratory-grade force plates manufactured by AMTI (AMTI, Watertown, MA, United States) and Kistler (Kistler, Winterthur, Switzerland). Other low-cost alternatives also exist, such as the Wii Balance Board (WBB; Nintendo, Kyoto, Japan), which is considered a reliable and valid tool [for review, see Clark et al. (2018)]. In general, studies investigating the reliability and validity of the Wii Balance Board mostly used path length as a sway parameter to evaluate healthy participants. Numerous studies have used conventional force platforms manufactured by Neurocom International, Inc., Clackamas, OR, USA. EquiTest® device—a Computerized Dynamic Posturography (CDP) system—is considered a gold standard instrument. The concept of CDP—an apparatus and method for sensory integration and motor coordination analysis—was developed and patented by Nashner in 1988. The sensory and motor components of human postural control can be assessed by different standardized tests (Sensory Organization Test [SOT], Motor Coordination Test [MCT], Adaptation Test [ADT], and Limits of Stability) for which manufacturers have commercially published normative data. For these tests, balance is measured using equilibrium and composite scores as well as sway energy and maximal voluntary movement parameters (e.g., directional control and reaction time).

Furthermore, instrumented posturographic tools have been used to investigate balance deficits, especially in populations at higher risk of falls (Achour Lebib et al., 2006; Bloem et al., 2003). For example, the CDP’s role has been widely studied in neurotology (Black et al., 1983; Horak et al., 1990; Nashner et al., 1994; Nashner and Peters, 1990; Wall et al., 1983). Increased instability has been reported in patients with vestibular disorders. SOT results have shown that patients exhibit impaired performances during conditions 5 (fixed visual surround and sway-referenced platform) and 6 (sway-referenced visual surround and platform), which is indicative of a vestibular deficit affecting the postural functioning of the individual (Black et al., 1988). Furthermore, the results of dynamic posturography in a broad spectrum of patients with peripheral and central vestibular disorders revealed varying degrees of functional capacity within each group. Psychometric properties for dynamic posturography have been mainly studied in this specific population (Di Fabio, 1995, 1996; Hamid et al., 1991). Analogous to CDP, computerized static posturography, using a foam-supported platform as a perturbation, offers a low-cost and portable alternative. Thus, the Clinical Test of Sensory Interaction on Balance (CTSIB) can evaluate balance in eyes open (EO) and eyes closed (EC) conditions with/without foam. Increased sway was observed in patients compared to controls during foam posturography, particularly with EC, measured using amplitude and velocity (Baloh et al., 1998) as well as velocity, area, and Romberg’s ratio (Fujimoto et al., 2009). In the CTSIB, integrated with virtual reality, instability was condition dependent—patients with vestibular disorders sway more during more challenging sensory conditions (i.e., foam surface, EC, and optokinetic stimulus) (Macedo et al., 2015).

Clinical applications notwithstanding, normative data are needed for interpretation in clinical settings. Normative data (SpringerReference) are observational data summarizing and describing a population’s characteristics at a specific time. Posturography norms are used to compare a reference group (healthy individuals) and a specific patient population. Thus, variations between patients and healthy individuals can be observed and interpreted in a clinical setting to determine pathological profiles. Descriptive information (e.g., total sample size, age groups, and sex) and statistics (e.g., median; other summaries of distribution; indicators of central tendency and dispersion; and standard scores) are needed to report such datasets. Some studies have focused on establishing reference values for CDP, mostly using the NeuroCom EquiTest (NeuroCom International, Inc), for which normative data are already available. Normative data have also been published by the BTrackS Balance Plate (Balance Tracking systems) parent company for static computerized posturography.

However, one significant limitation has been noted: the lack of stratification and socio-demographic information. Many studies have demonstrated that sex and age influence postural control (Farenc et al., 2003; Hageman et al., 1995; Ionescu et al., 2005). For static computerized posturography, a systematic review of postural sway in children revealed that age stratification was arbitrary in majority of studies, leading to conflicting results concerning postural control development (Verbecque et al., 2016b). Furthermore, results showed that stability increased with age, and children swayed more when visual input (in EC condition) was removed. These results have also been observed in adults. Other variables such as anthropometric characteristics (Alonso et al., 2012; Chiari et al., 2002), feet positioning (Gibbons et al., 2019; Kollegger et al., 1989), and physical activity (García-Soidán et al., 2020; Gauchard, 2003; Lelard and Ahmaidi, 2015) can also affect balance. Thus, lack of stratification and reporting might lead to misinterpretation of patient postural performance in clinical practice and healthy individuals undergoing behavioral and balance assessments. Therefore, there is room for improvement, and this work aspires to complement previous studies.

The purpose of this study is to systematically review the reported normative data, assessed by computerized posturography, in a healthy population and to discuss the strengths and limitations of these norms to outline future perspectives and needs. Overall, we aimed to identify methodological characteristics of computerized posturography assessment in this systematic review and investigate the impact of age and sex in the meta-analysis.

2 Methods 2.1 Information sources, search strategy, and inclusion criteria

This systematic review and meta-analysis included studies that provide normative data assessed using instrumented platforms in healthy populations (Registration: CRD42023378144). It was conducted in accordance with the Preferred Reporting Items for Systematic Review and Meta-Analysis (PRISMA) statement. A comprehensive literature search was performed in PubMed, Web of Science, and Science Direct electronic database platforms. A search string was used with keywords (1) “normative data”, (2) “posturography”, (3) “instrumented platform” combined using the Boolean operators “OR” and “AND” for interaction among sets of keywords. The searches were conducted in May 2022 and updated in January 2023. The studies were screened based on a priori defined inclusion and exclusion criteria using Rayyan, a web-based application (Ouzzani et al., 2016). The literature search was limited to articles published between 1 January 1980 and 1 January 2022. Computerized posturography was created in 1980 (Nashner, 1982). Therefore, only studies published as of this date were included in this review. In each article, the use of instrumented platforms as evaluation tools was sought. Reference lists and citations of the included articles were manually screened to identify additional studies of interest. The literature search was limited to full-text articles that were written in English and French. The articles meeting the criteria based on their titles and abstracts were included, and their full-text versions were then extracted. AJ and EV performed screening. Full-text articles were checked if information in the title and abstract was insufficient to determine eligibility.

2.2 Selection criteria and strategy

To select relevant literature, the following selection criteria were applied:

• Healthy participants were a combination of men and women (absence of any other impairments that can influence balance). The normative data had to be representative of the average population of healthy people. However, studies that included athletes, soldiers, and one of the sexes were exclusively considered relevant to the research question. A minimum sample size of 30 participants was considered an adequate representative sample.

• Instrumented equipment during static or dynamic bipedal balance measurement had to be included. Citations were excluded when data were collected during gait and functional measures (e.g., Timed Up and Go, climbing stairs, 6-min walk test, Y-Balance Test, Star Excursion Balance Test, treadmill walking or during running, turning, stepping tasks lateral, forward, backward, etc.). Balance assessment through mobile devices was excluded as well.

• Original research including full-length articles that were written in English or French. Reviews, meta-analyses, case–control studies/series, conference proceedings, abstract only, books/book chapters, letters to the editor, study protocols, pilot studies, editorials, or opinion pieces were excluded.

• Studies that had explicitly reported normative data were included.

2.3 Data extraction and study quality assessment

Data extraction was a priori defined using a codebook which included the following information: (1) Study characteristics (authors, year of publication, etc.); (2) methodological details, such as population and measurement characteristics, tasks, and sway parameters; and (3) equations used and psychometric properties (if provided). Authors were contacted to obtain any unavailable data. At a systematic review level, we evaluated the stratification of age and sex and the significance of other factors tested in each study. These are expressed as significant (S), not significant (NS), and not reported (NP).

Studies used a combination of different support and visual conditions, which can be divided into distinct groups and subgroups. In this review, balance tasks were described according to measurement and movement control type as well as sensory perturbations based on a classification process used by Herssens et al. (2020): without sensory perturbations, with proprioceptive perturbations (foam, moving platform and/or foot placement), with visual perturbations (deprivation and/or altered surround), with vestibular perturbations (head movements), and with multiple sensory perturbations. A distinction between measurement and movement control types was made to describe performance in the present review. External conditions determine measurement type during balance assessment. It is classified as dynamic when tasks are executed in altered visual and/or tactile environments and static when tasks are executed in stable visual and/or tactile environments. Movement control type is considered activity-based conditions represented as tasks consisting of quiet standing (static) and voluntary body movements (dynamic). In addition, we used the Systems Framework of Postural Control (Sibley et al., 2015) to determine the nine components of balance (functional stability limits, underlying motor systems, static stability, verticality, reactive postural control, anticipatory postural control, dynamic stability, sensory integration, cognitive influences) which were assessed in the different tasks.

To the best of our knowledge, no quality assessment (QA) tool exists for reference values. However, we adapted the quality assessment tool for diagnostic accuracy studies (QUADAS) scale reported? (Whiting et al., 2003, 2006) and selected the following relevant questions (1, 2, 5, 9, 12, and 13) based on our study objectives: (1) Was there adequate sampling of normative data? (2) Were selection criteria clearly described? (5) Was a motor scale (i.e., other than posturographic measurement) used as part of the test battery? (9) Was the description of instruction and position/setup for the patient, type of equipment, and the outcomes at least reported? (12) Was the age and sex of a sample at least available? (13) Were drop-outs and falls collected (e.g., was it reported that a portion of the participants fell) and reported? Answers were scored in “Yes” or “No” forms, and missing or ambiguous details were scored as “Unclear.” The total score has not been defined for the QUADAS tool. Therefore, studies were considered high quality (low risk of bias) if four or more items out of six items were scored “Yes.” AJ and EV independently performed a QA for each article, and each assessor was blind to the score the other gave. Any disagreement over the final score for each article was discussed until consensus.

Studies providing summary statistics were included in the meta-analysis. Quantitative analysis was done after extraction by categorizing and grouping according to population type (adults, children, and both) and sex distribution (females and males) as well as posturography type (static and dynamic), tasks, and sway parameters. We performed a meta-analysis on sway parameters used by at least three publications in similar measurement conditions. Priority was given to age and/or sex-stratified reference values to mitigate inter study variability. When possible, participants’ mean age and groups were converted and pooled according to decade for adults and biological ages for children to reduce heterogeneity. We used a broad age categorization to ensure enough studies were included in the quantitative analysis to allow for meaningful interpretation. Using to the Cochrane Handbook (Higgins et al., 2020), subgroups were combined into a single group to compare males and females separately and to compare specific age groups whenever possible. When possible, specific terms were attributed to sway parameters based on the definitions in the literature (Prieto et al., 1996) to aggregate data and comparison for meta-analysis purposes. Homogeneity should have been particularly present across age/sex stratification and test conditions. Additionally, quantitative analysis was not conducted when an insufficient number of identified studies were available per task condition (EO and EC, foam, etc.) (n > 5). We found that 38 studies used sway parameters used by at least three publications. After further inspection, we removed studies that had no age and/or sex comparisons, insufficient statistics, or were owned by a company (n = 21). Instead, these studies were summarized narratively (see Figure 1). The majority of the excluded studies had reported a decrease in sway with increasing age for children whereas an increase was observed for aging adults. Sensory perturbations tended to increase instability, and overall, females outperformed males. Studies assessing children did not investigate the influence of other factors.

www.frontiersin.org

Figure 1. Flowchart for the systematic review and meta-analysis.

2.4 Statistical analysis

Statistical analyses were performed in R version 4.2.3 using the packages, metafor (Viechtbauer, 2010), clubSandwich (Pustejovsky, 2024), and netmeta (Balduzzi et al., 2023). Age and sex were analyzed in each sensory condition using different models with effect sizes nested within outcomes, and the latter within studies. We undertook meta-analyses if participants, interventions, outcomes, and comparisons were sufficiently similar. Potential modifiers included population characteristics and intervention details, which are considered sources of heterogeneity.

2.4.1 Measures of group effect

The standardized mean difference was calculated to pool effect sizes to quantify the difference in performance between age and sex groups. For measures where a higher score represented better performance (e.g., equilibrium scores in EO and EC), the direction of the scale was reversed by multiplying by −1 to ensure interpretability across studies. Furthermore, we accounted for a hierarchical dependence structure as multiple effect sizes were extracted from the same studies, experiments, and outcomes across samples for the multilevel meta-analysis.

2.4.2 Outliers and influential cases

Standardized deleted residuals and Cook’s distance were used to identify outliers and influential studies. Residuals larger than ±1.96 indicated that studies did not fit the model and thus represented outliers. Moreover, influential cases were removed when Cook’s distance was over 4/n, where n is the total number of data points.

2.4.3 Assessment of heterogeneity

In the multilevel meta-analysis, we assessed heterogeneity using the I2 statistic to quantify the percentage of variation attributable to each level – study, outcome, and effect size (low: 25%, moderate: 50%, high: 75%; Higgins et al., 2020). In the network meta-analysis, global and local approaches were used to assess heterogeneity (i.e., variation in effect modifiers within comparisons) and inconsistency (i.e., imbalance in effect modifiers between comparisons). Specifically, we fitted Cochran’s Q (χ2) statistic decomposition to evaluate the contribution of each design to the heterogeneity “within designs” and consistency “between designs.” Moreover, a design by treatment (in our case, age groups) interaction model was also fitted for the global approach. Local inconsistency between direct and indirect evidence was evaluated using the Separate Indirect from Direct Evidence (SIDE) method (Dias et al., 2010). To explore sources of heterogeneity, a meta-regression was conducted.

2.4.4 Assessment of publication bias

The potential publication bias was addressed by estimating the funnel plot asymmetry and was tested via Egger’s regression when more than 10 studies were included.

2.4.5 Multilevel meta-analysis using robust variance estimation

We conducted a multilevel meta-analysis using robust variance estimation (RVE) to synthesize studies comparing females and males. To account for the complexity of the data structure, an RVE method can be used to model dependencies between effect sizes and their correlated sampling errors. To obtain robust confidence intervals and p-values, the Sandwich estimator (package clubSandwich) was used in combination with the model. For smaller numbers of included studies, valid analysis results are ensured by an adjustment matrix based on the bias-reduced linearization CR2 method. An assumption of the degree of correlation was estimated at r = 0.6. The RVE model provides a valid overall average effect size even if the correlation assumption is inaccurate. Several sensitivity analyses for varying values of r were nonetheless conducted.

2.4.6 Frequentist network meta-analysis: multiple interventions

We conducted a network meta-analysis (NMA) to estimate the relative effects for all possible comparisons between different age groups. The frequentist random-effects network meta-analysis considers direct and indirect evidence, also called mixed evidence (i.e., direct—studies comparing A vs. B; indirect—studies comparing A vs. C) (Rouse et al., 2017). Thus, this method estimates the relative effects of comparisons between groups via one or more intermediate comparators or direct ones. The network structure was inspected, and subnetworks (i.e., not fully connected networks) were analyzed separately. Furthermore, studies with missing or inconsistent group estimates and variances were also excluded from the network meta-analysis. The age groups (i.e., interventions) were ranked using the surface under the cumulative ranking curve (SUCRA) method (Salanti, 2011) to account for the entire distribution of the relative effects. This score is expressed as a percentage, with the best ranking corresponding to 100%. This analysis allowed us to include all possible direct and indirect comparisons of different age groups and determine their rankings. Furthermore, we assessed the certainty of the evidence for each comparison using Confidence in Network Meta-Analysis (CINeMA), a web application used to evaluate the confidence of the evidence estimates from network meta-analysis. It is based on the following six domains: within-study bias (risk of bias), reporting bias (publication bias), indirectness (i.e., relevance to the research question), imprecision (comparing the range of effects with the range of equivalence), and incoherence (disagreement between direct and indirect evidence). We excluded two studies (Libardoni et al., 2018; Micarelli et al., 2020) from the meta-analysis in EO and EC conditions due to inconsistent group estimates and variances.

3 Results 3.1 Information sources and search strategy

The search strategy resulted in the retrieval of 1,244 articles from PubMed, Web of Science, and Science Direct, on 9 May 2022 and a second search on 31 December 2022. After deduplication, 689 articles were screened based on title and abstract. One hundred and seven articles met the criteria after the first screening. In-depth, full-text screening resulted in the inclusion of 44 studies for the systematic review. An overview of the study design is summarized in the PRISMA flowchart (Figure 1).

3.2 Descriptive synthesis: risk of bias assessment and methodological characteristics

The risk of bias assessment reveals extensive variation in methodological characteristics (see Supplementary Table S1). Ten studies had high-quality assessment and took measures to reduce risk of bias (Figure 2). The majority of the problematic aspects were in questions 1, 5, and 13 of the adapted QUADAS.

www.frontiersin.org

Figure 2. Assessment of study quality (QA) and risk of bias. The risk of bias, using QUADAS’s tool, for each study, is included in the systematic review. Answers were scored “Yes” or “No,” and missing or ambiguous details were scored as “Unclear.” Unreported details were scored as an ‘unclear’ risk of bias.

3.2.1 Participants

The general characteristics of the included studies are presented in Table 1. The sample size of publications included in the systematic review ranged between 30 and 16,357 participants. In total, 42,408 participants (12,043 women, 13,761 males) were included and represented mostly regular individuals (77%). Children and adults combined were primarily represented in the sample (48%). Specific cohorts were included in a few studies, such as athletes (11%), military (6.8%), and mixed (4.5%). The majority of the studies were conducted in Europe (41%) and North America (36%). The study population represented a large range of ages and participants comprising of children, adolescents, and adults. Age and sex were stratified, respectively, in 64 and 45% of the publications. Regarding age stratification, majority of the studies did not provide age groups (34%) (Albertsen et al., 2017; Beauchet et al., 2016; Carrick et al., 2007; de la Torre et al., 2017; Goble et al., 2019a; Henry et al., 2022; Krityakiarana and Jongkamonwiwat, 2016; Lara et al., 2018; Letz et al., 1996; Matsuda et al., 2010; Owen et al., 1998; Pletcher et al., 2017; Scaglioni-Solano and Aragón-Vargas, 2014; Schmidt et al., 2012; Weismiller et al., 2021); some reported values per biological age (18%) (Barozzi et al., 2014; Casselbrant et al., 2010; di Fabio and Foudriat, 1996; Ionescu et al., 2005; Libardoni et al., 2018; Micarelli et al., 2020; Mnejja et al., 2022; Verbecque et al., 2016a) and per decade (16%) (Black et al., 1982; Eriksen and Hougaard, 2023; Faraldo-García et al., 2012, 2016; Goble and Baweja, 2018b; Sackley and Lincoln, 1991; Trueblood et al., 2018), or academic level associated with school year (11%) (Charpiot et al., 2010; Goble et al., 2019b; Massingale et al., 2018; Shams et al., 2020; Sinno et al., 2021). Others used an unconventional stratification method (Alburquerque-Sendín et al., 2009; Domènech-Vadillo et al., 2019; Goble and Baweja, 2018a; Kollegger et al., 1992; Masui et al., 2005; Nishino et al., 2021; Patti et al., 2018; Perucca et al., 2021; Roberts et al., 2021). Several studies did not provide explicit reasons for omitting age stratification. However, a few studies decided to pool data as they did not observe age-based differences. We speculate that sample size constraints and the complexities of recruiting older participants may have been contributing factors, as suggested by a few studies.

www.frontiersin.org

Table 1. General characteristics of included studies.

3.2.2 Tasks and sway parameters

The included studies investigated postural sway using a variety of tasks. Measurement conditions varied largely regarding arm and foot position, use of the visual target, the number and duration of trials and sequence of the conditions. Twenty-three studies used static measurement, whereas static movement control was described in 40 studies. When possible, tasks were converted into standardized tests commonly used in clinical practice based on specified conditions. If a task did not correspond to a known test, we coded it according to precise sensory conditions. Overall, 17 different tasks were used. The majority of studies used multiple sensory perturbations (70%) and did not report psychometric properties (80%). Using the Systems Framework of Postural Control, components of postural control measured in each task were determined (Figure 3A). Three out of 17 tasks evaluated the majority of the components of the systems framework for postural control (SFPC) (4 items); only one task was used in more than three studies, the Motor Control Test (MCT). Other mostly used tasks were the Modified Clinical Test of Sensory Interaction in Balance (mCTSIB) and the SOT, which evaluated three components of the SFPC (static stability, underlying motor systems, and sensory integration) as well as Vision + FIRM (visual perturbation with firm surface) and EC + FIRM (EC with firm surface) which evaluated two components (static stability and sensory integration). Finally, EO + FIRM (EO with firm surface) was also used and evaluated one component, static stability. The four SFPC components evaluated in the MCT were static stability, underlying motor systems, reactive postural control, and sensory integration.

www.frontiersin.org

Figure 3. Methodological characteristics for tasks and sway parameters. (A) Tasks reported in n ≥ 3 number of studies based on the Systems Framework of Postural Control components evaluated in each task. (B) Sway parameters used in n ≥ 3 number of studies and grouped by domain.

A large variety of parameters for postural sway led to semantic heterogeneity in the reported data. Therefore, whenever possible, specific terms were attributed to sway parameters in this review based on the definitions in the literature to allow comparison between different studies. The publications used 45 different sway parameters to assess balance performance; only a few provided equations for calculating sway parameters (27%). Thirteen sway parameters were used by more than three studies, which were distance domain features: review of center of pressure (COP) area, COP path, and COP velocity in either the anteroposterior (AP) and mediolateral (ML) direction or in a combined plane, COP range in AP and ML direction as well as SOT ratios, SOT equilibrium scores, SOT and MCT composite scores which are non-domain specific features (Figure 3B).

Other variables were also reported as factors influencing postural control (Supplementary Figure S2A). Five studies reported a significant effect of sensory conditions in adults, whereas height mainly was reported as a non-significant variable in children (n = 3) and mixed populations (n = 4). However, the majority of the studies did not report the influence of other variables on postural sway.

3.3 Quantitative analysis

In the multilevel or network meta-analysis, we pooled data from seventeen studies involving 5,194 participants (females: 2,630; males: 2,564). Furthermore, we did not conduct quantitative analysis for Foam conditions and sway-referenced conditions (SOT 3, SOT 4, SOT 5, and SOT 6) as well as for composite score and ratio outcomes as there was insufficient number of studies (n < 5).

3.3.1 Effect of sex on postural sway

In the EO condition (Figure 4), an overall summary meta-analysis found an average sex difference of sensory modulation disorder (SMD) = −0.17, t(7.6) = −2.02, p = 0.08, 95% CI: −0.36, 0.01. The overall I2 value indicates that 79.1% was due to heterogeneity, with the variance component accounting for about 28.6% of the total variance at the study level, 25.3% at the outcome level, and 25.3% at the effect size level. The remaining 20.9% was a sampling variance. There were no influential outliers found. Subgroup analyses were conducted since high heterogeneity was detected, and no influential cases were found. Population group, region, and risk of bias were determined a priori as potential modifiers. However, none of them could explain explicitly the overall high heterogeneity (Supplementary Table S5). The overall average effect size was not sensitive to our assumption about the correlation between effect size estimates, with estimates varying from −0.187, 95% CI: −0.382, 0.008 assuming ? = from 0.0 to −0.168, 95% CI: −0.372, 0.036 assuming ? = 0.9. The total variation in true effect sizes was similarly insensitive, with a total standard deviation (SD) ranging from 0.220 to 0.255. However, individual variance component estimates were more sensitive to the assumed ? (see Supplementary Table S4A).

www.frontiersin.org

Figure 4. Forest plot visualizing the overall effect in EO, calculated using robust variance estimation (RVE). SMD, standardized mean difference between females and males, CI, confidence interval.

In EC condition (Figure 5), an overall summary meta-analysis found an estimated overall significant SMD = −0.22, t(4.6) = −4.20, p = 0.01, 95% CI (−0.34, −0.10), with non-significant low heterogeneity (I2 = 11.6%) after removing one outlier. The overall average effect size was not sensitive to our assumption about the correlation between effect size estimates, with estimates varying from −0.233, 95% CI: −0.396, −0.069 assuming ? = from 0.0 to −0.211, 95% CI: −0.347, −0.076 assuming ? = 0.9. The estimated total variation in true effect sizes was similarly insensitive, with a total SD ranging from 0.11 to 0.141. However, individual variance component estimates were more sensitive to the assumed ? (see Supplementary Table S4B).

www.frontiersin.org

Figure 5. (A–D) Forest plot visualizing the overall effect in EC, calculated using robust variance estimation (RVE). SMD, standardized mean difference between females and males; CI, confidence interval.

3.3.2 Effect of age on postural sway

Network meta-analysis in EO and EC conditions included each of the 10 studies and revealed two subnetworks, which consisted of 22 groups, 358 comparisons, and 8 study designs. The 10 characteristics were relatively similar across groups in each subnetwork, indicating an acceptable transitivity assumption (see Supplementary Table S5). The first and second subnetworks represented adults aged between 20 and 79 years old and children aged between 3 and 14 years old (Supplementary Figure S4). The subnetworks were analyzed separately.

3.3.2.1 Effect of age within adults +20 years

The first subnetwork analysis, 20–29-year-old participants were used as the reference group. Figures 6A,B shows the first connected subnetworks for both EO and EC conditions. The most frequently examined comparisons were among age groups (in years) 20–29 vs. 30–69 and 30–39 vs. 40–69, and 40–49 vs. 50–69 and 50–59 vs. 60–69. Supplementary Table S6 summarizes the main results of both direct and indirect evidence and the first subnetwork meta-analysis for EO and EC. Figures 7A,B show the treatments’ relative rankings with the group of the age range (in years) 20–29 as the reference group. Figures 7C,D shows the relative rankings with group 14 as the reference. The lowest ranked was the group of the age range (in years) 70–79 (SUCRA: 0.09 for EO and 0.09 for EC) followed by 60–69 (SUCRA: 0.28 for EO and 0.14 for EC) and 50–59 (SUCRA: 0.29 for EO and 0.38 for EC). We present the summary of relative effects in league tables (see top table, Figure 8).

www.frontiersin.org

Figure 6. Forest plot showing the SUCRA ranking and effect estimates for EO (left) and EC (right) conditions. (A,B) The age group 20–29 years was used as a reference for the first subnetwork analysis. (C,D) The age group 14 years was used as a reference for the second subnetwork analysis. SMD, standardized mean difference; CI, confidence intervals.

www.frontiersin.org

Figure 7. Network plots for EO condition (left) and EC condition (right) of the (A,B) first and (C,D) the second subnetworks. The width of the lines represents the number of studies comparing each pair of groups. The size of the circle represents the sample size in each arm.

www.frontiersin.org

Figure 8. Net league tables for the EO and EC conditions of first subnetwork (upper) and second subnetwork (lower). Relative effect estimates for the contrasts between the different intervention and control arms for the EO condition (lower triangle) and the EC condition (upper triangle). Statistically significant effects are shown in bold letters and highlighted in white cells. SMD, standardized mean difference; CI, confidence intervals.

In the EO condition, older groups of the age ranges (in years) 50–59, 60–69, and 70–79 were significantly more instable than the reference group 20–29 (EO: SMD range 0.43–0.66). The groups of the age ranges (in years) 30–39 and 40–49 were more stable than older groups 60–69 (EO: SMD range 0.61–0.75) and 70–79 (EO: SMD range 0.53–0.59).

In EC condition, older groups of the age ranges (in years) 50–59, 60–69, and 70–79 were significantly more instable than the reference group 20–29 (EC: SMD range 0.51–0.85). The groups of the age ranges (in years) 30–39 and 40–49 were more stable than the older group 70–79 (EC: SMD range 0.70–0.84). A statistically significant difference was also found for the age ranges (in years) 50–59 compared to 30–39 (SMD = 0.50, 95% CI: 0.10, 0.90) and 60–69 (SMD = 0.25, 95% CI: 0.03, 0.46); indicating that older individuals swayed more for EC.

The heterogeneity and global inconsistency were low in the first subnetwork (EO: I2 = 0, 95% CI: 0, 62.4%, χ2 = 8.59, df = 9, p = 0.47; EC: I2 = 0, 95% CI: 0, 62.4%, χ2 = 8.41, df = 9, p = 0.49). Inconsistency between designs appeared to be not an issue in this subnetwork (EO: p = 0.7; EC: p = 0.6). Supplementary Tables S8A,B summarizes the local inconsistency between direct and indirect evidence using the SIDE method for the EO and EC conditions. The results show that there are 11 pairwise comparisons contributing both direct and indirect evidence, for none of which there is evidence of inconsistency for the EO condition (Supplementary Table S8A). However, a handful of comparisons with inconsistency for EC was observed (p < 0.10 refer to Supplementary Table S8B).

3.3.2.2 Effect of age within children ≤14 years

In the second subnetwork analysis, 14-year-old participants were used as the reference group. Figures 6C,D shows the second connected subnetworks for both EO (Figure 5C) and EC (Figure 5D) conditions. The most frequently examined comparisons were between 8 vs. 9, 8 vs. 10, and 9 vs. 10. Supplementary Table S7 summarizes the main results of both direct and indirect evidence and the first subnetwork meta-analysis for EO and EC. Figures 6C,D shows the relative rankings with group 14 as the reference. The lowest ranked was group 5 (SUCRA: 0.06 for EO and 0.04 for EC), followed by 4 (SUCRA: 0.13 for EO and 0.08 for EC), 3 (SUCRA: 0.19 for EO and 0.17 for EC), 6 (SUCRA: 0.21 for EO and 0.26 for EC), and 7 (SUCRA: 0.19 for EO and 0.17 for EC). The summary relative effects from the network meta-analysis are presented in league tables of each subnetwork (see bottom table, Figure 8).

In the EO condition, children aged between 3 and 7 years swayed significantly more than the reference group 14 (EO: SMD range 1.00–1.78). Between 3 and 6 years of age group, children were more instable than older groups: 9 years (EO: SMD range 0.93–1.36), 10 years (EO: SMD range 0.90–1.33), 11 years (EO: SMD range 1.0–1.50), 12 years (EO: SMD range 1.26–1.68), and 13 years (EO: SMD range 1.38–1.80). A statistically significant difference was also found for the group of age 7 years compared to that of 11 years (EO: SMD = 0.73, 95% CI: 0.03, 1.42), 12 years (EO: SMD = 0.91, 95% CI: 0.23, 1.59) and 13 years (EO: SMD = 1.03, 95% CI: 0.34, 1.72). A significant difference was also observed between groups 5 vs. 7 for EO (SMD = 0.77, 95% CI: 0.03; 1.51). For children aged 8 years, a significant difference was reported compared to the younger children for EO (SMD for the age group of 4–6 years range 0.87–1.10) and to group 13 years (SMD = 0.70, 95% CI: 0.01, 1.39). Children aged 5 years swayed significantly more than those aged 7 for EO (SMD = 0.77, 95% CI: 0.03, 1.51).

In EC condition, children aged between 3 and 7 years swayed significantly more than the reference group of age 14 years (EC: SMD range 0.75–1.56). Between 3 and 6 years of age, children were more instable than the older groups of age 9 years (EC: SMD range 0.65–1.15), 10 years (EC: SMD range 0.44–0.94), 11 years (EC: SMD range 0.78–1.28), and 12 years (EC: SMD range 0.86–1.35), and 13 years (EC: SMD range 0.80–1.30). A statistically significant difference was also found for the age group of 7 years compared to 11 years (EC: SMD = 0.47, 95% CI: 0.09, 0.86), 12 years (EC: SMD = 0.55, 95% CI: 0.18, 0.91), and 13 (EC: SMD = 0.49, 95% CI: 0.12, 0.87). A significant difference was also observed between younger children and the age group of 7 years for EC (SMD for groups 3–6 years range 0.31–0.81). A significant difference was reported for children aged 8 years compared to older children (EC SMD for 11–12) range 0.27–0.34 and 14 (EC SMD = 0.55, 95% CI: 0.03, 1.06). Children aged 5 years swayed significantly more than the age group of 3 years for EC (SMD = −0.29, 95% CI: −0.57, −0.02). A significant difference was observed between age groups 7 and 9 years (SMD = 0.34, 95% CI: 0.00, 0.69) as well as between age groups 10 years and older children (SMD for 12–14 years range 0.36–0.62).

The heterogeneity and global inconsistency were significantly high for EO (I2 = 79.8, 95% CI: 70.2, 86.3%, χ2 = 104.06, df = 21, p < 0.0001) and low for EC (I2 = 35.4, 95% CI: 0, 61.5%, χ2 = 32.53, df = 21, p = 0.0517). Inconsistency between designs appeared to be an issue in this subnetwork (p < 0.0001 for EO and EC). The results for detaching single designs show that the between-design heterogeneity can largely be traced back to the comparison of 4 vs. 5 for EO (χ2 = 46.45, df = 4, p < 0.0001) and EC (χ2 = 15.90, df = 4, p < 0.0001). Results for the full design-by-treatment interaction model show evidence of inconsistency between designs for EO (χ2 = 91.11, df = 5, p < 0.0001) and for EC (χ2 = 26.72, df = 5, p < 0.0001). Supplementary Tables S8C,D. summarizes the local inconsistency between direct and indirect evidence using the SIDE method for EO and EC. The results show 47 pairwise comparisons contributing both direct and indirect evidence. There is evidence of inconsistency for a few comparisons among age groups (in years) (p < 0.10 for 3 vs. 4, 3 vs. 5, 4 vs. 6, 5 vs. 8 for EO, and 5 vs. 7 for EC).

3.3.2.3 Certainty of evidence

We estimated within-trial bias as the weighted average of the overall risk of bias and reporting bias was considered of some concern in the first subnetwork. Furthermore, we had no concerns about indirectness, and a clinically significant threshold was SMD = 0.20. For EO, no concern was reported about incoherence because no comparison had a global or local inconsistency. The certainty of the evidence was low or very low for the majority of the comparisons, mainly due to within-study bias and reporting bias (Supplementary Table S9A). For EC, some concerns were reported about incoherence because some comparisons had a global or local inconsistency. The certainty of the evidence was low or very low for the majority of the comparisons, mainly due to within-study bias and reporting bias (Supplementary Table S9B).

We estimated within-trial bias as the weighted average of the overall risk of bias, and reporting bias was considered low in the second subnetwork. Furthermore, we had no concerns about indirectness, and a clinically significant threshold was SMD = 1.0. We were concerned about incoherence because a few comparisons had a global or local inconsistency. The certainty of the evidence was low or very low for majority of the comparisons, mainly due to within-study bias and incoherence (Supplementary Table S9C). For EC, the certainty of the evidence was moderate or low for majority of the comparisons, mainly due to within-study bias and incoherence (Supplementary Table S9D).

3.3.2.4 Publication bias

There was no clear evidence of publication bias according to a cluster robust Egger’s test for EO of sex comparisons (p = 0.46). For the funnel plot, see Supplementary Figure S3. We could not run this analysis for EC of sex comparisons after excluding the influential outliers (≥10 needed).

4 Discussion 4.1 Summary of main results

This review aimed to map age and gender-related reference values in a healthy population, assessed by computerized posturography in static and dynamic conditions, and discuss the strengths and limitations of these norms to outline future perspectives and needs. To the best of our knowledge, this work represents the first attempt to synthesize the literature on normative data for computerized posturography using a combined mixed method. The main findings of this systematic review are as follows: (1) Extensive heterogeneity was found in methodological characteristics, (2) there was insufficient risk of bias mitigation, (3) the majority of tasks evaluated less than four components of the SFPC, and (4) studies mostly used distance domain sway parameters and did not report the influence of other variables on postural sway.

Furthermore, we conducted quantitative analyses on the age- and sex-stratified data from these sway parameters of studies with similar interventions, comparisons, and groups. We included 17 studies for sex and age comparisons, involving 5,194 participants (females: 2,630; males: 2,564), in the multilevel or network meta-analyses. We could only conduct quantitative analysis for EO and EC conditions as there was an insufficient number of studies for other conditions. Based on the multilevel meta-analyses, females appeared to significantly outperform males in EC conditions. A significant difference was observed only in EC condition. Based on the network meta-analyses, we found a significant difference in sway among individuals between 50 and 79 years of age and younger individuals, with more instability observed in older participants in both EO and EC conditions. The results also revealed that younger children swayed significantly more than those aged between 8 and 14 years in EO condition. In EC condition, older children were significantly more stable than those aged 8 years and younger.

4.2 Methodological characteristics

We were able to draw some conclusions on methodological characteristics of normative data assessed by computerized posturography. Experimental standardization and reporting are lacking, leading to a low risk of bias mitigation. Specific details are provided in the following.

Only a few studies provided age-stratified reference values. However, some reported values per biological age, decade, or academic level, which made their comparison difficult. We also observed that various tasks and sway parameters were used, leading to semantic heterogeneity. The performances were most often evaluated through distance domain features. However, some questions have been raised about the relevance and reliability of these parameters (Hébert-Losier and Murray, 2020; Ruhe et al., 2010). Hébert-Losier and Murray (2020) found that the sway area and path of the center of pressure appeared to be the most reliable. However, an evaluation of the quality of the studies revealed that only four had good methodological quality. We also found that measurement conditions varied largely regarding arm and foot position, use of visual target, the number and duration of trials, and sequence of the conditions. The majority of studies did not report the influence of other variables (i.e., height, weight, foot placement) on postural sway. These factors need to be considered as they may influence postural control (i.e., anthropometric characteristics) (Chiari et al., 2002), feet positioning (Gibbons et al., 2019), and physical activity (Lelard and Ahmaidi, 2015). These discrepancies highlight methodological disparities that may impact the generalizability of these normative data. These findings agree with systematic reviews by Verbecque et al. (2016a,b) and Quijoux et al. (2020), where a need for standardization in p

Comments (0)

No login
gif