Skip to main content

Hidden Markov model-based similarity measure (HMM-SM) for gait quality assessment of lower-limb prosthetic users using inertial sensor signals

Abstract

Background

Gait quality indices, such as the Gillette Gait Index or Gait Profile Score (GPS), can provide clinicians with objective, straightforward measures to quantify gait pathology and monitor changes over time. However, these methods often require motion capture or stationary gait analysis systems, limiting their accessibility. Inertial sensors offer a portable, cost-effective alternative for gait analysis. This study aimed to evaluate a novel hidden Markov model-based similarity measure (HMM-SM) for assessing gait quality directly from gyroscope and accelerometer data captured by inertial sensors.

Methods

Walking trials were conducted with 26 lower-limb prosthetic users and 30 able-bodied individuals, using inertial sensors placed at various lower body locations. We computed the HMM-SM score along with other established inertial sensor-based methods, including the Movement Deviation Profile, Dynamic Time Warping, IMU-based Gait Normalcy Index, and Multifeature Gait Score. Spearman correlations with the GPS, a validated measure of gait quality, were assessed, as well as correlations among the inertial sensor methods. Welch’s t-tests were used to evaluate the ability to distinguish between prosthetic subgroups.

Results

The HMM-SM and other inertial sensor-based methods demonstrated moderate-to-strong correlations with the GPS (0.49 <|r|< 0.77 for significant correlations). Comparisons between different measures highlighted key similarities and differences, both in correlations and in their ability to differentiate between subgroups. Overall, the pelvis and lower leg sensors achieved significant correlations and outperformed the upper leg sensors, which did not achieve significant correlations with the GPS for any of the signal-based measures.

Conclusion

Results suggest inertial sensors located at the pelvis and lower leg provide valid markers for monitoring overall gait quality, offering the potential to develop nonobtrusive, wearable systems to facilitate long-term monitoring. Such systems could enhance rehabilitation by enabling continuous gait assessment that can be easily integrated in clinical and everyday settings.

Background

Traditional clinical practice heavily relies on observational gait analysis, despite its subjectivity and evidence indicating limitations in validity, reliability, and responsiveness [1, 2]. Instrumented gait analysis systems offer a more objective and comprehensive approach for clinicians to assess gait patterns, make informed treatment decisions, and evaluate intervention effectiveness [3, 4], ultimately improving patient outcomes [5]. Recent interest has turned to developing summarized gait quality indices as an alternative to assessing individual gait parameters. These indices provide straightforward, interpretable measures of overall gait patterns, which can be used for assessing gait problems and monitoring gait changes over time [2, 6]. Widely recognized metrics include the Gillette Gait Index (GGI) [7], Gait Deviation Index [8], and Gait Profile Score (GPS) [9], validated across various populations with gait disability. However, these indices require motion capture systems to accurately measure a full range of lower-body spatiotemporal and kinematic parameters [10]. Such systems are not readily accessible, costly and time-intensive to use, thus limiting their integration in clinical settings [3, 4]. Moreover, such systems are restricted to the lab or clinic environment, which may not be representative of real-world gait dynamics [11,12,13]. Development of methods tailored for wearable systems could allow for more convenient and portable solutions, facilitating their use in both clinical practice and everyday environments.

Several methods have been proposed for gait quality assessment using inertial sensors, which offer advantages such as affordability, portability, and versatility [4, 14]. The IMU-based Gait Normalcy Index (INI) [10] and Multifeature Gait Score (MGS) [15] both employ principal component analysis (PCA) on a set of gait parameters to derive a single gait quality score. The INI measures 9 parameters (3 spatiotemporal and 6 kinematic), which are normalized and transformed into orthogonal features using PCA. Deviations from a reference group of able-bodied individuals are then evaluated to determine overall gait abnormality. In contrast, the MGS defines 6 aspects of gait: amplitude, temporal, distribution, complexity, symmetry, and regularity. Each aspect incorporates various gait parameters or properties of the inertial sensor signals (e.g., skew, kurtosis, entropy). Mansour et al. then apply a PCA-driven approach to identify the most important parameters, which are compared to normative values to compute an overall gait quality score [15].

These methods still have some key drawbacks. Both algorithms require accurate and reliable measurement of many gait parameters simultaneously, which remains a significant challenge for inertial sensor systems, particularly for non-sagittal kinematics and spatiotemporal variability and symmetry parameters [16]. This may also require two or more sensors, negatively influencing the comfort and wearability of these systems [17, 18]. Furthermore, the INI and MGS rely on a predefined set of gait parameters. While some parameters such as speed are commonly regarded as important to gait quality, parameters of interest vary significantly across different populations with gait disability [19,20,21]. Choice of parameters may impact the performance of an overall gait quality measure when applied to different populations, so these would need to be validated in other populations. Alternatively, non-parameter-based approaches could potentially mitigate these challenges by eliminating the need for predefined, extensively validated parameter sets and circumventing the dependence on accurate parameter measurement for gait quality assessment.

Non-parameter-based approaches directly analyze the time-series signals from gait analysis systems. Barton et al. introduced the Movement Deviation Profile (MDP) [22], which evaluates deviations in time-series data using a Self-Organizing Map (SOM) trained on normative gait patterns. However, the MDP has only been validating using the pelvis, hip, knee, and ankle kinematic signals obtained from optical motion capture systems, limiting its use with inertial sensors. A robust benchmark for many machine learning tasks [23], dynamic time warping (DTW) is a distance measure for comparing time-series signals which has been used to analyze kinematics in Parkinson’s [24] and classify gait changes in lower-limb prosthetic users (LLPU) based on gyroscope signals [25]. Hidden Markov models (HMM) are another method suitable for time-series analysis, and they have been commonly used for wearable sensor gait data classification [26,27,28]. In our previous work, we have proposed a hidden Markov model-based similarity measure (HMM-SM) for unsupervised gait assessment using accelerometer and gyroscope data from a small set (1 to 2) of inertial sensors. The HMM-SM involves training HMMs on the inertial sensor data and measuring the similarity of these HMMs to quantify overall deviation between gait patterns. Previously, we validated that the HMM-SM could assess changes from an individual’s baseline gait patterns [29]. The current study aims to assess the performance of the HMM-SM for assessing gait normalcy and gait quality using inertial sensors in LLPU. This is accomplished through 2 main objectives. (1) Compare the HMM-SM to a validated gait quality measure to explore the clinical relevance of the HMM-SM. This study used the GPS, which is a widely cited gait quality index that has been validated in multiple populations, including LLPU [30,31,32]. (2) Compare other inertial sensor-based methods to the GPS to benchmark HMM-SM performance. As part of objectives 1 and 2, we also compared the scores to two functional gait measures, the prosthetic evaluation questionnaire mobility section (PEQ-MS) [33] and locomotor capabilities index (LCI-5) [34] to further explore any relationships between the gait quality methods and measures of amputee gait function. To our knowledge, this is the first study to develop and evaluate a non-parameter-based, inertial sensor gait quality assessment method with respect to a validated gait quality index such as the GPS. Additionally, no previous studies have compared inertial sensor gait quality measure with existing validated indices or compared multiple inertial sensor-based methods. Therefore, this research provides valuable evidence regarding the validity and relative performance of the HMM-SM and other methods—both parameter-based and non-parameter-based—for gait quality assessment using inertial sensor data.

Methods

Participants

26 LLPU completed the study protocol (23 unilateral, 3 bilateral). Individuals were included if they were above the age of 5 years and could ambulate independently on level ground. Participants included 4 types of prosthetic users: transtibial amputees (TT), transfemoral amputees (TF), Van-Nes rotationplasty (VN), and individuals with limb shortening/limb length difference (LS). 30 able-bodied participants were also recruited. Able-bodied participants were defined as individuals exhibiting no obvious gait abnormalities and having no previous history of musculoskeletal, neurological, or cardiovascular disorders. Able-bodied participants were used to form the normative group (i.e., reference) that all the gait quality measures used to calculate their scores. This is comparable to initial validations for the GGI and GPS, which used 24 and 38 participants for their reference group, respectively [7, 35]. Characteristics for the LLPU and able-bodied participants are shown in Table 1, including characteristics for each of the LLPU subgroups. Informed consent was obtained from each participant at the beginning of the data collection session. The recruitment and experimental procedure were approved by the Research Ethics Board at Holland Bloorview Kids Rehabilitation Hospital (REB-0176).

Table 1 Participant demographics

Data acquisition

Participants were instrumented with the Xsens Awinda system (Xsens Technologies BV, Enschede, Netherlands) and wore 8 inertial sensors located on the lower body and sternum, as shown in Fig. 1. This system has been well-validated for kinematic measurements, demonstrating excellent reliability in the sagittal plane and fair-to-excellent reliability in the other planes [36]. The placement of the sensors followed anatomical markers suggested in the Xsens user manual [37]. The Xsens collected orientation-free accelerometer (range ± 16 g) and angular velocity data (range ± 35 rad/s) as well as foot contacts, 3D position and orientation, and kinematic signals at 100 Hz. The accelerometer and angular velocity signals were down-sampled to 40 Hz and used by the signal-based ML models as well as to calculate any signal parameters (e.g., skew, kurtosis), while the foot contact, position, orientation, and kinematics were used to calculate spatiotemporal and kinematic gait parameters.

Fig. 1
figure 1

Example participant setup for the Xsens Awinda. Able-bodied and LLPU participants followed the same setup. Sensors on bridge of both feet, lower legs below the knee (shank), outside of upper legs parallel with sagittal plane (thigh), center of pelvis on the sacrum, and center of sternum. Free acceleration and angular velocity were used to train the ML models and calculate signal parameters

All participants completed walking trials along a 15 m straight pathway, stopping and turning around at either end. The first and last gait cycles of each pass were excluded to ignore the starting and stopping portion. LLPU participants completed 20 passes, which was able to capture at least 100 steady-state gait cycles for each participant. 100 gait cycles were sampled randomly from each of the LLPU participants for calculating the gait quality scores. Able-bodied participants were instructed to complete 10 passes, which resulted in at least 50 gait cycles. We randomly sampled 10 gait cycles from each able-bodied participant and aggregated these to form the normative/reference group used for each of the gait quality measures, similarly to the process by Wang et al. [10].

LLPU participants also completed 2 self-report functional measures, the Prosthesis Evaluation Questionnaire-Mobility Section (PEQ-MS) and the Locomotor Capabilities Index (LCI-5). These were included to assess relationships between gait mobility capacity as measured by the questionnaires and outputs from the gait quality scores, and to potentially provide more information concerning the clinical relevance of the inertial sensor methods.

Gait quality measures

Multiple gait quality measures were used for this study. All methods were able to provide a continuous output of gait similarity or distance (as opposed to a discrete classification scheme), analogous to the previously validated gait indices such as the GPS, GDI, and GGI. The primary inertial sensor-based method used was the HMM-SM, which was compared against 4 other inertial sensor-based measures. Detailed descriptions for these methods are provided in the subsequent sections. We categorize inertial sensor-based methods into two types: signal-based and parameter-based. Signal-based methods utilize raw time-series data from wearable sensors (such as gyroscopes and accelerometers) directly as inputs to the model for determining gait quality scores, and includes the HMM-SM, MDP, and DTW. In contrast, parameter-based methods involve calculating discrete parameters which can be derived from inertial sensor data, which are then used to assess overall gait quality, and include the INI and MGS.

For this study, the GPS was used as the gold standard gait quality measure to evaluate the performance of the inertial sensor gait quality methods. It was calculated as outlined in literature [35] using kinematic data obtained from the Xsens. Specifically, the GPS takes into account the pelvic orientation, foot progression, and joint angles from the hips, knees, and ankles. A Euclidean distance is calculated throughout the gait cycle between a participant’s gait and a set of normative gait (e.g., able-bodied individuals) to determine a gait deviation score. To calculate an individual’s GPS score, we used the mean score of the 100 sampled gait cycles.

All the gait quality measures in this study, including the GPS, derive their score by comparing LLPU participant data to a normative dataset representing ideal gait, typically able-bodied gait. The normative (i.e., reference) dataset in this study was formed as detailed in the previous section (Data acquisition) using the 10 randomly sampled gait cycles from each able-bodied participant.

Hidden Markov model similarity measure (HMM-SM)

The HMM-SM leverages a computationally low-cost method developed by Sahraeian and Yoon [38] to assess the similarity of HMM models. Each HMM \(\lambda\) is characterized by 2 main matrices A and B. A is an N × N matrix where each element \({a}_{i, j}\) represents the probability the system would transition from state i to state j at a given time. The number of states, N, is a hyperparameter of the system. B is the emissions probability matrix consisting of N elements \({b}_{1},\dots ,{b}_{N}\) where each element represents the likelihood of system outputs for a given state. These likelihoods are represented by a \(\mu\) (mean) and covariance matrix. To quantify the similarity of two HMMs (\({\lambda }_{a}\text{ and }{\lambda }_{b}\)), we first calculate a state-correspondence matrix Q where each element \({q}_{i,j}\) indicates the overall similarity between the emission matrix elements \({b}_{a}[i] \text{and }{b}_{b}[j]\), evaluated using symmetric Kullback–Leibler divergence. Subsequently, a similarity measure \(S({\lambda }_{a}||{\lambda }_{b})\) is calculated based on the sparsity of Q:

$$S({\lambda }_{a}|\left|{\lambda }_{b}\right)\triangleq \frac{1}{2}\left[\frac{1}{M}\sum_{i=1}^{M}H\left({r}_{i}\right)+\frac{1}{{M}{\prime}}\sum_{i=1}^{{M}{\prime}}H\left({c}_{j}\right)\right]$$
(1)

where ri is the i-th row of Q, cj is the j-th column of Q, and M and \({M}{\prime}\) are the number of rows and columns in Q. \(H({\varvec{u}})\) represents the normalized Gini Index which returns the sparsity of vector u from 0 to 1. A similarity score, \(S({\lambda }_{a}||{\lambda }_{b})\), closer to 1 indicates a higher degree of similarity between the HMMs. Using these properties, we can compare individual’s gait to that of ideal or normative gait to calculate a single gait quality score, where greater similarity to normative gait represents higher gait quality.

We conducted preliminary validation simulating gait perturbations in able-bodied individuals, which provided insights into hyperparameters and sensor configurations to use for the HMM-SM [29]. For this study, 5-state HMMs were used. Experiments demonstrated including multiple gait cycles increased HMM training consistency. This is also done for other HMM-based methods such as in Cuzzolin et al. which uses gait data from a 10 m pass to train their HMM models [26]. For each participant, we transformed their randomly sampled set of 100 gait cycles into a set of multi-cycle sequences. This was accomplished by using a sliding window to iteratively select groups of 10 gait cycles, which were then concatenated along the time axis. The concatenated sequences were used to train the HMM model for each participant (\({\lambda }_{p}\)). Each HMM, \({\lambda }_{p}\), was then compared to the HMM trained on the normative dataset (\({\lambda }_{control}\)) using the similarity measure from Eq. 1 to determine the participant’s gait quality score. An overview of this is provided in Table 2, along with the other inertial sensor-based gait quality measures.

Table 2 Summary of the inertial sensor-based gait quality measures

For this study, we tested three sensor configurations identified from preliminary investigations: pelvis, combined upper leg signals, and combined lower leg signals [29]. The single-sensor data were 6 × T arrays (tri-axial gyroscope and accelerometer signals, over T time points following the concatenation process). This meant for the pelvis configuration, the HMMs were trained on 6 × T arrays, whereas for the combined sensor configurations (upper leg and lower leg), the sensor data was stacked such that the HMMs were trained on 12 × T arrays.

Other inertial sensor gait quality measures

In addition to our method, the HMM-SM, we calculated scores for the INI, MGS, and MDP to serve as benchmarks against which to compare the HMM-SM’s performance. All methods were calculated as outlined in literature [10, 15, 22]. As an additional benchmark, we also evaluated DTW, which is a common baseline for evaluating time-series model performance [23]. For this work, we computed an average DTW distance between LLPU and able-bodied gait cycles. This was done using the tslearn library for determining the DTW distance of multivariate time series [39]. Table 2 provides a summary of the processes for the inertial sensor-based methods.

Neither the INI nor MGS publications included whether the right or left side were used for calculating the non-symmetry parameters, so gait parameters from the prosthetic side were used for unilateral prosthetic users, while the averages of right and left were used for bilateral prosthetic users. For the signal-based measures, the MDP and DTW, we tested the same sensor configurations as for the HMM-SM (pelvis, upper legs, and lower legs). For the MDP, instead of using the different kinematic signals as demonstrated by Barton et al. [22], the SOM was trained using the tri-axial accelerometer and gyroscope signals. To our knowledge, the MDP has yet to be evaluated using just wearable sensor signals. To calculate the INI, MGS, and MDP scores for each participant, we averaged the scores of the 100 sampled gait cycles, the same as for the GPS.

Data analysis

The Spearman’s rank correlation coefficient was used to evaluate the relationship between the various gait quality measures used in the study. The primary correlations of interest were between the GPS and the HMM-SM from the 3 sensor locations as well as between the GPS and other inertial sensor-based gait quality measures. These were used to evaluate concurrent validity with respect to the GPS and benchmark the performance of the HMM-SM.

Because of how the scores are constructed, we would expect the HMM-SM to have a negative correlation with the GPS (i.e., r-value < 0). This is due to the HMM-SM being a similarity measure between 0 and 1, where a score of 1 indicates no deviation, and decreasing scores (closer to 0) indicate greater deviation (i.e., lower similarity) with respect to the reference dataset. This is opposite to the GPS, whose range starts at 0 (no deviation) and in which increasing scores correspond to greater deviation with respect to the reference gait. Like the GPS, the remaining inertial sensor-based measures (MDP, DTW, INI, and MGS) should anticipate positive correlations with the GPS, as they are all formulated such that a score of 0 indicates no deviation, and increasing scores correspond to greater deviation from reference gait.

We also calculated the correlation coefficients between the different inertial sensor-based measures to identify any significant relationships. Lastly, we assessed correlations between the inertial sensor-based methods and the self-report measures. For this study, we considered an absolute value of r less than 0.1 as negligible correlation, 0.1–0.4 as weak, 0.4–0.7 as moderate, 0.7–0.9 as strong, and greater than 0.9 as very strong [40]. An alpha value of 0.05 was used to determine significance for all the statistical tests. To mitigate the risks arising from multiple comparisons, we applied a Benjamini-Hochberg (BH) correction with a false discovery rate (q) of 0.05 to determine significance of the correlations [41]. We used the protocol described by Benjamini and Hochberg [42] to calculate adjusted p-values which we report in this paper.

We also aimed to evaluate the discriminative validity by assessing differences in the gait quality measures between the different LLPU subgroups (TT, TF, VN, and LS). A Shapiro–Wilk test was used to evaluate normality of the data within each subgroup. A repeated measures ANOVA (RM-ANOVA) was used to determine whether any differences were present between the different subgroups for each of the inertial sensor-based algorithms. Post-hoc Welch’s t-tests were then performed to identify which groups significantly differed (alpha = 0.05).

Results

Correlations between gait quality measures

We analyzed eleven scores derived from inertial sensor gait quality measures in relation to the GPS. This includes three sensor configurations for the HMM-SM, MDP, and DTW, and one score each for the INI and MGS. Figure 2 displays the comparison of inertial sensor scores with GPS results, and Table 3 summarizes the Spearman’s rank correlation tests comparing all the different gait quality measures. Because the HMM-SM is a similarity measure, higher scores correspond to better gait quality (i.e., greater similarity to reference dataset). In contrast, higher scores for the other measures (INI, MGS, MDP, DTW, and GPS) should reflect worse gait quality as it indicates increased gait deviation. The results showed many significant correlations between the inertial sensor-based measures and the GPS. Additionally, there were many correlations between the different sensor configurations and between the different inertial sensor-based methods.

Fig. 2
figure 2

Correlations between the GPS (horizontal axis) and inertial sensor-based measure scores (vertical axis). The HMM-SM, DTW, and MDP algorithms are grouped, with plots for each of the sensor configurations evaluated in the study (upper leg sensors, lower leg sensors, and pelvis sensor). Plots with bolded correlation and adjusted p-values indicate significant correlation with the GPS (α = 0.05) for that configuration. R-values and p-values obtained using Spearman’s rank correlation coefficient as outlined in the methodology

Table 3 Correlations across gait quality measures

All the signal-based measures had at least one sensor configuration which was significantly correlated with the GPS, and these were all moderate correlations. The HMM-SM lower leg configuration showed a moderate negative correlation with the GPS (r = −0.55, p = 0.011). Other time-series measures which were significantly correlated with the GPS were the DTW lower leg (r = 0.68, p < 0.001), DTW pelvis (r = 0.57, p = 0.007), MDP lower leg (r = 0.49, p = 0.028), and MDP pelvis (r = 0.57, p = 0.008). HMM-SM and MDP exhibited comparable performance, with moderate correlations with the GPS for the lower leg configurations. Among the signal-based measures, DTW demonstrated the strongest correlation with the GPS, with the lower leg configuration results approaching a strong correlation. In general, across the signal-based measures (HMM-SM, MDP, and DTW), the sensor location on the upper legs performed worse than the pelvis and the lower limbs.

The INI demonstrated the strongest correlation with the GPS (r = 0.76, p < 0.001), while the MGS showed a moderate negative correlation (r = −0.49, p = 0.029). Notably for the MGS, the negative correlation was opposite what would be expected, indicating MGS scores improved as gait kinematics deviated from the reference dataset. This was the only measure to report significant correlations in the opposite direction as expected.

There were also many correlations among the different inertial sensor configurations, as seen in the Table 3 non-GPS columns. Comparing different sensor configurations with a given algorithm (e.g., DTW upper leg vs. DTW pelvis), the MDP exhibited strong correlations between the sensor configurations, while DTW exhibited moderate correlation between upper and lower leg (r = 0.61) and strong correlations between the pelvis and both the upper and lower leg configurations (r = 0.73 and r = 0.72, respectively). The HMM-SM sensor configurations did not show significant correlations with each other.

Comparing different measures, the HMM-SM lower leg configuration showed moderate correlation with the MDP pelvis, DTW lower leg and pelvis, INI, and the MGS (HMM-SM Lower column, Table 3). The MDP and DTW measures were highly correlated, with significant correlations observed for all the MDP and DTW comparisons. Only the comparison between MDP-Upper and DTW-Lower was not strong (r = 0.59), and the MDP-DTW upper and MDP-DTW pelvis correlations were both very strong (r > 0.90). The DTW lower leg configuration also showed moderate correlations with both the parameter-based measures (DTW Lower column, Table 3). The INI and MGS had a significant negative correlation.

Lastly, we compared the inertial sensor-based scores to the results from the functional measure questionnaires – the PEQ-MS and LCI-5 – to identify any relationships between the measures and assess whether the inertial sensor-based measures could be used to estimate functional ability in LLPU. As shown in Table 4, neither functional measure demonstrated significant correlations with any of the gait quality measures.

Table 4 Correlations with functional measures

Differences between prosthetic levels

We evaluated differences among prosthetic levels by analyzing gait quality measures across different prosthetic user types (TT, TF, VN, and LS). Mean and standard deviation values for each subgroup are presented in Fig. 3. Welch’s t-tests were used to identify significant differences. The results from the Welch’s test are presented in Table 5 and demonstrate that the gait quality measures varied in which subgroups they differentiated between.

Fig. 3
figure 3

Results from post-hoc t-tests comparing the different amputee levels. Significance bars indicate levels where statistical significance was detected, with corresponding significance level (*≤0.05, **≤0.01, and ***≤0.001). The GPS (reference measure) results are shown in the top left. The VN group scores were significantly higher than TT or TF amputees. Differences between subgroups were also found in the following inertial sensor configurations: INI, MGS, HMM-SM Lower Leg, HMM-SM Pelvis, MDP Lower Leg, and DTW Lower Leg. TT Transtibial, TF Transfemoral, VN Van Nes, LS Limb Shortening

Table 5 Comparison of prosthetic levels

The GPS detected 2 subgroup differences. Based on the GPS the VN group had significantly worse gait quality compared to both the TT (p = 0.008) and TF (p = 0.006) groups, as evidenced by higher scores. The mean GPS score for the LS group was 5.5° higher than those of the TT or TF groups, well above the minimal clinically important difference of 1.7° in LLPU identified by Carse et al. [30]. However, the differences between the TT-LS and TF-LS groups were borderline significant (p = 0.065 and p = 0.062, respectively), but did not reach the significance threshold of 0.05, potentially due to the smaller sample size of the LS group (n = 3).

All measures detected differences between at least one set of subgroups. The HMM-SM captured a total of 5 significant sub-group differences, across 2 sensor locations, which was the most among the signal-based measures. Specifically, for the HMM-SM lower leg configuration, LS participants scored significantly lower than the TT and VN groups, while for the pelvis configuration, LS participants scored lower than all three other prosthetic types. The DTW lower leg configuration best matched the GPS as it showed the same 2 significant differences as the GPS, between the LS group and both the TT and TF participants. The other two locations showed no differences. The MDP only revealed one significant sub-group difference in total across all three sensor locations, between TF and VN. In general, across the signal-based measures, the upper legs configuration performed worse than the pelvis and lower legs, as none of the HMM-SM, DTW, or MDP upper leg configurations revealed any significant differences between the prosthetic types. The INI detected 3 differences, including between the LS group and both the TT and TF groups, as well as between the TT and TF groups. The MGS revealed a single difference, between TF and VN.

There were no clear patterns for which subgroups the measures discriminated, as these varied between measures and sensor locations. For example, the HMM-SM only detected differences between LS and the other subgroups, while DTW results differentiated between TF and VN. TF-VN difference was most commonly identified (4 instances), followed by 3 for TT-LS, 1 for TT-TF, 2 for the remaining subgroups.

We also explored partitioning the LLPU into only two groups. To do this, we considered the VN and the TT as one group and the LS and TF as the second group. For instance, some studies consider VN also as below knee amputees in their analysis [43]. However, in this configuration, there were no statistically significant differences between the two groups for any of the gait quality measures.

Discussion

This study aimed to evaluate the proposed HMM-SM as a novel method for assessing gait quality using inertial sensor signals. To accomplish this, we examined the concurrent validity of the HMM-SM comparing its results with those obtained from the GPS, which were derived from the same walking trials. We also compared the HMM-SM and GPS with other inertial signal-based gait quality measures and time-series analysis techniques from the literature to benchmark its performance against existing methods. Additionally, we investigated the discriminative capabilities of these methods by analyzing how they differentiated between different types of prosthetic users. At a high level, all of the signal-based measures including the HMM-SM were at least moderately correlated with the GPS and were able to discriminate between at least 2 LLPU subgroups. Hence, these findings provide both confirmatory and new evidence (particularly in the case of HMM-SM) that inertial sensor signals could be used to assess overall gait quality. A summary of advantages and disadvantages of the inertial sensor-based algorithms, based on findings from the study, is included in Table 6 (excluding the MGS), with in depth discussion in the following sections.

Table 6 Summary of characteristics for inertial sensor-based gait quality measures

Inertial sensor-based measures for gait quality assessment

Among the signal-based methods, DTW exhibited the strongest correlations with the GPS, suggesting that even a relatively simple time-series analysis method like DTW could be used to evaluate overall gait quality. Studies in other domains such as time-series classification demonstrate that even 1-Nearest Neighbor DTW is a robust benchmark, with state-of-the-art algorithms failing to surpass it or achieving only marginal accuracy improvements [23, 44]. However, the HMM-SM may offer additional advantages over DTW beyond correlation with the GPS. For example, previous studies have used HMMs for gait event identification and gait phase analysis [45,46,47]. Therefore, the HMM-SM framework could be used to assess overall gait quality while simultaneously allowing for extraction of gait events and parameters which may be clinically useful.

Results suggest that optimal locations for assessing gait patterns using inertial sensors are the lower legs and pelvis. DTW and MDP both demonstrated significant correlations with the GPS for lower leg and pelvis configurations. The HMM-SM only showed significant correlations for the lower-leg, but this is in line with findings from initial investigations with able-bodied individuals that showed decreased HMM-SM reliability using the pelvis compared to both the upper leg and lower leg configurations [29]. None of the signal-based algorithms (HMM-SM, MDP, and DTW) showed significant correlation with the GPS using upper leg sensors. Although sensor placement and quantity vary across gait studies, results from this study support previous literature which indicates that the most common locations are the pelvis and lower legs [48,49,50].

Comparing GPS correlations between the signal-based and parameter-based measures, the HMM-SM and other signal-based measures (MDP and DTW) outperformed the MGS, whose scores improved as GPS scores worsened. In contrast, the INI showed the strongest correlation among the inertial sensor-based measures (r = 0.76, p < 0.001), indicating it could be a valid measure of gait quality. This was notable given that the INI uses a different set of gait parameters than the GPS, suggesting that a PCA-based approach can effectively quantify overall gait deviation. However, the MGS, which also used a PCA-based approach, decreased as the GPS and INI scores worsened. This discrepancy highlights how the choice of parameters and techniques can significantly impact the performance of parameter-based gait quality measures. Variations in parameter importance across different disability populations could contribute to this. Wang et al. suggest that parameters for a gait quality measure may need adjustments for different populations, particularly since symmetry measures, which are crucial for hemiplegic or asymmetric gait, were not included in the INI [10].

The HMM-SM, DTW, and MDP methods, which do not require extensive gait parameter validation and tuning, could be more adaptable across clinical populations. This would also reduce the reliance on accurately and reliably measuring a broad set of gait parameters. This is advantageous given that wearable systems can be less accurate than non-wearable systems, require complex algorithms for parameter assessment (e.g., obtaining the motion trajectory and kinematics for the INI), and typically assess a limited number of gait parameters [1, 3].

Correlations between inertial-sensor based methods

None of the HMM-SM sensor configurations showed significant correlations with each other. We might have expected some correlations given that movements in the pelvis, upper legs, and lower legs might impact each other. Baker et al. observed in their original validation of the GPS that individual component scores for different kinematics did not strongly correlate with each other [35]. This suggests that changes in gait patterns may have varied effects across different lower body locations and movement planes, potentially explaining the lack of significant correlations among HMM-SM configurations.

In contrast, comparing the sensor configurations within each of the DTW and MDP results, the three configurations were highly correlated with each other. The DTW lower leg versus upper leg exhibited a moderate correlation (r = 0.61), while the other comparisons all showed strong correlations. This indicates the HMM-SM may be more sensitive to changes in sensor configuration, whereas DTW and MDP are relatively invariant to sensor placement. Several factors could explain this. Firstly, the HMM training might be more inconsistent due to limitations in sample size or issues with hyperparameter selection. This inconsistency can affect the performance of HMM-SM independently of any changes in the sensor configuration. Another reason could be the fundamental differences between HMM-SM and the other methods. DTW and MDP are relatively simple distance measures that directly compare the time-series data. In contrast, HMM-SM involves a more complex analysis, as it compares the learned patterns from the sensor signals through a probabilistic model. This more abstract approach might make HMM-SM more sensitive to variations in sensor configuration. The increased consistency across the sensor configurations in the DTW and MDP algorithms could offer greater flexibility when integrating them into wearable systems. This flexibility might allow for sensor placement decisions based on other factors such as type of ambulatory analysis [51], accurate gait event detection [14], or activity analysis [52].

Correlations between the inertial sensor methods suggest some convergent validity among the different inertial sensor-based measures, indicating that various methods can extract similar relevant features from gait. For example, the HMM-SM lower leg results were significantly correlated with both the MDP pelvis configuration and the DTW lower leg and pelvis configurations. The DTW lower leg configuration showed significant correlation with the INI, suggesting that parameter-based and inertial signal-based approaches can provide similar evaluations of LLPU gait. This supports the idea that we can use inertial signals to implicitly assess changes in overall spatiotemporal and kinematic gait aspects such as those used by the INI and the GPS.

The DTW and MDP algorithms were highly correlated with each other. At a conceptual level, the MDP is similar to DTW in that for each point along the gait cycle, the MDP uses Euclidean distance to find the closest matching point in the SOM trained on reference gait. DTW employs an analogous strategy but includes additional constraints on the warping window (i.e., how far before or after to search for matching points) and the allowable warping path [53]. Although training a SOM can be computationally expensive [54], it needs to be trained only once on the reference dataset. Thus, the trade-off between computational complexity and performance should be evaluated when determining the most applicable method.

Correlations with self-report measures

Neither the LCI-5 nor the PEQ-MS exhibited significant correlations with the gait quality measures including GPS. This could be due to the sample size, and larger sample sizes may be needed to reveal potential existing correlations. However, previous studies have underscored the potential differences between gait quality measures and measures of gait function or self-reported capacity [55, 56]. While some research has indicated positive correlations between improvements in gait quality measures (e.g., quantified gait parameters, summarized gait indices like the Gait Deviation Index) and functional assessments like the Functional Gait Assessment [57] or the Timed Up and Go test [55], others have found low correlates between gait measures and with functional measures like gait speed [56] or self-report measures of ambulation [55]. The results of this study further highlight the distinction between gait quality and ambulatory function or capacity, emphasizing the importance of developing methods that can monitor gait quality over the long term.

Gait quality results across prosthetic levels

All measures (at various sensor locations) detected differences between at least one set of subgroups. The HMM-SM pelvis configuration and INI discriminated between the most sub-groups (n = 3). The rest of the measures including the GPS discriminated between one or two sub-groups. Additionally, if we consider only tracking a single sensor location (in cases where the goal is to maximize wearability of the gait assessment system), then the HMM-SM is the only measure to discriminate between 3 subgroups.

The variability in the discriminative performance among gait quality measures correlated with the GPS further underscores that while different inertial sensor-based measures can assess overall gait quality, each measure may have its own strengths and specific performance characteristics. For example, the HMM-SM may be particularly effective for monitoring pelvic kinematics (or other kinematic changes for individuals with lower-limb difference), whereas DTW could more closely approximate GPS performance.

Contrary to other studies comparing TT and TF gait [32, 58, 59], neither the GPS nor any of the signal-based measures (HMM-SM, MDP, and DTW) identified differences between the TT and TF groups. This might be attributed to the fact that 4 out of the 7 TF amputees used microprocessor knee joints, which significantly improve gait patterns, potentially approaching able-bodied gait kinematics [60].

The HMM-SM was particularly effective at discriminating the LS group. Individuals with limb discrepancy often use pelvic compensatory strategies and display increased pelvic obliquity (i.e., left or right side of the hip higher than the other) [61]. This may explain why the pelvis configuration consistently differentiated the LS group from the others, suggesting the HMM-SM may be more responsive to changes in pelvic kinematics than the GPS. It also could indicate that the HMM-SM is particularly effective at monitoring the compensatory strategies employed by individuals with limb discrepancy (pelvic or otherwise), as evidenced by the performance of both the lower leg and pelvis configurations.

Future work and limitations

This study identified significant correlations between the GPS and the HMM-SM as well as between the GPS and other inertial sensor-based methods, supporting the use of inertial sensors for monitoring overall gait quality. Furthermore, the study involved a diverse LLPU group from 9 to 63 years of age, comprising various prosthetic types—transfemoral, transtibial, Van Nes, and congenital limb shortening—as well as a mix of unilateral and bilateral LLPU. This suggests that the HMM-SM and other inertial signal-based measures can be applied to assess a wide range of gait characteristics and deviations.

However, future research should seek to address several study limitations. The study protocol did not allow for repeatability testing, as we only collected gait data during a single session for each participant. Among the inertial sensor methods evaluated, only the MGS reported repeatability outcomes [15]. Reliability is crucial for gait metrics to detect significant changes in gait patterns [62], so future studies should include test–retest assessments over multiple time periods without intervention to evaluate the repeatability of inertial sensor-based gait quality measures. It should be highlighted that the inertial sensor methods were validated against GPS using kinematics from an inertial sensor system (the Xsens MVN) as opposed to kinematics from an optical/camera motion capture system. Although XSens MVN has been well-validated as described in the methodology, this could affect the results or be a source of differences if future studies were to use alternative methods (e.g., optical systems) for calculating the GPS.

Additionally, future research should attempt to involve larger sample sizes. While the sample size used for the LLPU in this study (n = 26) was comparable to those used in existing studies such as for the INI (n = 8) [10] and LLPU GPS validation (n = 20) [55], more comprehensive validation studies for the GGI and GPS involved 64 and 407 participants for the gait disability cohorts, respectively [7, 35]. Larger sample sizes would enhance our ability to assess the correlation between the HMM-SM and the GPS and determine if any non-linear correlations exist. It could also enable us to assess correlations among the specific LLPU subgroups, which could reveal additional relationships not seen in the study or provide further evidence for correlations between the inertial sensor measures and the GPS. Future work should also continue hyperparameter testing, particularly for the HMM-SM, to explore the effects on correlation with the GPS and whether the methods from this study are broadly applicable to larger data sets. The HMM-SM exhibited more variability in performance than DTW or MDP among the different sensor configurations, which could also be indicative of overfitting by the model during the HMM training. Future studies should assess variance in the hyperparameter learning between sets of similar gait patterns (e.g., trials/sets of 100 gait cycles where gait is expected to not have changed, verified by similar GPS scores) to evaluate whether HMM training is consistent. Potential addition of regularization to the HMM training step or hyperparameter tuning (such as number of HMM states) may be necessary in the case of inconsistent HMM training or overfitting.

Although this study included correlation results for the GPS compared to parameter-based inertial sensor methods (INI and MGS), it did not explore correlations between individual parameters and the GPS. This was primarily due to this work’s focus on exploring the validity of signal-based measures, which could reduce the need for more complex multi-sensor systems. Future studies with larger sample sizes could investigate correlations among gait parameters, including but not limited to those used in INI and MGS, to determine whether a minimal set of gait parameters and number of sensors could accurately estimate the GPS. Furthermore, it could elucidate gait parameters which strongly contribute to (or are affected by) overall gait quality and gait kinematics.

Long-term research should also test the HMM-SM and other methods in more natural, free-living conditions, such as in environments without laboratory assistance or during natural walking conditions like on flat sidewalks. Inertial sensor-based methods are intended for integration into systems that clinicians and individuals can use during routine activities. Therefore, it is important to examine whether variations in sensor placement and orientation or less controlled walking conditions affect their performance.

While able-bodied gait patterns were used in this study, as in other studies evaluating gait quality measures [7, 10, 35], future applications could also investigate using other reference sets such as high-functioning LLPU individuals to determine whether able-bodied gait is a relevant and attainable gold standard for LLPU or other disability populations. Finally, future studies should evaluate the validity of inertial sensor-based gait quality measures in other populations to ensure its generalizability and effectiveness across different disability groups.

Conclusions

The HMM-SM, along with several other inertial sensor-based measures, demonstrated significant correlation with a clinically validated measure of gait quality when tested on a diverse group of prosthetic users. This preliminary evidence suggests that the HMM-SM could be an effective tool for assessing gait quality. Additionally, results from the HMM-SM, DTW, and MDP indicate that overall gait quality could be assessed using just the signals from a small set of inertial sensors. These methods could offer an easily interpretable assessment of gait pattern deviations without requiring extensive parameter tuning or model training, making them adaptable to a wide range of gait deviations.

Given the substantial time pressures faced by clinicians, it is crucial to develop systems that are quick and easy to use to encourage their adoption in clinical settings. Furthermore, inertial sensor-based methods can enable continuous gait monitoring outside the clinic and support the ongoing assessment of gait progress and long-term changes in gait patterns.

Availability of data and materials

No datasets were generated or analysed during the current study.

Abbreviations

DTW:

Dynamic Time Warping

GD:

Gait duration

GGI:

Gillette Gait Index

GPS:

Gait Profile Score

HMM:

Hidden Markov Model

HMM-SM:

Hidden Markov Model-based Similarity Measure

INI:

Inertial Measurement Unit-based Gait Normalcy Index

LCI-5:

Locomotor Capabilities Index-5

LLPU:

Lower-Limb Prosthetic User

LS:

Individual with Limb Shortening

MAB:

Maximum ankle abduction

MAD:

Maximum ankle adduction

MDP:

Movement Deviation Profile

MGS:

Multifeature Gait Score

MH:

Maximum ankle height

MHD:

Ankle horizontal displacement at MH

MV:

Maximum ankle velocity

PEQ-MS:

Prosthetic Evaluation Questionnaire-Mobility Section

PSP:

Percentage swing phase

PCA:

Principal Component Analysis

RM-ANOVA:

Repeated Measures Analysis of Variance

SL:

Stride length

SRM:

Shank range of motion in swing phase

TF:

Transfemoral amputee

TT:

Transtibial amputee

VN:

Van Nes amputee

References

  1. Hulleck AA, Menoth Mohan D, Abdallah N, El Rich M, Khalaf K. Present and future of gait assessment in clinical practice: Towards the application of novel trends and technologies. Front Med Technol. 2022;4:901331. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/FMEDT.2022.901331.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Gonçalves SB, Lama SBC, da Silva MT. Three decades of gait index development: a comparative review of clinical and research gait indices. Clin Biomech. 2022;96: 105682. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/J.CLINBIOMECH.2022.105682.

    Article  Google Scholar 

  3. Muro-De-La-Herran A, Garcia-Zapirain B, Mendez-Zorrilla A. Gait analysis methods: an overview of wearable and non-wearable systems, highlighting clinical applications. Sensors. 2012;14:3362–94. https://doiorg.publicaciones.saludcastillayleon.es/10.3390/s140203362.

    Article  Google Scholar 

  4. Chen S, Lach J, Lo B, Yang GZ. Toward pervasive gait analysis with wearable sensors: a systematic review. IEEE J Biomed Health Inform. 2016;20(6):1521–37. https://doiorg.publicaciones.saludcastillayleon.es/10.1109/JBHI.2016.2608720.

    Article  PubMed  Google Scholar 

  5. Wren TAL, Tucker CA, Rethlefsen SA, Gorton GE, Õunpuu S. Clinical efficacy of instrumented gait analysis: systematic review 2020 update. Gait Posture. 2020;80:274–9. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/J.GAITPOST.2020.05.031.

    Article  PubMed  Google Scholar 

  6. Cimolin V, Galli M. Gait scores – interpretations and limitations. In: Handbook of human motion, p. 1–15, 2016, https://doiorg.publicaciones.saludcastillayleon.es/10.1007/978-3-319-30808-1_34-1.

  7. Schutte LM, Narayanan U, Stout JL, Selber P, Gage JR, Schwartz MH. An index for quantifying deviations from normal gait. Gait Posture. 2000;11(1):25–31. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/S0966-6362(99)00047-8.

    Article  CAS  PubMed  Google Scholar 

  8. Schwartz MH, Rozumalski A. The gait deviation index: a new comprehensive index of gait pathology. Gait Posture. 2008;28:351–7. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.gaitpost.2008.05.001.

    Article  PubMed  Google Scholar 

  9. Baker R. Gait analysis methods in rehabilitation. J Neuroeng Rehabil. 2006;3:4. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/1743-0003-3-4.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Wang L, Sun Y, Li Q, Liu T, Yi J. IMU-based gait normalcy index calculation for clinical evaluation of impaired gait. IEEE J Biomed Health Inform. 2021;25(1):3–12. https://doiorg.publicaciones.saludcastillayleon.es/10.1109/JBHI.2020.2982978.

    Article  PubMed  Google Scholar 

  11. Carcreff L, et al. Comparison of gait characteristics between clinical and daily life settings in children with cerebral palsy. Sci Rep. 2020;10(1):2091. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/S41598-020-59002-6.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Shah VV, et al. Laboratory versus daily life gait characteristics in patients with multiple sclerosis, Parkinson’s disease, and matched controls. J Neuroeng Rehabil. 2020;17(1):1–12. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/S12984-020-00781-4.

    Article  Google Scholar 

  13. Waddell KJ, Lang CE. Comparison of self-report versus sensor-based methods for measuring the amount of upper limb activity outside the clinic. Arch Phys Med Rehabil. 2018;99(9):1913–6. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/J.APMR.2017.12.025.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Caldas R, Mundt M, Potthast W, Buarque de Lima Neto F, Markert B. A systematic review of gait analysis methods based on inertial sensors and adaptive algorithms. Gait Posture. 2017;57:204–10. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.gaitpost.2017.06.019.

    Article  PubMed  Google Scholar 

  15. Ben Mansour K, Gorce P, Rezzoug N. The multifeature gait score: an accurate way to assess gait quality. PLoS ONE. 2017;12(10):185741. https://doiorg.publicaciones.saludcastillayleon.es/10.1371/JOURNAL.PONE.0185741.

    Article  Google Scholar 

  16. Kobsar D, et al. Validity and reliability of wearable inertial sensors in healthy adult walking: A systematic review and meta-analysis. J Neuroeng Rehabil. 2020;17(1):1–21. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/S12984-020-00685-3/METRICS.

    Article  Google Scholar 

  17. Ng G, Gouda A, Andrysek J. Convolutional neural network for estimating spatiotemporal and kinematic gait parameters using a single inertial sensor. In: 45th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2023.

  18. Mobbs RJ, et al. Gait metrics analysis utilizing single-point inertial measurement units: a systematic review. Mhealth. 2022;8:9. https://doiorg.publicaciones.saludcastillayleon.es/10.21037/MHEALTH-21-17.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Lang CE, Barth J, Holleran CL, Konrad JD, Bland MD. Implementation of wearable sensing technology for movement: pushing forward into the routine physical rehabilitation care field. Sensors. 2020. https://doiorg.publicaciones.saludcastillayleon.es/10.3390/s20205744.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Baan H, Dubbeldam R, Nene AV, van de Laar MAFJ. Gait analysis of the lower limb in patients with rheumatoid arthritis: a systematic review. Semin Arthritis Rheum. 2012;41(6):768-788.e8. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/J.SEMARTHRIT.2011.11.009.

    Article  PubMed  Google Scholar 

  21. Sagawa Y, Turcot K, Phane Armand S, Thevenon A, Vuillerme N, Watelain E. Biomechanics and physiological parameters during gait in lower-limb amputees: a systematic review. Gait Posture. 2011;33:511–26. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.gaitpost.2011.02.003.

    Article  PubMed  Google Scholar 

  22. Barton GJ, Hawken MB, Scott MA, Schwartz MH. Movement deviation profile: a measure of distance from normality using a self-organizing neural network. Hum Mov Sci. 2012;31(2):284–94. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/J.HUMOV.2010.06.003.

    Article  PubMed  Google Scholar 

  23. Bagnall A, Lines J, Bostrom A, Large J, Keogh E. The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances. Data Min Knowl Discov. 2017;31(3):606–60. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/S10618-016-0483-9/.

    Article  PubMed  Google Scholar 

  24. Wang X, Kyrarini M, Ristić-Durrant D, Spranger M, Gräser A. Monitoring of gait performance using dynamic time warping on IMU-sensor data. In: Proceedings - 2016 IEEE International Symposium on Medical Measurements and Applications, IEEE, May 2016, p. 1–6. https://doiorg.publicaciones.saludcastillayleon.es/10.1109/MeMeA.2016.7533745.

  25. Ng G, Andrysek J. Classifying changes in amputee gait following physiotherapy using machine learning and continuous inertial sensor signals. Sensors. 2023;23(3):1412. https://doiorg.publicaciones.saludcastillayleon.es/10.3390/S23031412.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Cuzzolin F, et al. Metric learning for Parkinsonian identification from IMU gait measurements. Gait Posture. 2017;54:127–32. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.gaitpost.2017.02.012.

    Article  PubMed  Google Scholar 

  27. Bae J, Tomizuka M. Gait phase analysis based on a hidden Markov model. Mechatronics. 2011;21(6):961–70. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/J.MECHATRONICS.2011.03.003.

    Article  Google Scholar 

  28. Chattopadhyay S, Nandy A. Human gait modelling using hidden markov model for abnormality detection. In: TENCON 2018 - 2018 IEEE Region 10 Conference, IEEE, 2018, p. 0623–0628. https://doiorg.publicaciones.saludcastillayleon.es/10.1109/TENCON.2018.8650194.

  29. Ng G, Gouda A, Andrysek J. Quantifying asymmetric gait pattern changes using a hidden Markov model similarity measure (HMM-SM) on inertial sensor signals. Sensors. 2024;24(19):6431. https://doiorg.publicaciones.saludcastillayleon.es/10.3390/S24196431.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Carse B, Scott H, Davie-Smith F, Brady L, Colvin J. Minimal clinically important difference in walking velocity, gait profile score and two minute walk test for individuals with lower limb amputation. Gait Posture. 2021;88:221–4. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/J.GAITPOST.2021.06.001.

    Article  PubMed  Google Scholar 

  31. Carse B, Scott H, Brady L, Colvin J. A characterisation of established unilateral transfemoral amputee gait using 3D kinematics, kinetics and oxygen consumption measures. Gait Posture. 2020;75:98–104. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.gaitpost.2019.09.029.

    Article  PubMed  Google Scholar 

  32. Kark L, Vickers D, Mcintosh A, Simmons A. Use of gait summary measures with lower limb amputees. Gait Posture. 2011;35:238–43. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.gaitpost.2011.09.013.

    Article  PubMed  Google Scholar 

  33. Legro MW, Reiber GD, Smith DG, Del Aguila M, Larsen J, Boone D. Prosthesis evaluation questionnaire for persons with lower limb amputations: assessing prosthesis-related quality of life. Arch Phys Med Rehabil. 1998;79(8):931–8. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/S0003-9993(98)90090-9.

    Article  CAS  PubMed  Google Scholar 

  34. Franchignoni F, Traballesi M, Monticone M, Giordano A, Brunelli S, Ferriero G. Sensitivity to change and minimal clinically important difference of the Locomotor Capabilities Index-5 in people with lower limb amputation undergoing prosthetic training. Ann Phys Rehabil Med. 2019;62(3):137–41. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/J.REHAB.2019.02.004.

    Article  PubMed  Google Scholar 

  35. Baker R, et al. The gait profile score and movement analysis profile. Gait Posture. 2009;30:265–9. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.gaitpost.2009.05.020.

    Article  PubMed  Google Scholar 

  36. Al-Amri M, Nicholas K, Button K, Sparkes V, Sheeran L, Davies JL. Inertial measurement units for clinical movement analysis: reliability and concurrent validity. Sensors. 2018;18(3):719. https://doiorg.publicaciones.saludcastillayleon.es/10.3390/S18030719.

    Article  PubMed  PubMed Central  Google Scholar 

  37. MVN User Manual, Enschede, Netherlands, 2021. https://www.xsens.com/hubfs/Downloads/usermanual/MVN_User_Manual.pdf. Accessed 13 Dec 2022.

  38. Sahraeian SME, Yoon BJ. A novel low-complexity HMM similarity measure. IEEE Signal Process Lett. 2011;18(2):87–90. https://doiorg.publicaciones.saludcastillayleon.es/10.1109/LSP.2010.2096417.

    Article  Google Scholar 

  39. Tavenard R, et al. Tslearn, a machine learning toolkit for time series data. J Machine Learning Res. 2020;21(118):1–6.

    Google Scholar 

  40. Schober P, Boer C, Schwarte LA. Correlation coefficients: appropriate use and interpretation. Anesth Analg. 2018;126(5):1763–8. https://doiorg.publicaciones.saludcastillayleon.es/10.1213/ANE.0000000000002864.

    Article  PubMed  Google Scholar 

  41. Chen SY, Feng Z, Yi X. A general introduction to adjustment for multiple comparisons. J Thorac Dis. 2017;9(6):1725. https://doiorg.publicaciones.saludcastillayleon.es/10.21037/JTD.2017.05.34.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Roy Stat Soc Ser B (Methodol). 1995;57(1):289–300. https://doiorg.publicaciones.saludcastillayleon.es/10.1111/J.2517-6161.1995.TB02031.X.

    Article  Google Scholar 

  43. Anton A, Legault Z, Dudek N. Validity of the comprehensive high-level activity mobility predictor in a heterogeneous population with lower extremity amputations. Prosthet Orthot Int. 2020;44(2):60–5. https://doiorg.publicaciones.saludcastillayleon.es/10.1177/0309364619887559.

    Article  PubMed  Google Scholar 

  44. Petitjean F, Forestier G, Webb GI, Nicholson AE, Chen Y, Keogh E. Faster and more accurate classification of time series by exploiting a novel dynamic time warping averaging algorithm. Knowl Inf Syst. 2016;47(1):1–26. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/S10115-015-0878-8.

    Article  Google Scholar 

  45. Mannini A, Genovese V, Sabatini AM. Online decoding of hidden markov models for gait event detection using foot-mounted gyroscopes. IEEE J Biomed Health Inform. 2014;18(4):1122–30. https://doiorg.publicaciones.saludcastillayleon.es/10.1109/JBHI.2013.2293887.

    Article  PubMed  Google Scholar 

  46. Taborri J, Scalona E, Palermo E, Rossi S, Cappa P. Validation of inter-subject training for hidden markov models applied to gait phase detection in children with cerebral palsy. Sensors. 2015;15(9):24514–29. https://doiorg.publicaciones.saludcastillayleon.es/10.3390/s150924514.

    Article  PubMed  PubMed Central  Google Scholar 

  47. Liu L, et al. Ambulatory human gait phase detection using wearable inertial sensors and hidden Markov model. Sensors. 2021;21(4):1347. https://doiorg.publicaciones.saludcastillayleon.es/10.3390/s21041347.

    Article  PubMed  PubMed Central  Google Scholar 

  48. Vienne A, Barrois RP, Buffat S, Vidal RD. Inertial sensors to assess gait quality in patients with neurological disorders: a systematic review of technical and analytical challenges. Front Psychol. 2017;8:817. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/fpsyg.2017.00817.

    Article  PubMed  PubMed Central  Google Scholar 

  49. Brognara L, Palumbo P, Grimm B, Palmerini L. Assessing gait in Parkinson’s disease using wearable motion sensors: a systematic review. Diseases. 2019;7(1):18. https://doiorg.publicaciones.saludcastillayleon.es/10.3390/diseases7010018.

    Article  PubMed  PubMed Central  Google Scholar 

  50. Kobsar D, et al. Wearable inertial sensors for gait analysis in adults with osteoarthritis—a scoping review. Sensors. 2020;20(24):7143. https://doiorg.publicaciones.saludcastillayleon.es/10.3390/s20247143.

    Article  PubMed  PubMed Central  Google Scholar 

  51. Benson LC, Clermont CA, Bošnjak E, Ferber R. The use of wearable devices for walking and running gait analysis outside of the lab: a systematic review. Gait Posture. 2018;63:124–38. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.gaitpost.2018.04.047.

    Article  PubMed  Google Scholar 

  52. Saboor A, et al. Latest research trends in gait analysis using wearable sensors and machine learning: a systematic review. IEEE Access. 2020;8:167830–64. https://doiorg.publicaciones.saludcastillayleon.es/10.1109/ACCESS.2020.3022818.

    Article  Google Scholar 

  53. Xi X, Keogh E, Shelton C, Wei L, Ratanamahatana CA. Fast time series classification using numerosity reduction. In: Proceedings of the 23rd international conference on Machine learning - ICML ’06, New York, NY: ACM Press; 2006. p. 1033–1040. https://doiorg.publicaciones.saludcastillayleon.es/10.1145/1143844.1143974.

  54. Zhang J, Feng X, Zhou B, Ren D. An overall-regional competitive self-organizing map neural network for the Euclidean traveling salesman problem. Neurocomputing. 2012;89:1–11. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/J.NEUCOM.2011.11.024.

    Article  CAS  Google Scholar 

  55. Kark L, Odell R, McIntosh AS, Simmons A. Quantifying prosthetic gait deviation using simple outcome measures. World J Orthop. 2016;7(6):383. https://doiorg.publicaciones.saludcastillayleon.es/10.5312/WJO.V7.I6.383.

    Article  PubMed  PubMed Central  Google Scholar 

  56. Shin SY, Lee RK, Spicer P, Sulzer J. Does kinematic gait quality improve with functional gait recovery? A longitudinal pilot study on early post-stroke individuals. J Biomech. 2020;105:109761. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/J.JBIOMECH.2020.109761.

    Article  PubMed  Google Scholar 

  57. Price R, Choy NL. Investigating the relationship of the functional gait assessment to spatiotemporal parameters of gait and quality of life in individuals with stroke. J Geriatric Physical Therapy. 2019;42(4):256–64. https://doiorg.publicaciones.saludcastillayleon.es/10.1519/JPT.0000000000000173.

    Article  Google Scholar 

  58. Highsmith MJ, Schulz BW, Hart-Hughes S, Latlief GA, Phillips SL. Differences in the spatiotemporal parameters of transtibial and transfemoral amputee gait. J Prosthetics Orthotics. 2010;22(1):26–30. https://doiorg.publicaciones.saludcastillayleon.es/10.1097/JPO.0B013E3181CC0E34.

    Article  Google Scholar 

  59. Keklicek H, et al. Comparison of gait variability and symmetry in trained individuals with transtibial and transfemoral limb loss. J Orthop Surg. 2019;27(1):2–9. https://doiorg.publicaciones.saludcastillayleon.es/10.1177/2309499019832665.

    Article  Google Scholar 

  60. Jayaraman C, et al. Impact of powered knee-ankle prosthesis on low back muscle mechanics in transfemoral amputees: a case series. Front Neurosci. 2018;12:307876. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/FNINS.2018.00134/BIBTEX.

    Article  Google Scholar 

  61. Aiona M, Do KP, Emara K, Dorociak R, Pierce R. Gait patterns in children with limb length discrepancy. J Pediatr Orthop. 2015;35(3):280–4. https://doiorg.publicaciones.saludcastillayleon.es/10.1097/BPO.0000000000000262.

    Article  PubMed  Google Scholar 

  62. Hopkins WG. Measures of reliability in sports medicine and science. Sports Med. 2000;30(1):1–15. https://doiorg.publicaciones.saludcastillayleon.es/10.2165/00007256-200030010-00001.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

The authors would like to thank Alexandra Danks and the Orthotics & Prosthetics department at Holland Bloorview for their involvement in participant recruitment and clinical input. We would also like to thank Aliaa Gouda for her assistance in data collection.

Funding

This work was funded by Collaborative Health Research Projects (NSERC partnered), Canadian Institutes of Health Research - CPG-163967 and Natural Sciences and Engineering Research Council of Canada (NSERC) Discovery Grant-RGPIN 2018-05046, and Alliance Grant - Missions - ALLRP 570717-2021. It was also funded in part by the Loo Geok Eng Foundation Scholarship and Kimel Family Graduate Student Scholarship in Pediatric Rehabilitation.

Author information

Authors and Affiliations

Authors

Contributions

G.N. was involved in all aspects of the study including conception and design of the study, data collection and analysis, and manuscript revisions. J.A. was involved in study design and conception, data interpretation, and manuscript revisions. All authors have reviewed and approved the final manuscript.

Corresponding author

Correspondence to Jan Andrysek.

Ethics declarations

Ethics approval and consent to participate

The study recruitment and protocol were approved by the Research Ethics Board at Holland Bloorview Kids Rehabilitation Hospital (REB-0176). Individuals were assessed for capacity to consent in accordance with the approved ethics procedure. Individuals who did not meet the criteria provided assent with a parent present, with the parent providing consent for participation in the study.

Consent for publication

Consent for publication was obtained through the ethics process and signed consent forms as approved and required by the Research Ethics Board at Holland Bloorview Kids Rehabilitation Hospital.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ng, G., Andrysek, J. Hidden Markov model-based similarity measure (HMM-SM) for gait quality assessment of lower-limb prosthetic users using inertial sensor signals. J NeuroEngineering Rehabil 22, 109 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12984-025-01638-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12984-025-01638-4

Keywords