Grip strength as a simple method to predict postoperative delirium after lower extremity surgery in the elderly: a prospective diagnostic evaluation

Article information

Anesth Pain Med. 2026;21(1):101-110
Publication date (electronic) : 2025 November 24
doi : https://doi.org/10.17085/apm.25254
Department of Anesthesiology and Pain Medicine, Korea University Guro Hospital, Korea University College of Medicine, Seoul, Korea
Corresponding author Seok Kyeong Oh, M.D., Ph.D. Department of Anaesthesiology and Pain Medicine, Korea University Guro Hospital, Korea University College of Medicine, 148 Gurodong-ro, Guro-gu, Seoul 08308, Korea Tel: 82-2-2626-3234 Fax: 82-2-2626-1438 E-mail: nanprayboy@korea.ac.kr
This study in part has been accepted as an abstract presentation at the World Congress of Anaesthesiologists (WCA2024), February 2024, in Singapore.
Received 2025 March 31; Revised 2025 August 21; Accepted 2025 August 23.

Abstract

Background

Grip strength (GS), an indicator of sarcopenia, declines with age and is associated with frailty, which increases the risk of postoperative delirium (POD). Assessing frailty in patients undergoing lower extremity surgery is challenging owing to limited mobility. We investigated whether GS, a simple screening tool, can independently predict POD and aimed to determine the most appropriate criteria for low GS in the Korean population.

Methods

This prospective study included patients aged ≥ 65 years undergoing lower extremity surgery. Preoperative GS was measured, and sarcopenia was diagnosed according to the Asian Working Group for Sarcopenia (AWGS) cut-offs and three other established criteria for GS weakness. POD was assessed with the Confusion Assessment Method for the Intensive Care Unit. Receiver operating characteristic (ROC) curves were used to evaluate the sensitivity and specificity of low GS in predicting POD. Logistic regression analysis was performed to identify variables independently associated with POD.

Results

GS was measured in 150 patients with a median age of 73 years. POD was diagnosed in 17 patients (11.3%), 13 of whom had low GS. Among the four sarcopenia criteria, the AWGS showed the highest area under the ROC curve (0.796). Of the variables analyzed—including age, American Society of Anesthesiologists class, body weight, intraoperative opioid use, and postoperative pain—only low GS was identified as an independent predictor of POD (odds ratio: 15.543, P < 0.001).

Conclusions

GS is a simple, reliable measure that may serve as an independent predictor of POD in elderly undergoing lower extremity surgery.

INTRODUCTION

Postoperative delirium (POD) is a common and serious complication in elderly surgical patients, with reported incidence ranging from 13–50% depending on the diagnostic criteria used [1]. POD is associated with cognitive dysfunction, increased morbidity and mortality, prolonged hospital stays, and increased medical expenses [2,3]. Early identification and management of high-risk patients are therefore critical. Among the various risk factors, frailty and sarcopenia have emerged as important predictors of POD [1,4]. Frailty, defined as a decline in physiological reserve and functional capacity, is particularly relevant in older adults and is closely associated with sarcopenia, the age-related loss of muscle mass and strength. As a predisposing factor, frailty is considered a major contributor to POD [4].

Fried et al. defined frailty using five phenotypic criteria: unintentional weight loss (shrinking), weakness (low grip strength), poor endurance and energy, slowness, and reduced physical activity levels. Frailty is identified when at least three of these five criteria are present [5]. However, assessing slowness and physical activity often requires measuring walking speed or documenting recent activity history, which may not be feasible in patients with limited mobility (such as those awaiting lower extremity surgery). Furthermore, poor endurance and energy are based on subjective self-reports (e.g., the Center for Epidemiologic Studies Depression Scale), introducing additional complexity to frailty assessment.

Grip strength (GS) is a widely recognized indicator of muscle strength and function and offers a convenient, noninvasive method to assess sarcopenia and the degree of associated frailty [4]. Reduced GS is associated with increased postoperative complications, longer hospital stays, functional limitations, and disability [6]. Sarcopenia, in turn, is closely related to frailty, a condition of declining physiological reserves that negatively impacts the prognosis of elderly patients and increases the risk of POD [7].

We therefore hypothesized that GS alone may offer a practical and objective predictor of POD in elderly patients with impaired mobility undergoing lower extremity surgery. While previous studies have evaluated frailty or sarcopenia as predictors of POD, few have explored the relationship between POD and GS alone in this patient group. To our knowledge, no prior study has specifically focused on patients undergoing lower extremity surgery—a population with inherently limited mobility, in whom comprehensive frailty assessments are often challenging.

Given that GS is a simple and reliable marker of muscle strength, this study aims to investigate whether GS can independently predict POD in this population. The primary objective is to determine whether a simple measurement of GS can predict POD in older patients with impaired mobility due to lower extremity dysfunction. Additionally, given the varying definitions of low GS across different criteria, we assessed the most appropriate diagnostic criterion for reduced GS for the Korean population.

MATERIALS AND METHODS

This prospective observational study was approved by the Institutional Review Board (IRB) of Korea University (IRB No.: 2021GR0133) and conducted at Korea University Guro Hospital between March 2021 and December 2022. The trial was retrospectively registered in the University Hospital Medical Information Network (UMIN) Clinical Trials Registry (UMIN000057442).

Study population

Patients aged ≥ 65 years who were scheduled for lower extremity surgery under general anesthesia and had no preoperative cognitive dysfunction were eligible for inclusion. Exclusion criteria were hemodynamic instability, intellectual disability, or refusal to participate. Written informed consent was obtained from all participants before surgery. Preoperative cognitive function was screened using the animal verbal fluency test; patients who named fewer than seven animals were considered to have a Clinical Dementia Rating score of 1 and were excluded [8,9].

Grip strength measurement

GS was measured by two trained physicians in the preoperative waiting room following a standardized procedure based on the Southampton protocol [10]. A mechanical Jamar Hydraulic Hand Dynamometer (Performance Health Supply) was used. Each patient was instructed to compress the dynamometer handle three times using the right hand at 1-minute intervals, followed by three measurements with the left hand. Although the Southampton protocol recommends a seated position, this was not feasible owing to the patients’ lower extremity conditions; therefore, measurements were taken in a semi-recumbent position. The highest value from either hand was used for analysis.

Diagnostic criteria for GS weakness vary across definitions of sarcopenia (Table 1). In this study, we considered four commonly used criteria: the Asian Working Group for Sarcopenia (AWGS) [11], the European Working Group on Sarcopenia in Older People 2 (EWGSOP2) [6,12], the Korea National Health and Nutrition Examination Survey (KNHANES) [13], and the Simpler Modified Fried Frailty Scale (SMFFS), which adjusts GS thresholds according to body mass index (BMI) [14,15]. We primarily applied AWGS criteria but compared all four sets of diagnostic cut-offs.

Estimation Tools for Sarcopenia and Criteria for Identifying Hand Grip Weakness

Anesthesia management

Upon entering the operating room, standard monitoring was applied, including electrocardiography, pulse oximetry, an esophageal temperature probe, noninvasive blood pressure monitoring, neuromuscular transmission monitoring, and processed electroencephalography.

Anesthesia was induced with propofol (1–2 mg/kg) and rocuronium (0.6 mg/kg). Maintenance was achieved using desflurane, targeting a Bispectral Index of 40–60 or a Patient State Index of 25–50. Intraoperative analgesia was provided with remifentanil, titrated according to hemodynamic changes at the anesthesiologist’s discretion. At the end of surgery, a bolus of fentanyl was administered as preemptive analgesia upon discontinuation of remifentanil. The use (yes/no) and total doses of intraoperative remifentanil and fentanyl were recorded.

At the end of surgery, the inhalation agent was discontinued, and extubation was performed when the patient exhibited spontaneous breathing and eye-opening. Patients were then transferred to the post-anesthesia care unit (PACU), where discharge was determined according to the Aldrete score. Intravenous patient-controlled analgesia was initiated after recovery of consciousness, typically when patients achieved a full score in the consciousness domain of the Aldrete system. If patients reported significant pain (visual analogue scale ≥ 5), additional analgesics such as fentanyl or ketorolac were administered at the discretion of the PACU physician.

Regional analgesia, which could reduce opioid consumption (a known risk factor for POD), was intentionally excluded from anesthetic management in this study. Postoperative pain management followed a standardized protocol, consisting primarily of non-opioid intravenous medications such as nonsteroidal anti-inflammatory drugs or acetaminophen (if not contraindicated), administered at 4–6 hour intervals in addition to patient-controlled analgesia.

Postoperative delirium and outcomes assessment

Before PACU discharge, the Confusion Assessment Method for the Intensive Care Unit (CAM-ICU) was used to evaluate the presence of POD [16]. CAM-ICU assessments were conducted once daily for up to five consecutive days during the hospital stay by a trained research nurse blinded to GS results. To enhance consistency and minimize variability owing to the fluctuating nature of delirium, assessments were performed at a fixed time each day, typically before noon.

The primary endpoint was the incidence of POD. Secondary endpoints included length of hospital stay, 30-day discharge rate, and in-hospital mortality.

Statistical analyses

Sample size estimation was based on a retrospective pilot study at Korea University Guro Hospital, which reported a 20% incidence of POD in elderly patients undergoing lower extremity surgery. Previous literature, including a review by Inouye et al. [1], reported a POD incidence of 13–50% following orthopedic surgery, while a prospective cohort study by Gleason et al. [17] reported a 24% incidence following elective surgery in elderly patients. We conservatively estimated a 20% POD incidence and assumed that reduced GS would increase the risk with an odds ratio (OR) of 2, as reported by Arita et al. [18]. Based on these assumptions, a sample size of 150 was calculated as sufficient using a two-sided chi-square test with an alpha of 0.05 and a power of 0.8.

Patients were allocated into two groups based on the presence or absence of GS weakness, as defined by the AWGS criteria. The incidence of POD was compared between the groups. Statistical analyses were performed using IBM SPSS Statistics software (ver. 20.0, IBM Co.). Continuous variables were analyzed using Student’s t-test or the Mann–Whitney U test, depending on data normality. Categorical variables, including the presence or absence of complications in the PACU or ward, were analyzed using the chi-square test.

To assess the diagnostic value of GS, receiver operating characteristic (ROC) curve analysis was performed. The performance of the four criteria for low GS was further evaluated using diagnostic metrics, including the area under the curve (AUC), sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV). Pairwise comparisons of AUCs between the AWGS criteria and each of the other models were performed using DeLong’s test to determine whether observed differences in discriminatory ability were statistically significant.

Logistic regression analyses were conducted to evaluate associations between POD and various variables, including age, sex, height, weight, American Society of Anesthesiologists physical status (ASA PS) classification, BMI, type of surgery, duration of anesthesia, verbal fluency test results, intraoperative fentanyl and remifentanil use, PACU pain scores, use of PACU analgesics, and GS. For multivariate modeling, two approaches were used: in Model 1, GS was entered as a binary variable based on the AWGS cut-off (presence or absence of sarcopenia), whereas in Model 2, GS was entered as a continuous variable (kg) to allow direct interpretation and exploration of potential risk factors beyond dichotomized sarcopenia status. These two models were compared to assess whether binary classification using cut-offs and continuous GS measurement yielded different covariates and predictive performance for POD.

In both models, variables with a P value < 0.1 in univariate analysis were entered into multivariate logistic regression using a stepwise forward selection method. Predictive performance for each model was assessed using the optimal cut-off determined based on the Youden Index.

Data are presented as mean ± standard deviation, median (1Q, 3Q), or number of patients (%). A two-tailed P value < 0.05 was considered statistically significant.

RESULTS

A total of 150 patients with a median age of 73 years (IQR: 69, 78) were enrolled in the study (Fig. 1). Based on the AWGS criteria, patients were categorized into two groups: normal GS (n = 113) and low GS (n = 37) (Table 2). The low GS group was older (median 74.5 vs. 73 years; P = 0.026), had a higher proportion of ASA PS ≥ III (P = 0.008), and scored lower on the animal verbal fluency test (P = 0.007). Other baseline characteristics, including sex, BMI, duration of surgery, intraoperative opioid use, and PACU pain scores, were comparable between the groups.

Fig. 1.

Flow diagram of included patients.

Demographic and Clinical Data

The incidence of POD was significantly higher in the low GS group (36.1%) than in the normal GS group (3.5%) (P < 0.001). Overall, 17 patients (11.3%) developed POD (Table 3). No significant differences were observed between groups in length of hospital stay, 30-day discharge rate, or inpatient mortality.

Postoperative Outcomes

To evaluate the diagnostic utility of different GS weakness definitions, ROC curve analysis was performed for all four criteria (Fig. 2). The AUC and 95% confidence intervals (CIs) were calculated for each model. The AWGS model demonstrated the highest diagnostic performance, with an AUC of 0.796 (95% CI: 0.673–0.919; P < 0.001). The SMFFS model yielded an AUC of 0.739, while the EWGSOP2 and KNHANES models each had AUCs of 0.719.

Fig. 2.

Prediction of delirium across four grip strength models for sarcopenia. AUC and 95% CIs for EWGSOP2, AWGS, KNHANES, and SMFFS were 0.719 (95% CI, 0.574–0.864, P = 0.003), 0.796 (0.673–0.919, P < 0.001), 0.719 (0.574–0.864, P = 0.003), and 0.739 (0.614–0.865, P = 0.001), respectively. Pairwise comparison of AUCs between AWGS and the other models using DeLong’s test showed no significant differences (P > 0.638 for all). AUC: area under the curve, CI: confidence interval, EWGSOP2: European Working Group on Sarcopenia in Older People 2, AWGS: Asian Working Group for Sarcopenia, KNHANES: Korea National Health and Nutrition Examination Survey, SMFFS: Simpler Modified Fried Frailty Scale.

In addition to AUC, diagnostic performance was further evaluated using sensitivity, specificity, PPV, NPV, and likelihood ratios (Supplementary Table 1). The AWGS model showed the most balanced performance, with the highest sensitivity (0.765), NPV (0.965), and positive likelihood ratio (4.422), supporting its potential value in identifying patients at risk for POD. However, none of the pairwise AUC comparisons (using DeLong’s test) demonstrated a statistically significant difference between AWGS and the other models (all P > 0.638; Supplementary Table 2), suggesting that although AWGS may have marginally better diagnostic performance, the overall discriminative ability was comparable across models.

In multivariate logistic regression, Model 1 (GS as a binary variable per AWGS) retained only low GS as a significant predictor of POD (OR 15.543; 95% CI: 4.647–51.989; P < 0.001) (Table 4). Model 2 (GS as a continuous variable) retained ASA PS ≥ III, intraoperative use of remifentanil, and GS value; for each 1-kg increase in GS, the odds of POD decreased by 9.4% (OR 0.906; 95% CI: 0.839–0.979) (Table 5). The AUC, sensitivity, and NPV of the two models were compared using ROC curves, and the optimal cut-off for each model was determined based on the Youden Index. Model 2 showed marginally higher AUC (0.821 vs. 0.796), sensitivity (0.824 vs. 0.765), and NPV (0.974 vs. 0.965) compared with Model 1 (Supplementary Table 3). However, these findings based on the Youden Index were descriptive, and no formal statistical test confirmed a significant difference.

Logistic Regression of Delirium Occurrence and Variables Using AWGS Criteria for Low GS

Logistic Regression of Delirium Occurrence and Variables Using Raw GS Values (kg)

DISCUSSION

In this study, 13 of the 17 patients with POD had low GS. The incidence of POD was 36.1% in the low GS group, significantly higher than the 3.5% observed in the normal GS group, indicating a strong association between low GS and the development of POD. These findings support the use of GS as an independent marker to stratify delirium risk preoperatively, enabling targeted perioperative strategies and resource allocation in high-risk individuals.

The proportion of elderly patients undergoing surgery is steadily increasing, and the risk of complications increases with age, leading to disability, loss of independence, reduced quality of life, and increased healthcare costs [19,20]. Identifying risk factors for adverse outcomes in older adults is therefore essential. Among these, POD a common and clinically significant complication, with an incidence ranging from 13–50% in non-cardiac surgeries, depending on the type of surgery and diagnostic criteria used [1,17,18].

In our study of elderly patients undergoing orthopedic surgery, POD occurred in 17 patients (11.3%), a marginally lower rate than previously reported [1,17,18]. Notably, the median ages in the two key reference studies [17,18], which informed our study design, were 77 and 80 years, respectively—higher than in our cohort. This age difference, along with heterogeneity in surgical and perioperative contexts, likely contributed to the relatively lower POD rate observed. Interestingly, a recent study [21] in an Asian population with a mean age of approximately 70.6 years reported a POD incidence of only 5.2% following total hip or knee arthroplasty, suggesting that population characteristics, surgical context, and baseline frailty may all influence observed POD rates.

Although the observed incidence of POD (11.3%) was lower than the 20% assumed for sample size estimation, the strong association observed (OR 15.543) suggests that the study retained sufficient statistical power to detect meaningful differences. This large effect size may have compensated for the lower event rate, supporting the robustness of our findings. However, the CI for the OR was relatively wide (15.543; 95% CI: 4.647–51.989), likely due to the limited number of POD cases. A larger study population would be needed in future investigations to provide more precise risk estimates and narrower CIs.

Several prior studies have reported a direct relationship between GS and POD [18,22,23], particularly in Asian populations (Japan and China). Arita et al. [18] retrospectively studied patients with colorectal cancer and found that older age, lower Mini-Mental State Examination scores, higher Geriatric Depression Scale scores, and low GS were significant independent predictors of POD. Their ROC analysis suggested optimal GS cut-off values of 21.8 kg for men and 15.4 kg for women, which are lower than AWGS thresholds (28 kg for men, 18 kg for women) applied in our study. In contrast, Kotani et al. [22] used the revised Japanese Cardiovascular Health Study criteria and observed a 37% incidence of POD in patients undergoing cardiovascular surgery with cardiopulmonary bypass. They reported a strong association between low GS and POD, with an OR of 4.58 (95% CI: 1.57–13.2). A prospective study by Qian et al. [23], closely aligned with our design, examined POD following arthroplasty and found an incidence of 14.36%. Their ROC analysis identified GS thresholds of 22.05 kg for men and 18.05 kg for women, with the threshold for men being marginally lower than that of the AWGS. While previous studies primarily categorized patients by the presence or absence of delirium, our study stratified patients based on GS status, focusing on risk prediction. Despite differences in study design and populations, these findings consistently demonstrate an association between reduced GS and increased POD risk, supporting GS as a simple and effective screening tool to identify high-risk patients for early preventive strategies.

Low GS is considered a marker of general physical decline and frailty, both of which are associated with vulnerability to delirium. The mechanisms underlying this association may involve shared pathways linking sarcopenia to cognitive impairment, including systemic inflammation, reduced physical activity, and neurodegenerative changes [18,24]. In our study, patients with low GS were older, had higher ASA PS classifications, and scored lower on cognitive screening, further supporting the link between frailty and POD risk.

GS is widely used as a diagnostic indicator of sarcopenia, although thresholds vary according to guideline. In this study, we evaluated four different criteria, and the AWGS definition (< 28 kg for men, < 18 kg for women) demonstrated the highest predictive value, with an AUC of 0.796. This finding aligns with that of previous studies in Asian populations, which often apply similar GS thresholds unless data-driven cut-offs are calculated [18,22,23]. Although AWGS had the highest AUC, SMFFS (0.739), EWGSOP2 (0.719), and KNHANES (0.719) also showed acceptable predictive performance, and DeLong’s test revealed no significant difference between AWGS and the other models. Thus, in clinical environments where rapid screening is needed and precision may be less critical, these criteria may provide useful guidance.

We also developed two logistic regression models to assess whether different approaches to GS—binary classification using the AWGS cut-off (Model 1) and continuous GS measurement (Model 2)—would influence covariate selection and predictive performance. Both models identified GS as an important predictor, but Model 2 additionally retained ASA PS classification and intraoperative remifentanil use, suggesting that modeling choices can affect variable selection. Ultimately, the goal is to proactively identify and reduce the risk of POD, regardless of the criterion applied.

This study has several limitations. First, because of patients’ lower extremity conditions, we could not strictly follow the Southampton protocol, which recommends GS measurement in a seated position; measurements were instead taken in a semi-recumbent position. Nevertheless, our findings suggest that semi-recumbent measurements can provide clinically meaningful results, offering preliminary evidence for this adapted method in mobility-limited populations. Second, although our analysis focused on preoperative GS, POD is multifactorial and influenced by numerous perioperative and postoperative variables. Not accounting for additional factors such as sleep disturbance, nutritional status, or postoperative medications may have limited the comprehensiveness of our model. Third, while the OR was large and statistically significant, the relatively small number of POD cases contributed to the wide CI, potentially affecting reliability. Fourth, although our study focused on lower extremity surgery, the cohort included a variety of procedures. Postoperative pain management in the ward was determined by each orthopedic surgeon’s protocol, introducing some heterogeneity in analgesic strategies.

In conclusion, preoperative GS is a simple, objective, and independent predictor of POD. In elderly patients undergoing lower extremity orthopedic surgery in the Korean population, the AWGS criteria appear marginally better for identifying sarcopenia-related delirium risk, although its discriminative ability was comparable to other criteria. Routine pre-anesthetic GS assessment may facilitate early risk stratification and targeted interventions to reduce POD incidence.

SUPPLEMENTARY MATERIALS

Supplementary data is available at https://doi.org/10.17085/apm.25254.

Supplementary Table 1.

Diagnostic Performance of GS Criteria for Predicting POD Based on CAM-ICU Assessment

apm-25254-Supplementary-Table-1.pdf
Supplementary Table 2.

Pairwise Comparison of AUCs Using DeLong’s Test.

Each Comparison is Between Two Grip Strength Criteria (AWGS, EWGSOP2, KNHANES, Fried), Showing the AUC for Each Criterion, the Z Statistic for their Difference, and the P value. The First Three Comparisons (AWGS vs. EWGSOP2, AWGS vs. KNHANES, AWGS vs. Fried) are as Previously Reported, and the Remaining Three (EWGSOP2 vs. KNHANES, EWGSOP2 vs. Fried, KNHANES vs. Fried) are Newly Calculated from the Provided Dataset

apm-25254-Supplementary-Table-2.pdf
Supplementary Table 3.

Predictive Performance Summary (with Youden Index Cutoff for Model 1 and 2)

apm-25254-Supplementary-Table-3.pdf

Notes

FUNDING

This work was supported by grant No. KSGAP-2020-002 from the Korean Society of Geriatric Anesthesia and Pain.

CONFLICTS OF INTEREST

No potential conflict of interest relevant to this article was reported.

DATA AVAILABILITY STATEMENT

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

AUTHOR CONTRIBUTIONS

Conceptualization: Seok Kyeong Oh. Funding acquisition: Seok Kyeong Oh. Supervision: Seok Kyeong Oh, Young Sung Kim. Data curation: Hyo Sung Kim, Hyun Ah Lee. Formal analysis: Seok Kyeong Oh, Young Sung Kim. Writing - original draft: Hyo Sung Kim, Seok Kyeong Oh. Writing - review & editing: Hyo Sung Kim, Young Sung Kim, Hyun Ah Lee, Seok Kyeong Oh.

References

1. Inouye SK, Westendorp RG, Saczynski JS. Delirium in elderly people. Lancet 2014;383:911–22. 10.1016/s0140-6736(13)60688-1. 23992774.
2. Crocker E, Beggs T, Hassan A, Denault A, Lamarche Y, Bagshaw S, et al. Long-term effects of postoperative delirium in patients undergoing cardiac operation: a systematic review. Ann Thorac Surg 2016;102:1391–9. 10.1016/j.athoracsur.2016.04.071. 27344279.
3. Brown C 4th, Deiner S. Perioperative cognitive protection. Br J Anaesth 2016;117(Suppl 3):iii52–61. 10.1093/bja/aew361. 27940456.
4. Bahat G, Tufan A, Tufan F, Kilic C, Akpinar TS, Kose M, et al. Cut-off points to identify sarcopenia according to European Working Group on Sarcopenia in Older People (EWGSOP) definition. Clin Nutr 2016;35:1557–63. 10.1016/j.clnu.2016.02.002. 26922142.
5. Fried LP, Tangen CM, Walston J, Newman AB, Hirsch C, Gottdiener J, et al, ; Cardiovascular Health Study Collaborative Research Group. Frailty in older adults: evidence for a phenotype. J Gerontol A Biol Sci Med Sci 2001;56:M146–56. 10.1093/gerona/56.3.m146. 11253156.
6. Cruz-Jentoft AJ, Baeyens JP, Bauer JM, Boirie Y, Cederholm T, Landi F, et al. European Working Group on Sarcopenia in Older People. Sarcopenia: European consensus on definition and diagnosis: Report of the European Working Group on Sarcopenia in Older People. Age Ageing 2010;39:412–23. 10.1093/ageing/afq034. 20392703.
7. Quinlan N, Marcantonio ER, Inouye SK, Gill TM, Kamholz B, Rudolph JL. Vulnerability: the crossroads of frailty and delirium. J Am Geriatr Soc 2011;59(Suppl 2):S262–8. 10.1111/j.1532-5415.2011.03674.x. 22091571.
8. Radanovic M, Carthery-Goulart MT, Charchat-Fichman H, Herrera E Jr, Lima EEP, Smid J, et al. Analysis of brief language tests in the detection of cognitive decline and dementia. Dement Neuropsychol 2007;1:37–45. 10.1590/s1980-57642008dn10100007. 29213366.
9. Susano MJ, Grasfield RH, Friese M, Rosner B, Crosby G, Bader AM, et al. Brief preoperative screening for frailty and cognitive impairment predicts delirium after spine surgery. Anesthesiology 2020;133:1184–91. 10.1097/aln.0000000000003523. 32898243.
10. Roberts HC, Denison HJ, Martin HJ, Patel HP, Syddall H, Cooper C, et al. A review of the measurement of grip strength in clinical and epidemiological studies: towards a standardised approach. Age Ageing 2011;40:423–9. 10.1093/ageing/afr051. 21624928.
11. Chen LK, Woo J, Assantachai P, Auyeung TW, Chou MY, Iijima K, et al. Asian Working Group for Sarcopenia: 2019 consensus update on sarcopenia diagnosis and treatment. J Am Med Dir Assoc 2020;21:300–7.e2. 10.1016/j.jamda.2019.12.012. 32033882.
12. Cruz-Jentoft AJ, Bahat G, Bauer J, Boirie Y, Bruyère O, Cederholm T, et al. Sarcopenia: revised European consensus on definition and diagnosis. Age Ageing 2019;48:16–31. 10.1093/ageing/afz046. 30312372.
13. Kim M, Won CW, Kim M. Muscular grip strength normative values for a Korean population from the Korea National Health and Nutrition Examination Survey, 2014-2015. PLoS One 2018;13e0201275. 10.1371/journal.pone.0201275. 30125289.
14. Bahat G, Ilhan B, Tufan A, Dogan H, Karan MA. Success of simpler modified fried frailty scale to predict mortality among nursing home residents. J Nutr Health Aging 2021;25:606–10. https://doi.org/10.1007/s12603-020-1582-2. 10.1007/s12603-020-1582-2. 33949626.
15. Bahat G, Ilhan B, Erdogan T, Catikkas NM, Karan MA, Drey M, et al. Simpler modified fried frailty scale as a practical tool to evaluate physical frailty: methodological report for its cross-cultural adaptation and validation. Exp Gerontol 2022;166:111887. 10.1016/j.exger.2022.111887. 35803479.
16. Arbabi M, Dezhdar Z, Amini B, Dehnavi AZ, Ghasemi M. Depression and anxiety increase the odds of developing delirium in ICU patients; a prospective observational study. Cogn Neuropsychiatry 2022;27:1–10. 10.1080/13546805.2021.1991295. 34676803.
17. Gleason LJ, Schmitt EM, Kosar CM, Tabloski P, Saczynski JS, Robinson T, et al. Effect of delirium and other major complications on outcomes after elective surgery in older adults. JAMA Surg 2015;150:1134–40. 10.1001/jamasurg.2015.2606. 26352694.
18. Arita A, Takahashi H, Ogino T, Miyoshi N, Uemura M, Akasaka H, et al. Grip strength as a predictor of postoperative delirium in patients with colorectal cancers. Ann Gastroenterol Surg 2021;6:265–72. 10.1002/ags3.12519. 35261952.
19. Polanczyk CA, Marcantonio E, Goldman L, Rohde LE, Orav J, Mangione CM, et al. Impact of age on perioperative complications and length of stay in patients undergoing noncardiac surgery. Ann Intern Med 2001;134:637–43. 10.7326/0003-4819-134-8-200104170-00008. 11304103.
20. Gajdos C, Kile D, Hawn MT, Finlayson E, Henderson WG, Robinson TN. Advancing age and 30-day adverse outcomes after nonemergent general surgeries. J Am Geriatr Soc 2013;61:1608–14. 10.1111/jgs.12401. 23927841.
21. Tomite T, Saito H, Kijima H, Hatakeyama Y, Tazawa H, Wachi T, et al. Delirium following total hip or knee arthroplasty: a retrospective, single-center study. J Orthop Sci 2025;30:837–43. 10.1016/j.jos.2024.11.006. 39638735.
22. Kotani T, Ida M, Inoue S, Naito Y, Kawaguchi M. Association between preoperative hand grip strength and postoperative delirium after cardiovascular surgery: a retrospective study. J Clin Med 2023;12:2705. 10.3390/jcm12072705. 37048787.
23. Qian J, Shen X, Gao X, Sun Q. Grip strength is a predictor for subsyndromal delirium among older adults following joint replacement. Clin Interv Aging 2023;18:1675–85. 10.2147/cia.s423727. 37810954.
24. Sternäng O, Reynolds CA, Finkel D, Ernsth-Bravell M, Pedersen NL, Dahl Aslan AK. Grip strength and cognitive abilities: associations in old age. J Gerontol B Psychol Sci Soc Sci 2016;71:841–8. 10.1093/geronb/gbv017. 25787083.

Article information Continued

Fig. 1.

Flow diagram of included patients.

Fig. 2.

Prediction of delirium across four grip strength models for sarcopenia. AUC and 95% CIs for EWGSOP2, AWGS, KNHANES, and SMFFS were 0.719 (95% CI, 0.574–0.864, P = 0.003), 0.796 (0.673–0.919, P < 0.001), 0.719 (0.574–0.864, P = 0.003), and 0.739 (0.614–0.865, P = 0.001), respectively. Pairwise comparison of AUCs between AWGS and the other models using DeLong’s test showed no significant differences (P > 0.638 for all). AUC: area under the curve, CI: confidence interval, EWGSOP2: European Working Group on Sarcopenia in Older People 2, AWGS: Asian Working Group for Sarcopenia, KNHANES: Korea National Health and Nutrition Examination Survey, SMFFS: Simpler Modified Fried Frailty Scale.

Table 1.

Estimation Tools for Sarcopenia and Criteria for Identifying Hand Grip Weakness

Estimation tool of sarcopenia Criteria of grip weakness
AWGS [11] GS M < 28 kg, F < 18 kg
Physical performance: 6 m walk
Appendicular skeletal muscle mass: DXA
EWGSOP2 [6,12] Muscle strength: GS, chair stand test M < 27 kg, F < 16 kg
Muscle quantity: DXA, BIA, CT, MRI
Physical performance: gait speed, SPPB, 400 m walk
KNHANES [13] GS M < 28 kg, F < 16 kg
SMFFS [14,15] Weight loss M: ≤ 29 kg (BMI ≤ 24)
Low physical activity ≤ 30 kg (24 < BMI ≤ 28)
Low walking speed: 4 m walk time ≤ 32 kg (28 < BMI)
Exhaustion: CES-D F: ≤ 17 kg (BMI ≤ 23)
Weakness: GS ≤ 17.3 kg (23 < BMI ≤ 26)
≤ 18 kg (26 < BMI ≤ 29)
≤ 21 kg (29 < BMI)

AWGS: Asian Working Group for Sarcopenia, GS: grip strength, EWGSOP: European Working Group on Sarcopenia in Older People, KNHANES: Korea National Health and Nutrition Examination Survey, SMFFS: Simpler Modified Fried Frailty Scale, DXA: dual energy X-ray absorptiometry, BIA: bioimpedance analysis, CT: computed tomography, MRI: magnetic Resonance Imaging, SPPB: short physical performance battery, CES-D scales: center for epidemiologic studies depression scale, BMI: body mass index.

Table 2.

Demographic and Clinical Data

All patients (n=150) Normal GS (n=114) Low GS (n=36) P value
Age (yr) 73 (69, 78) 73 (69, 76) 74.5 (71, 79.8) 0.026*
F 95 (63.3) 71 (62.3) 24 (66.7) 0.635
American Society of Anesthesiologists physical status 0.008*
 II 38 (25.3) 35 (30.7) 3 (8.3)
 ≥ III 112 (74.7) 79 (69.3) 33 (91.7)
Height (cm) 156 (151, 163.6) 156.5 (151, 164) 155.0 (150.5, 163) 0.504
Weight (kg) 61.1 ± 11.0 62.3 ± 10.6 57.4 ± 11.4 0.645
BMI (kg/m2) 24.4 ± 3.6 24.8 ± 3.4 23.2 ± 3.8 0.237
Animal verbal score 10 (8, 11) 10 (9, 11) 9 (8, 10) 0.007*
Grip strength (kg) 23.5 (20, 32) 26 (22, 33.3) 16 (14, 20.8) <0.001*
 M 32 (24, 40) 34 (32, 40) 22 (20.3, 22.8) <0.001*
 F 21 (17, 24) 23 (20, 26) 14.5 (12, 16) <0.001*
Operation type 0.424
Knee 100 (66.7) 78 (68.4) 22 (61.1)
 Fracture/joint 37/63 (37/63) 25/53 (32/68) 12/10 (55/45) 0.093
Hip 50 (33.3) 36 (31.6) 14 (38.8)
 Fracture/joint 29/21 (58/42) 15/21 (42/58) 7/7 (50/50) 0.829
Operation time (min) 75 (63, 135) 80 (64.8, 130.5) 74.5 (58.8, 149.5) 0.551
Anesthesia time (min) 130 (115, 200) 130 (115, 195) 135 (112.5, 218.8) 0.553
Intraop. fentanyl 68 (45.3) 53 (46.5) 15 (41.7) 0.702
Intraop. remifentanil 112 (74.7) 82 (71.9) 30 (83.3) 0.194
PACU pain score (NRS 0–10) 5 (4, 6) 5 (4, 6) 5 (4, 6) 0.341
PACU analgesic 0.285
 0 92 (61.3) 72 (63.2) 20 (55.6)
 1 52 (34.7) 39 (34.2) 13 (36.1)
 ≥ 2 6 (4.0) 3 (2.6) 3 (8.3)

Values are presented as mean ± SD, median (1Q, 3Q), or number (%). Normal vs. low GS classified by AWGS. AWGS: Asian Working Group for Sarcopenia, GS: grip strength, BMI: body mass index, PACU: post-anesthesia care unit, Intraop.: intraoperative, NRS: Numerical Rating Scale.

*

P values indicate statistical significance < 0.05.

Fracture/Joint indicates the distribution of fracture-related and joint-related procedures within each surgical site. Percentages reflect the intra-group ratio for each category.

Table 3.

Postoperative Outcomes

All patients (n=150) Normal GS (n=113) Low GS (n=37) P value
Delirium 17 (11.3) 4 (3.5) 13 (36.1) <0.001*
Length of hospital stay (d) 8 (5, 11) 7 (5, 11) 8.5 (6, 12.8) 0.073
30-day discharge 145 (96.7) 112 (98.2) 33 (91.7) 0.090
Inpatient mortality 1 (0.7) 0 (0) 1 (2.8) 0.240

Values are presented as number (%) or median (1Q, 3Q). GS: grip strength.

*

P values indicate statistical significance of < 0.05.

Table 4.

Logistic Regression of Delirium Occurrence and Variables Using AWGS Criteria for Low GS

B P value Exp (B) 95% CI
Low GS based on AWGS 2.744 < 0.001 15.543 4.647–51.989
Constant –3.314 < 0.001 0.036

Logit P (delirium by CAM-ICU) = –3.314 + 2.744 (low GS by AWGS). Variables included in the models were selected through stepwise forward selection from candidate predictors with univariate P value < 0.1. Exp: exponential, CI: confidence interval, AWGS: Asian Working Group for Sarcopenia, GS: grip strength, CAM-ICU: Confusion Assessment Method for the Intensive Care Unit.

Table 5.

Logistic Regression of Delirium Occurrence and Variables Using Raw GS Values (kg)

B P value Exp (B) 95% CI
American Society of Anesthesiologists physical status ≥ III 1.754 0.100 5.779 0.713–46.806
Raw GS (kg) –0.098 0.013 0.906 0.839–0.979
Intraoperative remifentanil use 1.331 0.095 3.784 0.795–18.006
Constant –2.401 0.119 0.091

Logit P (delirium by CAM-ICU) = –2.401 + 1.754 (ASA physical status ≥ III) – 0.098 (raw GS in kg) + 1.331 (intraoperative remifentanil use). Variables included in the models were selected through stepwise forward selection from candidate predictors with univariate P value < 0.1. GS: grip strength, Exp: exponential, CI: confidence interval, CAM-ICU: Confusion Assessment Method for the Intensive Care Unit.