Research Perspective Advance Articles

Analysis of the current state of frailty indexes and their implementation for aging intervention studies

Oliver G. Frost1,2, , Anna Barkovskaya2, , Michael J. Rae2, , Marcela Atzori2, , Abdelhadi Rebbaa2, *, , Amit Sharma2, *, ,

  • 1 Centre for Biological Engineering, School of Mechanical, Electrical and Manufacturing Engineering, Loughborough University, Loughborough LE11 3TU, UK
  • 2 Lifespan Research Institute, Mountain View, CA 94041, USA
* Equal contribution

Received: March 14, 2025       Accepted: August 13, 2025       Published: August 26, 2025      

https://doi.org/10.18632/aging.206307
How to Cite

Copyright: © 2025 Frost et al. This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Abstract

Animal lifespan studies are foundational to developing interventions against the biological aging process. In recent years, there has been rising interest in characterizing the effects of longevity therapeutics on health span. Frailty indexes, originally developed to assess clinical frailty in aging humans, have shown promise as measurements of biological age and have been adopted for use in rodent aging biology. This Perspective looks at the current state of rodent frailty indexes and how they are implemented. The differences in frailty parameters used to calculate these indexes have led to inconsistencies between studies defining frailty. In this Perspective, we have highlighted those differences and made recommendations for implementing protocols for frailty index measurement.

Introduction

Biological aging is the accumulation of damage to the cellular and molecular functional units of tissues over time, leading to breakdowns of physiological systems, age-related disease, and, ultimately, mortality [1]. This process, combined with the global population undergoing a demographic transition in which a rising percentage is living into historically advanced ages, motivates the search for therapies that would intervene in the biological aging process and thus delay, decelerate, or reverse multiple manifestations of age-related ill health [2].

The cornerstone test for longevity therapeutics in animal models is the lifespan study, emphasizing maximum lifespan [3, 4]. However, potential treatments should also increase the time spent in good health, motivating interest in developing quantitative metrics for health span. One approach to this metric is the rodent frailty index (FI). In humans, frailty is a clinical syndrome characterized by a decline in physiological resilience and increased vulnerability to adverse health outcomes, particularly in the elderly [5]. As a population ages, there is an increase in frailty, especially over the age of 65, with a significant correlation between frailty, mortality, and hospitalization being evidenced [6, 7]. Hence, researchers have constructed practical and objective measures of frailty to manage and test new interventions against the syndrome. The frailty index is a quantitative, multidimensional approach that evaluates an individual’s level of frailty based on the accumulation of health deficits [8]. Subsequent research has shown that FIs are robust predictors of mortality in humans, even in people early in middle age who are not colloquially “frail” [9, 10] and similar deficit-based constructs respond to putative longevity interventions in humans [1013].

More recently, rodent FIs have been developed on principles like the human originals, using a panel of physical and/or clinical parameters to measure frailty. However, developing FIs in rodents is not straightforward. Researchers have adopted different approaches by varying the types and numbers of parameters to be included in the FI and cut-off points that classify an animal as frail. This variation results in each study having a different operational definition of frailty, which reduces the ability to compare findings across studies, thereby impeding progress toward longevity therapeutics. Even in humans, where sample sizes are larger and clinical data for FI construction is broader and more robust, different frailty instruments vary in which patients they identify as frail and are challenging to implement in clinical settings [14].

This Perspective highlights these differences and compares rodent and human frailty index scores. We summarise the published strategies for implementing frailty indexes, discuss their strengths and weaknesses, and implement a frailty index based on physical performance with our young and old mice. The Perspective concludes by making recommendations regarding FI use for studies of aging, age-related disease, and longevity therapeutics.

Materials and Methods

Animals

Young (3-4 months), middle-aged (18 months), and old (28 months), mainly male C57BL/6 mice, were acquired from Charles River Laboratories (Wilmington, MA, USA) (n = 3-7 per group). Young 3-month-old (n=5) mice were all male, 4-month-old mice included two males and two females, 18-month-old (n=5) were all male, and 28-month-old (n=4) were three males and one female. Mice were housed in a vivarium with ad libitum access to food (Teklad Global Soy protein-free, Envigo) and water (Aquavive Water, Innovive Inc). Cages were changed every 1-2 weeks depending on the number per cage, and documents were maintained under national and local regulations. All animal experiments in this study are approved by the Lifespan Research Institute IACUC committee following the “Guide for the Care and Use of Laboratory Animals” prepared by the Institute for Laboratory Animal Research, National Academy of Sciences.

Grip strength

C57BL/6 mice were used to establish baseline grip strength using the Grip Strength Meter (47200, Ugo Basile®). The mice were held by the mid-base of the tail, allowing their front paws to grip the bar. Once gripped, the mice were pulled back steadily, keeping them horizontal. The flat bar was used to measure the highest forelimb grip strength of the mice. To account for weight increases in older mice, results were normalised to weight. Each mouse was tested 4 times. The weakest measurement was removed and the average of the remaining scores was calculated.

Open field testing

C57BL/6 mice were used to establish a baseline of mobility and behavior using open-field testing (Noldus Ethovision XT 17.5). Two arenas side by side (50x50cm) were set up with Basler GenICam (acA1300-60, Basler) above. Arena settings were utilized to establish each arena's scale (50cm) and center and periphery. The periphery is defined as the outermost 10cm on either side of the center (30cm) of the arena. Each mouse began the experiment in the bottom left corner and was recorded for 10 minutes. Video analysis software (Ethovision XT 17.5) was used to track the mice using the center body point and calculate the total distance moved, percent of time spent in movement, average velocity, body elongation, and mobility. Between each trial, 70% ethanol was used to clean the arena to remove scent marks, and a 10-minute wait period was used to allow ethanol evaporation.

Measures used to construct the frailty index (FI)

The factors from open-field testing used to comprise the frailty index are previously described [15, 16]. They include the total distance moved (tracked by center-point over 10 minutes in cm), the maximum distance between two consecutive points in the tracks in cm (available in trial statistics of distance moved in the Ethovision analysis profile), total duration of movement (s), proportion of time spent moving (%), meander (change in direction per unit distance moved measured in degrees/cm from 0°-180°), average velocity (cm/s), rearing frequency (occurrence/time), and weight (g). For both studies, the duration of the open-field test was 10 minutes.

Frailty index cutoff points

The young, middle-aged, and old mice required reference values to determine frailty levels. For these, we included our 3-4-month-old mice and the reference values from two previous reports, which were in one case males aged 5 months (n = 5) [15], and in the other males and females aged 13.5 months (n=3 per sex) [16]. When using reference values from 3-month-old male mice, all females from our cohort were removed. For each mouse, each factor of the FI was compared to the reference values. The frailty scores were applied if the measured score differed from the reference by at least 1SD (Standard Deviation). They were graded as follows: less than 1SD scored 0; values differed by ± 1 SD scored 0.25; values that differed by ± 2 SD scored 0.5; values differed by ± 3SD scored 0.75; values that differed by more than 3SD scored maximum frailty of 1. The sum of these scores for each factor was divided by the number of parameters (8) to produce a total frailty score for each mouse [16].

Statistics

GraphPad Prism V10.1.0 software was used to perform statistical analysis. A two-way ANOVA analysis was used to see if two or more independent variables affected the dependent variable (multiple age groups in frailty scores or grip strength). To be considered statistically significant, the P-value must be ≤ 0.05 and was quantified in the following order. * = p ≤ 0.05, ** p = ≤ 0.01, *** = p ≤0.001, **** = p ≤0.0001. For each experiment and each condition, n ≥ 3.

Study selection process

A rigorous literature review with a range of keywords was used to search peer-reviewed journals: frailty, frailty index, aging, mice, longevity, healthspan, rodents, and phenotype. Databases searched included PubMed, EBSCOhost, Google Scholar, and Loughborough University Ex Libris. Only original research articles were included in the criteria, literature reviews and other article types were excluded. Only frailty indexes based on rodents were included with human frailty indexes excluded. This method provided 18 peer-reviewed articles published between 2012 and 2023. Other papers were identified but not included as this Perspective focuses on frailty indexes that are novel, modified, or further validate established versions [1532].

Results

All FIs included in this review provide a numeric score indicating the level of frailty present, but their respective methods differ significantly (Supplementary Table 1). When choosing which established FI to implement, one must examine what cutoff point to utilize, what reference values to use if a scaled frailty score is desired or the cohort is small, the equipment available, and the factors to include. For instance, scoring systems may rely on quantifying physical performance (13/18 studies) [15, 16, 19, 2226, 28, 29, 3133] or clinical observations (13/18 studies) [15, 16, 1822, 25, 27, 2932], with several articles using both [15, 16, 19, 22, 25, 29, 31, 32]. Those measuring physical fitness are modelled on Fried et al., 2001, who created a human frailty index measuring four key factors: weakness, slowness, low activity, and poor endurance [34].

The contents of the FIs vary significantly, depending on whether clinical observations or physical outputs are measured. For instance, those focusing on physical measurements have fewer items in their FI (4-5 items (7/18 studies) [17, 2224, 26, 28, 31] or 8 items (4/18 studies)) [15, 16, 29, 32], whereas those using clinical observations measured 23-34 items (12/18 studies) [15, 16, 1821, 25, 27, 2932]. Additionally, there is variation in the cutoff points used to determine whether a subject is frail and to what extent. This includes 0.8SD from a reference point or the lowest 20% of a cohort (7/18 studies) [2224, 26, 28, 29, 31], 1.5SD (4/18 studies) [17, 23, 27, 31], a staggered cutoff point of 1, 2, 3, 3+SD (9/18 studies) [15, 16, 1821, 25, 27, 30], or visual determination of 0 = not frail, 0.5 = mildly frail, and 1 = frail (9/18 studies) [15, 1821, 27, 29, 30, 32]. The cohort mean is often utilized as the reference point. But if a staggered cutoff point is used, a reference value is required from a control subject group. If the cohort mean-SD is used as the cut-off point, the frailty scoring is binary, either not frail = 0 or frail = 1. 0. When multiple cut-off points are used (1, 2, 3, 3+SD), then frailty is scored in a gradient (0, 0.25, 0.5, 0.75, 1).

While human frailty is the basis for rodent FIs, how to compare data from rodent FIs to equivalent effects if translated to humans is not clear. One approach is to compare deficits among animals and humans of similar biological age. Various investigators have developed systematic methods of equating biological age between mice and humans, including development, epigenetic age clocks, gene expression patterns, disease onset ages, median and maximum lifespan proportions, and/or the trajectory of the survival curve [3538]. As a result, several studies [1517, 2022, 26, 27] have linked their rodent analysis to quantitative human data, whereas others make no direct comparison (Supplementary Table 1) [18, 19, 2325, 2832].

Those making such comparisons often use deficit accumulation as the key metric (natural log of FI vs. age) after normalizing human and mouse data sets to 90% mortality values or comparing the equivalent ages of the two species. The problem with using corresponding ages is the inconsistency across the literature in what age cutoffs are considered equivalent. Liu et al. identified 9% of 27-28-month-old mice as frail, consistent with frailty levels in 80-year-old humans [17]. While Baumann et al. found all mice frail at 32 months of age but compared this to 60+ years in humans, where 5-10% of 60-69-year-olds or 26-65% of 85+-year-olds are frail [26]. Another publication suggests that a 32-month-old mouse is equivalent to a 109-year-old human [35]. Kane et al. observed 16-44% of 23-month-old mice as frail depending on the FI index used, while humans aged 65+ years showed a 22-32% frailty range using comparable indexes [22]. Two FIs quantified different animals of the same age group as frail [22]. Furthermore, another study compared three FIs in a group of 24-month-old males and found inconsistency between which mice were frail [31].

Like the heterogeneity of mouse data, the lack of a universally agreed standard of human frailty scoring ensures that the reference ages for the percentage of the population identified as frail at a given age will also be inconsistent. The difficulty in making precise comparisons is evident, though the underlying fact of deficit accumulation increasing with age remains. Developing a simplified approach for FI calculations in murine models and humans is essential for standardizing assessments, improving reproducibility, and validating medical interventions for their translational potential for human aging and frailty management.

We strived to provide recommendations for implementing a FI in murine models with commonly available equipment and inform our analysis of the challenges in doing so. To this end, we scored our mice on the 8-item FI developed and implemented in the literature, as it suited the available equipment and allowed frailty to be measured as a gradient [15, 16]. To score the mice in our study, we utilized the reference values from these studies. It is clear from this implementation that the reference ranges in these studies are not consistent, and further work will be required to develop reproducible reference ranges. However, our data are underpowered and are intended for illustrative purposes only and should not be used to draw independent conclusions. As the reference values of one study had both sexes, and our cohort did too, we matched them by sex [16]. Notably, the other reference value cohort was female, and most of our subjects were male [15]. However, the 3-4-month-old mice in this study are younger than those used to establish the reference values [15, 16].

Sex as a biological variable in FIs is an important consideration, as there is a known difference between male and female frailty onset and progression. In humans, females show higher frailty index scores in all ages compared to males [39]. For aging studies in mice in this Perspective, males are predominantly used (10/18 studies) [17, 18, 2024, 26, 31, 32] with both sexes (5/18 studies) [16, 19, 25, 27, 29] and females (3/18 studies) [15, 28, 30] used significantly less. One study measuring both sexes found males to have higher frailty scores in an Alzheimer’s model [29]. Another found that in C57BL/6 mice the frailty index implemented altered which sex had the higher frailty score [27]. Therefore, with the key role sex can play in frailty, it is preferred to separate the age groups by sex.

Using the first set of published reference values, the 8-item FI yielded frailty scores of 0.37/1 for our 3-4-month-old mice, 0.52/1 for our 18-month-old mice, and 0.66/1 for our 28-month-old mice (Figure 1A) [16]. The second published reference values (averages of trial 1+2) yielded scores for the 3-4-month-old mice as 0.26/1, 18 months as 0.46/1, and 28 months as 0.48/1 frailty scores (Figure 1A1i) [15]. In both reference sets, the frailty scores among the 3–4-month-old animals were notably high. The parameters responsible for scoring young mice as frail were the maximum distance post inactivity, meander, and movement duration, which affected this group's overall frailty score.

Analysis of the frailty phenotype in 3-4, 18, and 28-month-old C57BL/6 mice. (A) Frailty index scores implemented using Parks et al., 2012 reference values. (A–i) Frailty index scores implemented using Whitehead et al., 2014 reference values. (A–ii) Frailty index scores using our own 3–4-month-old mice as reference values. (B) Grip strength (average forelimb parallel bar score) is normalized to weight for 3-4, 18, and 28-month-old mice. P-values ≤ 0.05 (*), ≤ 0.01 (**), ≤ 0.001 (***), ≤ 0.0001 (****) were calculated using one-way ANOVA for three or more independent groups or unpaired t-tests for just two independent groups. Columns represent the mean with error bars ± SD.

Figure 1. Analysis of the frailty phenotype in 3-4, 18, and 28-month-old C57BL/6 mice. (A) Frailty index scores implemented using Parks et al., 2012 reference values. (Ai) Frailty index scores implemented using Whitehead et al., 2014 reference values. (Aii) Frailty index scores using our own 3–4-month-old mice as reference values. (B) Grip strength (average forelimb parallel bar score) is normalized to weight for 3-4, 18, and 28-month-old mice. P-values ≤ 0.05 (*), ≤ 0.01 (**), ≤ 0.001 (***), ≤ 0.0001 (****) were calculated using one-way ANOVA for three or more independent groups or unpaired t-tests for just two independent groups. Columns represent the mean with error bars ± SD.

High scores in movement duration in our young mice may be due to differences in acclimatization protocols. Parks et al. allowed 5 days of testing in the arena, using the last 2 days for assessment [16]. We acclimatized the mice to the testing room but not the arena for an hour before data collection, as open field testing depends on the inherent explorative nature of mice. The meander can also be measured differently, either as a relative or absolute meander, using Ethovision video software analysis (version 17.0.1). It can also be calculated from either the body point or the head direction. The maximal distance post-inactivity could also benefit from further clarification on how it is precisely measured; for example, how inactivity was defined and what body point was measured.

Based on these potential sources of discrepancy, we recommend each lab use its own reference mice to limit variability. When we did this using our 3-month-old mice to establish the reference values, the overall frailty scores in mice aged 18-28 months were much lower than the reference values that were used from similarly aged mice from published literature (0.03/1 [16] and 0.22/1 [15], respectively (Figure 1Aii)). Most studies (12/18) have used C57BL/6 mice to assess new FIs or modify existing FIs. Therefore, if using a different mouse strain or a transgenic or disease model, identifying the baseline of a lab’s rodent population is even more critical. We recommend using each mouse as its reference point for longitudinal studies, strengthening the analysis without increasing the workload, as the cost is often a significant factor for in vivo studies involving aged subjects, especially longitudinal studies. A common strategy for anti-aging interventions is to collect baseline data before and after treatment.

One challenge when creating reference values from a group of young mice is inherent variation within the group. If the reference group variation is significant, some of the frailty parameters in test mice may not reach a score of 1. One strategy to circumvent this is to reduce diversity in the reference value group. For example, Antoch et al. excluded animals if their scores exceeded the mean by more than one SD [25]. However, inclusion or exclusion of the outliers in small sample sizes risks creating non-representative values. Using each mouse as its reference point can also circumvent this potential issue. Similarly, to ascertain if a treatment can improve frailty we suggest scoring the same mice before treatment and utilizing that as individual reference values, instead of comparing the FIs of an aged, treated group with young, untreated mice as commonly done in rodent aging studies.

Similar to 4-5 item FIs, we included grip strength to characterize the physical health of the mice further. Due to equipment limitations, we were unable to fully implement the 4-5 item FI, which required an inverted cling-grip test, a rotarod, and voluntary wheel running cages. In grip strength scores normalized by weight, the 3- to 4-month-old mice had the highest average score, with 18- and 28-month-old mice having lower scores accordingly. However, the difference between 18 and 28 months was not statistically significant due to a single animal’s exceptionally high score (Figure 1B). Using individual mice as their comparators in longitudinal assessments could help identify any increases or decreases in performance. Normalization by weight is imperfect because it does not account for the changes in muscle mass and fat between young and old mice, as older mice tend to gain fat and lose muscle mass. To address this, body composition measurements would be preferable, although they require specialized and expensive equipment.

One limitation of this study is the low number of animals per group, which limits the results for animals of a given age, as some recommend a minimum of 20 animals per sex to measure physiological changes. However, this Perspective may be helpful in frailty-scoring selections for small-scale studies with limited subjects.

Open-field testing (OFT) or automated video testing simplifies frailty measurements when the methods are clearly defined and well-documented. Furthermore, it eliminates errors introduced by limited inter-rater reliability, a well-recognized problem in clinical observation FIs, as user input is minimal [18, 20]. To determine the most significant factors from OFT to include, an independent study is necessary to identify which parameters best predict mortality. For instance, investigators could make periodic measurements of the various OFT parameters (e.g., every 3 months) with a significant sample size until death, and then assess each factor to determine its weight in predicting mortality compared to an established FI.

The reason for this extensive independent study is the components of the current indexes. For instance, 8 item FIs include the total distance (cm), velocity (cm/s), and movement duration (s and %). Therefore, 4/8 factors measure closely related physiological traits, giving movement great weight in this index. Although walking speed is a well-established mortality indicator and predictor of surgical outcomes in humans, assigning movement-related variables half the overall score in a rodent FI will likely overweight the index [6, 7]. Principal component analysis should be applied to reduce the number of variables and choose those that are both highly predictive of mortality and (amongst those that fall within the same principal component) most convenient for implementation. One study prioritized a diversity of health-related physiological systems over a single key physiological trait, alongside quantitative parameters without visual scoring, while maintaining minimal invasiveness [25]. For future additions to FIs, one study measured gait speed in the cage and on the wheel in C57BL/6 mice and found it correlated with age and the manual frailty index [32].

Further development of frailty indices could improve their accuracy. Quantitative and automated measures would help further reduce inter-rater reliability concerns and experimenter bias, and in turn lab-to-lab variation. Measurements such as bone density, measured non-invasively by micro-CT, have been proven to show differences in murine age, especially in cranio-facial bones [4042]. However, this technique comes with a significant cost. Changes in eating behaviour, such as food dropping, reduced consumption, or altered eating patterns, serve as another quantitative measure observed in aging mice for future frailty indices [43, 44].

Aging also affects the circadian rhythm and feeding patterns in mice, as caloric restriction and feeding during the active phase of the circadian rhythm resulted in extended healthspan and lifespan [45]. One study in this Perspective identified an age-related change in the circadian distribution of wheel running, suggesting the inclusion of the circadian rhythm in frailty indices is warranted [32]. Evidence in humans corroborates the significant changes in the circadian rhythm with age, potentially offering additional translational evidence for future frailty indices [46]. To assess the age-related decline in cognitive function, namely spatial learning and memory, a Barnes-Maze test could be implemented. One group developed a cognitive frailty index (CoFI) to assess various parameters of the Barnes-Maze test in over 400 C57BL/6 mice [47]. While no sex-related differences were observed, an increase in CoFI scores with advancing age and a strong association with mortality were evident [45, 32, 46].

Another aspect is olfactory tests, which measure the loss of smell evident in aged C57BL/6 mice, where the loss of odor discrimination was among the earliest biomarkers compared to cognitive and motor function tests [48]. Olfactory tests have been included in a mouse Social Frailty Index (mSFI), which also analyses urine marking, social interactions, and nest building [49]. Interestingly, sex differences were observed, with females exhibiting lower mSFI scores compared to males, which correlates with data from physical indices. The application of a physical and cognitive/social frailty index could provide greater insight into the holistic effect of any anti-aging intervention.

Conclusion

Studies discussed in this Perspective offer a variety of approaches to measuring frailty. We recommend that investigators carefully consider what aspects of frailty to include in their analyses instead of fully adopting the published scoring systems. It is preferable to avoid including and giving equal weight to multiple parameters that measure closely related physiological traits, such as multiple parameters related to movement. In addition, it is best to use automated measurements where possible to avoid experimenter bias and inter-rater variations. As there is substantial variation even between mice of the same strain, it is optimal to measure baseline frailty in each animal and to track changes over time in response to treatment and during physiological aging, especially in a small sample size. Doing so accounts for variability in baseline values. It increases sensitivity in detecting subsequent changes while reducing the chances of false positives, resulting in a more reliable measurement of physical health in aging rodents and the effects of longevity therapeutic candidates.

Supplementary Materials

Supplementary Table 1

Author Contributions

O.G.F. Analyzed the literature, generated the data, FI scores, table, and graph, and wrote the manuscript. A.B. generated data and edited the manuscript. M.R. edited the manuscript. M.A. helped generate data. A.R. conceptualized the frailty measurement project, supervised its execution, and edited the manuscript. A.S. conceptualized the manuscript, supervised data generation, and edited the manuscript.

Conflicts of Interest

The authors declare no financial interests related to this work.

Ethical Statement

All procedures were conducted in accordance with the National Institutes of Health’s Guide for the Care and Use of Laboratory Animals and were approved by the Lifespan Research Institute Subcommittee on Research and Animal Care (#SRF-01.02).

Funding

We want to acknowledge funding agencies who supported this work at the Longevity Research Institute and Loughborough University (EPSRC, grant number 2610407).

References

  • 1. Moqri M, Herzog C, Poganik JR, Justice J, Belsky DW, Higgins-Chen A, Moskalev A, Fuellen G, Cohen AA, Bautmans I, Widschwendter M, Ding J, Fleming A, et al, and Biomarkers of Aging Consortium. Biomarkers of aging for the identification and evaluation of longevity interventions. Cell. 2023; 186:3758–75. https://doi.org/10.1016/j.cell.2023.08.003 [PubMed]
  • 2. Rae MJ, Butler RN, Campisi J, de Grey AD, Finch CE, Gough M, Martin GM, Vijg J, Perrott KM, Logan BJ. The demographic and biomedical case for late-life interventions in aging. Sci Transl Med. 2010; 2:40cm21. https://doi.org/10.1126/scitranslmed.3000822 [PubMed]
  • 3. Spindler SR. Review of the literature and suggestions for the design of rodent survival studies for the identification of compounds that increase health and life span. Age (Dordr). 2012; 34:111–20. https://doi.org/10.1007/s11357-011-9224-6 [PubMed]
  • 4. Frost OG, Ramkilawan P, Rebbaa A, Stolzing A. A systematic review of lifespan studies in rodents using stem cell transplantations. Ageing Res Rev. 2024; 97:102295. https://doi.org/10.1016/j.arr.2024.102295 [PubMed]
  • 5. Rodríguez-Mañas L, Féart C, Mann G, Viña J, Chatterji S, Chodzko-Zajko W, Gonzalez-Colaço Harmand M, Bergman H, Carcaillon L, Nicholson C, Scuteri A, Sinclair A, Pelaez M, et al, and FOD-CC group (Appendix 1). Searching for an operational definition of frailty: a Delphi method based consensus statement: the frailty operative definition-consensus conference project. J Gerontol A Biol Sci Med Sci. 2013; 68:62–67. https://doi.org/10.1093/gerona/gls119 [PubMed]
  • 6. Kojima G, Iliffe S, Walters K. Frailty index as a predictor of mortality: a systematic review and meta-analysis. Age Ageing. 2018; 47:193–200. https://doi.org/10.1093/ageing/afx162 [PubMed]
  • 7. Kojima G, Iliffe S, Jivraj S, Walters K. Association between frailty and quality of life among community-dwelling older people: a systematic review and meta-analysis. J Epidemiol Community Health. 2016; 70:716–21. https://doi.org/10.1136/jech-2015-206717 [PubMed]
  • 8. Howlett SE, Rutenberg AD, Rockwood K. The degree of frailty as a translational measure of health in aging. Nat Aging. 2021; 1:651–65. https://doi.org/10.1038/s43587-021-00099-3 [PubMed]
  • 9. Li X, Ploner A, Karlsson IK, Liu X, Magnusson PK, Pedersen NL, Hägg S, Jylhävä J. The frailty index is a predictor of cause-specific mortality independent of familial effects from midlife onwards: a large cohort study. BMC Med. 2019; 17:94. https://doi.org/10.1186/s12916-019-1331-8 [PubMed]
  • 10. Belsky DW, Caspi A, Houts R, Cohen HJ, Corcoran DL, Danese A, Harrington H, Israel S, Levine ME, Schaefer JD, Sugden K, Williams B, Yashin AI, et al. Quantification of biological aging in young adults. Proc Natl Acad Sci USA. 2015; 112:E4104–10. https://doi.org/10.1073/pnas.1506264112 [PubMed]
  • 11. Mitnitski AB, Mogilner AJ, Rockwood K. Accumulation of deficits as a proxy measure of aging. ScientificWorldJournal. 2001; 1:323–36. https://doi.org/10.1100/tsw.2001.58 [PubMed]
  • 12. Belsky DW, Huffman KM, Pieper CF, Shalev I, Kraus WE. Change in the Rate of Biological Aging in Response to Caloric Restriction: CALERIE Biobank Analysis. J Gerontol A Biol Sci Med Sci. 2017; 73:4–10. https://doi.org/10.1093/gerona/glx096 [PubMed]
  • 13. Xu X, Hu J, Pang X, Wang X, Xu H, Yan X, Zhang J, Pan S, Wei W, Li Y. Association between plant and animal protein and biological aging: findings from the UK Biobank. Eur J Nutr. 2024; 63:3119–32. https://doi.org/10.1007/s00394-024-03494-9 [PubMed]
  • 14. Ibrahim K, Howson FF, Culliford DJ, Sayer AA, Roberts HC. The feasibility of assessing frailty and sarcopenia in hospitalised older people: a comparison of commonly used tools. BMC Geriatr. 2019; 19:42. https://doi.org/10.1186/s12877-019-1053-y [PubMed]
  • 15. Whitehead JC, Hildebrand BA, Sun M, Rockwood MR, Rose RA, Rockwood K, Howlett SE. A clinical frailty index in aging mice: comparisons with frailty index data in humans. J Gerontol A Biol Sci Med Sci. 2014; 69:621–32. https://doi.org/10.1093/gerona/glt136 [PubMed]
  • 16. Parks RJ, Fares E, Macdonald JK, Ernst MC, Sinal CJ, Rockwood K, Howlett SE. A procedure for creating a frailty index based on deficit accumulation in aging mice. J Gerontol A Biol Sci Med Sci. 2012; 67:217–27. https://doi.org/10.1093/gerona/glr193 [PubMed]
  • 17. Liu H, Graber TG, Ferguson-Stegall L, Thompson LV. Clinically relevant frailty index for mice. J Gerontol A Biol Sci Med Sci. 2014; 69:1485–91. https://doi.org/10.1093/gerona/glt188 [PubMed]
  • 18. Feridooni HA, Sun MH, Rockwood K, Howlett SE. Reliability of a Frailty Index Based on the Clinical Assessment of Health Deficits in Male C57BL/6J Mice. J Gerontol A Biol Sci Med Sci. 2015; 70:686–93. https://doi.org/10.1093/gerona/glu161 [PubMed]
  • 19. Kane AE, Hilmer SN, Boyer D, Gavin K, Nines D, Howlett SE, de Cabo R, Mitchell SJ. Impact of Longevity Interventions on a Validated Mouse Clinical Frailty Index. J Gerontol A Biol Sci Med Sci. 2016; 71:333–39. https://doi.org/10.1093/gerona/glu315 [PubMed]
  • 20. Yorke A, Kane AE, Hancock Friesen CL, Howlett SE, O’Blenes S. Development of a Rat Clinical Frailty Index. J Gerontol A Biol Sci Med Sci. 2017; 72:897–903. https://doi.org/10.1093/gerona/glw339 [PubMed]
  • 21. Rockwood K, Blodgett JM, Theou O, Sun MH, Feridooni HA, Mitnitski A, Rose RA, Godin J, Gregson E, Howlett SE. A Frailty Index Based On Deficit Accumulation Quantifies Mortality Risk in Humans and in Mice. Sci Rep. 2017; 7:43068. https://doi.org/10.1038/srep43068 [PubMed]
  • 22. Kane AE, Huizer-Pajkos A, Mach J, Mitchell SJ, de Cabo R, Le Couteur DG, Howlett SE, Hilmer SN. A Comparison of Two Mouse Frailty Assessment Tools. J Gerontol A Biol Sci Med Sci. 2017; 72:904–09. https://doi.org/10.1093/gerona/glx009 [PubMed]
  • 23. Miller MG, Thangthaeng N, Shukitt-Hale B. A Clinically Relevant Frailty Index for Aging Rats. J Gerontol A Biol Sci Med Sci. 2017; 72:892–96. https://doi.org/10.1093/gerona/glw338 [PubMed]
  • 24. Gomez-Cabrera MC, Garcia-Valles R, Rodriguez-Mañas L, Garcia-Garcia FJ, Olaso-Gonzalez G, Salvador-Pascual A, Tarazona-Santabalbina FJ, Viña J. A New Frailty Score for Experimental Animals Based on the Clinical Phenotype: Inactivity as a Model of Frailty. J Gerontol A Biol Sci Med Sci. 2017; 72:885–91. https://doi.org/10.1093/gerona/glw337 [PubMed]
  • 25. Antoch MP, Wrobel M, Kuropatwinski KK, Gitlin I, Leonova KI, Toshkov I, Gleiberman AS, Hutson AD, Chernova OB, Gudkov AV. Physiological frailty index (PFI): quantitative in-life estimate of individual biological age in mice. Aging (Albany NY). 2017; 9:615–26. https://doi.org/10.18632/aging.101206 [PubMed]
  • 26. Baumann CW, Kwak D, Thompson LV. Assessing onset, prevalence and survival in mice using a frailty phenotype. Aging (Albany NY). 2018; 10:4042–53. https://doi.org/10.18632/aging.101692 [PubMed]
  • 27. Kane AE, Keller KM, Heinze-Milne S, Grandy SA, Howlett SE. A Murine Frailty Index Based on Clinical and Laboratory Measurements: Links Between Frailty and Pro-inflammatory Cytokines Differ in a Sex-Specific Manner. J Gerontol A Biol Sci Med Sci. 2019; 74:275–82. https://doi.org/10.1093/gerona/gly117 [PubMed]
  • 28. Kwak D, Baumann CW, Thompson LV. Identifying Characteristics of Frailty in Female Mice Using a Phenotype Assessment Tool. J Gerontol A Biol Sci Med Sci. 2020; 75:640–46. https://doi.org/10.1093/gerona/glz092 [PubMed]
  • 29. Todorovic S, Loncarevic-Vasiljkovic N, Jovic M, Sokanovic S, Kanazir S, Mladenovic Djordjevic A. Frailty index and phenotype frailty score: Sex- and age-related differences in 5XFAD transgenic mouse model of Alzheimer’s disease. Mech Ageing Dev. 2020; 185:111195. https://doi.org/10.1016/j.mad.2019.111195 [PubMed]
  • 30. Ribeiro AR, Barros C, Barateiro A, Howlett SE, Fernandes A. Improved Assessment of Overall Health in Variably Aged Murine Models of Multiple Sclerosis With a Novel Frailty Index Tool. J Gerontol A Biol Sci Med Sci. 2022; 77:1–9. https://doi.org/10.1093/gerona/glab185 [PubMed]
  • 31. Mach J, Allore H, Gnjidic D, Gemikonakli G, Kane AE, Howlett SE, de Cabo R, Le Couteur D, Hilmer SN. Preclinical frailty assessments: Phenotype and frailty index identify frailty in different mice and are variably affected by chronic medications. Exp Gerontol. 2022; 161:111700. https://doi.org/10.1016/j.exger.2022.111700 [PubMed]
  • 32. Ruby JG, Di Francesco A, Ylagan P, Luo A, Keyser R, Williams O, Spock S, Li W, Vongtharangsy N, Chatterjee S, Sloan CA, Ledogar C, Kuiper V, et al. An Automated, Home-Cage, Video Monitoring-based Mouse Frailty Index Detects Age-associated Morbidity in C57BL/6 and Diversity Outbred Mice. J Gerontol A Biol Sci Med Sci. 2023; 78:762–70. https://doi.org/10.1093/gerona/glad035 [PubMed]
  • 33. Zhang Y, Ikeno Y, Bokov A, Gelfond J, Jaramillo C, Zhang HM, Liu Y, Qi W, Hubbard G, Richardson A, Van Remmen H. Dietary restriction attenuates the accelerated aging phenotype of Sod1(-/-) mice. Free Radic Biol Med. 2013; 60:300–6. https://doi.org/10.1016/j.freeradbiomed.2013.02.026 [PubMed]
  • 34. Fried LP, Tangen CM, Walston J, Newman AB, Hirsch C, Gottdiener J, Seeman T, Tracy R, Kop WJ, Burke G, McBurnie MA, and Cardiovascular Health Study Collaborative Research Group. Frailty in older adults: evidence for a phenotype. J Gerontol A Biol Sci Med Sci. 2001; 56:M146–56. https://doi.org/10.1093/gerona/56.3.m146 [PubMed]
  • 35. Dutta S, Sengupta P. Men and mice: Relating their ages. Life Sci. 2016; 152:244–8. https://doi.org/10.1016/j.lfs.2015.10.025 [PubMed]
  • 36. Wang T, Ma J, Hogan AN, Fong S, Licon K, Tsui B, Kreisberg JF, Adams PD, Carvunis AR, Bannasch DL, Ostrander EA, Ideker T. Quantitative Translation of Dog-to-Human Aging by Conserved Remodeling of the DNA Methylome. Cell Syst. 2020; 11:176–85.e6. https://doi.org/10.1016/j.cels.2020.06.006 [PubMed]
  • 37. Zhuang J, Zhang L, Dai S, Cui L, Guo C, Sloofman L, Yang J. Comparison of multi-tissue aging between human and mouse. Sci Rep. 2019; 9:6220. https://doi.org/10.1038/s41598-019-42485-3 [PubMed]
  • 38. Geifman N, Rubin E. The mouse age phenome knowledgebase and disease-specific inter-species age mapping. PLoS One. 2013; 8:e81114. https://doi.org/10.1371/journal.pone.0081114 [PubMed]
  • 39. Gordon EH, Peel NM, Samanta M, Theou O, Howlett SE, Hubbard RE. Sex differences in frailty: A systematic review and meta-analysis. Exp Gerontol. 2017; 89:30–40. https://doi.org/10.1016/j.exger.2016.12.021 [PubMed]
  • 40. Bellantuono I, de Cabo R, Ehninger D, Di Germanio C, Lawrie A, Miller J, Mitchell SJ, Navas-Enamorado I, Potter PK, Tchkonia T, Trejo JL, Lamming DW. A toolbox for the longitudinal assessment of healthspan in aging mice. Nat Protoc. 2020; 15:540–74. https://doi.org/10.1038/s41596-019-0256-1 [PubMed]
  • 41. Scheuren AC, Kuhn GA, Müller R. Effects of long-term in vivo micro-CT imaging on hallmarks of osteopenia and frailty in aging mice. PLoS One. 2020; 15:e0239534. https://doi.org/10.1371/journal.pone.0239534 [PubMed]
  • 42. Shim J, Iwaya C, Ambrose CG, Suzuki A, Iwata J. Micro-computed tomography assessment of bone structure in aging mice. Sci Rep. 2022; 12:8117. https://doi.org/10.1038/s41598-022-11965-4 [PubMed]
  • 43. Starr ME, Saito H. Age-related increase in food spilling by laboratory mice may lead to significant overestimation of actual food consumption: implications for studies on dietary restriction, metabolism, and dose calculations. J Gerontol A Biol Sci Med Sci. 2012; 67:1043–48. https://doi.org/10.1093/gerona/gls009 [PubMed]
  • 44. Acosta-Rodríguez VA, Rijo-Ferreira F, Green CB, Takahashi JS. Importance of circadian timing for aging and longevity. Nat Commun. 2021; 12:2862. https://doi.org/10.1038/s41467-021-22922-6 [PubMed]
  • 45. Acosta-Rodríguez V, Rijo-Ferreira F, Izumo M, Xu P, Wight-Carter M, Green CB, Takahashi JS. Circadian alignment of early onset caloric restriction promotes longevity in male C57BL/6J mice. Science. 2022; 376:1192–202. https://doi.org/10.1126/science.abk0297 [PubMed]
  • 46. Li Y, Tan Y, Zhao Z. Impacts of aging on circadian rhythm and related sleep disorders. Biosystems. 2024; 236:105111. https://doi.org/10.1016/j.biosystems.2023.105111 [PubMed]
  • 47. Marcozzi S, Bigossi G, Giuliani ME, Lai G, Bartozzi B, Balietti M, Casoli T, Orlando F, Amoroso A, Giacconi R, Cardelli M, Piacenza F, Lattanzio F, et al. A Novel Cognitive Frailty Index for Geriatric Mice. Aging Cell. 2025; 24:e70056. https://doi.org/10.1111/acel.70056 [PubMed]
  • 48. Dan X, Yang B, McDevitt RA, Gray S, Chu X, Claybourne Q, Figueroa DM, Zhang Y, Croteau DL, Bohr VA. Loss of smelling is an early marker of aging and is associated with inflammation and DNA damage in C57BL/6J mice. Aging Cell. 2023; 22:e13793. https://doi.org/10.1111/acel.13793 [PubMed]
  • 49. Collinge CW, Razzoli M, Mansk R, McGonigle S, Lamming DW, Pacak CA, van der Pluijm I, Niedernhofer L, Bartolomucci A. The mouse Social Frailty Index (mSFI): a novel behavioral assessment for impaired social functioning in aging mice. Geroscience. 2025; 47:85–107. https://doi.org/10.1007/s11357-024-01263-4 [PubMed]