Construction and validation of a prediction model for hyperuricemia am

Background

Serum uric acid (SUA), which results from purine metabolism, is normally filtered by the kidneys and excreted in urine.1 However, hyperuricemia (HUA) occurs when SUA production exceeds renal excretion capacity or when impaired kidney clearance reduces uric acid output. These mechanisms elevate serum SUA concentrations.2 HUA is closely related to gout, and asymptomatic HUA can lead to the deposition of urate in the joints of patients and even bone erosion. Studies had found that asymptomatic HUA leading to gout was a continuous pathological process.3 Epidemiological evidence has demonstrated that HUA is both a critical contributor to gout pathogenesis and significantly associated with malignancies. According to a survey conducted in Chinese population, the overall prevalence of HUA among adults was 11.1% during 2015–2019, while the prevalence rate had risen to 14% by 2019.4 Additionally, a national health and nutrition survey conducted in the United States showed that the risk of death from HUA was similar to that of diabetes.5 HUA had become a serious risk factor for public health that could not be ignored.

The Framingham Heart Study indicated that SUA levels in men remain stable after puberty, whereas the SUA levels gradually increase after middle age in women.6 During perimenopause, declining ovarian function reduces estrogen levels. This impairs uric acid excretion, consequently elevating SUA concentrations.7 According to epidemiological studies, the prevalence of HUA increases with age. One study found that the prevalence of HUA among women was around 11%.8 A Meta-analysis indicated that the prevalence of HUA among Chinese perimenopausal women reached 13%, and it continued to rise as women enter menopause.9 This could potentially lead to various health issues in this demographic. Additionally, the hormonal changes make women more prone to obesity and metabolic syndrome during perimenopause, which could further exacerbate elevated SUA levels and consequently contribute to HUA.10

Previous studies showed that HUA was closely linked to the development and progression of cardiovascular disease, diabetes, and other diseases.11 Therefore, perimenopausal women should pay more attention to the changes in serum uric acid levels and take some measures to prevent HUA. According to one study, 88% of women experience menopause at a mean age of 51.4 years, with over 800 million women worldwide currently in this life stage.12 Therefore, predicting the prevalence of HUA in this group is crucial for their sustainable health management.

Currently, there were relatively few reports on the risk factors and predictive models for HUA in perimenopausal women. Eljaaly et al found that HUA was linked to SCR, high-density lipoprotein, triglycerides, hip circumference, total cholesterol.13 Zhang et al explored the risk factors for elevated serum uric acid (SUA) levels in elderly individuals and constructed a predictive model, which identified a SUA level ≥ 360 μmol/L as a common risk factor for both men and women.14 Therefore to advance clinical practice for perimenopausal women’s health promotion, we aimed to identify risk factors for HUA in perimenopausal women and to construct and validate a nomogram model for clinical risk prediction.

Methods

Study Population

The study was conducted in accordance with the provisions of the Declaration of Helsinki, and approval was granted by the First Affiliated Hospital of the University of Science and Technology of China (2025-RE-191). Informed consent was also obtained from all patients prior to the study. We collected clinical data from perimenopausal women who had completed standardized health evaluations at USTC First Affiliated Hospital’s Health Management Center.

The determination of HUA was based on a reasonable standard: the SUA level was equal to more than 360 mmol/L.15 The exclusion criteria included: (1) younger than 45 years old or older than 55 years old; (2) duplicated clinical data; (3) Severe hepatic and renal dysfunction; (4) currently suffering from malignant tumors.

We collected information from 6225 physical examinees. After excluding 512 records with missing medical data and 28 duplicate records, we retained records for 5685 examinees. Further excluding 46 individuals with abnormal liver and kidney function and 2 patients with malignant tumors, a total of 5637 examinees met the inclusion criteria. In this study, we collected a total of 28 different variables, according to the 10 EPV (Events Per Variable) rule,16 the number of positive samples should exceed 270. With a total of 733 positive samples among all study subjects, the sample size meets the requirement for model development. These were randomly divided into a model training set of 3945 individuals and a validation set of 1692 individuals. The flowchart is shown in Figure 1.

Figure 1 The process of determining research subjects.

Data Collection

The data for this study were sourced from the participants’ medical records, including the following information: (1) Demographic characteristics – Age; Gender; Body Mass Index (BMI), (2) Relevant medical history – HUA, fatty liver, (3) Renal function tests: Blood Urea Nitrogen (BUN); Serum Creatinine (SCR), (4) Liver function tests: Alanine Aminotransferase (ALT); Aspartate Aminotransferase (AST); Alkaline Phosphatase (ALP); Total Protein (TP); Albumin (ALB); Globulin (GLB), (5) Random blood glucose (GLU), (6) Blood lipid tests: High-density lipoprotein (HDL); Low-density lipoprotein (LDL); Very Low-Density Lipoprotein (VLDL); Total Cholesterol (TC); Triglyceride (TG), (7) Blood cell tests: White Blood Cell Count (WBC); Red Blood Cell Count (RBC); Hemoglobin (HGB); Platelet count (PLT); Eosinophils percentage (Eos%); Basophils percentage (Baso%); Lymphocyte Percentage (LY%); Neutrophil percentage (NEUT%), (8) Routine Urine index: Urine pH (UPH); Urine Specific Gravity (SG).

Statistical Analysis

Statistical analysis was performed via R 4.4.2 and SPSS 26.0. For Gaussian distribution was used to represent the data, for non-normally distributed data, the median and interquartile range (IQR) were used for representation; For categorical variables, we reported frequencies and proportions. The entire study population was random divided into a model training set and a model validation set at a ratio of 7:3. Continuous variables were analyzed using Student’s t-test (for normally distributed data) or the Mann–Whitney U-test (for nonparametric data), while categorical variables were assessed with the chi-square test. Since the study mainly selected hematological examination results as independent variables, which had strong collinearity, we used the least absolute shrinkage and selection operator (Lasso)17 and binary logistic regression were employed for feature variable screening, and employed Decision Curve Analysis (DCA) to evaluate the maximum net benefit of the prediction model. To visualize the prediction model, we presented it using a nomogram and evaluated its performance through Receiver Operating Characteristic (ROC) analysis and the Area Under the Curve (AUC). All analyses used two-tailed tests, with statistical significance set at P < 0.05. We utilized R packages “glmnet” and “rms” to perform Lasso regression and construct a nomogram model.

Result

Baseline Characteristics

In the group of 5637 perimenopausal women who met the research criteria, a total of 733 individuals suffered from HUA, the prevalence rate of HUA among all participants was 13%. There were statistically significant differences between non-HUA group and HUA group in terms of AGE, BMI, renal function (BUN, SCR), liver function (ALT, AST, ALP, TP, ALB, GLB), blood lipid (HDL, TC, VLDL, LDL, TG), blood cell (WBC, RBC, HGB, PLT, Eos), fasting blood glucose, and medical history (hypertension, fatty liver) (P< 0.05) (Table 1).

Table 1 Baseline Characteristics of HUA Group and Non-HUA Group

The training set comprised 3945 participants, while the validation set included 1692 participants. No statistically significant differences were observed in baseline characteristics or clinical features between the two groups (Table 2).

Table 2 Baseline Characteristics of Training Set and Validation Set

The Related Risk Factors of HUA

Variable selection was performed on the training data through Lasso regression coupled with tenfold cross-validation (Figure 2), where the final model was chosen based on the one standard error criterion, the λ was taken. Finally, thirteen indicators were screened, including BMI, UPH, ALP, WBC, SCR, HDL, LDL, HGB, ALT, TP, TG, hypertension, fatty liver. Subsequently, the above indicators were included in the binary logistic regression analysis, and the results were shown in Table 3. Finally, we found BMI (P<0.001), UPH (P=0.002), ALP (P<0.001), WBC (P=0.011), HDL (P<0.001), LDL (P<0.001), SCR (P<0.001), ALT (P=0.003), TP (P=0.005) and fatty liver (P<0.001) were recognized as independent risk factors for HUA in perimenopausal women.

Table 3 Binary Logistic Regression Analysis of the Risk Factors for HUA in Perimenopausal Women

Figure 2 Lasso regression results. (A) Lasso coefficient profiles of the variables. (B) Demonstrates the process of selecting the optimal parameter (lambda) in Lasso.

Constructing and Evaluating a Prediction Model for HUA in Perimenopausal Women

From the analysis of Lasso-logistic regression, we created a nomogram prediction model (Figure 3). The risk level of the results could be calibrated, and we could obtain the total score for the probability of a certain outcome event by combining them. An increased total score was associated with a higher risk of HUA. The Bootstrap method was used to validate the nomogram by resampling 1000 times for internal validation of the model, and calibration curves were plotted for both the training and validation sets (Figure 4). The Lasso-logistic method had good performance in predicting HUA in perimenopausal women.

Figure 3 Nomogram for predicting the risk of HUA in perimenopausal women.

Figure 4 The calibration plot for nomogram. (A) Calibration plot for the accuracy of the training set model. (B) Calibration plot for the accuracy of the validation set model.

The AUC of the prediction model in training set was 0.819 (95% CI: 0.801–0.838), and the AUC of the prediction model in validation set was 0.787 (95% CI: 0.756–0.818) (Figure 5A). As shown in Figure 5B, the DCA depicted was utilized to ascertain the maximum net benefit of the predictive model.

Figure 5 (A) ROC curve of the risk factor for predicting HUA in perimenopausal women. (B) DCA curve of the predictive model.

Stratified Analysis

In this study, we further performed a stratified analysis by BMI. The perimenopausal women were categorized into two groups according to their BMI: the women with BMI ≥ 25 and the women with BMI < 25.18 Then Lasso regression was used to screen the significant variables, and binary logistic regression analysis was performed in the two groups, and nomogram models were respectively established.

In the population with BMI < 25.0, the prevalence of HUA was 9.4%, while in the population with BMI ≥ 25.0, the prevalence of HUA was 24.3%. In the population with BMI < 25.0, the occurrence of HUA was associated with the following factors: BMI (P<0.001), SCR (P<0.001), HDL (P=0.001), LDL (P<0.001), ALT (P<0.001), TP (P=0.006), TG (P<0.001), Fatty liver (P<0.001) (Table 4 and Figure 6A). In the population with BMI ≥ 25.0, the development of HUA was influenced by the following factors: BMI (P=0.004), ALP (P=0.004), WBC (P=0.006), SCR (P<0.001), HDL (P=0.001), ALT (P=0.023), Hypertension (P=0.034), Fatty liver (P<0.001) (Table 5 and Figure 6B). As presented in Figure 7, the predictive model had an AUC of 0.765 in the population with BMI ≥ 25.0 and an AUC of 0.793 in the population with BMI < 25.0.

Table 4 Binary Logistic Regression Analysis of the Risk Factors for HUA in Perimenopausal Women with BMI < 25.0

Table 5 Binary Logistic Regression Analysis of the Risk Factors for HUA in Perimenopausal Women with BMI ≥ 25.0

Figure 6 (A) Nomogram model for predicting the risk of HUA with BMI < 25. (B) Nomogram model for predicting the risk of HUA with BMI ≥ 25.

Figure 7 ROC curve to evaluate two models.

Discussion

At present, HUA has become a major threat to public health, so it is of great significance to investigate the influencing factors of HUA and then construct a prediction model. In this study, we found that the independent risk factors for HUA were BMI, UPH, ALP, WBC, HDL, LDL, SCR, ALT, TP, and history of fatty liver disease in perimenopausal women through Lasso-logistic regression analysis. We further developed HUA prediction model using these factors in perimenopausal women, achieving an AUC of 0.787. Subsequently, we constructed BMI-stratified prediction models: for individuals with BMI < 25, the model AUC was 0.793; for those with BMI ≥ 25, AUC reached 0.765.

Previous studies have found that estrogen contributes to promote the excretion of SUA in the body.19 For perimenopausal women, the decline in ovarian function leads to a reduction in estrogen secretion, affecting their SUA levels, thereby increasing the prevalence of HUA in this population. In additional, epidemiological research indicates that the abnormal changes of blood lipid levels, especially triglyceride and HDL, are independent risk factors for HUA. Moreover, insulin resistance caused by obesity and the inhibition of uric acid excretion mediated by adipokines can increase serum uric acid levels.20 Hence, HUA is more prevalent in overweight or obese individuals, which is consistent with the findings of our study.21,22 In our study, the increased levels of ALP, ALT, and HDL were related to the increased risk of HUA, which is similar to the previous findings.23

Based on the results of binary logistic regression, HDL and UPH were identified as protective factors against HUA in perimenopausal women. Previous studies suggest that fluctuations in HDL levels can affect the kidneys, thereby influencing the excretion of uric acid.24,25 Acidic urine might aggravate insulin resistance, leading to increased serum uric acid levels, and increased insulin resistance could further reduce UPH. In the preset study, fatty liver was a significant risk factor for HUA in perimenopausal women. However, the exact mechanism is not yet clear. It is speculated that insulin resistance increases hepatic fat synthesis, promoting the development of fatty liver, which finally leads to disordered purine metabolism and elevated serum uric acid levels.26 We also found that SCR was closely related to HUA, which is contrary to a previous study conducted on the general population.27 This discrepancy might be attributed to the specific physiological stage of the study subjects, and the all subjects were perimenopausal women. Specifically, the estrogen levels in women are relatively stable, therefore having a smaller impact on uric acid or SCR levels, before entering perimenopause. However, a significant increase in SCR levels typically requires a longer period of estrogen deficiency, a pathophysiological process that often occurs after women have fully entered perimenopause. Notably, all participants were already in perimenopause in this study, which might influence the interpretation of these results.

After stratifying by different BMI, we also found significant differences in the risk factors for HUA among perimenopausal women with different BMI. Compared to the group with BMI ≥ 25.0, LDL, TP, and TG were identified as different predictors in the group with BMI < 25.0. The reason for this situation might be that visceral fat accumulation leads to increased release of free fatty acids and disordered secretion of adipokines, resulting in increased hepatic lipoprotein synthesis and reduced uric acid excretion. High LDL levels might reflect a disturbance in reverse cholesterol transport, elevated TP suggests hepatic protein metabolism disorder, and abnormal TG directly participates in the formation of insulin resistance. Together, these three factors constitute the core characteristics of “metabolic obesity”.28 In the group with BMI ≥ 25.0, ALP, WBC, and Hypertension were identified as different predictors, and the combination of these risk factors exhibits a stronger characteristic of systemic inflammation. An elevated ALP level might reflect the progression of non-alcoholic fatty liver disease, an increased WBC count suggests a state of chronic low-grade inflammation, and the synergistic effect between hypertension and HUA might originate from reduced renal blood flow and inhibited uric acid excretion caused by the activation of the renin-angiotensin system.29

Previous studies have identified PPARγ gene, BMI, and gender as significant predictors of hyperuricemia (HUA) when developing prediction models.30 However, applying this model requires individuals to possess a high level of professional knowledge and necessitates consideration of other factors, which presents significant challenges for its promotion and practical use. Furthermore, a Japanese study utilized gut microbiota to predict hyperuricemia.31 Although the model demonstrated strong predictive ability, its requirement for stool sample collection makes it difficult to promote clinically.

However, this study still has several limitations. First, as a single-center study, we lacked external data to validate our model, which might affect the generalizability of our findings. Second, the definition of perimenopausal women did not include hormone level testing for all participants. Finally, the collected information lacked data on subjects’ daily lifestyle factors.

In Summary, our study developed a nomogram model for predicting HUA risk in perimenopausal women using ten distinct clinical indicators (BMI, UPH, ALP, WBC, HDL, LDL, SCR, ALT, TP, and history of fatty liver disease). Furthermore, the performance of this model were proven to be quite effective. These findings provide important clues for enhancing the health management of perimenopausal women.

Data Sharing Statement

The data of this study are available from Dr. Tian-Ping Zhang upon reasonable request.

Ethics Statement

This study was approved by the Ethical Committee of the First Affiliated Hospital of USTC (2025-RE-191).

Author Contributions

All authors made a significant contribution to the work reported, whether that is in the conception, study design, execution, acquisition of data, analysis and interpretation, or in all these areas; took part in drafting, revising or critically reviewing the article; gave final approval of the version to be published; have agreed on the journal to which the article has been submitted; and agree to be accountable for all aspects of the work.

Disclosure

Yu-Fei Liu and Xiao-Jing Li are co-first authors for this study. The authors have no conflicts of interest in this work.

References

1. Fathallah-Shaykh SA, Cramer MT. Uric acid and the kidney. Pediatr Nephrol. 2014;29(6):999–1008. doi:10.1007/s00467-013-2549-x

2. Yanai H, Adachi H, Hakoshima M, Katsuyama H. Molecular biological and clinical understanding of the pathophysiology and treatments of hyperuricemia and its association with metabolic syndrome, cardiovascular diseases and chronic kidney disease. Int J Mol Sci. 2021;22(17):9221. doi:10.3390/ijms22179221

3. Dalbeth N, House ME, Aati O, et al. Urate crystal deposition in asymptomatic hyperuricaemia and symptomatic gout: a dual energy CT study. Ann Rheum Dis. 2015;74(5):908–911. doi:10.1136/annrheumdis-2014-206397

4. Zhang M, Zhu X, Wu J, et al. Prevalence of hyperuricemia among chinese adults: findings from two nationally representative cross-sectional surveys in 2015–16 and 2018–19. Front Immunol. 2022;12:791983. doi:10.3389/fimmu.2021.791983

5. Chen PH, Chen YW, Liu WJ, Hsu SW, Chen CH, Lee CL. Approximate mortality risks between hyperuricemia and diabetes in the United States. J Clin Med. 2019;8(12):2127. doi:10.3390/jcm8122127

6. Culleton BF, Larson MG, Kannel WB, Levy D. Serum uric acid and risk for cardiovascular disease and death: the Framingham Heart Study. Ann Intern Med. 1999;131(1):7–13. doi:10.7326/0003-4819-131-1-199907060-00003

7. Lega IC, Jacobson M. Perimenopause. CMAJ. 2024;196(34):E1169. doi:10.1503/cmaj.240337

8. Huang J, Ma ZF, Zhang Y, et al. Geographical distribution of hyperuricemia in mainland China: a comprehensive systematic review and meta-analysis. Glob Health Res Policy. 2020;5(1):52. doi:10.1186/s41256-020-00178-9

9. Li Y, Shen Z, Zhu B, Zhang H, Zhang X, Ding X. Demographic, regional and temporal trends of hyperuricemia epidemics in mainland China from 2000 to 2019: a systematic review and meta-analysis. Glob Health Action. 2021;14(1):1874652. doi:10.1080/16549716.2021.1874652

10. Brinton RD, Yao J, Yin F, Mack WJ, Cadenas E. Perimenopause as a neurological transition state. Nat Rev Endocrinol. 2015;11(7):393–405. doi:10.1038/nrendo.2015.82

11. Li W, Wang Y, Ouyang S, et al. Association between serum uric acid level and carotid atherosclerosis and metabolic syndrome in patients with type 2 diabetes mellitus. Front Endocrinol. 2022;13:890305. doi:10.3389/fendo.2022.890305

12. Harlow SD, Gass M, Hall JE, et al. Executive summary of the stages of reproductive aging workshop + 10: addressing the unfinished agenda of staging reproductive aging. J Clin Endocrinol Metab. 2012;97(4):1159–1168. doi:10.1210/jc.2011-3362

13. Eljaaly Z, Mujammami M, Nawaz SS, Rafiullah M, Siddiqui K. Risk predictors of high uric acid levels among patients with type-2 diabetes. Diabetes Metab Syndr Obes. 2021;14:4911–4920. doi:10.2147/DMSO.S344894

14. Zhang D, Xu X, Ye Z, Zhang Z, Xiao J. One-year risk prediction of elevated serum uric acid levels in older adults: a longitudinal cohort study. Clin Interv Aging. 2024;19:1951–1964. doi:10.2147/CIA.S476806

15. Mei CL, Ge JB, Zou HJ, et al. Multi-disciplinary expert task force on hyperuricemia and its related diseases. Zhonghua Nei Ke Za Zhi. 2017;56(3):235–248. doi:10.3760/cma.j.issn.0578-1426.2017.03.021

16. Dhiman P, Ma J, Qi C, et al. Sample size requirements are not being considered in studies developing prediction models for binary outcomes: a systematic review. BMC Med Res Methodol. 2023;23(1):188. doi:10.1186/s12874-023-02008-1

17. Xie QY, Wang MW, Hu ZY, et al. Screening the influence of biomarkers for metabolic syndrome in occupational population based on the Lasso algorithm. Front Public Health. 2021;9:743731. doi:10.3389/fpubh.2021.743731

18. Wu Y, Li D, Vermund SH. Advantages and limitations of the Body Mass Index (BMI) to assess adult obesity. Int J Environ Res Public Health. 2024;21(6):757. doi:10.3390/ijerph21060757

19. JJung JH, Song GG, Lee YH, Kim JH, Hyun MH, Choi SJ. Serum uric acid levels and hormone therapy type: a retrospective cohort study of postmenopausal women. Menopause. 2018;25(1):77–81. doi:10.1097/GME.0000000000000953

20. Chen Q, Xiao J, Zhang P, Chen L, Chen X, Wang S. Lower serum levels of uric acid in uterine fibroids and fibrocystic breast disease patients in Dongying City, China. Iran J Public Health. 2016;45(5):596–605.

21. Ni Q, Lu X, Chen C, Du H, Zhang R. Risk factors for the development of hyperuricemia: a STROBE-compliant cross-sectional and longitudinal study. Medicine. 2019;98(42):e17597. doi:10.1097/MD.0000000000017597

22. Qi D, Liu J, Wang C, et al. Sex-specific differences in the prevalence of and risk factors for hyperuricemia among a low-income population in China: a cross-sectional study. Postgrad Med. 2020;132(6):559–567. doi:10.1080/00325481.2020.1761133

23. Redon P, Maloberti A, Facchetti R, et al. Gender-related differences in serum uric acid in treated hypertensive patients from central and east European countries: findings from the Blood Pressure control rate and CArdiovascular Risk profilE study. J Hypertens. 2019;37(2):380–388. doi:10.1097/HJH.0000000000001908

24. Almuqrin A, Alshuweishi YA, Alfaifi M, Daghistani H, Al-Sheikh YA, Alfhili MA. Prevalence and association of hyperuricemia with liver function in Saudi Arabia: a large cross-sectional study. Ann Saudi Med. 2024;44(1):18–25. doi:10.5144/0256-4947.2024.18

25. Lee MJ, Khang AR, Kang YH, Yun MS, Yi D. Synergistic interaction between hyperuricemia and abdominal obesity as a risk factor for metabolic syndrome components in Korean population. Diabetes Metab J. 2022;46(5):756–766. doi:10.4093/dmj.2021.0166

26. Kawachi K, Kataoka H, Manabe S, Mochizuki T, Nitta K. Low HDL cholesterol as a predictor of chronic kidney disease progression: a cross-classification approach and matched cohort analysis. Heart Vessels. 2019;34(9):1440–1455. doi:10.1007/s00380-019-01375-4

27. Abudureyimu P, Pang Y, Huang L, et al. A predictive model for hyperuricemia among type 2 diabetes mellitus patients in Urumqi, China. BMC Public Health. 2023;23(1):1740. doi:10.1186/s12889-023-16669-6

28. Chihara Y, Wakabayashi I, Kataoka Y, Yamamoto T. Serum creatinine is more strongly associated with hyperuricemia than eGFR in males but not in females. Mod Rheumatol. 2025;35(2):378–385. doi:10.1093/mr/roae083

29. Shirasawa T, Ochiai H, Yoshimoto T, et al. Cross-sectional study of associations between normal body weight with central obesity and hyperuricemia in Japan. BMC Endocr Disord. 2020;20(1):2. doi:10.1186/s12902-019-0481-1

30. Lee MF, Liou TH, Wang W, et al. Gender, body mass index, and PPARγ polymorphism are good indicators in hyperuricemia prediction for Han Chinese. Genet Test Mol Biomarkers. 2013;17(1):40–46. doi:10.1089/gtmb.2012.0231

31. Miyajima Y, Karashima S, Mizoguchi R, et al. Prediction and causal inference of hyperuricemia using gut microbiota. Sci Rep. 2024;14(1):9901. doi:10.1038/s41598-024-60427-6

Continue Reading