Introduction

Along with thorough research, fatty liver disease was recognized to be associated with metabolic dysfunction, and the all-cause mortality of patients with fatty liver disease was much higher than that of the general population [1]. Hence, a new concept, metabolic dysfunction-associated fatty liver disease (MAFLD), was proposed to replace the non-alcoholic fatty liver disease (NAFLD) [2]. The diagnosis of hepatic steatosis (HS) is the first step in both NAFLD and MAFLD. The expert consensus on MAFLD recommended three approaches for the diagnosis of HS: imaging techniques, blood biomarkers/scores, or liver histology.

Fatty liver index (FLI) is a most acknowledged blood biomarker to diagnose HS at present, but no specific COPs were given. FLI is a simple algorithm developed by Bedogni et al. [3] for the prediction of fatty liver in the general population, which is composed of four components: waist circumference (WC), body mass index (BMI), triglycerides (TG), and gamma-glutamyl transferase (GGT). In Bedogni’s studies [3], they concluded that an FLI < 30 can be used to rule out and an FLI ≥ 60 could be used to rule in fatty liver, respectively. However, this COP of FLI for diagnosis of HS was inaccurate due to the absence of stratification of gender and age, although these factors were not considered predictors of fatty liver when FLI was created. Previous studies demonstrated that age and sex were important modifiers of FLI variability [4, 5]. Furthermore, the prevalence of fatty liver was significantly higher in men than that in women with reproductive age, and male gender is a risk factor for fatty liver [6, 7]. Many factors may contribute to gender differences in the incidence of fatty liver, such as waist-to-hip ratio, estrogen, abdominal fat distribution, etc. [8, 9]. Our preliminary study showed significant difference of COPs of FLI for HS diagnosis between Chinese males (37.25: sensitivity = 81.23, AUROC = 0.856) and females (17.00: sensitivity = 85.94, AUROC = 0.909), respectively [10]. In addition to gender stratification, we further provided Asian population-specific COPs of FLI for the diagnosis of HS with stratification of WC and BMI [11]. The COPs of FLI for diagnosing HS varied with different genders, WC and BMI, and so, these factors should be taken into consideration when using FLI to diagnose HS.

The COPs of FLI depend on the results of ultrasonography, a noninvasive first-line modality for the diagnosis of HS. Liver ultrasound transient elastography (LUTE), a new ultrasound technique, could simultaneously measure the ultrasound attenuation related to the presence of HS and records the controlled attenuation parameter (CAP) as the indicator for the fatness in the liver. CAP obtained from LUTE is a numerical value that has been proved to be significantly in line with the percentage of steatosis and the histological degree of steatosis [12,13,14,15]. Therefore, the diagnosis of HS by LUTE is much more objective and sensitive than conventional ultrasonography, which contributes to early detection and treatment of HS. In this study, we aimed to obtain the population-specific COPs of FLI based on both conventional ultrasonography and LUTE.

The National Health and Nutrition Examination Survey (NHANES) is a program of studies designed to assess the health and nutritional status of adults and children in the United States. Findings from this survey can be used to assess the health status and disease spectrum of the American population, and used in epidemiological studies and health sciences researches. According to data from the NHANES, in this study, we aimed to obtain the population-specific COPs (stratified by gender, WC and BMI) of FLI based on both conventional ultrasonography and LUTE, so as to better apply FLI for the diagnosis of HS in Americans.

Materials and methods

Study design and participants

All study data were collected from the American NHANES (https://www.cdc.gov/nchs/nhanes/index.htm). Participants who underwent LUTE (FibroScan®) (n = 5948) in NHANES 2017–2018 and conventional hepatic ultrasonography (n = 14,979) in NHANES III were enrolled. Invalid or missing data were excluded, and all data were double-checked. Exclusion criteria included: (1) age under 18 years old; (2) without key covariates: BMI, WC, TG and GGT; (3) confidence in HS assessment was “None” or “Doubtful”. After exclusion, the calculation of FLI and determination of COPs were performed on 4633 qualified participants in NHANES 2017–2018 (Additional file 1: Fig. S1) and 9214 participants in NHANES III (Additional file 1: Fig. S2).

Index calculation

FLI was calculated by BMI (kg/m2), WC(cm), TG(mg/dL) and GGT(U/L) of these subjects based on the algorithm [FLI = (e0.953*loge (triglycerides)+0.139*BMI+0.718*loge(GGT)+0.053*WC−15.745)/(1 + e0.953*loge(triglycerides)+0.139*BMI+0.718*loge(GGT)+0.053*WC−15.745) *100] [3]. Taking the HS diagnosed by ultrasonography as the reference standard, the optimal COPs of FLI were determined in a receiver operating characteristics (ROC) curve analysis by maximizing the Youden index. Homeostatic model assessment for insulin resistance (HOMA-IR) calculated by fasting glucose in mmol/L times insulin in μU/mL divided by 22.5 [16].

Definitions and subgroups

In NHANES 2017–2018, the presence of HS was determined by LUTE with FibroScan® (EchosensTM North America), and the categorized assessment of HS based on CAP encompassed normal (CAP < 248), mild (248 ≤ CAP < 268), moderate(268 ≤ CAP < 280) and severe (CAP ≥ 280) [17]. HS assessed by hepatic ultrasonography (Toshiba Sonolayer SSA-90A) in NHANES III was also reported as normal, mild, moderate, or severe. Moreover, all mild to severe HS were considered as HS, regardless of the method used. Based on the BMI and WC criteria for Caucasians made by World Health Organization (WHO), BMI was divided into four groups: underweight (< 18.5 kg/m2), normal (≥ 18.5 kg/m2 and < 25.0 kg/m2), overweight (≥ 25.0 kg/m2 and < 30.0 kg/m2) and obese (≥ 30.0 kg/m2), and WC was divided into normal (< 88 cm in female and < 102 cm in male) and abnormal (≥ 88 cm in female and ≥ 102 cm in male) groups. The optimal COPs of FLI for HS diagnosis in different sex were determined for different WC and BMI stratification. MAFLD was defined and diagnosed according to the international expert consensus statement released in 2020 [2]. The history of drug use and various metabolic indicators (HDL-cholesterol, fasting glucose, HbA1c, insulin, high-sensitivity C-reactive protein, etc.) involved in the diagnostic criteria of MAFLD were obtained from the database, and hypertension and diabetes were diagnosed according to widely accepted international standards.

Statistical analysis

All the continuous variables were tested by normality testing and described by medians (interquartile range), and categorical variables were described by number (proportions). Due to non-normally distributed covariates, the Mann–Whitney U test was performed to compare continuous variables of MAFLD and Non-MAFLD groups, and the Kruskal–Wallis test was used to test the difference among different degrees of HS (mild, moderate and severe). All the analysis was performed at SPSS 26.0. A two-tailed p value < 0.05 was considered statistically. The cut-off points of FLI and corresponding sensitivity, specificity, positive likelihood ratio, negative likelihood ratio, Youden index and area under the receiver operating characteristic curve (AUROC) were performed with MedCalc version 19.0.7. The Youden index is sensitivity plus specificity minus 1, and the larger the Youden index, the higher the accuracy of diagnosis. The optimal COPs of FLI for HS diagnosis were determined in a ROC analysis by maximizing the Youden index.

Results

Prevalence of HS and baseline characteristics of participants

Finally, a total of 4633 eligible participants in NHANES 2017–2018 and 9214 eligible participants in NHANES III were recruited in this study. In NHANES 2017–2018, the incidence of HS diagnosed by CAP obtained from LUTE in the participants was 58.7% (95% Confidence interval (CI) 57.3–60.1%), with mild, moderate and severe HS accounting for 10.3%, 7.7% and 40.7%, respectively. Overall, the prevalence of HS in males [62.9% (95%CI 60.9–64.9%)] was higher than that in females [54.7% (95%CI 52.7–56.7%), and this difference remained after stratification for BMI and WC. Moreover, with the increase of WC and BMI, the incidence of HS increased significantly. Only 25.86% of participants with normal BMI (≥ 18.5 kg/m2 and < 25.0 kg/m2) had HS, while 81.68% of overweight participants had HS. Participants with abnormal WC were nearly twice as likely as those with normal WC to have HS. The specific incidence of HS at different WC and BMI stratification were shown in Table 3.

Combined with the participants' medical history and laboratory results, 56.2% (95%CI 54.8–57.6%) had MAFLD. Given the incomplete metabolic indicators of some participants, the prevalence of MAFLD should be higher than 56.2%. Participants with MAFLD had larger BMI and WC, and higher cholesterol, TG, fasting glucose, insulin, HbA1c level and lower high-density lipoprotein cholesterol (Table 1). The level of hepatic enzymes such as aspartate aminotransferase (AST) and alanine aminotransferase (ALT) of participants with MAFLD was significantly higher than that of Non-MAFLD, and the more the severity of HS, the higher the liver stiffness (Table 1). Meanwhile, with the exception of ALT, there was no difference between mild and moderate MAFLD, but there were remarkable differences in all metabolic indicators compared with severe MAFLD (Table 1).

Table 1 Baseline characteristic of participants with or without MAFLD in NHANES 2017–2018

In NHANES III, the prevalence of HS and MAFLD was 34.4% (95%CI 33.4–35.3%) and 27.9% (95%CI 27.0–28.8%), respectively. Differences in the prevalence of HS diagnosed by conventional hepatic ultrasonography among various stratifications in NHANES III were similar to NHANES 2017–2018, but much lower than NHANES 2017–2018. And there were statistically significant differences in baseline characteristics among different degrees of HS (Table 2).

Table 2 Baseline characteristic of participants with or without MAFLD in NHANES III

COPs

In NHANES 2017–2018, the optimal COP of FLI for HS diagnosis in overall population was 45.60, with an AUROC of 0.833 (0.822–0.844), sensitivity of 80.85% (79.3–82.3%) and specificity of 70.50% (68.4–72.5%). For males, if FLI was greater than 48.57, HS could be considered with a sensitivity of 81.80% and specificity of 69.78%. And the value for female to diagnosis HS was 41.93 (sensitivity = 80.34%, specificity = 70.89%). There was a significant difference in the optimal COPs of FLI for HS diagnosis between normal (< 88 cm in females, < 102 cm in males) and abnormal WC groups (35.60 vs. 85.04 in males and 14.29 vs. 75.28 in females). Similarly, the COPs of FLI for HS diagnosis differed greatly by BMI. After stratification by WC and BMI, the COPs of FLI for diagnosis of HS were generally lower in females than those in males. Furthermore, with the increase of BMI and WC, the COPs increased gradually with significant difference between different groups. Detailed data were presented in Table 3. In the underweight group, the diagnostic ability of FLI in diagnosing HS was low, or ROC analysis could not be performed due to insufficient data. However, with the exception of those participants with low body weight, the AUROC of FLI was almost all greater than 0.700, suggesting that FLI had an acceptable diagnostic ability for the diagnosis of HS.

Table 3 The cut-off points of FLI stratified by waist circumference and BMI based on the results of liver ultrasound transient elastograpy in NHANES 2017–2018

In NHANES III, the COP of FLI for HS diagnosis in overall participants was 59.54 with an AUROC of 0.681 (0.671–0.691). The COP of FLI for men to diagnose HS was 61.47 (AUROC = 0.706, sensitivity = 55.53%, specificity = 75.15%), and the value for women was 51.65 (AUROC = 0.659, sensitivity = 56.6%, specificity = 71.65%). In NHANES 2017–2018, HS was diagnosed by LUTE, while in NHANES III, it was diagnosed by conventional hepatic ultrasonography. The diagnostic ability of FLI for HS diagnosis in NHANES III was low. Compared with ultrasonography, the COPs of FLI based on LUTE obtained from Fibroscan were much more precise, with higher AUROC, sensitivity and specificity. The population-specific COPs of FLI for HS diagnosis stratified by gender, WC, and BMI were tabulated in Table 4. All COPs were simplistically presented in Tables 5 and 6.

Table 4 The cut-off points of FLI stratified by waist circumference and BMI based on the results of conventional ultrasonography in NHANES III
Table 5 The cut-off points of FLI at waist circumference and BMI stratifications in NHANES 2017–2018
Table 6 The cut-off points of FLI at waist circumference and BMI stratifications in NHANES III

Discussion

In this study, we provided, for the first time, detailed population-specific COPs of FLI for the diagnosis of HS among Americans based not only on the common abdominal ultrasonography but also on CAP obtained from LUTE. And all data analyzed were derived from authoritative NHANES. FLI was first proposed by Bedogni et al. for the prediction of fatty liver among Italians (216 subjects with and 280 without suspected liver disease) [3]. Several studies demonstrated that the FLI had excellent discriminative ability to detect ultrasonographic HS, and outperforms other non-invasive markers such as BMI, WC, TG, cholesterol and so on [18,19,20]. Furthermore, in addition to predicting HS, studies had shown that FLI was also associated with metabolic and cardiovascular disease, and all-cause mortality [21,22,23]. Therefore, FLI has great clinical application value in health screening and epidemiologic studies, particularly when ultrasound and other imaging examinations are unavailable. However, there were no acknowledged COPs of FLI for the diagnosis of HS. Bedogni et al. suggested that a FLI < 30 could be used to rule out (sensitivity = 87%; specificity = 64%) and a FLI ≥ 60 to rule in hepatic steatosis (sensitivity = 61%; specificity = 86%), respectively. Koehler et al. [20] further validated that the COPs (30 and 60) had great diagnostic efficacy for HS through a Rotterdam study of 2652 participants. A cross-sectional study that included 8626 middle-aged and elderly Chinese (over 40 years old) found the optimal COP of FLI for the diagnosis of HS was also 30 with a maximum Youden Index of 0.51 [24].

However, this COP was inaccurate for that gender and race differences are not taken into account. The COPs of the main components of FLI for HS diagnosis, BMI and WC, were different with the variations of gender and ethnicity. Thus, the COPs of FLI for the diagnosis of HS should be different in different populations and need to be validated when used in a different population. Moreover, NAFLD/MAFLD has been proved to be a heterogeneous disease [25, 26], which makes it more necessary to obtain specific COPs of FLI stratified by heterogeneous factors (age, sex, reproductive status, coexistence of different comorbidities, etc.) for better use in different populations. The gender-based optimal COPs of FLI for HS diagnosis proposed by Motamed et al. were 46.9 (sensitivity = 82.42%, specificity = 76.87%) in men and 53.8 (sensitivity = 82.33%, specificity = 76.55%) in women [19]. In contrast, in the Asian population, the COPs of FLI for HS diagnosis were higher for men than for women, which is in line with the results of Dehnavi et al.’s study [27]. A Taiwan study [18] analyzed the ultrasonography and laboratory results of 29,797 healthy subjects. It concluded that an FLI of < 25 for males & < 10 for females could rule out and an FLI of > 35 for males & > 20 for females could rule in ultrasonographic HS, respectively. Our previous study which recruited 135,436 health check-up populations also showed significant difference between Chinese males and females, with COPs of 37.25 (sensitivity = 81.23, AUROC = 0.856) and 17.00 (sensitivity = 85.94, AUROC = 0.909), respectively. In addition to gender difference, the cut-off values of WC and BMI of males and females are also different. Therefore, our team further obtained the population-specific COPs of FLI, at gender, WC and BMI stratifications, for the diagnosis of HS by using the above physical examination data [11]. The results suggested that, apart from gender, BMI and WC also had a great impact on the COPs of FLI [11]. If the stratifications mentioned above are not considered, only using the fixed COP of FLI to diagnose HS may cause missed diagnosis or misdiagnosis.

Nonetheless, the above researches about the COPs of FLI for HS diagnosis were all based on the common abdominal ultrasonography. It is generally believed that conventional ultrasound is not sensitive to mild fatty infiltration, and only more than 30% of fat infiltration can be accurately detected [28,29,30]. But, as technology improves, a recent meta-analysis re-evaluated the accuracy of ultrasound in diagnosing hepatic steatosis. Results showed that the overall sensitivity of ultrasonography to detect ≥ 5% histologically defined HS could reach 82% [31]. Ultrasonography is a relatively subjective detecting technology and the results depend on the operator’s experiences. In recent years, a new liver-specific quantitative measurement method, CAP obtained from LUTE, has been developed for HS. CAP uses standardized (controlled) settings, thereby minimizing user influence on the attenuation value, and it can be assessed by an operator who does not have any ultrasound imaging skills. [32] CAP was significantly correlated with the percentage of steatosis [15] and was able to identify steatosis at early stages (> 11%) [32]. A recent study confirmed that the sensitivity of CAP in diagnosing HS was higher than conventional ultrasonography, but without statistical difference. But the difference in specificity of CAP and ultrasonography when using only echogenicity of liver parenchyma of 29% was significant [33]. Hence, the COPs of FLI based on LUTE maybe more accurate than that based on the common ultrasound in the diagnosis of HS. Dehnavi et al. [27] were the first to utilize the fatty liver identified by LUTE to obtain the COPs of FLI, and calculated the COPs of 26.2, 38.3 and 49.7, respectively, in Grades 1, 2, and 3 of HS. However, this study had a small sample size (n = 212) and did not provide criteria for diagnosing HS by CAP nor did further stratification be performed.

To our knowledge, this is the first study to utilize extensive sample size data of the U.S. to provide detailed COPs of FLI, at gender, WC and BMI stratifications, for the diagnosis of HS based on the results of LUTE. According to the Hosmer and Lemeshow guidelines [34] for evaluating predictive abilities, Our study demonstrated that FLI had the excellent diagnostic capability to detect HS with an AUROC of 0.833 (0.822–0.844), sensitivity of 80.85% (79.3–82.3%) and specificity of 70.50% (68.4–72.5%) in the general population. After men and women were divided into two groups, the AUROC could also remain above 0.8, 0.837 (0.821–0.852) in men and 0.829 (0.813–0.844) in women, respectively. Although the diagnostic power of FLI decreased with more detailed stratification, but was still acceptable (AUROC > 0.700). At the same time, the ultrasonography data from NHANES III (1988–1994) was also used to obtain the COPs of FLI, at gender, WC and BMI stratifications, for the diagnosis of HS. The results demonstrated that the prevalence of HS diagnosed by LUTE was much higher than that diagnosed by the common abdominal ultrasonography, and the COPs obtained by LUTE were all smaller than those obtained by the common abdominal ultrasonography in any stratification, which reflected the higher sensitivity of LUTE in the diagnosis of HS from the side.

However, the limitation that cannot be ignored is that the two surveys were performed in different NHANES samples and spanned a long time. In the past 30 years, in addition to the continuous development of ultrasound detection technology, the incidence of HS has been increasing year by year [35,36,37], which led to the lack of comparability between the two surveys. Another limitation of our study is data processing. NHANES uses a complex, multistage, probability sampling design to select participants representing the civilian, non-institutionalized US population. Therefore, we should have fully considered the sample weights when analyzing NHANES data to account for the complex survey design (including oversampling), survey nonresponse, and post-stratification. In our study, we did not perform weights analysis, and samples with missing data were directly excluded, which made the research results would not be representative of the actual United States. Even so, this study was the first to utilize data from a large sample size cross-sectional survey in the United States to produce detailed FLI COPs with acceptable diagnostic power.

In our study, the COPs of FLI based on CAP obtained from LUTE were much more precise, with higher AUROC, sensitivity and specificity, which suggests that the COPs of FLI for HS diagnosis based on CAP obtained from LUTE was accurate and valuable. Therefore, we recommend to apply the COPs of FLI based on CAP obtained from LUTE in epidemiological investigation and clinical practice when ultrasound is not available. It is worth noting that our study demonstrated the COPs of FLI for the diagnosis of HS varied with different genders, WC and BMI, and so, these factors should be considered when using FLI to diagnose HS. Otherwise, a large number of patients may be missed or misdiagnosed, resulting in a considerable disease burden or waste of medical resources. In this study, relatively accurate and detailed diagnostic COPs of FLI were obtained to apply FLI to clinical practice better. Certainly, further studies are warranted to explore more accurate FLI COPs or other blood biomarkers to diagnose HS.

In conclusion, the present study provided more accurate COPs of FLI based on CAP obtained from LUTE for the diagnosis of HS, and demonstrated that FLI, as a noninvasive, convenient and inexpensive index, has an acceptable ability to predict the occurrence of HS.