GLORIA — GEOMAR Library Ocean Research Information Access

Hits per page

hits 1 - 2 | 2 hits

Sorting

Online Resource

Mixture density networks for the indirect estimation of reference intervals

Hepp, Tobias ; Zierk, Jakob ; Rauh, Manfred ; [et al.]

Springer Science and Business Media LLC ; 2022

In: BMC Bioinformatics Vol. 23, No. 1 ( 2022-12)

add to mindlist on the mindlist

Details

In: BMC Bioinformatics, Springer Science and Business Media LLC, Vol. 23, No. 1 ( 2022-12)

Abstract: Reference intervals represent the expected range of physiological test results in a healthy population and are essential to support medical decision making. Particularly in the context of pediatric reference intervals, where recruitment regulations make prospective studies challenging to conduct, indirect estimation strategies are becoming increasingly important. Established indirect methods enable robust identification of the distribution of “healthy” samples from laboratory databases, which include unlabeled pathologic cases, but are currently severely limited when adjusting for essential patient characteristics such as age. Here, we propose the use of mixture density networks (MDN) to overcome this problem and model all parameters of the mixture distribution in a single step. Results Estimated reference intervals from varying settings with simulated data demonstrate the ability to accurately estimate latent distributions from unlabeled data using different implementations of MDNs. Comparing the performance with alternative estimation approaches further highlights the importance of modeling the mixture component weights as a function of the input in order to avoid biased estimates for all other parameters and the resulting reference intervals. We also provide a strategy to generate partially customized starting weights to improve proper identification of the latent components. Finally, the application on real-world hemoglobin samples provides results in line with current gold standard approaches, but also suggests further investigations with respect to adequate regularization strategies in order to prevent overfitting the data. Conclusions Mixture density networks provide a promising approach capable of extracting the distribution of healthy samples from unlabeled laboratory databases while simultaneously and explicitly estimating all parameters and component weights as non-linear functions of the covariate(s), thereby allowing the estimation of age-dependent reference intervals in a single step. Further studies on model regularization and asymmetric component distributions are warranted to consolidate our findings and expand the scope of applications.

Type of Medium: Online Resource

ISSN: 1471-2105

URL: Article

DOI: 10.1186/s12859-022-04846-0

Language: English

Publisher: Springer Science and Business Media LLC

Publication Date: 2022

detail.hit.zdb_id: 2041484-5

SSG: 12

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

Online Resource

Latent class distributional regression for the estimation of non-linear reference limits from contaminated data sources

Hepp, Tobias ; Zierk, Jakob ; Rauh, Manfred ; [et al.]

Springer Science and Business Media LLC ; 2020

In: BMC Bioinformatics Vol. 21, No. 1 ( 2020-12)

add to mindlist on the mindlist

Details

In: BMC Bioinformatics, Springer Science and Business Media LLC, Vol. 21, No. 1 ( 2020-12)

Abstract: Medical decision making based on quantitative test results depends on reliable reference intervals, which represent the range of physiological test results in a healthy population. Current methods for the estimation of reference limits focus either on modelling the age-dependent dynamics of different analytes directly in a prospective setting or the extraction of independent distributions from contaminated data sources, e.g. data with latent heterogeneity due to unlabeled pathologic cases. In this article, we propose a new method to estimate indirect reference limits with non-linear dependencies on covariates from contaminated datasets by combining the framework of mixture models and distributional regression. Results Simulation results based on mixtures of Gaussian and gamma distributions suggest accurate approximation of the true quantiles that improves with increasing sample size and decreasing overlap between the mixture components. Due to the high flexibility of the framework, initialization of the algorithm requires careful considerations regarding appropriate starting weights. Estimated quantiles from the extracted distribution of healthy hemoglobin concentration in boys and girls provide clinically useful pediatric reference limits similar to solutions obtained using different approaches which require more samples and are computationally more expensive. Conclusions Latent class distributional regression models represent the first method to estimate indirect non-linear reference limits from a single model fit, but the general scope of applications can be extended to other scenarios with latent heterogeneity.

Type of Medium: Online Resource

ISSN: 1471-2105

URL: Article

DOI: 10.1186/s12859-020-03853-3

Language: English

Publisher: Springer Science and Business Media LLC

Publication Date: 2020

detail.hit.zdb_id: 2041484-5

SSG: 12

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

hits 1 - 2 | 2 hits