RESUMO
Quantile regression allows us to estimate the relationship between covariates and any quantile of the response variable rather than the mean. Recently, several statistical distributions have been considered for quantile modeling. The objective of this study is to provide a new computational package, two biomedical applications, one of them with COVID-19 data, and an up-to-date overview of parametric quantile regression. A fully parametric quantile regression is formulated by first parameterizing the baseline distribution in terms of a quantile. Then, we introduce a regression-based functional form through a link function. The density, distribution, and quantile functions, as well as the main properties of each distribution, are presented. We consider 18 distributions related to normal and non-normal settings for quantile modeling of continuous responses on the unit interval, four distributions for continuous response, and one distribution for discrete response. We implement an R package that includes estimation and model checking, density, distribution, and quantile functions, as well as random number generators, for distributions using quantile regression in both location and shape parameters. In summary, a number of studies have recently appeared applying parametric quantile regression as an alternative to the distribution-free quantile regression proposed in the literature. We have reviewed a wide body of parametric quantile regression models, developed an R package which allows us, in a simple way, to fit a variety of distributions, and applied these models to two examples with biomedical real-world data from Brazil and COVID-19 data from US for illustrative purposes. Parametric and non-parametric quantile regressions are compared with these two data sets.
Assuntos
COVID-19 , Modelos Estatísticos , Brasil , COVID-19/epidemiologia , HumanosRESUMO
Response variables in medical sciences are often bounded, e.g. proportions, rates or fractions of incidence of some disease. In this work, we are interested to study if some characteristics of the population, e.g. sex and race which can explain the incidence rate of colorectal cancer cases. To accommodate such responses, we propose a new class of regression models for bounded response by considering a new distribution in the open unit interval which includes a new parameter to make a more flexible distribution. The proposal is to obtain compound power normal distribution as a base distribution with a quantile transformation of another family of distributions with the same support and then is to study some properties of the new family. In addition, the new family is extended to regression models as an alternative to the regression model with a unit interval response. We also present inferential procedures based on the Bayesian methodology, specifically a Metropolis-Hastings algorithm is used to obtain the Bayesian estimates of parameters. An application to real data to illustrate the use of the new family is considered.
Assuntos
Neoplasias Colorretais , Teorema de Bayes , Neoplasias Colorretais/epidemiologia , Humanos , Incidência , Distribuição NormalRESUMO
New generators are required to define wider distributions for modeling real data in survival analysis. To that end we introduce the four-parameter generalized beta-generated Lindley distribution. It has explicit expressions for the ordinary and incomplete moments, mean deviations, generating and quantile functions. We propose a maximum likelihood procedure to estimate the model parameters, which is assessed through a Monte Carlo simulation study. We also derive an additional estimation scheme by means of least square between percentiles. The usefulness of the proposed distribution to describe remission times of cancer patients is illustrated by means of an application to real data.