A "one in 20 rule" has been suggested, indicating the need for
shrinkage of regression coefficients, and a "one in 50 rule" for
stepwise selection with the default
p-value of 5%. Other studies, however, show that the one in ten rule may be too conservative as a general recommendation and that five to nine events per predictor can be enough, depending on the
research question. More recently, a study has shown that the ratio of events per predictive variable is not a reliable statistic for estimating the minimum number of events for estimating a logistic prediction model. Instead, the number of predictor variables, the total sample size (events + non-events) and the events fraction (events / total sample size) can be used to calculate the expected prediction error of the model that is to be developed. One can then estimate the required sample size to achieve an expected prediction error that is smaller than a predetermined allowable prediction error value. The necessary sample size and number of events for model development are then given by the values that meet these requirements. == Other modalities ==