Research output: Contribution to journal › Article

Journal | Computational Statistics |
---|---|

Date | E-pub ahead of print - 5 Nov 2013 |

Date | Published (current) - Jun 2014 |

Issue number | 3-4 |

Volume | 29 |

Number of pages | 25 |

Pages (from-to) | 743-767 |

Early online date | 5/11/13 |

Original language | English |

This paper considers the problem of simple linear regression with interval-censored data. That is, n pairs of intervals are observed instead of the n pairs of precise values for the two variables (dependent and independent). Each of these intervals is closed but possibly unbounded, and contains the corresponding (unobserved) value of the dependent or independent variable. The goal of the regression is to describe the relationship between (the precise values of) these two variables by means of a linear function. Likelihood-based Imprecise Regression (LIR) is a recently introduced, very general approach to regression for imprecisely observed quantities. The result of a LIR analysis is in general set-valued: it consists of all regression functions that cannot be excluded on the basis of likelihood inference. These regression functions are said to be undominated. Since the interval data can be unbounded, a robust regression method is necessary. Hence, we consider the robust LIR method based on the minimization of the residuals' quantiles. For this method, we prove that the set of all the intercept-slope pairs corresponding to the undominated regression functions is the union of finitely many polygons. We give an exact algorithm for determining this set (i.e., for determining the set-valued result of the robust LIR analysis), and show that it has worst-case time complexity O(n^{3}log n. We have implemented this exact algorithm as part of the R package linLIR.

- Exact algorithm, Interval-censored data, Least median of squares, Nonparametric likelihood inference, R package, Robust regression

Find related publications, people, projects, datasets and more using interactive charts.