Generalized linear models
In the standard linear model, we did not make any assumptions about the distribution of Y, though in some cases we can gain better estimates if we know that Y is, for example restricted to non-negative integers 0,1,2,…, or restricted to the interval [0,1]. A framework for analyzing such cases is referred to as generalized linear models, commonly abbreviated as GLMs. The two key components of the GLM are the link function and a probability distribution. The link function g connects our familiar matrix product Xβ to the Y values through:
E(Y)=g−1(Xβ)
There is a function in base R for fitting GLMs, which is glm and uses a familiar form as lm, with additional arguments including family which specifies the distributional assumption of Y. Some examples of the use of GLMs are shown at the Quick R website. There are a number of references for GLMs on the Wikipedia page.
|