分層廣義線性模型

在統計學中，分層廣義線性模型（hierarchical generalized linear models (HGLM)）可視為廣義線性模型的推廣。在廣義線性模型中，誤差分量是統計獨立的，^[1] 然而這一假設並非總是成立的。即在有些情況下，誤差項之間有函數關係。分層廣義線性模型允許有不同的誤差分量，誤差分量可以統計相關的，並不必要滿足常態分布。當有不同的聚類存在時，同一聚類中的觀測值是相關的，並且是正相關的。在這種情況下，廣義線性模型是不適用的，忽略這些關聯會引起造成一些問題^[2] 。

概論

模型

在分層模型中，觀測值可進行聚類，並且觀測值的分布不僅由所有聚類的共同結構決定，也由聚類的具體結構決定。於是，模型要引入隨機效應分量，並且不同的聚類有不同的隨機效應分量。設 $y$ 為響應變量， $u$ 為隨機效應， $g$ 為連結函數。在分層廣義線性模型中，需要假設 $y|u$ 和 $u$ 滿足：^[3] $y\mid u\sim \ f(\theta ,\,\phi )$ and $u\sim \ f_{u}(\alpha ).$

線性預測器形式為：

g(E(y))=g(\mu )=\eta =X\beta +v\,

其中， $\mu =E(y)$ ， $\eta =X\beta +v$ ， $v=v(u)$ 為 $u$ 的嚴格單調函數。在分層廣義線性模型中，固定效應為 $\beta$ ，對所有觀測值都相同。隨機分量 $u$ 是不可觀測的，不同聚類對應的隨機分量取值是隨機的。於是，同一聚類的觀測值對應的 $u$ 的取值相同，不同聚類的觀測值對應的 $u$ 的取值也不同。^[2]

可識別性

為了進行參數推斷，有必要保證滿足可識別性（英語：Identifiability）^[4] 。在以上模型中，v是不可識別的，因為

X\beta +v=(X\beta +a)+(v-a)\,

其中 $a$ 為常數。^[3] 要使模型可識別，需要對參數另加約束。約束常加在隨機效應上，比如 $E(v)=0$ 。^[3]

不同分布函數和連結函數的模型

假設不同的分布函數 $y\mid u$ 和 $u$ ，採用不同的連結函數 $g$ 和 ' $v$ ，可以得到不同的模型。另外，廣義線性混合模型（英語：generalized linear mixed model）是分層廣義線性模型的一個特例。在分層廣義線性模型中，隨機效應的分布函數 $u$ 不必要滿足常態分布。如果 $u$ 的分布為常態分布， $v$ 的連結函數為恆等函數，此時的分層廣義線性模型即為廣義線性混合模型。^[3]

$y\mid u$ 和 $u$ 的分布可取為共軛分布，此時分層廣義線性模型有一些特殊的性質，並且易於計算和理解^[3]。比如，如果 $y\mid u$ 的分布為平均值一定的泊松分布， $u$ 的分布為伽瑪分布，並取標準對數連接函數，則此時分層廣義線性模型為泊松共軛分層廣義線性模型。如果 $y\mid u$ 的分布為平均值一定的二項分布， $u$ 的分布為共軛貝塔分布，採用標準羅吉特連接函數，則此時分層廣義線性模型為貝塔共軛模型。另外，廣義線性混合模型其實就是正態共軛分層廣義線性模型。^[3]

常見的模型總結如下：^[5]

Commonly used models
模型名字	y的分布	y 和 u的連接函數	u的分布	u 和 v 的連接函數
正態共軛	常態分布	恆等函數	常態分布	恆等函數
二項共軛	二項分布	羅吉特（英語：logit）	貝塔分布	羅吉特（英語：logit）
泊松共軛	泊松分布	對數	伽瑪分布	對數
伽瑪共軛	伽瑪分布	倒數	逆伽瑪分布	倒數
二項GLMM	二項分布	羅吉特（英語：logit）	常態分布	恆等函數
泊松 GLMM	泊松分布	對數	常態分布	恆等函數
伽瑪 GLMM	伽瑪分布	對數	常態分布	恆等函數

擬合

分層廣義線性模型適用條件是觀測值可歸為不同的聚類。估計函數有兩類：固定效應估計函數和隨機效應估計函數，分別相應於 $\eta =\mathbf {x} {\boldsymbol {\beta }}$ 和 $\mathbf {v(u)}$ 中的參數。有多種方法進行分層廣義線性模型中的參數估計。如果只對固定效應估計函數感興趣，可以採用總體平均模型。如果要推斷個體，就需要估計隨機效應。^[2] 擬合分層廣義線性模型有多種技術。

應用

分層廣義線性模型在實際生活中有諸多應用。

工程

這一模型可用於分析半導體製造中相互關聯的過程形成的負載的層級過程^[6]^[7]。工程師可以應用此模型發現和分析重要的次過程，同時評估這些次過程對最終性能的影響 ^[6]。

商業

市場問題也可以用分層廣義線性模型來分析。研究者應用此模型研究了一國範圍內的消費者，以解決國際市場研究中的嵌套資料結構問題^[8]。

參考文獻

^ Generalized Linear Models. Chapman and Hall/CRC. 1989. ISBN 0-412-31760-5.
^ ^2.0 ^2.1 ^2.2 Agresti, Alan. Categorical Data Analysis. Hoboken, New Jersey: John Wiley & Sons, Inc. 2002. ISBN 0-471-36093-7.
^ ^3.0 ^3.1 ^3.2 ^3.3 ^3.4 ^3.5 Y. Lee and J. A. Nelder. Hierarchical Generalized Linear Models. Journal of the Royal Statistical Society, Series B. 1996,. Vol. 58, No. 4: 619–678. JSTOR 2346105.
^ Allman, Elizabeth S.; Matias; Rhodes; Elizabeth S. Allman, Catherine Matias, and John A. Rhodes. Identifiability of Parameters in Latent Structure Models with Many Observed Variables. The Annals of Statistics. 2009,. Vol. 37, No. 6A, (6A): 3099–3132. Bibcode:2008arXiv0809.5032A. arXiv:0809.5032 . doi:10.1214/09-AOS689.
^ Lars Rönnegård, Xia Shen and Moudud Alam. hglm: A Package for Fitting Hierarchical Generalized Linear Models. The R Journal. Dec 2010,. Vol. 2/2.
^ ^6.0 ^6.1 Naveen Kumar; Christina Mastrangelo; Doug Montgomery. Hierarchical Modeling Using Generalized Linear Models. Quality and Reliability Engineering International. 2011.
^ Chung Kwan Shin; Sang Chan Park. A machine learning approach to yield management in semiconductor manufacturing. International Journal of Production Research. 2000, 38 (17): 4261–4271. doi:10.1080/00207540050205073.
^ Tasoluk, Burcu; Burcu Tasoluk, Cornelia Dröge, Roger J. Calantone. Interpreting interrelations across multiple levels in HGLM models: An application in international marketing research. International Marketing Review. 2011,. Vol. 28 Iss: 1,: pp. 34–56. doi:10.1108/02651331111107099.

[1] Generalized Linear Models. Chapman and Hall/CRC. 1989. ISBN 0-412-31760-5.

[CDA-2] 2.0 ^2.1 ^2.2 Agresti, Alan. Categorical Data Analysis. Hoboken, New Jersey: John Wiley & Sons, Inc. 2002. ISBN 0-471-36093-7.

[paper1996-3] 3.0 ^3.1 ^3.2 ^3.3 ^3.4 ^3.5 Y. Lee and J. A. Nelder. Hierarchical Generalized Linear Models. Journal of the Royal Statistical Society, Series B. 1996,. Vol. 58, No. 4: 619–678. JSTOR 2346105.

[identifiability-4] Allman, Elizabeth S.; Matias; Rhodes; Elizabeth S. Allman, Catherine Matias, and John A. Rhodes. Identifiability of Parameters in Latent Structure Models with Many Observed Variables. The Annals of Statistics. 2009,. Vol. 37, No. 6A, (6A): 3099–3132. Bibcode:2008arXiv0809.5032A. arXiv:0809.5032 . doi:10.1214/09-AOS689.

[5] Lars Rönnegård, Xia Shen and Moudud Alam. hglm: A Package for Fitting Hierarchical Generalized Linear Models. The R Journal. Dec 2010,. Vol. 2/2.

[engg-6] 6.0 ^6.1 Naveen Kumar; Christina Mastrangelo; Doug Montgomery. Hierarchical Modeling Using Generalized Linear Models. Quality and Reliability Engineering International. 2011.

[7] Chung Kwan Shin; Sang Chan Park. A machine learning approach to yield management in semiconductor manufacturing. International Journal of Production Research. 2000, 38 (17): 4261–4271. doi:10.1080/00207540050205073.

[mktresearch-8] Tasoluk, Burcu; Burcu Tasoluk, Cornelia Dröge, Roger J. Calantone. Interpreting interrelations across multiple levels in HGLM models: An application in international marketing research. International Marketing Review. 2011,. Vol. 28 Iss: 1,: pp. 34–56. doi:10.1108/02651331111107099.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]