中央研究院統計科學研究所
第二次客座系列專題演講

Generalized Linear Models, Estimating Functions and Multivariate Extensions

梁 賡 義 教授 主講

Professor Kung-Yee Liang, Department of Biostatistics
School of Hygiene & Public Health, The Johns Hopkins University

摘        要

  This century has witnessed the evolution of human diseases that were mainly infectious to chronic in nature. While still battling against the latter, the human population also took note of the re-emergence of infectious diseases in the last two decades. Looking back, it seems fair to state that statistics, as a discipline, has risen to the occasion and has met the statistical challenges whenever they existed in the past decades. To this end, we start by examining the role statistics and statisticians have played in biomedical research aimed at providing better treatment and understanding on the etiology of diseases. This is followed by a brief review of some guiding statistical principles and concepts, some of which are arguably potentially misused.

  We next turn to the issue of correlated data which has drawn a good deal of attention among the statistical community during the past 15 years. Here the notion of correlated data refers to the situation where the conventional independent assumption among observations may be violated. Examples of these kinds from real studies are provided and characterized in terms of underlying scientific objectives. Before addressing the issue of correlated data analysis, we first review methods that have been instrumental when the independent assumption is indeed valid. This includes as special cases generalized linear models, quasi-likelihood and the estimating function approaches. In so doing, we focus on the statistical reasoning that has led one approach to another and on the theoretical underpinning of each approach. Second, we discuss in detail the analysis of discrete data which are commonly observed in biomedical research. Here, examples (e.g., teratological studies) and issues considered lead naturally to the arena of correlated data analysis.

  Finally, we discuss some recent methodological developments for analyzing correlated data. First, three modeling approaches for correlated data, all of which represent multivariate versions of generalized linear models of different flavor, are introduced and contrasted. The emphasis is on the relative merit of interpretations of regression coefficients from these three models. Second, we discuss the pros and cons of the likelihood and semi-parametric inferential procedures along with their connections. Third, we visit the topic of multivariate survival data analysis, which has lately seen considerable renewed interest. Theoretical development and applications of the pseudo-likelihood method to this topic and others will be discussed.

Throughout, a variety of data examples for which we have been directly or indirectly involved will be used for illustrations. For the sake of time, technical derivations will be kept to a minimum. Rather, we focus on the constant feedback between motivation leading to methodological developments and their theoretical justifications on an intuitive ground, whenever possible.


地點:中央研究院生物醫學科學研究所地下一樓B1C演講廳

88年7月12日上午 8:30 - 9:00 報到登記
上午 9:00 - 9:10 開幕致詞
演講場次日期時間講題
Lecture 188年7月12日(星期一) 09:10 – 10:30 am The Interplay between Statistics and Biomedical Science
Lecture 2 11:00 – 12:30 am Correlated Data and Regression Analysis
Lecture 3 02:00 – 03:30 pm Correlated Data and Regression Analysis
Lecture 4 88年7月13日(星期二) 09:00 – 10:30 am Analysis of Binary Data
Lecture 5 11:00 – 12:30 am Analysis of Polytomous and Count Data
Lecture 6 02:00 – 03:30 pm Multivariate Extension I: Statistical Modeling
Lecture 7 88年7月14日(星期三) 09:00 – 10:30 am Multivariate Extension II: Statistical Inference
Lecture 8 11:00 – 12:30 am Multivariate Extension III: Survival Analysis
Lecture 9 02:00 – 03:30 pm General Discussion


OUTLINES OF THE LECTURE

I. The Interplay between Statistics and Biomedical Science

II & III. Correlated Data and Regression Analysis
IV. Analysis of Binary Data
V. Analysis of Polytomous and Count Data
VI. Multivariate Extension I: Statistical Modeling
VII. Multivariate Extension II: Statistical Inference
VIII. Multivariate Extension III: Survival Analysis