# Pubh 8500 final exam | PUBH 8500 – Advanced Biostatistics | Walden University

Answer the following questions. Copy and paste any required data charts or summaries into this Word document. Use additional space as needed. Be sure to include your name on the document and use the file naming convention. This exam is open book and open notes.

I. Two-way ANOVA (15 points)

A researcher is comparing the effects of 2 different asthma drugs on test performance. The researcher suspects that at least one of the drugs may have a different effect on fresh versus tired test takers. Data was collected on test results achieved by fresh and tired test takers after using each of the 2 drugs. There is concern that the exposure to these drugs could impair more than a person’s ability to take a test and therefore become a Public Health issue. The Research Question is: Does Drug A or Drug B impair test performance in either fresh or tired test takers? Please answer this question using the Final Exam – ANOVA (SPSS document) dataset and the following steps:

1. Conduct two-way ANOVA with interaction and post hoc analysis (as appropriate) using Tukey to correct for multiple comparisons. Provide relevant SPSS output. (10 points)

2. Interpret the analysis results in the context of the research question. Include important statistics from your analysis results to support your conclusion and generalize your results, if appropriate, to the relevant population(s). (5 points)

II. Multiple Linear Regression (25 points)

A health department randomly selected 400 subjects from a local community and monitored their cardiovascular condition. Data from this study are provided in the Final Exam – Linear and Logistic (SPSS document) dataset. The following variables are included in the database: sex, age, BMI, SBP, DBP, serum cholesterol, coronary heart disease, and follow-up.

1. Conduct a multiple linear regression using SPSS. Provide relevant SPSS output and assess the statistical significance of the effect of sex, age, and BMI on systolic blood pressure. (10 points)

2. Explain the practical implications of your finding. In other words, what is the clinical relevance of these results and how can the findings of this study be used in the real world? Include a reference to the R square of the model in your discussion. (5 points)

3. Discuss whether or not there is interaction (effect modification) between sex and age. (Hint: For this part you need to perform independent regression models for each variable looking at the association between sex and systolic blood pressure as well as age and systolic blood pressure. Then you need to perform another model for the combination of variables sex , age and systolic blood pressure. (10 points)

III. Multiple Logistic Regression (30 points)

Use the Final Exam – Linear and Logistic (SPSS document) dataset to assess the impact of sex, age, and BMI on the risk of coronary heart disease.

1. Conduct simple logistic regression of coronary heart disease and sex. (10 points)

2. Conduct a multiple logistic regression using SPSS to address the research question: What is the association between sex and coronary heart disease after controlling for age and BMI? (Hint: Add the variables of age and BMI to the model you used in #1.) (10 points)

3. Discuss how the addition of age and BMI in the model affected the association of sex and coronary heart disease using the Odds Ratios and confidence intervals in your output. (5 points)

4. Assess the statistical significance of the individual risk factors and explain the practical implication of your finding. In other words, what is the clinical significance of these results and how can the findings of this study be used in the real world? (5 points)

IV. Cox Proportional Hazard (30 points)

The Final Exam – Linear and Logistic (SPSS document) dataset, used in problems III and IV, also includes follow-up time (in days) from the beginning of the study to either onset of coronary heart disease or end of the study.  This allows you to also look at the relationship of sex to CHD using survival analysis techniques.

1. Complete a Kaplan-Meier Survival Analysis using Followup as the Time variable, Chdfate as the status variable and Sex as the factor. Produce a plot of the survival function. Discuss whether the survival time appears related to whether the person is male or female based on the survival plot. (10 points)

2. Use Kaplan-Meier in SPSS to test the assumption of proportionality. Create a Hazard plot with time = followed, status=Chdfate, and factor = sex. Interpret the results. (8 points)

3. Conduct a Cox Proportional Hazard regression to compare the time to coronary heart disease event between men and women. Include a Plot of the Hazards function stratified by sex in the output. Interpret the results. (8 points)

4. How does the hazard ratio compare to the odds ratio obtained from the simple logistic regression from the previous problem? Why might they differ?  (4 points)