Task： Choose an existing datasets or create your own data, carry out exploratory data analyses and regression analyses to explain the relationships among the variables involved.
Team members: For this project you may choose to work with 1-2 persons and submit a joint project. If you cannot find a team member, I will assign a teammate to you.
Opt out deadline: 03/08/2018. (If you feel your teammates are being incorporative, you can choose to start your own team as one individual team. However, you CANNOT switch to other teams. You can work on the same project you has been worked on but you are expected to write your own reports and make your own presentation. You must choose to do so voluntarily. Any team CANNOT force a team member out. )
Grading policies: team members will receive the same grade for the project and it is up to you to make sure that the work is shared equitably. The total points of the project is 100 points, which can be divided into three parts:
Initial report (20 pts): due by 03/08/2018;
Presentation (20 pts): each team will give a 30 minutes presentation of the project; (dates to be assigned)
Final report (60 pts): due by the end of the final exam date (TBA);
General guidelines of the project
Identify the problem of interest: choose a data set, describe the data set and identify the problem you are interested in;
Perform preliminary studies of the data: data visualization; check model assumptions, etc
Select most promising predictors: what variables are potentially most useful for your problem;
Choose the best regression model that serves your purpose and justify your choice (e.g. model diagnostics, outliers, normality, model selection criterion, etc…)
Interpret the final regression model;
Discuss your findings: What do the results mean?
Put all your codes in the appendix.
Find your own data set online (e.g. google “regression data set”), you will find plenty;