
<?xml version="1.0" encoding="utf-8"?>
<!-- generator="FeedCreator 1.7.2-ppt DokuWiki" -->
<?xml-stylesheet href="https://www2.math.binghamton.edu/lib/exe/css.php?s=feed" type="text/css"?>
<feed xmlns="http://www.w3.org/2005/Atom">
    <title>Department of Mathematics and Statistics, Binghamton University people:kargin:math457_fall2021</title>
    <subtitle></subtitle>
    <link rel="alternate" type="text/html" href="https://www2.math.binghamton.edu/"/>
    <id>https://www2.math.binghamton.edu/</id>
    <updated>2026-04-09T20:01:24-04:00</updated>
    <generator>FeedCreator 1.7.2-ppt DokuWiki</generator>
<link rel="self" type="application/atom+xml" href="https://www2.math.binghamton.edu/feed.php" />
    <entry>
        <title>Project</title>
        <link rel="alternate" type="text/html" href="https://www2.math.binghamton.edu/p/people/kargin/math457_fall2021/project"/>
        <published>2021-11-29T16:30:17-04:00</published>
        <updated>2021-11-29T16:30:17-04:00</updated>
        <id>https://www2.math.binghamton.edu/p/people/kargin/math457_fall2021/project</id>
        <summary>
&lt;h2 class=&quot;sectionedit1&quot; id=&quot;project&quot;&gt;Project&lt;/h2&gt;
&lt;div class=&quot;level2&quot;&gt;

&lt;/div&gt;
&lt;!-- EDIT1 SECTION &quot;Project&quot; [1-19] --&gt;
&lt;h3 class=&quot;sectionedit2&quot; id=&quot;task&quot;&gt;Task&lt;/h3&gt;
&lt;div class=&quot;level3&quot;&gt;

&lt;p&gt;
Choose an existing dataset or collect your own data, and carry out data analyses to explain the relationships among the variables involved. You can also compare performance of different statistical 
tools for prediction or classification, using the chosen dataset.
&lt;/p&gt;

&lt;/div&gt;
&lt;!-- EDIT2 SECTION &quot;Task&quot; [20-301] --&gt;
&lt;h3 class=&quot;sectionedit3&quot; id=&quot;teams&quot;&gt;Teams&lt;/h3&gt;
&lt;div class=&quot;level3&quot;&gt;

&lt;p&gt;
For this project you are supposed to work with 1-2 persons and submit a joint report. If you cannot find a team member, they will be assigned to you.
&lt;/p&gt;

&lt;/div&gt;
&lt;!-- EDIT3 SECTION &quot;Teams&quot; [302-469] --&gt;
&lt;h3 class=&quot;sectionedit4&quot; id=&quot;grading_policies&quot;&gt;Grading policies&lt;/h3&gt;
&lt;div class=&quot;level3&quot;&gt;

&lt;p&gt;
Team members will receive the same grade for the project and it is up to you to make sure that the work is shared equitably. 
The total points of the project is 100 points, which can be divided into three parts:
&lt;/p&gt;
&lt;ol&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; Project Proposal (20 pts).&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; Presentation (40 pts): each team will give a 20 minutes presentation of the project; (dates to be assigned)&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; Final report (40 pts)&lt;/div&gt;
&lt;/li&gt;
&lt;/ol&gt;

&lt;/div&gt;
&lt;!-- EDIT4 SECTION &quot;Grading policies&quot; [470-881] --&gt;
&lt;h3 class=&quot;sectionedit5&quot; id=&quot;schedule&quot;&gt;Schedule&lt;/h3&gt;
&lt;div class=&quot;level3&quot;&gt;
&lt;ul&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; Declaration of team members: due by October 18&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; Project proposal: due by November 10&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; Preliminary report: due by December 3&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; Project presentations: December 6, 8, 10&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; Final report: due by December 13&lt;/div&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;/div&gt;
&lt;!-- EDIT5 SECTION &quot;Schedule&quot; [882-1118] --&gt;
&lt;h3 class=&quot;sectionedit6&quot; id=&quot;data&quot;&gt;Data&lt;/h3&gt;
&lt;div class=&quot;level3&quot;&gt;

&lt;p&gt;
Find your own data set online, you will find plenty;
&lt;/p&gt;

&lt;p&gt;
Popular collections of publicly-available datasets: 
&lt;/p&gt;
&lt;ul&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://archive.ics.uci.edu/ml/index.php&quot; class=&quot;urlextern&quot; title=&quot;https://archive.ics.uci.edu/ml/index.php&quot;&gt; UCI Machine Learning Repository&lt;/a&gt;&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://www.kaggle.com/datasets&quot; class=&quot;urlextern&quot; title=&quot;https://www.kaggle.com/datasets&quot;&gt; Kaggle &lt;/a&gt; &lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://academictorrents.com/browse.php?cat=6&quot; class=&quot;urlextern&quot; title=&quot;https://academictorrents.com/browse.php?cat=6&quot;&gt; Academic Torrents&lt;/a&gt; (shares large datasets via bit torrent technology)&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;http://lib.stat.cmu.edu/datasets/&quot; class=&quot;urlextern&quot; title=&quot;http://lib.stat.cmu.edu/datasets/&quot;&gt; StatLib &lt;/a&gt; (This is an older collection of data which is no longer updated.)&lt;/div&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;
Some institional collections of data:
&lt;/p&gt;
&lt;ul&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://data.worldbank.org&quot; class=&quot;urlextern&quot; title=&quot;https://data.worldbank.org&quot;&gt; World Bank&lt;/a&gt;&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://www.who.int/data/collections&quot; class=&quot;urlextern&quot; title=&quot;https://www.who.int/data/collections&quot;&gt; World Health Organization&lt;/a&gt;&lt;/div&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;/div&gt;
&lt;!-- EDIT6 SECTION &quot;Data&quot; [1119-1786] --&gt;
&lt;h3 class=&quot;sectionedit7&quot; id=&quot;guidelines&quot;&gt;Guidelines&lt;/h3&gt;
&lt;div class=&quot;level3&quot;&gt;
&lt;ul&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; The proposal should give information about the team, description of the data, potential research questions and possible methods to use. The proposal should not exceed one page. The proposal, preliminary  and final reports should be uploaded via Google form: &lt;a href=&quot;https://forms.gle/fQoNw817KDPyZzgDA&quot; class=&quot;urlextern&quot; title=&quot;https://forms.gle/fQoNw817KDPyZzgDA&quot;&gt; Google Form for Project files&lt;/a&gt;.&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; The preliminary report is a draft of the final report. &lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; The final report should not exceed 6 pages, including figures and tables, and must begin with an appropriate title highlighting your choice of topic and analysis.&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; The final report should include:&lt;/div&gt;
&lt;ul&gt;
&lt;li class=&quot;level2&quot;&gt;&lt;div class=&quot;li&quot;&gt;  Description of research questions / issues. The significance of the problems.&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level2&quot;&gt;&lt;div class=&quot;li&quot;&gt; Description of the data.&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level2&quot;&gt;&lt;div class=&quot;li&quot;&gt; Preliminary studies: data visualization, dimension reduction, feature extraction, feature selection etc.&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level2&quot;&gt;&lt;div class=&quot;li&quot;&gt; Statistical analysis:&lt;/div&gt;
&lt;ol&gt;
&lt;li class=&quot;level3&quot;&gt;&lt;div class=&quot;li&quot;&gt; Methods: what analyses were done and why. If there is any challenge in analysis, describe your approach to tackle the problem.&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level3&quot;&gt;&lt;div class=&quot;li&quot;&gt; Results: A small number of well-designed and tailored tables and graphics may be appropriate. &lt;/div&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li class=&quot;level2&quot;&gt;&lt;div class=&quot;li&quot;&gt; Discussion/Conclusion: Convey your findings to a broad audience. &lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level2&quot;&gt;&lt;div class=&quot;li&quot;&gt; Try to avoid including too much of the software output. Supporting code can be kept in appendix or a separate document.&lt;/div&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; The final report will be evaluated on the basis of the following criteria &lt;/div&gt;
&lt;ol&gt;
&lt;li class=&quot;level2&quot;&gt;&lt;div class=&quot;li&quot;&gt;How interesting is the dataset and the research idea. &lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level2&quot;&gt;&lt;div class=&quot;li&quot;&gt;How well the dataset was prepared for analysis. &lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level2&quot;&gt;&lt;div class=&quot;li&quot;&gt;Quality of the statistical analysis &lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level2&quot;&gt;&lt;div class=&quot;li&quot;&gt;Quality of material presentation  [The grammar, orthography and style matter.]&lt;/div&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; The presentation in class can be done by a single team-member. It should include clear description of the data, research question(s) and findings. The evaluation criteria are similar to the criteria for the final report, with emphasis on the quality of presentation.&lt;/div&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;/div&gt;
&lt;!-- EDIT7 SECTION &quot;Guidelines&quot; [1787-] --&gt;</summary>
    </entry>
    <entry>
        <title>people:kargin:math457_fall2021:sampleprojects</title>
        <link rel="alternate" type="text/html" href="https://www2.math.binghamton.edu/p/people/kargin/math457_fall2021/sampleprojects"/>
        <published>2022-02-02T20:57:15-04:00</published>
        <updated>2022-02-02T20:57:15-04:00</updated>
        <id>https://www2.math.binghamton.edu/p/people/kargin/math457_fall2021/sampleprojects</id>
        <summary>
&lt;p&gt;
Best projects:
&lt;/p&gt;
&lt;ul&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://www2.math.binghamton.edu/lib/exe/fetch.php/people/kargin/math457_fall2021/stroke_prediction.pdf&quot; class=&quot;media mediafile mf_pdf&quot; title=&quot;people:kargin:math457_fall2021:stroke_prediction.pdf (352.6 KB)&quot;&gt;Stroke Prediction&lt;/a&gt;&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://www2.math.binghamton.edu/lib/exe/fetch.php/people/kargin/math457_fall2021/mortality_covid19.pdf&quot; class=&quot;media mediafile mf_pdf&quot; title=&quot;people:kargin:math457_fall2021:mortality_covid19.pdf (286.7 KB)&quot;&gt;Predicting the 30-Days Mortality Rate of Patient with Covid-19&lt;/a&gt;&lt;/div&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;
Other projects (in no particular order)
&lt;/p&gt;
&lt;ul&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://www2.math.binghamton.edu/lib/exe/fetch.php/people/kargin/math457_fall2021/animal_shelters.pdf&quot; class=&quot;media mediafile mf_pdf&quot; title=&quot;people:kargin:math457_fall2021:animal_shelters.pdf (462.8 KB)&quot;&gt; Shelter Animal Outcomes&lt;/a&gt;&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://www2.math.binghamton.edu/lib/exe/fetch.php/people/kargin/math457_fall2021/student_performance.pdf&quot; class=&quot;media mediafile mf_pdf&quot; title=&quot;people:kargin:math457_fall2021:student_performance.pdf (311.7 KB)&quot;&gt; Student Performance on Exams&lt;/a&gt;&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://www2.math.binghamton.edu/lib/exe/fetch.php/people/kargin/math457_fall2021/olympic_results.pdf&quot; class=&quot;media mediafile mf_pdf&quot; title=&quot;people:kargin:math457_fall2021:olympic_results.pdf (4.8 MB)&quot;&gt; Analysis of 120 Years of Olympic Results&lt;/a&gt;&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://www2.math.binghamton.edu/lib/exe/fetch.php/people/kargin/math457_fall2021/prospective_applicants.pdf&quot; class=&quot;media mediafile mf_pdf&quot; title=&quot;people:kargin:math457_fall2021:prospective_applicants.pdf (177.7 KB)&quot;&gt; Statistical Analysis as a Method to Gauge Interest in Prospective Applicants for Data Analytics Jobs&lt;/a&gt;&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://www2.math.binghamton.edu/lib/exe/fetch.php/people/kargin/math457_fall2021/music_popularity.pdf&quot; class=&quot;media mediafile mf_pdf&quot; title=&quot;people:kargin:math457_fall2021:music_popularity.pdf (201.1 KB)&quot;&gt; Music Popularity&lt;/a&gt;&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://www2.math.binghamton.edu/lib/exe/fetch.php/people/kargin/math457_fall2021/house_sales.pdf&quot; class=&quot;media mediafile mf_pdf&quot; title=&quot;people:kargin:math457_fall2021:house_sales.pdf (652.3 KB)&quot;&gt; House Sales in Ames, Iowa&lt;/a&gt;&lt;/div&gt;
&lt;/li&gt;
&lt;/ul&gt;
</summary>
    </entry>
</feed>
