
<?xml version="1.0" encoding="utf-8"?>
<!-- generator="FeedCreator 1.7.2-ppt DokuWiki" -->
<?xml-stylesheet href="https://www2.math.binghamton.edu/lib/exe/css.php?s=feed" type="text/css"?>
<feed xmlns="http://www.w3.org/2005/Atom">
    <title>Department of Mathematics and Statistics, Binghamton University people:kargin:math457_fall2023</title>
    <subtitle></subtitle>
    <link rel="alternate" type="text/html" href="https://www2.math.binghamton.edu/"/>
    <id>https://www2.math.binghamton.edu/</id>
    <updated>2026-04-04T23:59:38-04:00</updated>
    <generator>FeedCreator 1.7.2-ppt DokuWiki</generator>
<link rel="self" type="application/atom+xml" href="https://www2.math.binghamton.edu/feed.php" />
    <entry>
        <title>Project</title>
        <link rel="alternate" type="text/html" href="https://www2.math.binghamton.edu/p/people/kargin/math457_fall2023/project"/>
        <published>2024-06-28T11:07:55-04:00</published>
        <updated>2024-06-28T11:07:55-04:00</updated>
        <id>https://www2.math.binghamton.edu/p/people/kargin/math457_fall2023/project</id>
        <summary>
&lt;h2 class=&quot;sectionedit1&quot; id=&quot;project&quot;&gt;Project&lt;/h2&gt;
&lt;div class=&quot;level2&quot;&gt;

&lt;/div&gt;
&lt;!-- EDIT1 SECTION &quot;Project&quot; [1-19] --&gt;
&lt;h3 class=&quot;sectionedit2&quot; id=&quot;task&quot;&gt;Task&lt;/h3&gt;
&lt;div class=&quot;level3&quot;&gt;

&lt;p&gt;
Choose an existing dataset or collect your own data, and carry out data analyses to explain the relationships among the variables involved. You can also compare performance of different statistical 
tools for prediction or classification, using the chosen dataset.
&lt;/p&gt;

&lt;/div&gt;
&lt;!-- EDIT2 SECTION &quot;Task&quot; [20-301] --&gt;
&lt;h3 class=&quot;sectionedit3&quot; id=&quot;teams&quot;&gt;Teams&lt;/h3&gt;
&lt;div class=&quot;level3&quot;&gt;

&lt;p&gt;
For this project you are supposed to work with 1-2 persons and submit a joint report. If you cannot find a team member, they will be assigned to you. (This is not recommended. Try to find a student who is compatible with your work style.)
&lt;/p&gt;

&lt;/div&gt;
&lt;!-- EDIT3 SECTION &quot;Teams&quot; [302-558] --&gt;
&lt;h3 class=&quot;sectionedit4&quot; id=&quot;grading_policies&quot;&gt;Grading policies&lt;/h3&gt;
&lt;div class=&quot;level3&quot;&gt;

&lt;p&gt;
Team members will receive the same grade for the project and it is up to you to make sure that the work is shared equitably. 
The total points of the project is 100 points, which can be divided into three parts:
&lt;/p&gt;
&lt;ol&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; Project Proposal (20 pts).&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; Presentation (40 pts): each team will give a 20 minutes presentation of the project; (dates to be assigned)&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; Final report (40 pts)&lt;/div&gt;
&lt;/li&gt;
&lt;/ol&gt;

&lt;/div&gt;
&lt;!-- EDIT4 SECTION &quot;Grading policies&quot; [559-970] --&gt;
&lt;h3 class=&quot;sectionedit5&quot; id=&quot;schedule&quot;&gt;Schedule&lt;/h3&gt;
&lt;div class=&quot;level3&quot;&gt;

&lt;p&gt;
See syllabus.
&lt;/p&gt;

&lt;/div&gt;
&lt;!-- EDIT5 SECTION &quot;Schedule&quot; [971-1213] --&gt;
&lt;h3 class=&quot;sectionedit6&quot; id=&quot;methods&quot;&gt;Methods&lt;/h3&gt;
&lt;div class=&quot;level3&quot;&gt;

&lt;p&gt;
You can use methods we covered in class or other statistical learning methods, with preference for more advanced methods. 
&lt;/p&gt;

&lt;/div&gt;
&lt;!-- EDIT6 SECTION &quot;Methods&quot; [1214-1355] --&gt;
&lt;h3 class=&quot;sectionedit7&quot; id=&quot;data&quot;&gt;Data&lt;/h3&gt;
&lt;div class=&quot;level3&quot;&gt;

&lt;p&gt;
Find your own data set online, you will find plenty;
&lt;/p&gt;

&lt;p&gt;
Popular collections of publicly-available datasets: 
&lt;/p&gt;
&lt;ul&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://archive.ics.uci.edu/ml/index.php&quot; class=&quot;urlextern&quot; title=&quot;https://archive.ics.uci.edu/ml/index.php&quot;&gt; UCI Machine Learning Repository&lt;/a&gt;&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://www.kaggle.com/datasets&quot; class=&quot;urlextern&quot; title=&quot;https://www.kaggle.com/datasets&quot;&gt; Kaggle &lt;/a&gt; &lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://academictorrents.com/browse.php?cat=6&quot; class=&quot;urlextern&quot; title=&quot;https://academictorrents.com/browse.php?cat=6&quot;&gt; Academic Torrents&lt;/a&gt; (shares large datasets via bit torrent technology)&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;http://lib.stat.cmu.edu/datasets/&quot; class=&quot;urlextern&quot; title=&quot;http://lib.stat.cmu.edu/datasets/&quot;&gt; StatLib &lt;/a&gt; (This is an older collection of data which is no longer updated.)&lt;/div&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;
Some institional collections of data:
&lt;/p&gt;
&lt;ul&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://data.worldbank.org&quot; class=&quot;urlextern&quot; title=&quot;https://data.worldbank.org&quot;&gt; World Bank&lt;/a&gt;&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://www.who.int/data/collections&quot; class=&quot;urlextern&quot; title=&quot;https://www.who.int/data/collections&quot;&gt; World Health Organization&lt;/a&gt;&lt;/div&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;/div&gt;
&lt;!-- EDIT7 SECTION &quot;Data&quot; [1356-2023] --&gt;
&lt;h3 class=&quot;sectionedit8&quot; id=&quot;guidelines&quot;&gt;Guidelines&lt;/h3&gt;
&lt;div class=&quot;level3&quot;&gt;
&lt;ul&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; The proposal should give information about the team, description of the data, potential research questions and possible methods to use. The proposal should not exceed one page. The proposal, preliminary  and final reports should be uploaded via Google form: &lt;a href=&quot;https://forms.gle/fQoNw817KDPyZzgDA&quot; class=&quot;urlextern&quot; title=&quot;https://forms.gle/fQoNw817KDPyZzgDA&quot;&gt; Google Form for Project files&lt;/a&gt;.&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; The preliminary report is a draft of the final report. &lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; The final report should not exceed 6 pages, including figures and tables, and must begin with an appropriate title highlighting your choice of topic and analysis.&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; The final report should include:&lt;/div&gt;
&lt;ul&gt;
&lt;li class=&quot;level2&quot;&gt;&lt;div class=&quot;li&quot;&gt;  Description of research questions / issues. The significance of the problems.&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level2&quot;&gt;&lt;div class=&quot;li&quot;&gt; Description of the data.&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level2&quot;&gt;&lt;div class=&quot;li&quot;&gt; Preliminary studies: data visualization, dimension reduction, feature extraction, feature selection etc.&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level2&quot;&gt;&lt;div class=&quot;li&quot;&gt; Statistical analysis:&lt;/div&gt;
&lt;ol&gt;
&lt;li class=&quot;level3&quot;&gt;&lt;div class=&quot;li&quot;&gt; Methods: what analyses were done and why. If there is any challenge in analysis, describe your approach to tackle the problem.&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level3&quot;&gt;&lt;div class=&quot;li&quot;&gt; Results: A small number of well-designed and tailored tables and graphics may be appropriate. &lt;/div&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li class=&quot;level2&quot;&gt;&lt;div class=&quot;li&quot;&gt; Discussion/Conclusion: Convey your findings to a broad audience. &lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level2&quot;&gt;&lt;div class=&quot;li&quot;&gt; Try to avoid including too much of the software output. Supporting code can be kept in appendix or a separate document.&lt;/div&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; The final report will be evaluated on the basis of the following criteria &lt;/div&gt;
&lt;ol&gt;
&lt;li class=&quot;level2&quot;&gt;&lt;div class=&quot;li&quot;&gt;How interesting is the dataset and the research idea. &lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level2&quot;&gt;&lt;div class=&quot;li&quot;&gt;How well the dataset was prepared for analysis. &lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level2&quot;&gt;&lt;div class=&quot;li&quot;&gt;Quality of the statistical analysis &lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level2&quot;&gt;&lt;div class=&quot;li&quot;&gt;Quality of material presentation  [The grammar, orthography and style matter.]&lt;/div&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; The presentation in class can be done by a single team-member or by all team members, as you prefer it. It should include clear description of the data, research question(s) and findings. The evaluation criteria are similar to the criteria for the final report, with emphasis on the quality of presentation. &lt;/div&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;/div&gt;
&lt;!-- EDIT8 SECTION &quot;Guidelines&quot; [2024-] --&gt;</summary>
    </entry>
    <entry>
        <title>people:kargin:math457_fall2023:sampleprojects</title>
        <link rel="alternate" type="text/html" href="https://www2.math.binghamton.edu/p/people/kargin/math457_fall2023/sampleprojects"/>
        <published>2024-07-12T09:37:50-04:00</published>
        <updated>2024-07-12T09:37:50-04:00</updated>
        <id>https://www2.math.binghamton.edu/p/people/kargin/math457_fall2023/sampleprojects</id>
        <summary>
&lt;p&gt;
These are some projects done in Fall 2023. They are not meant to be a representative sample. Rather, these are projects that were outstanding in one or several aspects (originality of idea, data, methods, presentation). Some of them surprised me a lot.
&lt;/p&gt;
&lt;ul&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://www2.math.binghamton.edu/lib/exe/fetch.php/people/kargin/math457_fall2023/math_457_nyc_crime.pdf&quot; class=&quot;media mediafile mf_pdf&quot; title=&quot;people:kargin:math457_fall2023:math_457_nyc_crime.pdf (1.9 MB)&quot;&gt; Geography of NYC Crime&lt;/a&gt;.&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://www2.math.binghamton.edu/lib/exe/fetch.php/people/kargin/math457_fall2023/math457_feature_engineering.pdf&quot; class=&quot;media mediafile mf_pdf&quot; title=&quot;people:kargin:math457_fall2023:math457_feature_engineering.pdf (3 MB)&quot;&gt; Faster Learning by Feature Engineering with ResNet-50&lt;/a&gt;.&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://www2.math.binghamton.edu/lib/exe/fetch.php/people/kargin/math457_fall2023/math457_mushrooms.pdf&quot; class=&quot;media mediafile mf_pdf&quot; title=&quot;people:kargin:math457_fall2023:math457_mushrooms.pdf (418.7 KB)&quot;&gt; Classification of Mushroom Edibility: Report&lt;/a&gt;, &lt;a href=&quot;https://www2.math.binghamton.edu/lib/exe/fetch.php/people/kargin/math457_fall2023/math457_mushrooms.pptx.pdf&quot; class=&quot;media mediafile mf_pdf&quot; title=&quot;people:kargin:math457_fall2023:math457_mushrooms.pptx.pdf (8 MB)&quot;&gt; Presentation (converted to pdf)&lt;/a&gt;.&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt; &lt;a href=&quot;https://www2.math.binghamton.edu/lib/exe/fetch.php/people/kargin/math457_fall2023/math457_covid_19_in_mexico.pdf&quot; class=&quot;media mediafile mf_pdf&quot; title=&quot;people:kargin:math457_fall2023:math457_covid_19_in_mexico.pdf (856.5 KB)&quot;&gt; Covid-19 Mortality -- Mexican Data: Report&lt;/a&gt;, &lt;a href=&quot;https://www2.math.binghamton.edu/lib/exe/fetch.php/people/kargin/math457_fall2023/math457_covid_19_in_mexico.pptx.pdf&quot; class=&quot;media mediafile mf_pdf&quot; title=&quot;people:kargin:math457_fall2023:math457_covid_19_in_mexico.pptx.pdf (3.8 MB)&quot;&gt; Presentation (converted to pdf)&lt;/a&gt;.&lt;/div&gt;
&lt;/li&gt;
&lt;li class=&quot;level1&quot;&gt;&lt;div class=&quot;li&quot;&gt;  &lt;a href=&quot;https://www2.math.binghamton.edu/lib/exe/fetch.php/people/kargin/math457_fall2023/math457_one_shot.pdf&quot; class=&quot;media mediafile mf_pdf&quot; title=&quot;people:kargin:math457_fall2023:math457_one_shot.pdf (434.8 KB)&quot;&gt; One-shot Learning of Alphabet Characters&lt;/a&gt;.&lt;/div&gt;
&lt;/li&gt;
&lt;/ul&gt;
</summary>
    </entry>
</feed>
