User Tools

Site Tools


seminars:datasci:181030

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

seminars:datasci:181030 [2018/10/24 22:03]
sdang created
seminars:datasci:181030 [2018/10/24 22:03] (current)
sdang
Line 1: Line 1:
 +<WRAP centeralign>##​Data Science Seminar##\\ Hosted by Department of Mathematical Sciences</​WRAP>​
 +
 +  * Date: Tuesday, October 30, 2018
 +  * Time: 12:00pm -- 1:00pm
 +  * Room: WH-100E
 +  * Speaker: ​ Rebecca Nugent (Carnegie Mellon University)
 +  * Title: Before Teaching Data Science, Let’s First Understand How People Do It
 +
 +<WRAP center box 80%>
 +<WRAP centeralign>​**//​Abstract//​**</​WRAP>​
 +Much of the first wave of Data Science programs was built on a foundation of already existing computing-oriented classes; less effort was spent on how people from diverse backgrounds and disciplines approach data science. At Carnegie Mellon, the Department of Statistics & Data Science teaches thousands of students with future degrees ranging from Pre-Med to Rhetoric to Chemistry to Business to Statistics & Machine Learning and are well positioned to tackle this pedagogical challenge. ​ In the last two years, we have begun developing ISLE (Interactive Statistics Learning Environment),​ an interactive platform that removes the computing cognitive load and lets students explore Statistics & Data Science concepts in both structured and unstructured ways. The platform also supports student-driven inquiry and case studies. We track and model every click, word used, and decision made throughout the data analysis pipeline from loading the data to the final written report. The platform is flexible enough to allow adaptation, providing different modes of data analysis instruction,​ active learning opportunities,​ and exercises for different subsets of the population. Students are also able to build their own case studies with little restriction or faculty intervention. The resulting data sets are invaluable in capturing behavioral data science information and generate interesting statistical methodological questions about how to model the learning processes using data of mixed modality (clicks, text, audio, video, etc).  We present some initial methodological work with an emphasis on developing variable selection methods when clustering circular data (text). ​ In short, teaching Data Science while simultaneously learning how we do it.
 +</​WRAP>​