Course SLO | Assessment |
---|---|
After completing this course, students will be able to import, format, and transform common data-set types programmatically and build effective visualizations of data | Homework, Data Package Assignment, and Project |
After completing this course, students will be able to apply appropriate exploration and modeling techniques to learn from data. | Homework and Project |
After completing this course, students will be able to present and communicate results obtained via data analysis in an effective manner. | Homework, Data Package Assignment, and Project |
Syllabus
DS 201 (Q) Introduction to Data Science
Instructor Information
Instructor: Dr. Jason M. Graham
Office: LSC 319A
- Office Hours: Mondays 12:30-1:30, Wednesdays 9:30-10:30 and 12:30-1:30. You may also make an appointment to meet with me outside of scheduled office hours. Appointments are not necessary for regularly scheduled office hours.
Instructor Schedule: View my weekly schedule
Email: jason.graham@scranton.edu
Phone: (570) 941-7491
Course Page: View course page
Course Materials
Required Readings
R for Data Science by Hadley Wickham & Garrett Grolemund, view the free online version of the text.
Data Science: A First Introduction by Tiffany Timbers, Trevor Campbell, and Melissa Lee, view the free online version of the text.
Recommended Readings
Hands-On Programming with R by Garrett Grolemund, view the free online version of the text.
Telling Stories with Data by Rohan Alexander, view the free online version of the text.
Data Visualization A Practical Introduction by Healy, view the free online version of the text.
Statistical Inference via Data Science A ModernDive into R and the Tidyverse by Chester Ismay & Albert Y. Kim, view the free online version of the text.
Further Reading
An Introduction to Statistical Learning 2nd Ed. by James, Witten, Hastie, and Tibshirani, view the free online version of the text.
Hands-On Machine Learning with R, view the free online version of the text.
Links to additional resources related to the course material will be posted on the course website. View the resources link.
Course Information
Course Description
An introduction to basic data science workflow following current best practices. This course will introduce students to computational or algorithmic ways to think about and learn from data. Emphasis will be placed on data visualization, exploratory data analysis, and foundational modeling principles and techniques implemented using an appropriate programming language.
Prerequisites
Math Placement PT score of 14 or higher, or ALEKS score of 76 or higher, or MATH 114, or permission of instructor
Student Learning Objectives and Assessment:
Course Policies and Procedures
Grading
Grade Policy
The overall course grade will be based on (roughly twelve) weekly homework assignment totaling 30% of the overall course grade, a data package assignment totaling 20% of the overall course grade, and a semester project totaling 50% of the overall course grade.
Grade Scale
Letter grades will be assigned based on the following scale:
Use of AI
Artificial intelligence (AI) can be an effective tool in data science. For example, AI-based programming assistants like GitHub Copilot or generative model platforms like ChatGPT now help programmers and developers to write better code in less time. Learning to use AI is essentially becoming a basic skill for the modern data scientist. Because of this, I do not want to completely discourage the use of AI assistance.
However, I ask that you avoid using AI platforms or tools in a manner that is inappropriate in the context of this course. This course teaches a variety of concepts, skills, and critical thinking. Using AI in such a way as to avoid learning, developing skills, or critical thinking is not appropriate. If you find yourself using AI to look up answers, search for complete solutions to problems, or things like this, then your use of AI is not acceptable. It might be helpful to think of AI as an analog to a calculator. If the goal of an assignment is for you to demonstrate that you can do a certain calculation, then using a calculator is not appropriate. On the other hand, if the goal of an assignment is for you to demonstrate that you can solve a problem for which a minor step involves doing a calculation, then using a calculator is okay. AI should be treated analogously.
In particular, it is expected that students will be able to explain independently and in detail what any line of code submitted as part of an assignment this semester does. Also, it is expected that students can explain independently and in detail the solution to any problem submitted as part of an assignment this semester.
If you have any doubts about your use of AI, then either ask the instructor if your use of AI is acceptable or just don’t use AI.
Assignments
Homework Assignments
There will be roughly 12 weekly homework assignments throughout the semester. These assignments with due dates will be posted to the course learning management system. Homework problems will be a mix of hand-written and computer assignments and the problems will relate to the material covered in lectures and readings.
Data Package Assignment
The data package assignment, making up 20% of the course grade asks students to curate a data set according the best practices for reproducible data analyses. A complete data package assignment should consist of
An appropriate raw data set(s) with original source fully documented.
An appropriately cleaned and processed version of the original raw data set. All code or other files used to clean and process the raw data must be included as part of the final data package submission.
An appropriate data sheet to accompany the data package.
A public folder, repository, R package, or other container that can be used to make your data set and associated files and documentation available to others.
Further details on the semester project such as the parameters of the assignment and a grade rubric will be posted on the course learning management system.
Semester Project
The semester project will incorporate all the components of a data analysis covered in the course throughout the semester applied to a data set of your choosing pending approval by the instructor. Various components of the project will be due at different times but you will have the opportunity to revise some components prior to the submission of the final product.
A complete project, counting for 50% of the overall course grade will consist of the following:
A data sheet describing the essential information about your chosen data set.
An initial exploratory data analysis for your chosen data set.
An appropriate analysis of your chosen data set with the goal to address a specific research question.
A project report developed using Quarto. You may view the rendered version of an example report here.
Slides for a presentation summarizing your project. You will not actually present the slides. A rendered version of examples slides may be viewed here.
A GitHub repository containing all code (appropriately documented) written and used in your project. An example project repository may be viewed here.
Your project report and presentation should be written as if it is addressed to a stake holder with some subject matter knowledge in the domain of application but not necessarily with a quantitative or programming background. Further details on the semester project such as the parameters of the assignment and a grade rubric will be posted on the course learning management system.
Course Timeline
Weekly Schedule
Week 1: Introduction to data
Week 2: Basic programming for data science
Week 3: Data wrangling; Project component 1 due
Week 4: Creating data packages
Week 5: Data visualization
Week 6: Exploratory data analysis; Project component 2 due
Week 7: Intermediate programming for data science
Week 8: Introduction to probability for data science
Week 9: Introduction to statistics for data science; Project component 3 due
Week 10: Classification
Week 11: Regression
Week 12: Clustering; Project component 4 due
Week 13: Introduction to machine learning
Week 14: Ethical considerations in data science
Week 15: Putting everything together; Project final version due
Important Dates
Event | Date |
---|---|
Classes begin | 08-28 |
Last day to add classes | 90-01 |
Holiday, no classes | 09-04 |
100% tuition refund | 09-06 |
Drop (no grade) | 09-27 |
Fall break | 10-07 to 10-10 |
Mid-semester | 10-18 |
Withdraw with W | 11-10 |
Thanksgiving break | 11-22 to 11-26 |
Last week | 12-05 to 12-11 |
Finals | 12-12 to 12-16 |
University Resources for Students and Academic Honesty
Students with Disabilities
Reasonable academic accommodations may be provided to students who submit relevant and current documentation of their disability. Students are encouraged to contact the Center for Teaching and Learning Excellence (CTLE) at disabilityservices@scranton.edu or (570) 941-4038 if they have or think they may have a disability and wish to determine eligibility for any accommodations. For more information, please visit http://www.scranton.edu/disabilities.
Writing Center Services
The Writing Center focuses on helping students become better writers. Consultants will work one-on-one with students to discuss students’ work and provide feedback at any stage of the writing process. Scheduling appointments early in the writing progress is encouraged.
To meet with a writing consultant, call (570) 941-6147 to schedule an appointment, or send an email with your available meeting times, the course for which you need assistance, and your phone number to: writing-center@scranton.edu. The Writing Center does offer online appointments for our distance learning students.
Academic Honesty and Integrity
Each student is expected to do their own work. It is also expected that each student respect and abide by the Academic Code of Honesty as set forth in the University of Scranton student handbook. Conduct that violates the Academic Code of Honesty includes plagiarism, duplicate submission of the same work, collusion, providing false information, unauthorized use of computers, theft and destruction of property, and unauthorized possession of tests and other materials. Steps taken in response to suspected violations may include a discussion with the instructor, an informal meeting with the dean of the college, and a hearing before the Academic Dishonesty Hearing Board. Students who are found to have violated the Code will ordinarily be assigned the grade F by the instructor and may face other sanctions. The complete Academic Code of Honesty is located on the University website at https://www.scranton.edu/academics/wml/acad-integ/acad-code-honesty.shtml.
My Reporting Obligation as a Responsible Employee
As a faculty member, I am deeply invested in the well-being of each student I teach. I am here to assist you with your work in this course. Additionally, if you come to me with other non-course-related concerns, I will do my best to help. It is important for you to know that all faculty members are required to report incidents of sexual harassment or sexual misconduct involving students. This means that I cannot keep information about sexual harassment, sexual assault, sexual exploitation, intimate partner violence or stalking confidential if you share that information with me. I will keep the information as private as I can but am required to bring it to the attention of the University’s Title IX Coordinator, Elizabeth M. Garcia, or Deputy Title IX Coordinator, Diana M. Collins, who, in conversation with you, will explain available support, resources, and options. I will not report anything to anybody without first letting you know and discussing choices as to how to proceed. The University’s Counseling Center (570-941-7620) is available to you as a confidential resource; counselors (in the counseling center) do not have an obligation to report to the Title IX Coordinator.
Non-discrimination Statement
The University is committed to providing an educational, residential, and working environment that is free from harassment and discrimination. Members of the University community, applicants for employment or admissions, guests, and visitors have the right to be free from harassment or discrimination based on race, color, religion, ancestry, gender, sex, pregnancy, sexual orientation, gender identity or expression, age, disability, genetic information, national origin, veteran status, or any other status protected by applicable law.
Students who believe they have been subject to harassment or discrimination based on any of the above class of characteristics, or experience sexual harassment, sexual misconduct or gender discrimination should contact Elizabeth M. Garcia, Title IX Coordinator, (570) 941-6645 elizabeth.garcia2@scranton.edu, Deputy Title IX Coordinators Diana M. Collins (570) 941-6645 diana.collins@scranton.edu, or Ms. Lauren Rivera, AVP for Student Life and Dean of Students, at (570)941-7680 lauren.rivera@scranton.edu. The United States Department of Education’s Office for Civil Rights (OCR) enforces Title IX. Information regarding OCR may be found at <www.ed.gov/about/offices/list/ocr/index.html>
The University of Scranton Sexual Harassment and Sexual Misconduct Policy can be found online at https://www.scranton.edu/diversity. All reporting options and resources are available at https://www.scranton.edu/CARE.
About Pronouns
It is easy to make assumptions about an individual’s pronouns, but we try not to! Please tell us in class or via a private email if you would like to let us know what your pronouns are, if/when you would like us (and others) to use them, and certainly feel free to correct us or others if we make a mistake. Using the pronouns that a person has indicated they prefer is considered both professional and polite, and as such we ask that all members of our class use the appropriate pronouns.
If you have questions about this, please feel free to look up more information here (https://www.mypronouns.org/) or email jason.graham@scranton.edu with any questions.
Student Mental Health: Suggestions and Resources
Many students experience mental health challenges at some point in college. Struggles vary and might be related to academics, anxiety, depression, relationships, grief/loss, substance abuse, and other challenges. There are resources to help you and getting help is the smart and courageous thing to do.
Counseling Center (6th Floor O’Hara Hall; 570-941-7620) – Free, confidential individual and group counseling is available on campus.
Teletherapy – For students who wish to access therapy via video, phone, and/or chat, the University offers a teletherapy resource. Please contact the Counseling Center (570-941-7620) to inquire about teletherapy.
Mental Health Screenings – Confidential, online “check up from your neck up” to help you determine if you should connect with a mental health professional.
Dean of Students Office (201 DeNaples Center; 570-941-7680) – Private support and guidance for students navigating personal challenges that may impact success at the University
Final Note
The instructor reserve the right to modify this syllabus; students will immediately be notified of any such changes and an updated syllabus will be made available to the class via the course learning management system.