This course explores essential concepts and techniques in statistical inference and big data analytics. It covers a broad spectrum of topics, including data fundamentals, analysis in high-dimensional spaces, singular value decomposition (SVD), principal component analysis (PCA), random walks, and Markov Chains. Additionally, it includes a special topic on the ethical implications of big data -- security, privacy, and fairness, explored through reading research papers. Students will engage in practical learning through coding assignments and hands-on projects to gain practical experience and exposure to cutting-edge techniques.
Throughout this course, students will:
Basic concepts of linear algebra, calculus, probability theory, and programming skills in Python.
Foundation of Data Science (2018), Avrim Blum, John Hopcroft and Ravindran Kannan Online
Assignments: This course includes 4-5 written assignments to enhance learning. All assignments must be completed individually, as teamwork is not permitted.
Mini Project: The mini project in this course allows students to select from a range of topics for exploration and analysis. Students are permitted to form teams. A report is required, which will encapsulate the project’s methodology, analysis, and individual contributions.
Exams: Exams are conducted in a closed-book format to assess your understanding and integration of the course material. Questions are accessible via Canvas on a phone or laptop The final exam is a comprehensive test covering all topics discussed throughout the semester.
Participation: Participation is assessed through in-class quizzes. Quizzes consist of multiple-choice questions accessible via Canvas on a phone or laptop.