The Data Science Major degree program combines computational and inferential reasoning to draw conclusions based on data about some aspect of the real world. Data scientists come from all walks of life, all areas of study, and all backgrounds. They share an appreciation for the practical use of mathematical and scientific thinking and the power of computing to understand and solve problems for business, research, and societal impact.
The Data Science Major will equip students to draw sound conclusions from data in context, using knowledge of statistical inference, computational processes, data management strategies, domain knowledge, and theory. Students will learn to carry out analyses of data through the full cycle of the investigative process in scientific and practical contexts. Students will gain an understanding of the human and ethical implications of data analytics and integrate that knowledge in designing and carrying out their work.
The Data Science major requirements include DATA C8 and DATA C100, the core lower-division and upper-division elements of the major, along with courses from each of the following requirement groups:
Foundations in Mathematics and Computing
Computational and Inferential Depth
Modeling, Learning and Decision Making
Probability
Human Contexts and Ethics
Domain Emphasis
All students will select a Domain Emphasis, a cluster of one lower division course and two upper division courses, that brings them into the context of a domain and allows them to build bridges with data science.
Declaring the Major
Students can apply to declare the Data Science major after completing all the lower-division prerequisites (see the Major Requirements tab). For applicants with prerequisites in progress, applications will be reviewed after the grades for all prerequisites are available.
It is necessary for applicants to achieve a minimum prerequisite grade point average (GPA) in order to declare the Data Science major. Information on this GPA and the process to apply for admission to the major can be found on the Declaring the Major web page.
Minor Program
The Minor in Data Science at UC Berkeley aims to provide students with practical knowledge of the methods and techniques of data analysis, as well as the ability to think critically about the construction and implications of data analysis and models. The minor will empower students across the wide array of campus disciplines with a working knowledge of statistics, probability, and computation that allow students not just to participate in data science projects, but to design and carry out rigorous computational and inferential analysis for their field of interest. Check the Data Science Minor program website for details.
In addition to the University, campus, and college requirements listed on the College Requirements tab, students must fulfill the below requirements specific to the major program. Please check theData Science program website for updates.
General Guidelines
All courses taken to fulfill the major requirements below must be taken for letter-graded credit.
No more than two upper-division courses can overlap between two majors.
A minimum grade point average (GPA) of 2.0 must be maintained in all courses toward the major, and in all upper-division courses toward the major.
In some cases, students may complete alternative courses to satisfy the above prerequisites. See the lower-division requirements page on the Data Science program website for more details.
Lower Division Requirements
Students will also be required to take one lower division course towards their choice of Domain Emphasis.
Upper Division Requirements
Students will be required to complete 8 unique upper-division courses for a total of 28 or more units from the following requirement categories.
Students will be required to take two upper division courses comprising 7 or more units that provide computational and inferential depth beyond that provided in Data 100 and the lower-division courses.
Course List
Code
Title
Units
Choose two courses comprising 7+ units from the following:
Modern Statistical Prediction and Machine Learning
4
Human Contexts and Ethics
Students will be required to take one course from a curated list of courses that establish a human, social, and ethical context in which data analytics and computational inference play a central role.
The Minor in Data Science at UC Berkeley aims to provide students with practical knowledge of the methods and techniques of data analysis, as well as the ability to think critically about the construction and implications of data analysis and models. The minor will empower students across the wide array of campus disciplines with a working knowledge of statistics, probability, and computation that allow students not just to participate in data science projects, but to design and carry out rigorous computational and inferential analysis for their field of interest.
General Guidelines
All minors must be declared no later than one semester before a student's Expected Graduation Term (EGT). If the semester before EGT is fall or spring, the deadline is the last day of RRR week. If the semester before EGT is summer, the deadline is the final Friday of Summer Sessions. For more information about declaring the minor, view the Data Science minor webpage.
All courses for the minor must be taken for a letter grade.
Students must earn a C- or better in each course, and have a minimum 2.0 GPA in all courses towards the minor.
Students may overlap up to 1 course in the upper division requirements for the Data Science minor with each of their majors (for example, a Computer Science major may count COMPSCI/DATA/STAT C100 toward both their major and the Data Science minor).
A maximum of one course offered by or cross-listed with the student’s major department(s) may count toward the data science minor upper-division requirements, including any overlapping course (for example, if a Computer Science major takes COMPSCI/DATA/STAT C100 toward the Data Science minor, this is the only COMPSCI, ELENG, or EECS course which may count toward the upper-division requirements for the minor).
An upper-division course used to fulfill a lower-division requirement (for example, Stat 134 to fulfill the probability requirement) will not be counted toward the maximum 1 course allowed to overlap with the major, nor will it fulfill one of the four upper division course requirements.
There is no restriction on overlap with another minor.
Courses used to fulfill the minor requirements may be applied toward the Seven-Course Breadth requirement, for Letters & Science students.
All minor requirements must be completed prior to the last day of finals during the semester in which you plan to graduate.
Undergraduate students must fulfill the following requirements in addition to those required by their major program.
For detailed lists of courses that fulfill college requirements, please review the College of Letters & Sciences page in this Guide. For College advising appointments, please visit the L&S Advising Pages.
All students who will enter the University of California as freshmen must demonstrate their command of the English language by fulfilling the Entry Level Writing requirement. Fulfillment of this requirement is also a prerequisite to enrollment in all reading and composition courses at UC Berkeley.
The American History and Institutions requirements are based on the principle that a US resident graduated from an American university, should have an understanding of the history and governmental institutions of the United States.
All undergraduate students at Cal need to take and pass this course in order to graduate. The requirement offers an exciting intellectual environment centered on the study of race, ethnicity and culture of the United States. AC courses offer students opportunities to be part of research-led, highly accomplished teaching environments, grappling with the complexity of American Culture.
College of Letters & Science Essential Skills Requirements
The Quantitative Reasoning requirement is designed to ensure that students graduate with basic understanding and competency in math, statistics, or computer science. The requirement may be satisfied by exam or by taking an approved course.
The Foreign Language requirement may be satisfied by demonstrating proficiency in reading comprehension, writing, and conversation in a foreign language equivalent to the second semester college level, either by passing an exam or by completing approved course work.
In order to provide a solid foundation in reading, writing, and critical thinking the College requires two semesters of lower division work in composition in sequence. Students must complete parts A & B reading and composition courses by the end of their second semester and a second-level course by the end of their fourth semester.
College of Letters & Science 7 Course Breadth Requirements
The undergraduate breadth requirements provide Berkeley students with a rich and varied educational experience outside of their major program. As the foundation of a liberal arts education, breadth courses give students a view into the intellectual life of the University while introducing them to a multitude of perspectives and approaches to research and scholarship. Engaging students in new disciplines and with peers from other majors, the breadth experience strengthens interdisciplinary connections and context that prepares Berkeley graduates to understand and solve the complex issues of their day.
Unit Requirements
120 total units
Of the 120 units, 36 must be upper division units
Of the 36 upper division units, 6 must be taken in courses offered outside your major department
Residence Requirements
For units to be considered in "residence," you must be registered in courses on the Berkeley campus as a student in the College of Letters & Science. Most students automatically fulfill the residence requirement by attending classes here for four years. In general, there is no need to be concerned about this requirement, unless you go abroad for a semester or year or want to take courses at another institution or through UC Extension during your senior year. In these cases, you should make an appointment to meet an adviser to determine how you can meet the Senior Residence Requirement.
Note: Courses taken through UC Extension do not count toward residence.
Senior Residence Requirement
After you become a senior (with 90 semester units earned toward your BA degree), you must complete at least 24 of the remaining 30 units in residence in at least two semesters. To count as residence, a semester must consist of at least 6 passed units. Intercampus Visitor, EAP, and UC Berkeley-Washington Program (UCDC) units are excluded.
You may use a Berkeley Summer Session to satisfy one semester of the Senior Residence requirement, provided that you successfully complete 6 units of course work in the Summer Session and that you have been enrolled previously in the college.
Modified Senior Residence Requirement
Participants in the UC Education Abroad Program (EAP), Berkeley Summer Abroad, or the UC Berkeley Washington Program (UCDC) may meet a Modified Senior Residence requirement by completing 24 (excluding EAP) of their final 60 semester units in residence. At least 12 of these 24 units must be completed after you have completed 90 units.
Upper Division Residence Requirement
You must complete in residence a minimum of 18 units of upper division courses (excluding UCEAP units), 12 of which must satisfy the requirements for your major.
Summary of Modifications
L&S College Requirements: Reading & Composition, Quantitative Reasoning, and Foreign Language, which typically must be satisfied with a letter grade, can be satisfied with a Passed (P) grade during Fall 2020 and Spring 2021 if a student elects to take the course for P/NP. Note: This does not include Entry Level Writing (College Writing R1A).
Requirements within L&S majors and minors can be satisfied with Passed (P) grades during the Fall 2020 and Spring 2021 semesters. This includes prerequisites for majors. Contact your intended or declared major/minor adviser for more details.
Departments may create alternative methods for admitting students into their majors.
L&S students will not be placed on academic probation automatically for taking all of their courses P/NP during Fall 2020 or Spring 2021.
Plans of Study
Sample plans for completing major coursework are included below. These are not comprehensive plans which will reflect the situation of every student. These sample plans are meant only to serve as a baseline guide for structuring a plan of study, and only include the minimum courses for meeting the L&S Data Science major requirements.
Modeling, Learning & Decision Making (e.g. Data 102)
4
Upper-division Domain Emphasis #2
3-4
Human Contexts & Ethics (e.g. Data 104)
3-4
Breadth/Elective
3-4
Breadth/Elective
3-4
9-12
10-12
Total Units: 85-100
For transfer students (two-year plan):
*Note: this sample plan is based on a transfer student who has completed 1 year of calculus, linear algebra and data structures, as well as IGETC/L&S 7-Course Breadth at their previous college or university, which may not reflect the reality for every transfer student. Students should consult with a Data Science Advisor to make an individualized plan based on their specific situation.
Modeling, Learning & Decision Making (e.g. Data 102)
3-4
Upper-division Domain Emphasis #2
3-4
Human Contexts & Ethics (e.g. Data 104)
3-4
Non-major Elective
1-2
Non-major Elective
1-2
10-14
10-14
Total Units: 42-56
Major Map
Major Maps help undergraduate students discover academic, co-curricular, and discovery opportunities at UC Berkeley based on intended major or field of interest. Developed by the Division of Undergraduate Education in collaboration with academic departments, these experience maps will help you:
Explore your major and gain a better understanding of your field of study
Connect with people and programs that inspire and sustain your creativity, drive, curiosity and success
Discover opportunities for independent inquiry, enterprise, and creative expression
Engage locally and globally to broaden your perspectives and change the world
Reflect on your academic career and prepare for life after Berkeley
Use the major map below as a guide to planning your undergraduate journey and designing your own unique Berkeley experience.
Each semester, we recruit dozens of students to participate in our student teams as interns and volunteers. Teams include Communications, Analytics, External Relations, and Curriculum Development. Interested students can email ds-teams@berkeley.edu with questions about the opportunities. Learn more here.
Data Scholars
The Data Scholars program addresses issues of underrepresentation in the data science community by establishing a welcoming, educational, and empowering environment for underrepresented and nontraditional students. The program, which offers specialized tutoring, advising, mentorship, and workshops, is especially suited for students who can bring diverse perspectives to the field of Data Science. Learn more here.
Data Peer Consulting
Students in our consulting network help make data science accessible across the broader campus community by providing technical support and tutoring. Peer consultants are available at Moffitt Library on a drop-in basis. Learn more here.
Data Science Peer Advising
Academic Peer Advisors are available to help fellow students choose classes, explore academic interests, and learn how to declare the Data Science major. The Peer Advising services are available on a drop-in basis at Moffitt Library. Contact the Data Science Peer Advisors at ds-peer-advising@berkeley.edu. Learn more here.
Discovery Research Program
The Data Science Discovery Research program connects undergraduates with hands-on, team-based opportunities to contribute to cutting-edge research projects with graduate and post-doctoral students, community impact groups, entrepreneurial ventures, and educational initiatives across UC Berkeley. Learn more here.
Data Science Nexus
The Data Science Nexus is an alliance of data science student organizations on campus that work together to build community, host industry events, and provide academic support for students. In recognition of the extraordinarily diverse and multi-faceted nature of data science, members of the Nexus come from a variety of domains. Learn more here.
Related Courses
Terms offered: Spring 2021
This course engages students with fundamental questions of justice in relation to data and computing in American society. Data collection, visualization, and analysis have been entangled in the struggle for racial and social justice because they can make injustice visible, imaginable, and thus actionable. Data has also been used to oppress minoritized communities and institutionalize, rationalize, and naturalize systems of racial violence. The course examines key sites of justice involving data (such as citizenship, policing, prisons, environment, and health). Along with critical social science tools, students gain introductory experience and do collaborative and creative projects with data science using real-world data. Data and Justice: Read More [+]
Hours & Format
Fall and/or spring: 15 weeks - 3 hours of lecture and 1.5 hours of discussion per week
Additional Details
Subject/Course Level: Data Science, Undergraduate/Undergraduate
Grading/Final exam status: Letter grade. Alternative to final exam.
Terms offered: Prior to 2007
An introduction to computational thinking and quantitative reasoning, preparing students for further coursework, especially Foundations of Data Science (CS/Info/Stat C8). Emphasizes the use of computation to gain insight about quantitative problems with real data. Expressions, data types, collections, and tables in Python. Programming practices, abstraction, and iteration. Visualizing univariate and bivariate data with bar charts, histograms, plots, and maps. Introduction to statistical concepts including averages and distributions, predicting one variable from another, association and causality, probability and probabilistic simulation. Relationship between numerical functions and graphs. Sampling and introduction to inference. Introduction to Computational Thinking with Data: Read More [+]
Objectives & Outcomes
Course Objectives: C6 also includes quantitative reasoning concepts that aren’t covered in Data 8. These include certain topics in: principles of data visualization; simulation of random processes; and understanding numerical functions through their graphs. This will help prepare students for computational and quantitative courses other than Data 8. C6 takes advantage of the complementarity of computing and quantitative reasoning to enliven abstract ideas and build students’ confidence in their ability to solve real problems with quantitative tools. Students learn computer science concepts and immediately apply them to plot functions, visualize data, and simulate random events.
Foundations of Data Science (CS/Info/Stat C8, a.k.a. Data 8) is an increasingly popular class for entering students at Berkeley. Data 8 builds students’ computing skills in the first month of the semester, and students rely on these skills as the course progresses. For some students, particularly those with little prior exposure to computing, developing these skills benefits from further time and practice. C6 is a rapid introduction to Python programming, visualization, and data analysis, which will prepare students for success in Data 8.
Student Learning Outcomes: Students will be able to perform basic computations in Python, including working with tabular data.
Students will be able to understand basic probabilistic simulations.
Students will be able to understand the syntactic structure of Python code.
Students will be able to use good practices in Python programming.
Students will be able to use visualizations to understand univariate data and to identify associations or causal relationships in bivariate data.
Terms offered: Fall 2021, Summer 2021 8 Week Session, Spring 2021, Fall 2020
Foundations of data science from three perspectives: inferential thinking, computational thinking, and real-world relevance. Given data arising from some real-world phenomenon, how does one analyze that data so as to understand that phenomenon? The course teaches critical concepts and skills in computer programming and statistical inference, in conjunction with hands-on analysis of real-world datasets, including economic data, document collections, geographical data, and social networks. It delves into social and legal issues surrounding data analysis, including issues of privacy and data ownership. Foundations of Data Science: Read More [+]
Rules & Requirements
Prerequisites: This course may be taken on its own, but students are encouraged to take it concurrently with a data science connector course (numbered 88 in a range of departments)
Terms offered: Spring 2021, Fall 2020, Spring 2020
Designed to be taken in conjunction with the Foundations of Data Science (COMPSCI/INFO/STAT C8) course, each connector course will flesh out data science ideas in the context of one particular field. Blending inferential thinking and computational thinking, the course relies on the increasing availability of datasets across a wide range of human endeavor, and students' natural interest in such data, to teach students to work actively with data in a field of their interest and to interpret and critique their analyses of data. Topics vary by field, and several topics will be offered each term. Data Science Connector: Read More [+]
Objectives & Outcomes
Course Objectives: Discuss how to formulate and substantiate an argument with evidence Explain a variety of analytic and visualization techniques Explore approaches to effective communication Explore the challenges with working with primary and secondary data
Student Learning Outcomes: Apply data analysis to evaluate everyday problems
Communicate effectively in written, spoken, and graphical form about specific issues
Interpret statistical results
Know how to locate and use primary data sources
Obtain and/or collect relevant data using specific qualitative and/or quantitative research methods
Understand how to use empirical evidence to evaluate an argument
Rules & Requirements
Prerequisites: Instructors may require students to enroll concurrently or have completed Data 8 (COMPSCI/STAT/INFO C8)
Repeat rules: Course may be repeated for credit without restriction. Students may enroll in multiple sections of this course within the same semester.
Hours & Format
Fall and/or spring: 15 weeks - 2-4 hours of seminar per week
Additional Details
Subject/Course Level: Data Science, Undergraduate/Undergraduate
Grading/Final exam status: Letter grade. Alternative to final exam.
Terms offered: Fall 2021, Spring 2021
This class aims to motivate and illustrate key concepts in economics through a series of exercises and examples that use Python Jupyter notebooks. The class covers concepts from introductory economics, microeconomic theory, econometrics, development economics, environmental economics and public economics. The course provides data science students a pathway to apply Python programming and data science concepts within the discipline of economics. The course will also gives economics students a pathway to apply programming to reinforce fundamental concepts and to advance the level of study in upper division coursework and possible thesis work. Economic Models: Read More [+]
Objectives & Outcomes
Course Objectives: Demonstrate how to construct understanding of concepts in economics by developing and coding examples Illustrate topics in economics through coding applications Motivate basics of econometrics from a data science perspective
Student Learning Outcomes: Programmatically create and interpret graphs of simple equations used in microeconomics
Reason about and solve simple equations used in microeconomics through coding
Understand basic concepts in economics
Rules & Requirements
Prerequisites: You must have taken Data C8 or be concurrently enrolled in Data C8 to take this course. That being said, we are able to make exceptions if you have prior programming or data science experience; please email the course staff if you have any questions. Prior economics knowledge may be helpful but is not necessary
Hours & Format
Fall and/or spring: 15 weeks - 2 hours of lecture per week
Summer: 6 weeks - 5 hours of lecture per week
Additional Details
Subject/Course Level: Data Science, Undergraduate/Undergraduate
Grading/Final exam status: Letter grade. Alternative to final exam.
Terms offered: Fall 2021, Summer 2021 8 Week Session, Spring 2021, Fall 2020, Summer 2020 8 Week Session
In this course, students will explore the data science lifecycle, including question formulation, data collection and cleaning, exploratory data analysis and visualization, statistical inference and prediction, and decision-making. This class will focus on quantitative critical thinking and key principles and techniques needed to carry out this cycle. These include languages for transforming, querying and analyzing data; algorithms for machine learning methods including regression, classification and clustering; principles behind creating informative data visualizations; statistical concepts of measurement error and prediction; and techniques for scalable data processing. Principles & Techniques of Data Science: Read More [+]
Terms offered: Fall 2021, Spring 2021, Fall 2020
This course develops the probabilistic foundations of inference in data science, and builds a comprehensive view of the modeling and decision-making life cycle in data science including its human, social, and ethical implications. Topics include: frequentist and Bayesian decision-making, permutation testing, false discovery rate, probabilistic interpretations of models, Bayesian hierarchical models, basics of experimental design, confidence intervals, causal inference, Thompson sampling, optimal control, Q-learning, differential privacy, clustering algorithms, recommendation systems and an introduction to machine learning tools including decision trees, neural networks and ensemble methods. Data, Inference, and Decisions: Read More [+]
Rules & Requirements
Prerequisites: Mathematics 54 or Mathematics 110 or Statistics 89A or Physics 89 or both of Electrical Engineering and Computer Science 16A and Electrical Engineering and Computer Science 16B; Statistics/Computer Science C100; and any of Electrical Engineering and Computer Science 126, Statistics 140, Statistics 134, Industrial Engineering and Operations Research 172. Statistics 140 or Electrical Engineering and Computer Science 126 are preferred
Credit Restrictions: Students will receive no credit for DATA C102 after completing STAT 102, or DATA 102. A deficient grade in DATA C102 may be removed by taking STAT 102, STAT 102, or DATA 102.
Hours & Format
Fall and/or spring: 15 weeks - 3 hours of lecture, 1 hour of discussion, and 1 hour of laboratory per week
Additional Details
Subject/Course Level: Data Science, Undergraduate/Undergraduate
Grading/Final exam status: Letter grade. Final exam required.
Terms offered: Fall 2021, Spring 2021, Fall 2020, Spring 2020, Fall 2019
This course teaches you to use the tools of applied historical thinking and Science, Technology, and Society (STS) to recognize, analyze, and shape the human contexts and ethics of data. It addresses key topics such as doing ethical data science amid shifting definitions of human subjects, consent, and privacy; the changing relationship between data, democracy, and law; the role of data analytics in how corporations and governments provide public goods such as health and security to citizens; sensors, machine learning and artificial intelligence and changing landscapes of labor, industry, and city life. It prepares you to engage as a knowledgeable and responsible citizen and professional in the varied arenas of our datafied world. Human Contexts and Ethics of Data - DATA/History/STS: Read More [+]
Terms offered: Fall 2021, Spring 2021
This course teaches a broad range of statistical methods that are used to solve data problems. Topics include group comparisons and ANOVA, standard parametric statistical models, multivariate data visualization, multiple linear regression, logistic regression and classification, regression trees and random forests. An important focus of the course is on statistical computing and reproducible statistical analysis. The course and lab include hands-on experience in analyzing real world data from the social, life, and physical sciences. The R statistical language is used.
Terms offered: Fall 2021, Spring 2021
An introduction to probability, emphasizing the combined use of mathematics and programming to solve problems. Random variables, discrete and continuous families of distributions. Bounds and approximations. Dependence, conditioning, Bayes methods. Convergence, Markov chains. Least squares prediction. Random permutations, symmetry, order statistics. Use of numerical computation, graphics, simulation, and computer algebra. Probability for Data Science: Read More [+]
Objectives & Outcomes
Course Objectives: The emphasis on simulation and the bootstrap in Data 8 gives students a concrete sense of randomness and sampling variability. Stat 140 will capitalize on this, abstraction and computation complementing each other throughout.
The syllabus has been designed to maintain a mathematical level at least equal to that in Stat 134. So Stat 140 will start faster than Stat 134 (due to the Data 8 prerequisite), avoid approximations that are unnecessary when SciPy is at hand, and replace some of the routine calculus by symbolic math done in SymPy. This will create time for a unit on the convergence and reversibility of Markov Chains as well as added focus on conditioning and Bayes methods. With about a thousand students a year taking Foundations of Data Science (Stat/CS/Info C8, a.k.a. Data 8), there is considerable demand for follow-on courses that build on the skills acquired in that class. Stat 140 is a probability course for Data 8 graduates who have also had a year of calculus and wish to go deeper into data science.
Student Learning Outcomes: Understand the difference between math and simulation, and appreciate the power of both
Use a variety of approaches to problem solving
Work with probability concepts algebraically, numerically, and graphically
Rules & Requirements
Prerequisites: Statistics/Computer Science/Information C8, or Statistics/Computer Science C100, or both Stat 20 and Computer Science 61A; and one year of calculus at the level of Mathematics 1A-1B or higher. Corequisite: Mathematics 54, Electrical Engineering 16A, Statistics 89A, Mathematics 110 or equivalent linear algebra
Credit Restrictions: Students will receive no credit for STAT C140 after completing STAT 134. A deficient grade in STAT C140 may be removed by taking STAT 134.
Hours & Format
Fall and/or spring: 15 weeks - 3 hours of lecture, 2 hours of discussion, and 1 hour of supplement per week
Additional Details
Subject/Course Level: Data Science, Undergraduate/Undergraduate
Grading/Final exam status: Letter grade. Final exam required.
Terms offered: Fall 2021, Fall 2020, Fall 2019
Data Mining and Analytics introduces students to practical fundamentals of data mining and emerging paradigms of data mining and machine learning with enough theory to aid intuition building. The course is project-oriented, with a project beginning in class every week. The in-class portion of the project is meant to be collaborative and a time for the instructor and GSIs to work closely with project groups to understand the objectives, help work through software logistics, and connect project work to lecture. Lectures will introduce theories, concepts, practical contexts, and algorithms. Students should expect to leave the class with hands-on, contemporary data mining skills they can confidently apply in research and industry. Data Mining and Analytics: Read More [+]
Objectives & Outcomes
Course Objectives: Conduct manual feature engineering (from domain knowledge) vs. machine induced featurization (representation learning) Develop intuition in various machine learning classification algorithms (e.g. decision trees, feed-forward neural networks, recurrent neural networks, skip-grams) and clustering techniques (e.g. k-means, spectral) Foster critical thinking about real-world actionability from analytics Provide an overview of issues in research and practice that will affect the practice of data science in a variety of domains
Student Learning Outcomes: Develop capabilities in a range of data mining techniques
Gain the ability to solve problems in data mining research and practice
Think critically about how to assess analytics
Use data mining and analytics in a domain of application
Rules & Requirements
Prerequisites: Data 100 (COMPSCI/STAT C100) recommended
Credit Restrictions: Students will receive no credit for DATA 144 after completing INFO 154. A deficient grade in DATA 144 may be removed by taking INFO 154.
Hours & Format
Fall and/or spring: 15 weeks - 3 hours of lecture per week
Additional Details
Subject/Course Level: Data Science, Undergraduate/Undergraduate
Grading/Final exam status: Letter grade. Alternative to final exam.
Terms offered: Prior to 2007
The senior honors thesis seminar gives students an opportunity to experience firsthand what it means to do data science research. Over two semesters, students will learn to formulate a research problem, design a research strategy, collect evidence, and write up the findings and analysis. The first semester focuses primarily on the preparation and implementation of a research proposal, as well as data management strategies. During the second semester, we will emphasize analysis and writing. The final result will be a hybrid product with a 20-25 page research paper, with data visualizations and analysis tables, along with a documented data source, annotated code, well documented Github repository, and open science posting of the project. Data Science Honors Thesis Seminar: Read More [+]
Objectives & Outcomes
Course Objectives: Assist students with project organization and management. Convey approaches to effective writing and visual communication. Discuss how to formulate and substantiate an argument with evidence. Explain approaches to designing a research question and project. Explore a variety of analytic and visualization techniques and discuss their appropriateness to different research questions. Identify the challenges in data acquisition and management.
Student Learning Outcomes: Communicate effectively in written, spoken, and graphical form.
Develop an understanding of data availability, constraints, and ethics.
Develop data management skills.
Develop reproducible research and interpret results.
Formulate a proposal for a research project.
Learn how to develop a research question and project.
Understand how to organize empirical work into a written document.
Understand how to use empirical evidence to construct an argument.
Rules & Requirements
Prerequisites: There are no specific prerequisites. Students must be accepted into the data science honors program in order to take this course. Students must complete H195A in order to enroll in H195B
Hours & Format
Fall and/or spring: 15 weeks - 2 hours of seminar per week
Additional Details
Subject/Course Level: Data Science, Undergraduate/Undergraduate
Grading/Final exam status: Letter grade. Alternative to final exam.
Terms offered: Spring 2020
The senior honors thesis seminar gives students an opportunity to experience firsthand what it means to do data science research. Over two semesters, students will learn to formulate a research problem, design a research strategy, collect evidence, and write up the findings and analysis. The first semester focuses primarily on the preparation and implementation of a research proposal, as well as data management strategies. During the second semester, we will emphasize analysis and writing. The final result will be a hybrid product with a 20-25 page research paper, with data visualizations and analysis tables, along with a documented data source, annotated code, well documented Github repository, and open science posting of the project. Data Science Honors Thesis Seminar: Read More [+]
Objectives & Outcomes
Course Objectives: Assist students with project organization and management. Convey approaches to effective writing and visual communication. Discuss how to formulate and substantiate an argument with evidence. Explain approaches to designing a research question and project. Explore a variety of analytic and visualization techniques and discuss their appropriateness to different research questions. Identify the challenges in data acquisition and management.
Student Learning Outcomes: Communicate effectively in written, spoken, and graphical form.
Develop an understanding of data availability, constraints, and ethics.
Develop data management skills.
Develop reproducible research and interpret results.
Formulate a proposal for a research project.
Learn how to develop a research question and project.
Understand how to organize empirical work into a written document.
Understand how to use empirical evidence to construct an argument.
Rules & Requirements
Prerequisites: There are no specific prerequisites. Students must be accepted into the data science honors program in order to take this course. Students must complete H195A in order to enroll in H195B
Hours & Format
Fall and/or spring: 15 weeks - 2 hours of seminar per week
Additional Details
Subject/Course Level: Data Science, Undergraduate/Undergraduate
Grading/Final exam status: Letter grade. Alternative to final exam.
Terms offered: Fall 2019
Students take part in organized individual field sponsored programs with off-campus organizations or tutoring/mentoring relevant to specific aspects and applications of data science on or off campus. Note Summer CPT or OPT students: written report required. Course may not count toward major requirements but will be counted in the cumulative units toward graduation. Field Studies in Data Science: Read More [+]
Rules & Requirements
Prerequisites: Consent of instructor (see department advisor). Upper-division standing
Repeat rules: Course may be repeated for credit with advisor consent.
Hours & Format
Fall and/or spring: 15 weeks - 1-4 hours of fieldwork per week
Summer: 6 weeks - 2.5-10 hours of fieldwork per week 8 weeks - 2-7.5 hours of fieldwork per week 10 weeks - 1.5-6 hours of fieldwork per week
Additional Details
Subject/Course Level: Data Science, Undergraduate/Undergraduate
Grading/Final exam status: Offered for pass/not pass grade only. Alternative to final exam.
Terms offered: Spring 2021
Written proposal must be approved by a faculty sponsor, who will serve as Instructor of Record. Seminars for the group study of selected topics, which will vary from semester to semester. Topics may be initiated by students. Directed Group Studies for Advanced Undergraduates: Read More [+]
Rules & Requirements
Prerequisites: Instructors may require students to enroll concurrently or have completed Data 8 (COMPSCI/STAT/INFO C8). Upper-division standing and consent of instructor
Repeat rules: Course may be repeated for credit without restriction. Students may enroll in multiple sections of this course within the same semester.
Hours & Format
Fall and/or spring: 15 weeks - 1-4 hours of directed group study per week
Additional Details
Subject/Course Level: Data Science, Undergraduate/Undergraduate
Grading/Final exam status: Offered for pass/not pass grade only. Final exam not required.
Terms offered: Prior to 2007
Independent study and research by arrangement with faculty or staff. This course allows students to obtain course credit for participation in undergraduate research. Students may opt either to participate in a semester-long series of workshops which provide a guided research experience with project milestone assignments and regular feedback, or they may opt to work independently with supervision from one faculty research mentor. Supervised Independent Study and Research: Read More [+]
Objectives & Outcomes
Student Learning Outcomes: Develop and refine skills acquired in other courses in a hands-on, self-directed research project.
Identify how to properly manage data and describe best practices in programming and analytics.
Integrate feedback from an instructor into research on a regular basis.
Learn how to structure and complete a research project working independently.
Rules & Requirements
Prerequisites: Instructors may require students to enroll concurrently or have completed Data 8 (COMPSCI/STAT/INFO C8). Upper-division standing and consent of instructor
Repeat rules: Course may be repeated for credit without restriction.
Hours & Format
Fall and/or spring: 15 weeks - 3-12 hours of independent study per week
Summer: 6 weeks - 7.5-30 hours of independent study per week 8 weeks - 5.5-22.5 hours of independent study per week
Additional Details
Subject/Course Level: Data Science, Undergraduate/Undergraduate
Grading/Final exam status: Offered for pass/not pass grade only. Alternative to final exam.
Terms offered: Fall 2021, Spring 2021, Fall 2020, Spring 2020, Fall 2019
Explores the data science lifecycle: question formulation, data collection and cleaning, exploratory, analysis, visualization, statistical inference, prediction, and decision-making. Focuses on quantitative critical thinking and key principles and techniques: languages for transforming, querying and analyzing data; algorithms for machine learning methods: regression, classification and clustering; principles of informative visualization; measurement error and prediction; and techniques for scalable data processing. Research term project. Principles and Techniques of Data Science: Read More [+]
Rules & Requirements
Prerequisites: Computer Science/Information/Statistics C8 or Engineering 7; and either Computer Science 61A or Computer Science 88. Corequisite: Mathematics 54 or Electrical Engineering 16A
Fall and/or spring: 8 weeks - 6 hours of lecture, 2 hours of discussion, and 2 hours of laboratory per week 15 weeks - 3 hours of lecture, 1 hour of discussion, and 1 hour of laboratory per week
Summer: 8 weeks - 6 hours of lecture, 2 hours of discussion, and 2 hours of laboratory per week
Additional Details
Subject/Course Level: Data Science, Undergraduate/Graduate
Grading: Letter grade.
Formerly known as: Statistics C200C/Computer Science C200A
When you print this page, you are actually printing everything within the tabs on the page you are on: this may include all the Related Courses and Faculty, in addition to the Requirements or Overview. If you just want to print information on specific tabs, you're better off downloading a PDF of the page, opening it, and then selecting the pages you really want to print.