What You’ll Learn in Become a Data Scientist Nanodegree

Data Scientist

Estimated 4 months to complete

You’ll master the skills necessary to become a successful Data Scientist. You’ll work on projects designed by industry experts, and learn to run data pipelines, design experiments, build recommendation systems, and deploy solutions to the cloud.

If you already have experience with machine learning, take this program.

Become a Data Scientist Intro Video:

Prerequisite knowledge

We recommend students are familiar with machine learning concepts, like those in the Intro to Machine Learning Nanodegree Program. In addition, students should be familiar with Python programming, probability, and statistics.

The Data Scientist Nanodegree program is an advanced program designed to prepare students for Data Scientist jobs. As such, students should have a high comfort level with a variety of topics before starting the program. In order to successfully complete this program, you should meet the following prerequisites:

Machine Learning

  • Supervised and Unsupervised methods equivalent to those taught in the Intro to Machine Learning Nanodegree Program.


  • Python Programming including writing functions, building basic applications, and common libraries like NumPy and Pandas
  • SQL programming including querying databases, using joins, aggregations, and subqueries.
  • Comfortable using the Terminal and Github

Probability and Statistics

  • Descriptive Statistics including calculating measures of center and spread
  • Inferential Statistics including sampling distributions, hypothesis testing


  • Calculus including maximizing and minimizing algebraic equations
  • Linear Algebra including matrix manipulation and multiplication

Data Wrangling and Visualization

  • Accessing database, CSV, and JSON data
  • Data cleaning and transformations using pandas and Sklearn
  • Data Visualization with matplotlib
  • Exploratory and explanatory data analysis and visualization

Solving Data Science Problems

Learn the data science process, including how to build effective data visualizations, and how to communicate with various stakeholders.

Project – Write a Data Science Blog Post

In this project, you will choose a dataset, identify three questions, and analyze the data to find answers to these questions. You will create a github repository with your project, and write a blog post to communicate your findings to the appropriate audience. The project will help you reinforce and extend your knowledge of machine learning, data visualization, and communication.

Software Engineering for Data Scientists

Develop software engineering skills that are essential for data scientists, such as creating unit tests and building classes.

Data Engineering for Data Scientists

Learn to work with data through the entire data science process, from running pipelines, transforming data, building models, and deploying solutions to the cloud.

Project – Build Disaster Response Pipelines with Figure Eight

Figure Eight, a company focused on creating datasets for AI applications, has crowdsourced the tagging and translation of messages to improve disaster relief efforts. In this project, you’ll build a data pipeline to prepare message data from major natural disasters around the world. You’ll build a machine learning pipeline to categorize emergency messages based on the needs communicated by the sender.

Experiment Design and Recommendations

Learn to design experiments and analyze A/B test results. Explore approaches for building recommendation systems.

Project – Design a Recommendation Engine with IBM

IBM has an online data science community where members can post tutorials, notebooks, articles, and datasets. In this project, you will build a recommendation engine, based on user behavior and social network in IBM Watson Studio’s data platform, to surface content most likely to be relevant to a user.

Data Science Projects

Leverage what you’ve learned throughout the program to build your own open-ended Data Science project. This project will serve as a demonstration of your valuable abilities as a Data Scientist.

Project – Data Science Capstone Project

You will leverage what you’ve learned throughout the program to build your own data science project. You will define the problem, identify and explore the data, perform your analyses, and present your solution in a blog post or front-end application. This project will serve as a demonstration of your ability as a Data Scientist, and will be an important piece of your job-ready portfolio.

