Lecture 13 - Navigating Data Science Projects Lifecycle

Introduction

This section provides a summary on the holistic procedures of a data science project using R. We will explore the steps necessary to take a data science project from initial concept to deployment and presentation. Each stage will include practical applications using R, providing both theoretical and hands-on learning opportunities.

flowchart TD
  A[1. Formulating a Research Question] --> B[2. Data Understanding]
  B --> C
  subgraph C[Data Preparation]
    direction LR
    C1[3 Data Cleaning] --> C2[4 Feature Engineering]
  end
  C --> D[5. Model Building, Tuning, and Evaluation]
  D --> E[6. Model Deployment and Results Reporting]

Resources:

  1. codeacademy - Introduction to Machine Learning
  2. Wikipedia, The Free Encyclopedia - Cross-industry standard process for data mining.