Code Organization

Learn how we will organize our code.

We'll cover the following...

Organizing our code
Try it yourself

Organizing our code

Before we dive into coding, it’s important to discuss how to organize the code. Novice programmers tend to write everything in one file, no matter how large the file gets. While this might work, it’s not a good idea for readability or maintainability. A typical ML project can require thousands of lines of code, including those for data processing, model training, and other tasks.

Moreover, multiple people work on different aspects of a project. Some of the code that a data scientist writes may be used in more than one project. In these conditions, it’s useful to have code organized in a way that’s logically consistent and amenable to collaborative development. How can we organize our code to make it logically sound and readable?

We already started the process when we decided on our directory structure. Here’s our directory tree.

ml_pipeline_tutorial/ ...

1.Introduction

2.Getting Started

3.Structuring the ML Pipeline

4.Directed Acyclic Graphs (DAGs)

5.The ML Library

Project

6.The Pipeline Core

7.Extending the Pipeline

Project

8.Testing

9.Deployment

10.Other Considerations

11.Wrapping Up

12.Appendix

Assessment

Code Organization

Organizing our code