Managing Datasets
Explore how to create and manage datasets within Azure Machine Learning pipelines. Learn methods to upload data from local files, datastores, cloud storage, or public HTTPS sources and use YAML configuration for dataset creation.
Where is the data?
Data is everywhere. Our data can reside in multiple locations: local files, Azure datastores, or public HTTPS URLs. We can copy the files from multiple locations into Azure as datasets. These datasets can be used during training or when validating the ML models. In this lesson, we will discuss how to create Azure datasets.
Create datasets from the Azure Machine Learning studio
We can go to the homepage of the Azure Machine Learning studio and click “Create dataset.” We’ll get the option to choose the file location. Select the location, and the file gets uploaded to Azure as the dataset at “workspaceblobstore.”
Dataset from a local file using Azure CLI
Datasets can be created ...