Gain insights into Rust for real-world data, delving into data input, storage, analysis, and visualization. Explore web scraping, SQL, NoSQL databases, and machine learning with Rust.

rust-data-0.1.18.tar.gz

rust-basic

rust-basic-live

rust-basic-live-copy

Rust is a fast-growing, modern programming language oriented to system programming that guarantees memory and thread safety. It enables you to eliminate many classes of bugs at compile-time. This course covers all the skills needed to work with real-world data using Rust. 

In this course, you’ll learn all the basics of intermediate Rust programming. You’ll discover and master the five data-related skills: data input, storage, serving data, analyzing data, and data visualization. You’ll learn to ingest data from various formats, like CSV, JSON, web APIs, and web scraping. You’ll also learn to store the data using Redis, an SQL server, or a NoSQL database like MongoDB. Finally, you will learn how to present your data in meaningful visualizations, web maps, and reports. 

After taking this course, you’ll be able to approach data using Rust with confidence. You’ll be able to manage SQLite, ORM SQL, and NoSQL databases in Rust. You’ll learn the basics of web scraping, data analysis, and machine learning in Rust.

Processing Real-world Data Efficiently with Rust

# The basics of web scraping

Web scraping is the art of getting data from web pages. The difference between scraping and polling a web service is that web pages are meant to be seen by humans, while web services are for machines.

How, then, can we teach machines to read data meant for humans?


## The need for hooks

Apart from some applications of artificial intelligence, machines have to be guided to retrieve data meant to be visualized on a page. We need to use some tricks and hooks to allow the program to navigate a page and recognize data.

When we create web pages, we typically define them with parts that are all formatted the same way. For this reason, we usually assign CSS classes for consistency across the whole website.


> **Tip**: Our first hook is to look for CSS classes.

Sometimes web developers assign an element ID to some elements on the page. This is a unique identifier that, if present, is a powerful hook.


> **Tip**: The second hook to consider is the element `id`.


With these two hooks, we can already scrape a lot of different pages.




## A small example

Let’s try to scrape the Hacker News site.

The [Hacker News](https://news.ycombinator.com/) homepage is a collection of articles. Let’s say we want to create a list of all the current article titles.

We need to find the right hooks in the web page’s source code to run our scrapers.

We can see that the site is composed of one large table with some other nested tables. The table that contains the stories is coded as follows:

``` html
<table class=“itemlist” ... >
...
</table>
```

Within that table, some rows are marked with a class called `athing`. These are the rows containing the titles of the stories, located underneath an `<a>` tag.

```html
<tr class=“athing” id="29059499">
...
</tr>
```

The titles themselves are inside a `<td>` marked with the class `title`.


```html
<td class=“title”>
    <a href="..." class=“titlelink”> ... </a>
    ...
</td>
```

**Summary**: 

First, we need to find the rows marked with the class `athing`, which is our hook. Then we need to find the `<a>` tags inside each of these rows to extrapolate their content.

## A small example

Let’s try to scrape the Hacker News site.

The [Hacker News](https://news.ycombinator.com/) homepage is a collection of articles. Let’s say we want to create a list of all the current article titles.

We need to find the right hooks in the web page’s source code to run our scrapers.

We can see that the site is composed of one large table with some other nested tables. The table that contains the stories is coded as follows:

``` html
<table ... >
...
</table>
```

Within that table, some rows are marked with a class called `athing`. These are the rows containing the titles of the stories, located underneath an `<a>` tag.

```html
<tr id="29059499">
...
</tr>
```

The titles themselves are inside a `<td>` marked with the class `title`.


```html
<td>
    <a href="..."> ... </a>
    ...
</td>
```

**Summary**: 

First, we need to find the rows marked with the class `athing`, which is our hook. Then we need to find the `<a>` tags inside each of these rows to extrapolate their content.

Before We Begin

Rust Data Structures

Rust Data Structures

Basics of Functional Programming

Functional Programming in Rust

Data Skill: Input Data

Input Data with Rust

Web Scraping: Getting Products and Prices from an E-commerce Site

Data Skill: Store Data

Storing Data in Rust

Create a Simple Payroll Management System

Data Skill: Serve Data

Serving Data in Rust

Build a Server Backed Up by a Database

Data Skill: Analyze Data

Assessment on Analyze Data

ML Basics: Classification of the Iris Data Set

Machine Learning and the MNIST Digit DataSet

Data Skill: Dataviz and Storytelling

Data Viz and Storytelling in Rust

Create Infographics in Rust

Some Parting Words

The Basics of Web Scraping

The basics of web scraping

The need for hooks

A small example