A Beginner’s Guide to TensorFlow: Building Machine Learning Model

Home/

Blog/

Machine Learning/

7 mins read

Jan 15, 2024

Content

Getting started with TensorFlow

Building an ML model

Creating a simple neural network

Assigning weights and bias

Applying the activation function

Defining the loss function and optimizer

Training the model

Validating the model

Next steps

The “divide and conquer” rule can make solving big and complicated tasks a breeze. Imagine we have a big puzzle and access to good friends who are willing to help. Each friend can solve a small part of the puzzle, and these small parts are combined once finished. This approach solves the problem in smaller steps to obtain the final result. In terms of machine learning, solving a large task is quite a resource-consuming and time-taking task. TensorFlow breaks a complex machine learning task into smaller tasks with each task processed in parallel. This makes the overall process efficient.

TensorFlow is an open-source framework for implementing machine learning models. Additionally, the advent of deep learning-based neural networks has made TensorFlow more in demand in various domains like natural language processing, image, and audio recognition, etc. If you’re new to TensorFlow and want to train and evaluate ML and DNN models, you’re in the right place. This blog provides a step-by-step guide to TensorFlow on building machine learning (ML) and deep neural network (DNN) models using this powerful framework.

Getting started with TensorFlow#

A tensor is a multi-dimensional data structure representing the input, output, and intermediate data in TensorFlow computations. In TensorFlow, a graph represents a computation with nodes and edges. A node represents operations, and edges represent the data flow between these operations.

Building an ML model#

A machine learning model consists of a neural network with interconnected nodes organized into layers. Each node takes input from multiple nodes from the previous layer. Let’s take an example of a simple graph with an “add” operation as shown below:

Creating a simple neural network#

Let’s build a simple neural network to implement an XOR logic gate with the following structure:

An input layer with two nodes representing the two inputs of the XOR gate.
A hidden layer with two nodes. This layer is called the hidden layer since it is not directly observable or accessible in terms of the input or output of the network.
An output layer with one node representing the network output.

The input-output relation of an XOR logic gate is depicted in the table below:

# Define placeholders for input and output
x = tf.placeholder(tf.float32, shape=[None, input_dim], name='x')
y = tf.placeholder(tf.float32, shape=[None, output_dim], name='y')
# Define variables for weights and biases of the hidden layer
W_hidden = tf.Variable(tf.random_normal([input_dim, hidden_dim]), name='W_hidden')
b_hidden = tf.Variable(tf.zeros([hidden_dim]), name='b_hidden')
# Define variables for weights and biases of the output layer
W_output = tf.Variable(tf.random_normal([hidden_dim, output_dim]), name='W_output')
b_output = tf.Variable(tf.zeros([output_dim]), name='b_output')

As an example, the output of the first node in the hidden layer will be calculated as follows:

\text{hidden \ 1} = \text{input\ 1} \times w1 + \text{input\ 2} \times w2 + b_1

Applying the activation function#

An activation function is applied to the weighted sum of inputs and biases of the node to produce an output of the node. An XOR logic gate is a non-linear operation, and we must introduce non-linearity into the model to make the network learn a non-linear XOR logic. Hence, we use an activation function to introduce non-linearity.

For the hidden layer, we will use a sigmoid function as an activation function that provides a smooth and continuous non-linear transformation, and it is mathematically written as follows:

S(\text{hidden \ 1}) = \frac{1}{1 + e^{-(\text{hidden \ 1})}}

For the output layer, we perform a linear transformation of the hidden layer outputs, weights, and bias as follows:

\text{output} = \text{hidden\ 1} \times w5 + \text{hidden\ 2} \times w6 + b_3

Let’s apply these two activation functions:

Line 4: Applies the sigmoid activation function to the nodes in the hidden layer.
Line 6: Applies the linear transformation to calculate the network output.

Defining the loss function and optimizer#

Now that we have completed one pass of the network, we need to check if the network’s output matches the desired output. To do so, we define a loss function. A loss function measures the dissimilarity between the predicted and expected output. This helps us to quantify the model’s performance. We will use a mean squared error (MSE) as a loss function that calculates the average squared difference between the predicted and expected output values.

Based on the output of the loss function, we now need to update the model’s parameters, like weights and biases. We will use gradient descent optimization algorithms to find the optimum values of weights and biases. A gradient descent tunes the parameters in the direction of the steepest descent of the loss function.

Let’s implement the loss function and the optimization algorithm into our machine learning model:

Line 2: Defines the loss function to reduce the mean squared error between the output_layer and y. Here y is the expected output and output_layer is the predicted output.
Lines 5–6: Defines the optimization algorithm with parameter learning_rate. This defines the step size taken during each parameter update. A larger value of learning_rate can result in faster convergence but also risks overshooting the optimal value. Similarly, a smaller value of learning_rate results in slow convergence but is expected to provide a more precise result.

Training the model#

Now that everything is in place, it’s time to train our model using the training date defined in x_train and y_train:

# Step 1: Prepare the training data
x_train = np.array([[0, 0], [0, 1], [1, 0], [1, 1]], dtype=np.float32)  # Input features
y_train = np.array([[0], [1], [1], [0]], dtype=np.float32)  # Target outputs
# Step 5: Train the model
num_epochs = 1000
batch_size = 4
with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())  # Initialize variables
    
    for epoch in range(num_epochs):
        # Generate random mini-batches
        indices = np.random.choice(len(x_train), batch_size, replace=False)
        x_batch = x_train[indices]
        y_batch = y_train[indices]
        
        # Run optimization operation
        _, current_loss = sess.run([train_op, loss], feed_dict={x: x_batch, y: y_batch})

The predicted output shows that our neural network has converged to the expected output of an XOR logic gate. The first and last predicted outputs are close to 0 and the second and third predicted outputs are close to 1.

Next steps#

This blog has briefly introduced TensorFlow and how we can build a machine learning model using tensors.

We encourage you to explore more activation and loss functions and practice building more complex machine learning models. You can also check out the following courses on Educative to learn machine learning:

Applied Machine Learning: Industry Case Study with TensorFlow

Applied Machine Learning: Industry Case Study with TensorFlow

In this course, you'll work on an industry-level machine learning project based on predicting weekly retail sales given different factors. You will learn the most efficient techniques used to train and evaluate scalable machine learning models. After completing this course, you will be able to take on industry-level machine learning projects, from data analysis to creating efficient models and providing results and insights. The code for this course is built around the TensorFlow framework, which is one of the premier frameworks for industry machine learning, and the Python pandas library for data analysis. Basic knowledge of Python and TensorFlow are prerequisites. To get some experience with TensorFlow, try our course: Machine Learning for Software Engineers. This course was created by AdaptiLab, a company specializing in evaluating, sourcing, and upskilling enterprise machine learning talent. It is built in collaboration with industry machine learning experts from Google, Microsoft, Amazon, and Apple.

3hrs

Intermediate

16 Challenges

2 Quizzes

Become a Machine Learning Engineer

Start your journey to becoming a machine learning engineer by mastering the fundamentals of coding with Python. Learn machine learning techniques, data manipulation, and visualization. As you progress, you'll explore object-oriented programming and the machine learning process, gaining hands-on experience with machine learning algorithms and tools like scikit-learn. Tackle practical projects, including predicting auto insurance payments and customer segmentation using K-means clustering. Finally, explore the deep learning models with convolutional neural networks and apply your skills to an AI-powered image colorization project.

105hrs

Beginner

12 Challenges

28 Quizzes

Machine Learning with NumPy, pandas, scikit-learn, and More

If you're a software engineer looking to add machine learning to your skillset, this is the place to start. This course will teach you to write useful code and create impactful machine learning applications immediately. From the start, you'll be given all the tools that you need to create industry-level machine learning projects. Rather than reading through dense theory, you’ll learn practical skills and gain actionable insights. Topics covered include data analysis/visualization, feature engineering, supervised learning, unsupervised learning, and deep learning. All of these topics are taught using industry-standard frameworks: NumPy, pandas, scikit-learn, XGBoost, TensorFlow, and Keras. Basic knowledge of Python is a prerequisite to this course. This course was created by AdaptiLab, a company specializing in evaluating, sourcing, and upskilling enterprise machine learning talent. It is built in collaboration with industry machine learning experts from Google, Microsoft, Amazon, and Apple.

15hrs

Intermediate

115 Challenges

8 Quizzes

Frequently Asked Questions

How do you make a ML model using TensorFlow?

Install TensorFlow: pip install tensorflow.
Import TensorFlow: import tensorflow as tf.
Load and preprocess data: Normalizing data, extracting features, and splitting it into training and testing sets.
Build the ML model: Define the architecture of your model. model = tf.keras.Sequential([ tf.keras.layers.Dense(128, activation=‘relu’, input_shape=(input_dim,)), tf.keras.layers.Dropout(0.2), tf.keras.layers.Dense(10, activation=‘softmax’) )]
Compile the model: Configure the optimizer, loss function, and metrics. model.compile(optimizer=‘adam’, loss=‘categorical_crossentropy’, metrics=[‘accuracy’])
Train the model: Train the model on training data. model.fit(X_train, y_train, epochs=10, batch_size=32, validation_data=(X_val, y_val))
Hyperparameter tuning: Adjust the number of epochs, batch size, learning rate, and other hyperparameters to improve the model.
Evaluate the Model: Once training is complete, evaluate the model. test_loss, test_accuracy = model.evaluate(X_test, y_test)
Make predictions: Use the trained model to make predictions on new data. predictions = model.predict(new_data)

Written By:

Najeeb Ul Hassan

New on Educative

Learn to Code

Learn any Language as a beginner

Develop a human edge in an AI powered world and learn to code with AI from our beginner friendly catalog

🎁 G i v e a w a y

30 Days of Code

Complete Educative’s daily coding challenge every day in September, and win exciting Prizes.

Free Resources

blog

Demystifying Fuzzy Inference Systems

blog

Introduction to convolutional neural networks (CNN)

blog

Bagging vs. Boosting in machine learning

A Beginner’s Guide to TensorFlow: Building Machine Learning Model

Getting started with TensorFlow#

Building an ML model#

Creating a simple neural network#

XOR Gate: Input-Output Relation

Assigning weights and bias#

Applying the activation function#

Defining the loss function and optimizer#

Training the model#

Validating the model#

Next steps#

Frequently Asked Questions

How do you make a ML model using TensorFlow?