Vector Space in Data Science
Learn how vector spaces are relevant in data science.
From abstract to real vector space
All data (text, images, videos, sound) is made up of data points or samples, which can be represented as column vectors having real numbers. That said, we already know that all d-dimensional vectors compose the vector space over the field . So, we can say that is the most popular vector space used in data science because all data is represented using vectors that span . Here, represents the dimension of the vector space, .
Image is a point in space
Consider a matrix of intensities of a grayscale image shown in the figure below. Each intensity value can be considered a feature of the image, so we can say that it has a total of 16 features. We can represent these features as a column vector in by flattening the image, say, row-wise. Even if we don’t flatten the image, it’s still a vector/point in the space of matrices denoted by .
Get hands-on with 1400+ tech skills courses.