Linear regression with several unknowns

So far, we’ve worked with examples of linear regression consisting of only one unknown. However, systems from real-life situations are typically complex, with several unknowns. In this lesson, we’ll work with a more involved example of a real data setA. Tsanas, A. Xifara: ‘Accurate quantitative estimation of energy performance of residential buildings using statistical machine learning tools’, Energy and Buildings, Vol. 49, pp. 560-567, 2012.

Context

The task at hand is to find a relationship that determines the amount of heating or cooling required by a building, given different features of the building like roof area, wall area, and glazing.

The data set consists of 88 different features, namely X1 to X8, of 768768 different buildings, along with their heating and cooling loads, Y1 and Y2, respectively. Below is a representation of the first five data records and the header information.

Get hands-on with 1400+ tech skills courses.