Skip to content

Commit 236c4e4

Browse files
committed
Playground and readme file completed
1 parent b4e2c87 commit 236c4e4

File tree

5 files changed

+185
-11
lines changed

5 files changed

+185
-11
lines changed

Linear Regression/Images/graph1.png

22.4 KB
Loading

Linear Regression/Images/graph2.png

42.5 KB
Loading

Linear Regression/Images/graph3.png

30.7 KB
Loading

Linear Regression/LinearRegression.playground/Contents.swift

+37-10
Original file line numberDiff line numberDiff line change
@@ -4,20 +4,47 @@ import Foundation
44

55
let carAge: [Double] = [10, 8, 3, 3, 2, 1]
66
let carPrice: [Double] = [500, 400, 7000, 8500, 11000, 10500]
7-
var intercept = 10000.0
8-
var slope = -1000.0
9-
let alpha = 0.0001
10-
let numberOfCarAdvertsWeSaw = carPrice.count
7+
var intercept = 0.0
8+
var slope = 0.0
119

12-
func h(carAge: Double) -> Double {
10+
func predictedCarPrice(carAge: Double) -> Double {
1311
return intercept + slope * carAge
1412
}
1513

16-
for n in 1...10000 {
17-
for i in 1...numberOfCarAdvertsWeSaw {
18-
intercept += alpha * (carPrice[i-1] - h(carAge[i-1])) * 1
19-
slope += alpha * (carPrice[i-1] - h(carAge[i-1])) * carAge[i-1]
14+
// An iterative approach
15+
16+
let numberOfCarAdvertsWeSaw = carPrice.count - 1
17+
let numberOfIterations = 100
18+
let alpha = 0.0001
19+
20+
for n in 1...numberOfIterations {
21+
for i in 0...numberOfCarAdvertsWeSaw {
22+
let difference = carPrice[i] - predictedCarPrice(carAge[i])
23+
intercept += alpha * difference
24+
slope += alpha * difference * carAge[i]
2025
}
2126
}
2227

23-
print("A car age 4 years is predicted to be worth £\(Int(h(4)))")
28+
print("A car age of 4 years is predicted to be worth £\(Int(predictedCarPrice(4)))")
29+
30+
// A closed form solution
31+
32+
func average(input: [Double]) -> Double {
33+
return input.reduce(0, combine: +) / Double(input.count)
34+
}
35+
36+
func multiply(input1: [Double], _ input2: [Double]) -> [Double] {
37+
return input1.enumerate().map({ (index, element) in return element*input2[index] })
38+
}
39+
40+
func linearRegression(xVariable: [Double], _ yVariable: [Double]) -> (Double -> Double) {
41+
let sum1 = average(multiply(xVariable, yVariable)) - average(xVariable) * average(yVariable)
42+
let sum2 = average(multiply(xVariable, xVariable)) - pow(average(xVariable), 2)
43+
let slope = sum1 / sum2
44+
let intercept = average(yVariable) - slope * average(xVariable)
45+
return { intercept + slope * $0 }
46+
}
47+
48+
let result = linearRegression(carAge, carPrice)(4)
49+
50+
print("A car of age 4 years is predicted to be worth £\(Int(result))")

Linear Regression/README.markdown

+148-1
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,4 @@
11
# Linear Regression
2-
## First draft version of this document
32

43
Linear regression is a technique for creating a model of the relationship between two (or more) variable quantities.
54

@@ -16,4 +15,152 @@ Age (in years)| Price (in £)
1615

1716
Our car is 4 years old. How can we set a price for our car based on the data in this table?
1817

18+
Let's start by looking at the data plotted out:
19+
20+
![graph1](Images/graph1.png)
21+
22+
We could imagine a straight line drawn through the points on this graph. It's not (in this case) going to go exactly through every point, but we could place the line so that it goes as close to all the points as possible.
23+
24+
To say this in another way, we want to make the distance from the line to each point as small as possible. This is most often done by minimising the square of the distance from the line to each point.
25+
26+
We can describe the straight line in terms of two variables:
27+
28+
1. The point at which it crosses the y-axis i.e. the predicted price of a brand new car. This is the *intercept*.
29+
2. The *slope* of the line - i.e. for every year of age, how much does the price change.
30+
31+
This is the equation for our line:
32+
33+
$$
34+
carPrice = slope \times carAge + intercept
35+
$$
36+
37+
How can we find the best values for the intercept and the slope? Let's look at two different ways to do this.
38+
39+
## An iterative approach
40+
One approach is to start with some arbitrary values for the intercept and the slope. We work out what small changes we make to these values to move our line closer to the data points. Then we repeat this multiple times. Eventually our line will approach the optimum position.
41+
42+
First let's set up our data structures. We will use two Swift arrays for the car age and the car price:
43+
44+
```swift
45+
let carAge: [Double] = [10, 8, 3, 3, 2, 1]
46+
let carPrice: [Double] = [500, 400, 7000, 8500, 11000, 10500]
47+
```
48+
49+
This is how we can represent our straight line:
50+
51+
```swift
52+
var intercept = 0.0
53+
var slope = 0.0
54+
func predictedCarPrice(carAge: Double) -> Double {
55+
return intercept + slope * carAge
56+
}
57+
```
58+
Now for the code which will perform the iterations:
59+
60+
```swift
61+
let numberOfCarAdvertsWeSaw = carPrice.count - 1
62+
let iterations = 2000
63+
let alpha = 0.0001
64+
65+
for n in 1...iterations {
66+
for i in 0...numberOfCarAdvertsWeSaw {
67+
let difference = carPrice[i] - predictedCarPrice(carAge[i])
68+
intercept += alpha * difference
69+
slope += alpha * difference * carAge[i]
70+
}
71+
}
72+
```
73+
74+
```alpha``` is a factor that determines how much closer we move to the correct solution with each iteration. If this factor is too large then our program will not converge on the correct solution.
75+
76+
The program loops through each data point (each car age and car price). For each data point it adjusts the intercept and the slope to bring them closer to the correct values. The equations used in the code to adjust the intercept and the slope are based on moving in the direction of the maximal reduction of these variables. This is a *gradient descent*.
77+
78+
We want to minimse the square of the distance between the line and the points. Let's define a function J which represents this distance - for simplicity we consider only one point here:
79+
80+
$$
81+
J \propto ((slope.carAge+intercept) - carPrice))^2
82+
$$
83+
84+
In order to move in the direction of maximal reduction, we take the partial derivative of this function with respect to the slope:
85+
86+
$$
87+
\frac{\partial J}{\partial (slope)} \propto (slope.carAge+intercept) - carPrice).carAge
88+
$$
89+
90+
And similarly for the intercept:
91+
92+
$$
93+
\frac{\partial J}{\partial (intercept)} \propto
94+
(slope.carAge+intercept) - carPrice)
95+
$$
96+
97+
We multiply these derivatives by our factor alpha and then use them to adjust the values of slope and intercept on each iteration.
98+
99+
Looking at the code, it intuitively makes sense - the larger the difference between the current predicted car Price and the actual car price, and the larger the value of ```alpha```, the greater the adjustments to the intercept and the slope.
100+
101+
It can take a lot of iterations to approach the ideal values. Let's look at how the intercept and slope change as we increase the number of iterations:
102+
103+
Iterations | Intercept | Slope | Predicted value of a 4 year old car
104+
:---------:|:---------:|:-----:|:------------------------:
105+
0 | 0 | 0 | 0
106+
2000 | 4112 | -113 | 3659
107+
6000 | 8564 | -764 | 5507
108+
10000 | 10517 | -1049 | 6318
109+
14000 | 11374 | -1175 | 6673
110+
18000 | 11750 | -1230 | 6829
111+
112+
Here is the same data shown as a graph. Each of the blue lines on the graph represents a row in the table above.
113+
114+
![graph2](Images/graph2.png)
115+
116+
After 18,000 iterations it looks as if the line is getting closer to what we would expect (just by looking) to be the correct line of best fit. Also, each additional 2,000 iterations has less and less effect on the final result - the values of the intercept and the slope are converging on the correct values.
117+
118+
##A closed form solution
119+
120+
There is another way we can calculate the line of best fit, without having to do multiple iterations. We can solve the equations describing the least squares minimisation and just work out the intercept and slope directly.
121+
122+
First we need some helper functions. This one calculates the average (the mean) of an array of Doubles:
123+
124+
```swift
125+
func average(input: [Double]) -> Double {
126+
return input.reduce(0, combine: +) / Double(input.count)
127+
}
128+
```
129+
We are using the ```reduce``` Swift function to sum up all the elements of the array, and then divide that by the number of elements. This gives us the mean value.
130+
131+
We also need to be able to multiply each element in an array by the corresponding element in another array, to create a new array. Here is a function which will do this:
132+
133+
```swift
134+
func multiply(input1: [Double], _ input2: [Double]) -> [Double] {
135+
return input1.enumerate().map({ (index, element) in return element*input2[index] })
136+
}
137+
```
138+
139+
We are using the ```map``` function to multiply each element.
140+
141+
Finally, the function which fits the line to the data:
142+
143+
```swift
144+
func linearRegression(xVariable: [Double], _ yVariable: [Double]) -> (Double -> Double) {
145+
let sum1 = average(multiply(xVariable, yVariable)) - average(xVariable) * average(yVariable)
146+
let sum2 = average(multiply(xVariable, xVariable)) - pow(average(xVariable), 2)
147+
let slope = sum1 / sum2
148+
let intercept = average(yVariable) - slope * average(xVariable)
149+
return { intercept + slope * $0 }
150+
}
151+
```
152+
This function takes as arguments two arrays of Doubles, and returns a function which is the line of best fit. The formulas to calculate the slope and the intercept can be derived from our definition of the function J. Let's see how the output from this line fits our data:
153+
154+
![graph3](Images/graph3.png)
155+
156+
Using this line, we would predict a price for our 4 year old car of £6952.
157+
158+
159+
##Summary
160+
We've seen two different ways to implement a simple linear regression in Swift. An obvious question is: why bother with the iterative approach at all?
161+
162+
Well, the line we've found doesn't fit the data perfectly. For one thing, the graph includes some negative values at high car ages! Possibly we would have to pay someone to tow away a very old car... but really these negative values just show that we have not modelled the real life situation very accurately. The relationship between the car age and the car price is not linear but instead is some other function. We also know that a car's price is not just related to its age but also other factors such as the make, model and engine size of the car. We would need to use additional variables to describe these other factors.
163+
164+
It turns out that in some of these more complicated models, the iterative approach is the only viable or efficient approach. This can also occur when the arrays of data are very large and may be sparsely populated with data values.
165+
19166
*Written for Swift Algorithm Club by James Harrop*

0 commit comments

Comments
 (0)