You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
throughout this lesson. See [Setup](/sql-ecology-lesson/setup/) for
27
-
instructions on how to download the data, and also how to install and open
28
-
SQLite Manager.
23
+
See [Setup](../setup/) for
24
+
instructions on how to download the data, and also how to install jupyter notebook
29
25
30
26
## What is SQL?
31
27
@@ -36,7 +32,7 @@ These queries can allow you to perform a number of actions such as: insert, upda
36
32
37
33
> ## Think about this situation
38
34
>
39
-
> Imagine if you are a owner of a convenience store, and you are trying to record your soda purchase record <br>
35
+
> Imagine if you are a owner of a convenience store, and you are trying to record your soda purchase. <br>
40
36
> In each invoice, it contains the following information: <br>
41
37
> Invoice id, Date, Category, Soda name, Volume, Cost, Retail Price, Vendor, Number of bottle purchased <br>
42
38
> How would you store the data?
@@ -46,19 +42,20 @@ These queries can allow you to perform a number of actions such as: insert, upda
46
42
If you store all these invoice information in one Excel file,
47
43
What problem could raise from this approach?
48
44
<b>Data redundency:</b> <br>
49
-

45
+
{:width="700px"}
50
46
<br><br>
51
47
Imagine if you consistantly purchased some Big Dog Cola from LCDM Beverage vendor every day for 5 days,
52
48
Notice these columns: Category, Soda_name, Volume, Cost, Retail_Price, Vendor, Vendor number
53
49
With tranditional file approach, you have to record exact same information in these columns 5 times.
54
50
<b>Data inconsistancy:</b> <br>
55
-

51
+
{:width="700px"}
56
52
<br><br>
57
53
Imagine if the vendor changed its phone number. Then multiple changes has to be made.
58
54
There are only 5 rows so it might be easy to change everything. If the data size gets large, mistakes are likely to occur.
59
55
60
56
If you thought about storing these information in few different Excel files, <b>great idea! You are on the right track </b><br>
61
-
However, if you want information from all files at the same time, how do you combine them? If each file contains thoudans of rows, Ah...
57
+
However, if you want information from all files at the same time, how do you combine them? If each file contains thoudans of rows, Ah... <br>
58
+
The largest table that Excel can handle is 1,048,576 * 16,384. Excel can be very slow when working with large number of data. Moreover, in the real word, you can easily get 1 million rows of data...
0 commit comments