Differentiate between wide and tall data formats?

Differentiate between wide and tall data formats?

In a wide data format, you would have a Data Set with:

  1. One Single row for EVERY data point
  2. Multiple columns to represent different attributes of a data point

For the use case of classroom, this is what a wide data set could look like:

Image for post

Wide Data Set

As you can see above, each student has a single row, with different columns showing their scores in different subjects.

Suppose you want to consolidate all the marks into a single column. For this purpose you would need a Tall Data Set, which would look like this:

Image for post

Tall Data Set