Billboard 100 Music Performance
There are many reasons why a song may be on the Billboard 100 from popularity of the artist to the sound of the music.
Today it is harder than ever to keep track of the reasons behind a song’s rise to the top 100. With streaming services like Spotify, Pandora, Google Music, etc. increasing in popularity, the process has gotten harder.
What if there was a better way to predict a songs success based on its attributes? This is what we will be looking at using songs that entered Billboard 100 in the year 2000. You can download the data used by clicking here.
As a standard for using secondary data (data that was not collected by me), I will need to make a set of assumptions which comes with the risk of inaccurate analysis.
I will be making the following assumptions about this data:
- The data is accurate.
- The weeks are counted beginning the first day that the song entered the Billboard 100
- Lengths of the songs are displayed as minutes, seconds, milliseconds, –
- Example: 3,38,00 AM would mean that the song is 3 minutes 38 seconds long
- The rank of the song on any given week is correct.
- An asterisk means that the song did not chart for that week
This is what the data looks like. There are columns for the year (2000), name of the artist (last name first), name of song, length of song, genre, date entered Billboard 100, date peaked (highest charting), and the weeks with song rank (lower is better).
| year | artist.inverted | track | time | genre | date.entered | date.peaked | x1st.week | x2nd.week | x3rd.week | ... | x75th.week | x76th.week |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2000 | Destiny's Child | Independent Women Part I | 3,38,00 AM | Rock | September 23, 2000 | November 18, 2000 | 78 | 63 | 49 | ... | * | * |
| 2000 | Santana | Maria, Maria | 4,18,00 AM | Rock | February 12, 2000 | April 8, 2000 | 15 | 8 | 6 | ... | * | * |
| 2000 | Savage Garden | I Knew I Loved You | 4,07,00 AM | Rock | October 23, 1999 | January 29, 2000 | 71 | 48 | 43 | ... | * | * |
| 2000 | Madonna | Music | 3,45,00 AM | Rock | August 12, 2000 | September 16, 2000 | 41 | 23 | 18 | ... | * | * |
| 2000 | Aguilera, Christina | Come On Over Baby (All I Want Is You) | 3,38,00 AM | Rock | August 5, 2000 | October 14, 2000 | 57 | 47 | 45 | ... | * | * |
The first thing that peaked my curiosity was the song length. Knowing that the songs that chart on the Billboard 100 are often played on the radio, I concluded that songs that are shorter in length would be more popular. With that in mind I decided to explore this part of the data.
My hypothesis was that if a song is shorter than 4 minutes, it will remain on the billboard 100 for a longer time than a song that is 4 minutes or longer.
This will make the following my null and alternate hypothesis. H0 = There is no relationship between song length and number of weeks on Billboard H1 = There is a negative relationship between song length and number of weeks on Billboard
Visualizing the data
Looking at histograms of how many weeks songs stayed on the billboard, we can see that many of them only stayed for 20 weeks. Both for songs under and over 4 minutes in length.
Songs that are four minutes or less by week

Songs that are four minutes or more by week

The songs had a nice distribution, many being between the 3-4 minute mark.

The following scatterplot (with a regression line) demonstrates the relationship between song length and weeks on Billboard. It is easy to see that there is no relationship between these two variables.

Statistical Testing
One thing that could quickly confirm what our scatterplot showed was doing a t-test and looking at the p-value.
T statistic: 0.0489501719137
P-value: 0.461026146868
With a P-Value of 46.1% we can safely say that there is no relationship between the length of a song and its prolonged success. As a result, we have failed to reject our null hypothesis.
One more look
Before giving up on song length, I wanted to look at another relationship. I chose to look at how successful a song performed, it’s best rank on the Billboard 100, within all the weeks it was on the chart.
I created another scatterplot and just like before, there doesn’t seem to be any relationship between dong length and its popularity.

Wrap Up
Our visualizations and statistical tests show that there is no relationship between song length and number of weeks a song charts or how far it climbs on the Billboard 100. As a result, we failed to reject our null (H0) hypothesis. There are many other angles we can look at this problem in the future.
Some examples include;
- relationship between genre and best position on the chart,
- relationship between time of year and position on the chart,
- relationship between genre and length of time on the Billboard 100
Please leave any comments or questions you may have below!