Analyzing the Epidemiological Outbreak of COVID-19: A Visual Exploratory Data Analysis (EDA) Approach

This Data Visualization repository is solely developed for the research article mentioned here. This is only for visualization purposes and all the visualization models made from the data sources provided by different Organizations. Here we present an approach to visualize and analyze the data between 22 January 2020 and 16 February 2020

Visual Exploratory Data Analysis of COVID-19, caused by SARS-COV-2

Samrat Kumar Dey, Md. Mahbubur Rahaman, Umme Raihan Siddiqi, and Arpita Howlader

In [7]:
Province/State Country/Region Lat Long 1/22/20 1/23/20 1/24/20 1/25/20 1/26/20 1/27/20 ... 2/7/20 2/8/20 2/9/20 2/10/20 2/11/20 2/12/20 2/13/20 2/14/20 2/15/20 2/16/20
0 Anhui Mainland China 31.82571 117.2264 1 9 15 39 60 70 ... 665 733 779 830 860 889 910 934 950 962
1 Beijing Mainland China 40.18238 116.4142 14 22 36 41 68 80 ... 297 315 326 337 342 352 366 372 375 380
2 Chongqing Mainland China 30.05718 107.8740 6 9 27 57 75 110 ... 426 428 468 486 505 518 529 537 544 551
3 Fujian Mainland China 26.07783 117.9895 1 5 10 18 35 59 ... 224 239 250 261 267 272 279 281 285 287
4 Gansu Mainland China 36.06110 103.8343 0 2 2 4 7 14 ... 67 79 83 83 86 87 90 90 90 90

5 rows × 30 columns

In [8]:
Index(['Province/State', 'Country/Region', 'Lat', 'Long', '1/22/20', '1/23/20',
       '1/24/20', '1/25/20', '1/26/20', '1/27/20', '1/28/20', '1/29/20',
       '1/30/20', '1/31/20', '2/01/20', '2/02/20', '2/03/20', '2/04/20',
       '2/05/20', '2/06/20', '2/07/20', '2/08/20', '2/09/20', '2/10/20',
       '2/11/20', '2/12/20', '2/13/20', '2/14/20', '2/15/20', '2/16/20'],
In [11]:
Province/State Country/Region Lat Long 1/22/20 1/23/20 1/24/20 1/25/20 1/26/20 1/27/20 ... 2/07/20 2/08/20 2/09/20 2/10/20 2/11/20 2/12/20 2/13/20 2/14/20 2/15/20 2/16/20
0 Anhui Mainland China 31.82571 117.2264 0.0 0.0 0.0 0.0 0.0 0.0 ... 47.0 59.0 72.0 88.0 105.0 127.0 157.0 193 221 255
1 Beijing Mainland China 40.18238 116.4142 0.0 0.0 1.0 2.0 2.0 2.0 ... 33.0 34.0 37.0 44.0 48.0 56.0 69.0 80 98 108
2 Chongqing Mainland China 30.05718 107.8740 0.0 0.0 0.0 0.0 0.0 0.0 ... 31.0 39.0 51.0 66.0 79.0 102.0 128.0 152 184 207
3 Fujian Mainland China 26.07783 117.9895 0.0 0.0 0.0 0.0 0.0 0.0 ... 20.0 24.0 35.0 39.0 45.0 53.0 57.0 63 71 82
4 Gansu Mainland China 36.06110 103.8343 0.0 0.0 0.0 0.0 0.0 0.0 ... 9.0 12.0 16.0 17.0 24.0 31.0 39.0 39 49 54

5 rows × 30 columns

In [12]:
Province/State Country/Region Lat Long Date Confirmed Deaths Recovered
0 Anhui Mainland China 31.82571 117.2264 1/22/20 1 0 0.0
1 Beijing Mainland China 40.18238 116.4142 1/22/20 14 0 0.0
2 Chongqing Mainland China 30.05718 107.8740 1/22/20 6 0 0.0
3 Fujian Mainland China 26.07783 117.9895 1/22/20 1 0 0.0
4 Gansu Mainland China 36.06110 103.8343 1/22/20 0 0 0.0

Data Cleaning and Preprocessing

In [14]:
Province/State Country/Region Lat Long Date Confirmed Deaths Recovered Deaths to Confirmed Recovered to Confirmed
0 Anhui China 31.82571 117.2264 2020-01-22 1 0 0.0 0.0 0.0
1 Beijing China 40.18238 116.4142 2020-01-22 14 0 0.0 0.0 0.0
2 Chongqing China 30.05718 107.8740 2020-01-22 6 0 0.0 0.0 0.0
3 Fujian China 26.07783 117.9895 2020-01-22 1 0 0.0 0.0 0.0
4 Gansu China 36.06110 103.8343 2020-01-22 0 0 0.0 NaN NaN

Exploratory Data Analysis (EDA)

Countries with most reported cases (Till 16 February 2020)

  • Massive number of cases are reported in Mainland China Compared to rest of the world
  • The next few countries are infact are the neighbours of China

Number of Countries/Regions to which COVID-19 spread

In [18]:

Provinces in China with most reported cases

In [19]:
Province/State Confirmed
0 Hubei 58182
1 Guangdong 1316
2 Henan 1231
3 Zhejiang 1167
4 Hunan 1004
  • Even in china most of the cases reported are from a particular Province Hubei.
  • It is no surprise, because Hubei's capital is Wuhan, where the the first cases are reported

Number of Province/State in China to which COVID-19 spread

In [20]:

Countries with deaths reported

In [21]:
Country/Region Deaths
0 China 1765
1 Taiwan 1
2 France 1
3 Japan 1
4 Hong Kong 1
5 Philippines 1
  • Outside China, there hasn't been a lot of deaths due to COVID-19 has reported
  • There are only 3 deaths reported outside China

Countries with deaths reported

In [22]:
Country/Region Deaths
0 China 1765
1 Taiwan 1
2 France 1
3 Japan 1
4 Hong Kong 1
5 Philippines 1

Deaths to recoverd cases

In [23]:
Deaths Recovered
25 1770 10865
  • There are more recovered cases than deaths at this point of time

MAP View of Coutries with Confirmed and Death reported

In [24]:

Visual Exploratory Data Analysis (V-EDA)

In [25]:

VEDA of COVID-19 Spread over time in China and Outside of China

In [26]:

Number of Countries/Regions to which COVID-19 spread over the time

Countries with deaths reported

In [38]:
Country/Region Deaths
0 China 1765
1 Taiwan 1
2 France 1
3 Japan 1
4 Hong Kong 1
5 Philippines 1

Countries with no recovered cases

In [39]:
Country/Region Confirmed Deaths Recovered
0 Italy 3 0 0
1 Belgium 1 0 0
2 Egypt 1 0 0
3 Sweden 1 0 0

Countries with no death case anymore

In [40]:
Country/Region Confirmed Deaths Recovered
0 India 3 0 3
1 Russia 2 0 2
2 Spain 2 0 2
3 Cambodia 1 0 1
4 Finland 1 0 1
5 Nepal 1 0 1
6 Sri Lanka 1 0 1

Countries with all the cases recovered

In [41]:
Country/Region Confirmed Recovered
0 India 3 3
1 Russia 2 2
2 Spain 2 2
3 Cambodia 1 1
4 Finland 1 1
5 Nepal 1 1
6 Sri Lanka 1 1

Hubei-Other provinces-World

In [87]:
In [27]:

Although COVID-19 spread to all the provinces of the China really fast and early, number of countries to which COVID-19 spread hasn't increased after first few weeks

Number of Cases in China and Outside China

In [28]:

Number of Confirmed, Deaths and Recovered Cases

In [29]:
In [30]:

Number of Confirmed, Deaths and Recovered Cases Outside China

In [31]:
In [32]:

Tree Map Views of differnt cases in Chinese province and outside China

In [33]:
In [34]:
                                           Samrat Kumar Dey and Md. Mahbubur Rahman