(Global)
The total CRD cases due to the COVID-19 pandemic in India during the PLD, LD, and UL phases are shown in Table 2 and Figure 3 . Figure 2 shows the spatio-temporal distribution of confirmed cases in the states of India. In the PLD period, the highest case numbers were reported in KL followed by MH, KA, Telangana (TG), and UP. The numbers of confirmed cases increased ( Figure 2 ) in almost all states during the LD phases. The top five states (in chronological order) with the highest numbers of confirmed cases in each of the LD phases were: (1) LD1.0: MH with 2537 cases and acted like a epicenter, followed by DL, TN, Rajasthan (RJ), and Madhya Pradesh (MP); (2) LD2.0: MH, Gujarat (GJ), DL, MP, and UP; (3) LD3.0: MH, TN, GJ, DL, and RJ; and (4) LD4.0: MH, TN, DL, GJ, and RJ.
Confirmed COVID-19 cases during PLD, LD, and UL phases.
COVID-19 cases in India during LD and UL phases: (a) cumulative cases, (b) daily cases, and (c) deaths.
A rapid spatial spread of confirmed COVID-19 cases was observed in MH, TN, and DL during UL1.0 ( Figure 2 (b)), and it continued in other states such as KL, AP, and KA in the south, Punjab (PB) and UP in the north, and WB, Odisha (OD), and Bihar (BR) in the east until UL4.0 ( Figure 2 (c), (d), and (e)). Figure 2 (f) shows that slight decreases in the numbers of confirmed cases occurred during UL5.0 in all states. Infections increased rapidly due to the relaxation of interstate movements and the opening of market shops, business offices, industries, religious places, etc. Rapid increases in COVID-19 cases were observed in areas with high population densities, poor testing facilities without frequent and adequate testing, and changes in human behavior due to various reasons. The top five states with the maximum numbers of cases in all UL phases were: (1) UL1.0: MH, TN, DL, GJ, and UP; (2) UL2.0: MH, TN, AP, KA, and UP; (3) UL3.0: MH, AP, KA, TN, and UP; (4) UL4.0: MH, KA, AP, TN, and UP; and (5) UL5.0: MH, KL, KA, AP, and TN. MH was the epicenter for COVID-19 infections in India and the first reported confirmed cases occurred in this state during all LD and UL phases. Due to the efforts of the GoI, many testing sites were distributed evenly across states irrespective of the population density and healthcare facilities, and thus the infection rates and rate of spread could have been lower in the Indian states, as reflected in UL5.0.
Figure 3 shows the spread of COVID-19 where the infection rate was lower during the LD phases (LD1.0–LD4.0) but it increased significantly after the start of the UL phase on June 1, 2020. Overall mobility was restricted during the LD phases, before it was relaxed in a phased manner during the UL period. Thus, the measures implemented in different regions during LD to restrict movement helped to limit the infection rates. Figure 3 (a), (b), and (c) show the cumulative (confirmed, recovered, and active) cases, daily confirmed and recovered cases, and deaths (cumulative and daily) in India, respectively. The daily reported cases peaked during the middle of UL4.0 phase and reached their lowest level in UL5.0 phase.
The total confirmed COVID-19 cases in India were compared with those in the most affected countries throughout the world as well as in SAARC countries. Figure 4 (a) shows that India was the third most badly affected country in the word after the USA and Brazil during the UL2.0 and UL3.0 periods, and the second most badly affected after the USA during UL4.0 and UL5.0. India was the most badly affected among the SAARC countries during the entire study period ( Figure 4 (b)).
Cumulative confirmed cases in the most badly affected countries: (a) selected countries, and (b) SAARC countries.
The infection spread rate was not uniform throughout India during the entire pandemic period. The maximum infection rates were concentrated around high population density areas. Figure 5 (a) shows a population map of the districts of India, with the very high population areas in MH, WB, AP, DL, UP, and BR. Confirmed COVID-19 cases were classified at the end of each phase based on the percentage contribution of each district in the whole country. The totals of the red dots in Figure 5 (b)–(f) represent 80% of the confirmed cases in a district, which indicate that uniform government control was not required in all districts. In order to save resources and control the spread of the pandemic in a timely manner, the government focused on the locations that comprised 80% of the cases. At the end of UL1.0, the number of these locations was low with 74 districts but it subsequently increased at the end of UL2.0–123 districts, and then to 168 locations in UL3.0, 187 districts in UL4.0, and 189 districts in UL5.0, as shown in Figure 7 (a). Figure 5 indicates that the distribution of the districts that contributed 80% of the cases was greater in the most populous regions of the country.
Classifications and distributions of cumulative confirmed COVID-19 cases, where the total of the red dots represents 80% of the cases in a district.
Contributions of top districts to COVID-19 confirmed, recovered, death, and active cases over 15-day intervals: (a), (c), (e), and (g) number of districts that accounted for 80% of total CRDA cases, respectively; (b), (d), (f), and (h) percentage contributions of top districts to CRDA cases, respectively.
The results also showed that only a few districts accounted for most of the total confirmed cases in India. The percentage contributions of the top 25 districts, top 26–50 districts, and remaining 677 districts in different time periods are shown in Figure 6 . On June 1, 2020 immediately after the LD4.0 period, 70.99% of the confirmed cases occurred in the top 25 districts, 6.38% of confirmed cases in the top 26–50 districts, and only 22.63% of the confirmed cases in the remaining 677 districts. After the UL started, the contributions of the top 25 districts decreased, whereas those of the top 26–50 districts increased slowly and those of the remaining districts increased rapidly, thereby clearly demonstrating that the spread of COVID-19 cases increased during the UL period, as shown in Figure 6 , Figure 7 (b).
Percentage contributions of top 25 districts, top 26–50 districts, and remaining districts to confirmed cases.
Figure 7 shows the number of districts that accounted for 80% of CRDA cases, as well as the percentage contributions of the top 25 districts and top 26–50 districts in India to CRDA cases at 15-day intervals. The number of districts that accounted for 80% of CRD cases increased in the UL phases ( Figure 7 (a), (c), and (e)), whereas the number of active cases increased until the middle of UL4.0 and then decreased ( Figure 7 (g)). The percentage contributions of CRD cases ( Figure 7 (b), (d), and (f)) by the top 25 districts and top 26–50 districts were highest in UL1.0 before they then decreased, thereby demonstrating the greater spatial spread of CRD cases in the remaining districts. The contribution of the top 25 districts to active cases ( Figure 7 (h)) was around 74% in UL1.0 but it then decreased to 41% at the end of UL4.0, before increasing again in UL5.0. The decrease in the number of districts that accounted for 80% of active cases and the increase in the contribution from the top 25 districts at the end of UL5.0 show that the recovery rate was very high in other districts.
Throughout the LD and UL phases, the maximum numbers of cases were reported for only a few major cities in India. Figure 8 shows the confirmed, death, and recovered cases at the start of UL1.0 and during each UL phase in major Indian cities. During UL5.0, Bengaluru Urban had the highest number of confirmed cases ( Figure 8 (a)) and highest number of recovered cases ( Figure 8 (c)), whereas Mumbai had the lowest maximum number of COVID-19 infected persons ( Figure 8 (b)).
COVID-19 cases in selected districts of India during UL phases: (a) confirmed cases, (b) deaths, and (c) recovered cases.
Figure 9 shows the numbers of COVID-19 CRDA cases in the top 25 districts at the end of UL5.0. DL had the most confirmed cases followed by Bengaluru Urban, Pune, Mumbai, Thane, and Chennai.
Top 25 districts in India in terms of confirmed COVID-19 cases by October 31, 2020: (a) confirmed and recovered cases, and (b) active cases and deaths.
The COVID-19 CRD cases in Indian states during the PLD, LD, and UL phases, as well as the CRDA cases at the end of UL5.0 are shown in Figure 10 . The total numbers of CRD cases were highest in MH during the PLD, LD, and UL phases, mainly because major cities such as Mumbai, Pune, Thane, Nagpur, and Nashik are located in this state. At the end of UL5.0, MH still had the most CRDA cases compared with other states, as shown in Figure 10 (d) and (e).
COVID-19 cases in Indian states: (a) confirmed cases during PLD, LD, and UL phases; (b) recovered cases during PLD, LD, and UL phases; (c) deaths during PLD, LD, and UL phases, (d) confirmed and recovered cases by October 31, 2020; and (e) active cases and deaths by October 31, 2020.
In this study, geospatial analysis was conducted to assess COVID-19 CRDA cases in India during the PLD, LD, and UL phases. Analyses were performed at the country, state, and district levels, and Indian COVID-19 cases were also compared with those in the other most infected countries throughout the world. The death rate in India was lower than the global average throughout the pandemic period considered. India was the third most badly affected country throughout the world after the USA and Brazil during the UL2.0 and UL3.0 periods, and the second most badly affected after the USA during UL4.0 and UL5.0. India was the most badly affected of SAARC countries from the start of the pandemic until UL5.0. In India, the highest positivity rate was 8.47% in UL2.0, the highest death rate was 3.27% in LD1.0, and the highest recovery rate was 92.04% in UL5.0. The total daily numbers of confirmed and recovered cases in India peaked during the middle of UL4.0, before subsequently decreasing until the end of UL5.0.
The spatial distributions of confirmed cases in Indian states were mapped during the PLD, LD, and UL phases. The numbers of confirmed cases increased in all states from PLD to UL4.0, but decreased slightly in UL5.0. The maximum number of confirmed cases during PLD occurred in KL, and the highest numbers of confirmed cases during all LD and UL phases were recorded in MH. Major increases in COVID-19 cases during UL1.0–UL4.0 were observed in the west–central part comprising MH, southern states of KL, TN, AP, and KA, northern states of Delhi, PB, and UP, and eastern states of WB, OD, and BR.
The observed infection spread rate was not uniform throughout the country during the entire pandemic period considered. Classifications of confirmed cases at the end of each UL phase based on the percentage contribution of each district in the whole country showed that 80% of the confirmed cases occurred in a few high population areas, i.e., MH, AP, TN, KA, KL, GJ, DL, UP, BR, WB, and OD. This spatial distribution could facilitate decision making when controlling the spread of COVID-19. The results showed that among 727 districts in India, 80% of the confirmed cases occurred in only 74, 123, 168, 187, and 189 districts at the end of each UL period, respectively. UL1.0: June 1-30, 2020, UL2.0: July 1-31, 2020, UL3.0: August 1-31, 2020, UL4.0: September 1-30, 2020, UL5.0: October 1-31, 2020. In addition, the contributions of the top 25 districts, top 26–50 districts, and remaining districts to CRDA cases were assessed. The top 25 districts accounted for 70.99%, 69.38%, 54.87%, 44.23%, 40.48%, and 38.96% of cases at the start of UL1.0 and at the ends of UL1.0–UL5.0, respectively, and the top 26–50 districts accounted for 6.38%, 6.76%, 11.23%, 12.98%, 13.40%, and 13.61%. Clearly, the contributions of the top 25 districts decreased when UL started, whereas those of the top 26–50 districts increased slowly and those of the remaining districts increased rapidly, thereby indicating the greater spread of COVID-19 cases in India during the UL phases. Moreover, the top 25 districts accounted for around 74% of the active cases in UL1.0, before decreasing to 41% at the end of UL4.0 and then increasing again in UL5.0. By October 31, 2020, DL had the highest number of confirmed cases in India followed by Bengaluru Urban, Pune, Mumbai, Thane, and Chennai.
This is to inform you that the proposed reviewers have no conflicts of interest to declare and they agreed to review this manuscript. The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
No funding source was associated with this COVID-19 related study.
The instructions to the authors have been read and accepted by all the authors of this manuscript. All of the authors have seen the manuscript and agreed to the submission of this version. The research described in this paper is manuscript and it has not been published or submitted for publication in any other journal simultaneously. We also agree that if this paper is accepted for publication, the paper will not be published elsewhere in the same form in English or any other language without the written consent of the copyright holder.
Two case studies on Nizamuddin and Dharavi
BMC Public Health volume 21 , Article number: 631 ( 2021 ) Cite this article
11k Accesses
11 Citations
9 Altmetric
Metrics details
In this study we cluster the districts of India in terms of the spread of COVID-19 and related variables such as population density and the number of specialty hospitals. Simulation using a compartment model is used to provide insight into differences in response to public health interventions. Two case studies of interest from Nizamuddin and Dharavi provide contrasting pictures of the success in curbing spread.
A cluster analysis of the worst affected districts in India provides insight about the similarities between them. The effects of public health interventions in flattening the curve in their respective states is studied using the individual contact SEIQHRF model, a stochastic individual compartment model which simulates disease prevalence in the susceptible, infected, recovered and fatal compartments.
The clustering of hotspot districts provide homogeneous groups that can be discriminated in terms of number of cases and related covariates. The cluster analysis reveal that the distribution of number of COVID-19 hospitals in the districts does not correlate with the distribution of confirmed COVID-19 cases. From the SEIQHRF model for Nizamuddin we observe in the second phase the number of infected individuals had seen a multitudinous increase in the states where Nizamuddin attendees returned, increasing the risk of the disease spread. However, the simulations reveal that implementing administrative interventions, flatten the curve. In Dharavi, through tracing, tracking, testing and treating, massive breakout of COVID-19 was brought under control.
The cluster analysis performed on the districts reveal homogeneous groups of districts that can be ranked based on the burden placed on the healthcare system in terms of number of confirmed cases, population density and number of hospitals dedicated to COVID-19 treatment. The study rounds up with two important case studies on Nizamuddin basti and Dharavi to illustrate the growth curve of COVID-19 in two very densely populated regions in India. In the case of Nizamuddin, the study showed that there was a manifold increase in the risk of infection. In contrast it is seen that there was a rapid decline in the number of cases in Dharavi within a span of about one month.
Peer Review reports
The Corona Virus Disease 2019 (COVID-19) caused by the novel corona virus SARSCoV-2, started at Wuhan in the Hubei province of China and has spread with great speed around the world. More than a year has passed since and the virus is still wreaking havoc in many nations including especially the US and India. The objectives of the analysis presented in this paper are as follows:
In this study we cluster the districts of India in terms of the spread of COVID-19 and related variables such as population density and the number of specialty hospitals.
Simulation using a compartment model is used to provide insight into differences in response to public health interventions.
Two case studies of interest from Nizamuddin and Dharavi provide contrasting pictures of the success in curbing spread of COVID-19.
As COVID-19 pandemic aggressively spreads across continents, vigorous public health responses are now being put in place in all the countries hit by the virus. Articles are being added to the expanding body of work on COVID-19 in f l a t t e n i n g t h e c u r v e . Many recent research papers have discussed the assessment of these containment policies through prediction of the path of COVID-19 cases in India, for different scenarios [ 1 – 5 ]. Some recent papers have discussed in detail the outbreak of the disease across the globe. The main underlying theme of most of these studies is to track and predict the path of the virus spread in the nation. Researchers have toiled to predict the time it will take the outbreak to subside. As a result different groups of researchers have made use of various modelling techniques for prediction and have come up with many interesting findings. For example, [ 6 ] have revealed that a reduction in the contact rate between uninfected and infected individuals by quarantining the susceptible individuals, can in effect reduce the basic reproduction number. Now, with the advent of the COVID-19 outbreak, governments all over the world have started enforcing interventions like increased quarantining, self-isolation, increased hospital facilities etc. In this paper we have made an endeavour to follow the path of the disease and predict the prevalent number of susceptible, infected, recovered, and fatal cases for the Indian population cohort. This is done through some simulations with the timely government interventions being put in place. Essentially the simulation helps us predict prevalence of the infections, numbers recovered and fatal cases through simulations using the SEIQHRF epidemic model, by moderating the parameters of the model according to the government interventions and emergency policies.
In this study we have performed a cluster analysis on the worst hit districts of India. With variables like number of confirmed COVID-19 cases, number of COVID-19 hospitals, and population density of the respective districts we have tried to classify different districts in India according to their homogeneity in nature due to these factors. The main objective of clustering in this study is to improve monitoring of the affected areas which will be useful in understanding seriousness of the spread of novel coronavirus (COVID-19) to revamp government policies, decisions, medical facilities (ventilators, testing kits, masks etc.), treatment etc. This in turn will help to reduce number of infected and deceased individuals. A similar clustering approach is used in a study carried out by [ 7 ].
In addition to clustering, a scenario analysis of the respective areas facilitate following the growth curve in the respective states. For that purpose, individuals have been divided into various groups, such as, susceptible, exposed, infected, infectious (but self isolated), hospitalized, recovered and dead (death but not hospitalized, from COVID-19). Various parameters used in the probabilistic model are, the number of per day exposure events ( act ) between infectious and susceptible individuals; chance of iinfection being spread at each exposure event between the infectious and susceptible; rate at which the symptomatic self isolate themselves; the per day rate of symptomatic people requiring hospitalisation; per day rate at which people requiring hospitalisation recover; and per day fatality rate for people needing hospitalisation but could not because the full capacity of the hospitals was occupied. Each of these groups of individuals are divided into these various compartments, and the transition rates of individuals into and out of these compartments are also taken into consideration in the model. The purpose of this method used is to look at different intervention experiments and follow the prevalence of COVID-19 (in terms of number of persons) with each passing day since the beginning of the epidemic. The research has been carried out for some of the Indian states where we find the maximum number of districts with high number of confirmed COVID-19 cases. The act parameter in this study, is adjusted for each state based on its population density. Thus, we follow the prevalence curve of the cohorts from the chosen states and evaluate the effectiveness of the interventions imposed by the government to curb the COVID-19 growth.
The rest of the paper is organized as follows: “ Methods ” section describes the SEIQHRF epidemic model and explains the parameters involved and explains the two clustering techniques used to classify districts. “ Results ” section reports the district-wise cluster analysis results. “ Discussion ” section discusses the case of Nizamuddin basti in Delhi,where a religious gathering exposed thousands to this virus and caused a second wave of infection in states when Nizamuddin attendees returned home. Here we discuss the results from fitting the SEIQHRF model. This section also discusses the case of Dharavi, one of Asia’s largest slums with very high population density. “ Conclusion ” section enumerates the findings of this study and concluding remarks.
Two of the most commonly used algorithms in clustering are the hierarchical and the k-means algorithm. In this study we have used both these techniques of clustering technique due to their easily interpretable visualization and intuitive interpretation. One of the most relevant characteristics of hierarchical clustering is the fact that the user does not need to provide the number of clusters beforehand. Based on the variable upon which clustering is done, the observations are clustered into optimum number of groups. However a disadvantage of the method is that it has a high computational complexity when the number of observations to classify is very high. Thus using it on all 640 districts in India is quite cumbersome. So we have chosen the top 50 worst affected districts with maximum number of confirmed COVID-19 cases. The methodology used to build the hierarchical clustering, as also used in [ 8 ], is the following:
Pairwise distance between all the hotspot districts is calculated using D i ( S A , S B ), for a particular value of the parameter α . This distance matrix is symmetrical and has a null diagonal, which will be important to analyze the similarity between the different districts.
Searching through the distance matrix, we select the two most similar two districts.
These two districts are joined then to produce a new group that now has at least two districts.
The distance matrix is then updated by calculating the distances between the new cluster and all other clusters.
Step 2 is repeated until all districts belong to a group.
The k-means algorithm is one of the most popular clustering technique that divides the data samples into pre-defined distinct subgroups, where each data point belongs to just one group. It keeps data points within a cluster similar to each other and also tries to maximize the heterogeneity between two clusters. A cluster refers to a set of information aggregated together owing to certain similarities. Unlike hierarchical clustering in k-means the number of cluster is prefixed. The optimum number of clusters, say k , is obtained by using an elbow plot. From the dataset the method identifies k centroids, and then assigns every data point to the cluster it is closest to, keeping the centroids as small as possible. The clustering stops when either:
The centroids have more or less become constant and there is no change in their values.
The number of iterations defined has been reached.
In our study, data on confirmed COVID-19 cases, population density and number of COVID-19 hospitals in each districts is used to get the set of centroids. Although before the k -means clustering is performed the variables were standardized so that they are all on the same scale.
One of the popular models used for predicting the course of an epidemic such as COVID-19, is the Susceptible-Infectious-Recovered (SIR) model ([ 9 – 11 ]), [ 12 ]. In this model, the population is segregated into three separate compartments, i.e. Susceptible (S), Infectious (I) and Removal (R), where transition rates between the compartments is defined beforehand. Figure 1 is a representation of the three compartments in SIR model. In order to describe the per day rate at which individuals move in or out of the compartments, two equations are used. It is by solving these equations that we get the prevalence in each compartment.
SIR model flowchart- This figure represents the different compartments of the model and the arrows represent the direction of flow of individuals from one to the other. The SIR model predicts the prevalence of susceptible, infected and recovered individuals
The SEIR model, a variant of the SIR model, is one which includes a new compartment called Exposed (E), [ 13 – 15 ]. However, more recently an extension of the SEIR model has been proposed by [ 4 , 16 ], where apart from the three compartments, Susceptible (S), Infectious (I), Exposed (E) and Removal (R), two other compartments are included. These compartments are Quarantine (Q) and Hospitalization (H). The compartment H is supposed to take into account the healthcare capability of a state/district. In this new model the Removal (R) stage is divided into Recovery (denoted by R) compartment and fatality (denoted by F) compartment. The dynamic nature of the model is due to the fact that the number of individuals in each compartment may change over time. The flowchart showing the various compartments in the model and the arrows depict the direction of movement of individuals into and out of them is shown in Fig. 2 [ 16 ]. The following Table 1 , provides a description of individual compartments.
SEIQHRF model flowchart-This figure represents the various compartments and the arrows represent the direction of movement of individuals from one to the other
With the introduction of more compartments into the SIR and SEIR models, the systems of equations that should be solved to get the prevalence, become very complicated. Thus, in case of SEIQHRF, probabilistic compartment models are developed. Probabilistic models help simulate individuals in a population, which allows them to move between compartments and make it easy to specify various probabilities of passing on infection between the individuals in the different compartments, such as moving from infectious to susceptible. The probabilistic model can be further improved to adequately model real-life scenarios. We have thus used the model to explore the effects numerous public health interventions and policies through scenario analyses, [ 5 ].
The extensions of SEIR model adds a number of compartments in the analysis. The E compartment is for the individuals exposed to the virus and infected with it, but are still asymptomatic with the potential of becoming infectious. This is also true for the traditional SEIR models, but the normal assumption is that the asymptomatic infected do not spread the infection to others. But that condition was relaxed in [ 16 ] due to increasing evidence in support that most infections are being spread during the incubation days when the carrier is asymptomatic. This evidence also supports the flowchart in Fig. 2 , where there is a movement of individuals from the I infected symptomatic compartment to Q , quarantine compartment. But the individuals in the E compartment only move to the Q compartment when they have gone through compartment I i.e. only when they have shown some symptoms of the infection.
Similarly, the H compartment represents those that need hospitalisation. There’s a parameter h o s p . c a p that specifies the hospital capacity, and it is possible to indicate increased mortality rates for those who need hospitalisation in the H compartment but cannot get such care because the total h o s p . c a p has been exhausted. Related to this, the F compartment shows the case fatalities; i.e. the deaths in COVID-19 cases due to the virus. There are also some parameters for background death rates due to other causes among the susceptible (defined as the crude death rate for India). Also note that case fatalities are restricted only to occur among those in the H compartment, that is, those requiring hospitalisation (irrespective of whether they can get it or not). A similar method for modelling spread of COVID-19 has been used by [ 5 ].
In the present study we make an endeavour to fit the SEIQHRF model mentioned in Fig. 2 to COVID-19 outbreak data for some states in India where COVID-19 has been rampant. The purpose of this model is two-fold. First, the resulting simulations are used to study in detail the impact of lockdown on the epidemic. Second, we perform a what -if analysis for strategies to impose and relax lockdown at the most opportune time.
The timeline used for this study is the beginning of the second phase of lockdown in India, primarily because the spread of cases from the Nizamuddin cluster became prominent towards the beginning of April and the first case in Dharavi slum was also reported on the 1st of April. Thus using 15th April, the start of lockdown phase 2 in India as the starting point of this study. We obtain the best fit to the observed cases and fatality data of COVID-19 by using the following set of parameters, which are represented in Table 2 .
The starting day for the simulation was taken as 15th April 2020 (when second phase of lockdown began). The initial susceptible population in Nizamuddin Basti was taken as 12,000 (it has a floating population of 2000-5000 per day and a population density approx 70,000 k m 2 .) (Source: Aga Khan Developmental Network). Dharavi has a population of about 1,000,000. With a population density of over 277,136 per k m 2 . As the incubation period of COVID-19 is about 5 days, the number of infected cases (I compartment) on 15th April was assumed to be the number of new cases 5 days thereafter, i.e. on 20th April. It is especially interesting that the E compartment of asymptomatic infectious individuals, who are accountable for increased case numbers even after improvement in healthcare facilities for symptomatic cases.
With the recent updates on COVID-19 cases in India, it has become evident that some parts of the country are more affected than others. For instance, Maharashtra, Tamil Nadu, Delhi, Gujarat, Telangana, Karnataka and Uttar Pradesh are on the top of the list of most affected states in India. At a more disaggregate level, few districts of these states are more badly affected. The common trend in COVID-19 infection is that mostly the urban regions have been more affected than their rural counterparts. This is primarily due to the fact that the virus was introduced by travellers from abroad. The variables used in this study are described in detail in Table 3 below. The summary measures of population density and number of COVID-19 hospitals reveal a large variation between the districts, the former ranging from 161 to 27730 persons per sq.km. whereas the latter ranges from no hospitals to 62 hospitals in a district.( https://covidindia.org , https://www.census2011.co.in/district.php ). These variations necessitate performing a cluster analysis to optimally classify the districts into homogeneous groups. Both hierarchical clustering and k-means clustering methods are done using the confirmed cases, population density and number of COVID-19 hospitals.
If we focus on the distribution of COVID-19 cases across states, the metropolitan areas are most affected in most of them. For Maharashtra (Fig. 3 ), the district marked in red is in the extreme west coastal region, Mumbai city and Mumbai Suburban. Similarly, in Gujarat, the most affected district is Ahmedabad. In Uttar Pradesh and Tamil Nadu the main hotspot districts are Agra and Chennai respectively. In West Bengal the main focus is on three districts, i.e. Kolkata, Howrah and North 24 Parganas (as shown by the red zones in the state map).
Distribution of COVID-19 cases in districts of selected states. These maps were created by the authors using R, packages ggmap , maps , maptools and rgeos
The population density map shows a similar pattern across districts in the states, refer Fig. 4 . However, the distribution of number of COVID-19 hospitals in the districts vary from the distribution of confirmed COVID-19 cases, refer Fig. 5 . The distribution of hospitals is much less skewed than the population density and COVID-19 cases. The state government and the central has been proactively trying to augment hospital facilities. The almost even distribution of the COVID-19 special hospitals in districts is a result of those initiatives.
District-wise population density in selected states. These maps were created by the authors using R, packages ggmap , maps , maptools and rgeos
Distribution of COVID-19 hospitals in districts of selected states. These maps were created by the authors using R, packages ggmap , maps , maptools and rgeos
The k-means clustering leads to three homogeneous clusters. The first of these, which include Mumbai, Thane and Chennai, have large number of confirmed COVID19 cases, number of hospitals as well as high population density. The second cluster has the lowest number of confirmed cases, COVID-19 hospitals and population density among the three. It includes cities like Kolkata, West Delhi, Central Delhi and Hyderabad. Unlike the first two clusters, cluster 3 has very high population density, but low number of COVID-19 hospitals and confirmed COVID-19 cases. Table 4 presents the centroids of each cluster, the central tendency values for each variable in each cluster. A diagrammatic representation is shown in Fig. 6 .
Visualization of three clusters generated from k-means clustering
Thus it is observed that the burden on the healthcare system is maximum in the cluster one districts, while with their high density but low confirmed cases cluster three comprises of districts which have been effective in controlling the disease.
The hierarchical clustering (as shown in the dendrogram below in Fig. 7 ), done to show the closeness of two districts in terms of the above three parameters, corroborates our findings from the k-means clustering.
The dendrogram above is a result of hierarchical clustering. Based on the number of active positive cases of COVID-19, fifty worst hit districts of India were used to create clusters of homogeneous districts. Six clusters were created, demarcated in the diagram with the boxes around clusters of districts that had similar number of active positive COVID-19 cases
In the wake of the global crisis due to COVID-19, almost all affected countries, have seen an exponential growth in the number of confirmed case count. In most of such countries, doctors discovered a gaggle of individuals who got infected at one place and mostly at around the same time. Such groups are termed clusters . In India one such case has been identified at the Nizamuddin basti in Delhi with its highly dense population. There have been numerous studies over the years, [ 17 ], that describes this area of Delhi as one of the more populous with almost 1500 households and a large floating population as well. Another study [ 18 ] discusses the development of the highly populated Nizamuddin basti around Hazrat Nizamuddin Auliya mosque. It is this locality that has been identified as a massive cluster , after a religious congregation held in mid-March led to COVID-19 spread among the attendees; at least 130 cases have been identified as having originated from this cluster.
As a special case study, we’ve used the SEIQHRF model to simulate the spread of the virus through infected individuals from the Nizamuddin cluster . A stochastic individual contact model (ICM) is employed to simulate the baseline projections of the timeline of establishment period of the virus, illness duration and survival time of the case fatalities. In the five panels of Fig. 8 we provide general statistical properties of the model in relation to some of the major parameters estimated.
The incubation period for the virus has a median of about 10 days and in few cases could reach 20 days or more.
Nizamuddin tablighi cohort duration frequency distribution
For some, isolation started much later, could be as long as 10 days. However, many people took less than a couple of days before they started to isolate themselves because of some COVID-19 like symptoms.
Illness duration seems span around 25 days. Although this may vary based on the individuals comorbid conditions.
Hospital care duration is about 10 to 15 days mostly, which seems to match actual observations.
Survival time of case fatalities is seen to be mostly between two to five days. However few fatal cases have had a survival time as high as 20 days.
The next plot shows the prevalence of COVID-19 among the Nizamuddin population in the various compartments of the SEIQHRF model. Prevalence is the number of people in each compartment at each point in time (each day). Given the dense nature of the population in the Nizamuddin basti the act parameter (average number of exposure events between infectious individuals and susceptible individuals) for this area is quite high making it a hotbed for the virus spread within the community. From Fig. 9 we can see how peaked the distribution of exposed individuals is in the neighbourhood. Also due to the delayed signs of this viral infection among individuals, the ones living in these densely populated areas unknowingly pass on the virus to others. The prevalence of cases where self-isolation by individuals is undertaken has a comparatively low peak, thus isolating oneself when symptoms are visible reduce the risk for others. The model estimate that from 15,000 susceptible people, almost as many as 15000 other people are potentially exposed during the 3 days event, and from this almost 15000, about 4000 are potentially infected and require hospitalization. However, due to the delayed signs of the infection causes possible delays in the people getting to the hospital for checking, which causes about 20 days delay in the infected person to be hospitalized. Furthermore, the number of people who went to self-isolation is predicted to be very small.
Prevalence simulation of COVID-19
In Fig. 10 we have isolated the infected, hospitalized and fatal compartments to look at them more closely. It shows that of the almost more than 4000 required hospitalisation, although with a delay of almost 15 days. The model used here predicts the prevalence in each compartment; shows the number of individuals in each compartment who have the condition during the time period under study. As we have stated before, sometimes the delayed signs of the infection cause a lag between the time infection is identified and it is serious enough to require hospitalisation. The time period of this study (second phase of lockdown in India) thus includes individuals for whom the infection was identified earlier than the present time but it aggravated enough to require hospitalisation much later. Thus making the predicted number larger than the size of the infected cohort. There were more than 2500 people infected. However, the number of fatal cases remained low, around 150.
Prevalence simulation COVID-19 confirmed cases
Apart from the baseline scenario, we have tried to follow the path of prevalence in each compartment after the implementation of several interventions.
Some of these administrative interventions are implemented in the simulation of the Nizamuddin cluster and represented in Fig. 11 a. A side-by-side comparison of similar implementation in Delhi is in Fig. 11 b. A list of interventions considered in this study is;
I 1 : Increase quarantining of symptomatic individuals during the first 15 days after the beginning of second phase of lockdown, i.e. 15th April 2020.
Comparison of prevalence between Nizamuddin cohort and Delhi
I 2 : Starting Social Distancing from 15th April, 2020: The Government of India announced a second phase of lockdown from 15th April, 2020 till 9th May, 2020 as an intervention to stop the COVID-19 pandemic. Hence, as an image of Social Distancing, we gradually reduced the frequency of exposure events from 20 to 10.
I 3 : Having introduced further social distancing from 15th April, 2020, we also introduced another intervention by increasing the hospital bed capacity to triple the initial number considered during the first 15 days of lockdown phase 2.
I 4 : Imposing more severe social distancing by the end of second phase of lockdown, thereby further reducing the number of exposure events from 10 to 5.
I 5 : Imposing a combination of increased social distancing and quarantining during the first 15 days of lockdown 2.
We will thereafter refer to the interventions as I i for i =1,2,3,4,5.
The same set of interventions have been used for other states as well, where attendees from Nizamuddin have returned after the convention.
The baseline plot shows that the number of infected/infectious individuals in the Nizamuddin cluster is close to 10,000, almost double of that in Delhi. This can mostly be s result of the very fact that in an exceedingly dense geographic area there is a larger propensity for people to mingle with more number of individuals, thus increasing the prevalnce of the spread of COVID-19 manifold.
In both Nizamuddin cluster and Delhi, the curve of infected/infectious individuals have flattened with implementation of the above interventions. Among the administrative mediation, increasing social distancing as well as self-isolation starting on day 15, after the beginning of the disease outburst, produces the best results in flattening the disease spread curve. However, a comparison shows that the Nizamuddin cluster increased the number of confirmed cases and also the number requiring hospitalisation. The numbers have almost doubled from that of Delhi.
The Nizamuddin cluster was discovered in mid-March and at least 130 cases in India have been identified as having originated from this cluster. There was then a second wave of infectious individuals identified in several other states that were linked to the Nizamuddin cluster. Tamil Nadu, Delhi, Telengana, Gujarat, Maharashtra are a some of such states.
In each of the states above (Tamil Nadu, Delhi, Telengana) the number of infected individuals linked to Nizamuddin cluster are 72, 24 and 6 respectively. The comparison in sets of Figs. 12 , 13 , and 14 show that in the second phase the number of infected individuals had seen a multitudinous increase. Thereby, increasing the danger of disease spread within the respective states. However, the simulations reveal that the administrative interventions, if implemented strictly, flatten the curve of disease spread. Specifically, the best results are seen if social distancing is strictly practised by day 15 since the disease outbreak.
Comparison of baseline prevalence with those with interventions for Delhi
Comparison of baseline prevalence with those with interventions for Tamil Nadu
Comparison of baseline prevalence with those with interventions for Telangana
In Table 5 first panel, the maximum predicted numbers in each compartment is presented for Nizamuddin basti right after the Tablighi Jamaat congregation (from SEIQHRF model fitting, also shown in Fig. 11 a). With the choice of parameters defined in “ Choice of parameters ” section, the baseline model is fitted, but I 1 through I 5 are the various interventions applied and the corresponding parameters have been modified accordingly. For instance, in the baseline case the projected number of infected cases was maximum around 5000, with the implementation of I 1 i.e. increased advocacy of quarantining the maximum projected individuals infected increased even more (around 7500). However, for I 2 through I 5 this number decreased. Thus, social distancing, augmentation of hospital capacity and severe quarantining implemented together prove to be useful in flattening the curve. The second and the third panel are predictions from Delhi during the first phase of lockdown, starting point is 24th March 2020. The third panel shows the predicted number of individuals in each compartment after the attendees of Nizamuddin meeting dispersed. A comparison of infected individuals increased manifold between the first lockdown and after Nizamuddin congregation. The number of infected before Tablighi Jammat (1000) was the lowest for intervention I 5 i.e. increased social distancing and quarantining. However, for the Nizamuddin cohort a severe imposition of social distancing worked better than any other intervention.
Similar to Table 5 we have also tabulated the maximum predicted number of individuals in each of compartment for Tamil Nadu and Telengana for both before and after phases of the Nizamuddin meeting, results shown in Tables 6 and 7 . In case of Tamil Nadu (Table 6 ), a combination of social distancing and quarantining during the first lockdown phase visibly lowers the number infected (1000), however, after the attendees from Nizamuddin cohort returned to the various states, Tamil Nadu and Telengana were one of the largest clusters of COVID-19 infections. In the second phase therefore increasing the hospital capacity helped in flattening the curve of infected persons. With the augmented hospital capacity proves to be an effective intervention. For Telengana (Table 7 ), in both phases 1 and 2, i.e. before and after the Nizamuddin cohort returned to the state, augmented hospital capacity seems like the best among the five interventions. Commensurate with the number of susceptible people in the states we also look at the number of fatalities and the ones requiring hospitalisation. But, the records of the number of days of hospitalization are not publicly available by each patient, thus making if unreliable to infer much from the simulation on how long the period would be. We also cannot claim the fatality rates with too much certainty because there are many unknown parameters involved and not much data is available for the co-morbidity conditions of the patients either.
Dharavi is considered Asia’s largest slum, one of the most densely populated areas in the world (with 2.6 lakh people per sq. km) and, now, also marked as a containment zone for COVID-19. When driving through the lanes and by-lanes of Dharavi, thronging with people, it is evident that the one necessary norm to prevent the spread of COVID-19, social distancing, is practically impossible to implement here. Most of the houses in the area are merely 10X10 feet, it is a challenge to keep people confined to such small area. In addition, there is a major problem with the common toilets that inhabitants have to use, which makes the containment of the virus an impossible task. Although the first case of COVID-19 in India was reported on the 30th of January, by middle of March only mere eight positive cases of COVID-19 were identified in Mumbai. All of these cases had a travel history abroad. However, by the end of March, the number of positive cases started growing exponentially. In Dharavi the first case was observed on the first of April, [ 19 ]. The index patient was a 56 year-old garment shop owner complained of high fever and cough. When his symptoms worsened he was admitted to the hospital where he succumbed to the disease before civic officials could talk to him about the people he might have come in contact with. So the officials began their contact-tracing exercise. It came to light during their investigation that a few days before the garment shop owner started showing the signs of COVID-19 infection, he had hosted a party with some people in his house, and all these guests were attendees of the religious event that took place in mid-March in Nizamuddin area of Delhi.
Other than the constraints of space that make the inhabitants more exposed to the virus, most COVID-19 positive cases in Dharavi have been found to be asymptomatic. The silent carriers living in the area were unaware of how many people they were infecting with the virus. A simulation run using the SEIQHRF model produces the number of infected/ asymptomatic, infectious, self-isolated, hospitalised and fatal cases in Dharavi (Fig. 15 ).
Prevalence plots in various compartments for Dharavi
Due to the very high act parameter value for Dharavi the SEIQHRF model algorithm is used to simulate the scenario for a period of 30 days. The results illustrate the predicted course of COVID-19 and the impacts of various experimental interventions. Dharavi has especially been a challenge for the Maharashtra government, slowly turning out to be the COVID-19 capital of the state. From Fig. 15 it is seen that the prevalence rate among the residents was very high since the beginning of the outbreak. The lack of space to practice social distancing and the use of common toilets, exposed them more to the risk of acquiring the virus. In the baseline model the number of infectious/asuymptomatic patients is as high as almost 10,000. However, measures like increased social distancing and self-quarantine is expected to help in flattening the curve to some extent. From the panel on the left we can see that with increase in social distancing and quarantining the number of infected/asymptomatic has come down by almost 2500. There is also a delay in the prevalence number reaching it’s peak, which is almost 10 to 15 days. Also the panel on the right reveals predicted number of people requiring hospitalization. Dharavi alone predicts a requirement of 200 hospital beds for hospitalization. As, of 11th July 2020 the total number of positive cases are 2359. The fatality rate in Dharavi increased from about 3% to 4.1% over a period of two weeks between May 5th and May 20th.
Amidst all the challenges, Dharavi has emerged as one of the few successes in containing the spread of COVID-19 with 1952 recovered cases of its 2359 confirmed and only a handful 166 remaining active. According to the Brihanmumbai Municipal Corporation officials, the steps taken in Dharavi can be defined by four T’s- tracing, tracking, testing and treating. Almost as many as 47500 houses were screened, while 14970 people were screened in mobile vans. In the study done by [ 20 ], the author states that the WHO has also acknowledged Dharavi’s success in controlling the spread of COVID-19 and mentioned that it should be seen as an example across the world. The WHO has also commended the Dharavi model which is mostly based on community engagement, testing, tracing, isolating, and treating in order to break the chain of transmission. Quarantine facilities were ramped up with the use of schools, marriage halls and sports complexes. The strict implementation of lockdown and an endeavour to treat all COVID-19 patients in Dharavi itself proved successful in containing the infection. A proof that these interventions were fruitful is the fact that while in April, the doubling rate in Dharavi was 18 days, it gradually improved to 43 days in May and slowed down to 108 and 430 days in June and July respectively. In a recent press meet officials from WHO cited Dharavi as one of the few cases around the world where a massive breakout of the infection could still be brought under control, [ 21 ]. In [ 20 ] the author reports that he municipal administration of Mumbai very aptly called the Dharavi model as “chasing the virus” rather than waiting for people to report it. According to [ 22 ] under the able leadership of the Mumbai municipality, Brihanmumbai Municipal Corporation (BMC), private medical practitioners, social activists, community leaders, and non-governmental organisations were roped in to battle the pandemic on a war footing. Measures were taken to give older people special attention. Most symptomatic cases were treated at community centres, except when critical and time was invested in trust building efforts to gather public support for containment of the disease. In this study the authors report that unlike the traditional approach of passive screening as undertaken elsewhere in India, the civic authorities of Brihanmumbai Municipal corporation adopted a proactive screening strategy in Dharavi and went door-to-door looking for COVID-19 suspects. Which supports the claims made by the municipal administration of Mumbai, that they were chasing the virus [ 20 ].
The various scenarios analysed in this paper introduce the various interventions undertaken and compare the course of COVID-19 under each of them with a baseline model. To summarise the outcomes some important observations made are as follows: In the baseline model, the results are from our model fitting on a hypothetical population of 1000 people. The findings are summarized as follows;
Time taken for the COVID-19 pandemic to abate is about two months.
As the population density of Delhi is very high, many people are infected, although asymptomatic.
We see typical exponential behaviour, although these are prevalence numbers, not incidence. The prevalence counts tends to start with an exponential growth; however they then diminish.
The number of people who require hospital facilities is not too large.
Although the growth rate is much lower, the number of cases in the fatality compartment is increasing consistently.
But the various interventions have varied effect on the spread of COVID-19. In most of the states in India the best results are seen in implementation of early social distancing and quarantine of the infectious individuals. In states where a lot of attendees from Nizamuddin had returned, extra care was taken to isolate them and that to some extent alleviated the risk of infection spread. We see an exponential behaviour in the growth of prevalence in the panels, although these are not incidence. Prevalence starts with an exponential growth but then diminishes. That’s what we are seeing here, so that’s a good sign.
In case of Nizamuddin, the highest frequency is seen between day 5 and 7, the individuals for whom the disease turned fatal mostly survived for five to seven days. The attendees of the gathering in Nizamuddin then exposed some of the states/UTs, like, Delhi, Tamil Nadu and Telangana to a second wave of COVID-19 infection. A side-by-side comparison of prevalence in these states during the first and second wave show a significant growth in number of COVID-19 positive cases.
Dharavi is one of the unlikely success stories in India. In spite of being one of the largest slum areas of Asia, the Brihanmumbai Municipal Corporation has managed to control COVID-19 spread in an area merely 2.1 k m 2 housing almost a population of 10,00,000. The four T’s- tracing, tracking, testing and treating have worked wonders for the slum and when large metropolises around the world like New York, Los Angeles, London reeling under the COVID-19 attack, a slum in Mumbai India has managed to reduce the reproduction number to a record low. In comparison to Dharavi, Mumbai a city which Dharavi is a part of, has suffered more severe fate at the hands of the deadly virus with almost 2.61 lakh active cases as of 6th November 2020.
The main limitation of this study is that the COVID-19 scenario is still very dynamic and evolving with each passing day. Predicting the true path that the virus is going to take, is difficult if the parameters in the model are incorrectly specified. And with the ever changing COVID-19 landscape, it is one of the major challenges in model fitting. With new strains of the virus emerging, the unavailability of disaggregate level data of the confirmed cases makes modelling a challenge. There is also lack of available data on co-morbid conditions of the patients hospitalised with COVID-19 along with lack of data on actual hospitalisation. This adds to the already existing impediments in the way of modelling the path of the virus spread. Availability of such data would enable a more robust forecast of the number of confirmed COVID-19 cases. The data would also help in early detection of COVID-19 clusters which would be helpful for executing public health policies. Also the government has been trying to implement new policies to curb the spread. During the time-line of this study we have only looked at the first three months of the pandemic outbreak in India. As an extension of this study we would like to tap into the increasing data on COVID-19 and follow the path of the virus in the more recent unlock phases in India. Also a longitudinal analysis of COVID-19 data from the worst hit districts and following them over the months can give us more useful insights.
The top 50 districts of India where most severe outbreak of COVID-19 was seen are analysed through cluster analysis using their respective population densities and number of COVID-19 hospitals. The results helped identify the cities where similar pattern of disease outbreak was seen as well as the ones with homogeneity in terms of COVID-19 hospitals and population density. An in depth look at the various mediations undertaken to curb the growth of COVID-19 in India have shown different levels of improvement in flattening the disease curve. As the attendees of Nizamuddin congregation returned to their home states, like Tamil Nadu, Telangana, and other parts of Delhi the second wave of surge in confirmed case numbers is seen. A comparison between the first and second wave of COVID-19 infection in select states also support this finding. A further study of Dharavi, one of the success stories of the biggest slums in Asia, is undertaken and the strategy of 4 T’s-tracing, tracking, testing and treating has shown to reduce the spread of the disease dramatically.
The datasets used and/or analysed during the current study are simulated based on the epidemiology model used in this study. The data used and prevalence plots generated thereafter have been simulated and created using the R programming language, packages EpiModel , and incidence .
Corona Virus Disease 2019
Susceptible-Infectious-Recovered model
Susceptible-Exposed-Infectious-Recovered model
Susceptible-Exposed-Infectious-Quarantine-Hospitalised-Recovered-Fatal model
Chatterjee K, Chatterjee K, Kumar A, Shankar S. Healthcare impact of covid-19 epidemic in india: A stochastic mathematical model. Med J Armed Forces India. 2020.
Das S, Ghosh P, Sen B, Mukhopadhyay I. Critical community size for covid-19–a model based approach to provide a rationale behind the lockdown. arXiv preprint arXiv:2004.03126. 2020.
Das S. Prediction of covid-19 disease progression in india: Under the effect of national lockdown. arXiv preprint arXiv:2004.03147. 2020.
Ghosh P, Basheer S, Paul S, Chakrabarti P, Sarkar J. Increased detection coupled with social distancing and health capacity planning reduce the burden of covid-19 cases and fatalities: A proof of concept study using a stochastic computational simulation model. medRxiv. 2020.
Sengupta P, Ganguli B, Chatterjee A, SenRoy S, Chatterjee M. Covid-19 epidemic modelling and the effect of public health interventions in india-seiqhrf model. medRxiv. 2020.
Sarkar K, Khajanchi S, Nieto JJ. Modeling and forecasting the covid-19 pandemic in india. Chaos, Solitons Fractals. 2020; 139:110049.
Article Google Scholar
Kumar S. Monitoring novel corona virus (covid-19) infections in india by cluster analysis. Ann Data Sci. 2020; 7:417–25.
Rojas I, Rojas F, Valenzuela O. Estimation of covid-19 dynamics in the different states of the united states using time-series clustering. medRxiv. 2020.
Smith D, Moore L. The sir model for spread of disease: The differential equation model. Loci.(originally Convergence.) 2004. https://www.maa.org/press/periodicals/loci/joma/the-sir-model-for-spread-of-disease-the-differential-equation-model .
Allen LJ, Brauer F, Van den Driessche P, Wu J. Mathematical Epidemiology, vol. 1945: Springer; 2008.
Korobeinikov A. Global properties of sir and seir epidemic models with multiple parallel infectious stages. Bull Math Biol. 2009; 71(1):75–83.
Churches T. Tim Churches Health Data Science Blog: Modelling the effects of public health interventions on COVID-19 transmission using R - part 1. 2020. https://timchurches.github.io/blog/posts/2020-03-10-modelling-the-effects-of-public-health-interventions-on-covid-19-transmission-part-1/ .
Stehlé J, Voirin N, Barrat A, Cattuto C, Colizza V, Isella L, Régis C, Pinton J-F, Khanafer N, Van den Broeck W, et al.Simulation of an seir infectious disease model on the dynamic contact network of conference attendees. BMC Med. 2011; 9(1):87.
Li MY, Graef JR, Wang L, Karsai J. Global dynamics of a seir model with varying total population size. Math Biosci. 1999; 160(2):191–213.
Article CAS Google Scholar
Röst G. Seir epidemiological model with varying infectivity and infinite delay. Math Biosci Eng. 2008; 5(2):389–402.
Churches T. Tim Churches Health Data Science Blog: Modelling the effects of public health interventions on COVID-19 transmission using R - part 2. 2020. https://timchurches.github.io/blog/posts/2020-03-18-modelling-the-effects-of-public-health-interventions-on-covid-19-transmission-part-2/ .
CHAUHAN E. Challenges to community participation in heritage tourism development: Case studies of shahjahanabad and nizamuddin basti in new delhi, india. WIT Trans Ecol Environ. 2020; 248:225–33.
Snyder M. Where delhi is still quite far: Hazrat nizamuddin auliya and the making of nizamuddin basti. Columbia Undergrad J South Asian Stud. 2010:1–29.
Times, Economic. Asia’s largest slum Dharavi reports first Covid-19 casualty.[Internet]. 2020 [cited 2nd April, 2020]. 2020.
Golechha M. Covid-19 containment in Asia’s largest urban slum Dharavi-Mumbai, India: Lessons for policymakers globally. J Urban Health. 2020; 97(6):796–801.
India.com(a). Dharavi Flattens Its COVID Curve: After WHO’s Praise, BMC Official’s Insight on Efforts, Approach & Challenges For Curbing Spread.[Internet]. 2020 [cited 11th July, 2020]. 2020.
Kumar A, Pareek V, Narayan RK, Kant K, Kapoor C. Covid-19 in india: Dharavi’s success story. BMJ. 2020; 370.
Download references
Not applicable.
This study is under the “LISA 2020- Building Data Science Capacity” project funded by USAID and University of Colorado Boulder. The funder of the study sponsor had no role whatsoever in study design; in the collection, analysis, and interpretation of data; in the writing of the report.
Authors and affiliations.
International Management Institute, Kolkata, India
Pooja Sengupta
Department of Statistics, University of Calcutta, Kolkata, India
Bhaswati Ganguli, Sugata SenRoy & Aditya Chatterjee
You can also search for this author in PubMed Google Scholar
BG, SSR, AC, and PSG co-conceived the study. BG and PSG led the data analysis. PSG wrote the first draft of the manuscript, and all the authors provided crucial input on several iterations of the manuscript. SSR, BG, and AC provided useful insights into designing the simulation study. All authors read and approved the final version of the manuscript.
Correspondence to Pooja Sengupta .
Ethics approval and consent to participate.
Not applicable. No live subjects were used for this study.
Competing interests.
The authors declare that they have no competing interests.
Publisher’s note.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ . The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/ ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
Reprints and permissions
Cite this article.
Sengupta, P., Ganguli, B., SenRoy, S. et al. An analysis of COVID-19 clusters in India. BMC Public Health 21 , 631 (2021). https://doi.org/10.1186/s12889-021-10491-8
Download citation
Received : 01 September 2020
Accepted : 23 February 2021
Published : 31 March 2021
DOI : https://doi.org/10.1186/s12889-021-10491-8
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
ISSN: 1471-2458
IMAGES
VIDEO
COMMENTS
The highly infectious coronavirus disease (COVID-19) was first detected in Wuhan, China in December 2019 and subsequently spread to 212 countries and territories around the world, infecting millions of people. In India, a large country of about 1.3 billion people, the disease was first detected on January 30, 2020, in a student returning from ...
The actual death toll from COVID-19 is likely to be higher than the number of confirmed deaths - this is due to limited testing and challenges in the attribution of the cause of death. The difference between confirmed deaths and actual deaths varies by country. How COVID-19 deaths are determined and recorded may differ between countries.
Visualizing data plays a pivotal role in portraying important scientific information. Hence, visualization techniques aid in displaying relevant graphical interpretations from the varied structures of data, which is found otherwise. In this paper, we explore the COVID-19 pandemic influence trends in the subcontinent of India in the context of how far the infection rate spiked in the year 2020 ...
At least 1 in 2,574 residents have died from the coronavirus, a total of 530,779 deaths. May 2021 was the month with the highest average cases and deaths in India.
Various statistical and mathematical analysis and studies are ongoing to forecast the future trend of COVID-19 in India, and models are being developed to predict the future situation, known as N-days in forecasting. ... A residual plot is a graphical representation that shows the relationship between a given independent variable and the ...
The WHO India Weekly COVID-19 Situational Report provides a comprehensive summary of the COVID-19 situation in India. The report provides an epidemiological overview of India, highlights WHO India operational updates on risk communication and community engagement, infection prevention and control, clinical management, operation support and logistics.
Pandemic statistics on this page are no longer being updated. There have been 43,710,027 infections and 525,604 coronavirus-related deaths reported in the country since the pandemic began.
In this letter, the concept of SIR and fractal interpolation models is applied to predict the number of positive cases in India by approximating the epidemic curve, where the epidemic curve denotes the two-dimensional graphical representation of COVID-19-positive cases in which the abscissa denotes the time, while the ordinate provides the ...
The Indian Council of Medical Research (ICMR) governs classification of a COVID-19 related death in India. ICMR in turn follows WHO guidelines, recording COVID-19 deaths as U07.1 as per International Classification of Diseases. [1] The National Centre for Disease Informatics and Research, ICMR, released a document titled "Guidance for appropriate recording of COVID-19 related deaths in India".
COVID-19 Results BriefingIndiaJanuary 13, 2022This document contains summary information on the latest proj. ctions from the IHME model on COVID-19 in India. The model was run on Janu. ry 12, 2022, with data through January 10, 2022.In 2021, India had a dramatic rise in COVID-19 cas.
The table below shows the 7-day moving average of Daily Covid-19 cases, Tests conducted and Positivity Rate (daily cases/tests) for the last 30 days for each state. Total Tests and Tests/Million population is also shown for each state. The chart below show the trajectory of the pandemic in each select city from April end, 2020 with a 7-day ...
Cumulative number of samples tested for the coronavirus (COVID-19) across India from April 2020 to October 2023 (in 1,000s) Basic Statistic Number of COVID-19 tests in India 2021, by state
COVID-19 Results BriefingIndiaJune 9, 2022This document contains summary information on the latest projecti. ns from the IHME model on COVID-19 in India. The model was run on. une 8, 2022, with data through June 6, 2022.The Omicron variant surge that hit India in January-February 2022 has subsided, with a generally decreasing trend in the ...
India is among the top three countries in the world both in COVID-19 case and death counts. With the pandemic far from over, timely, transparent, and accessible reporting of COVID-19 data continues to be critical for India's pandemic efforts. We systematically analyze the quality of reporting of COVID-19 data in over one hundred government platforms (web and mobile) from India. Our analyses ...
A graphical representation of the data will relay an informative delineation of the COVID-19 pandemic trends through the various states in India. Preliminary screening of COVID-19 conducted by the Central Indian Ministry of Health included visitors from China and subsequently expanded to include other international travelers.
Visualizing section: This section contains a graphical representation of the way COVID-19 has been progressing in India. There are namely four graphs on the main page: ... Real-time status of COVID-19 in India: This graph contains the history of the entire COVID-19 cases reported in India to date, it includes total cases, total recovered cases ...
of the governments of various countries to the COVID-19 situation. India is one of the very few countries which was awarded a score of 100 early on during the situation. Figure 9: Stringency scores as of 1 st week of April 2020 Despite the turn of events in Italy, till date, their measures have not been as stringent as those taken by India.
The COVID-19 pandemic in India is a part of the worldwide pandemic of coronavirus disease 2019 (COVID-19) caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2).As of 10 September 2024, according to Indian government figures, India has the second-highest number of confirmed cases in the world (after the United States of America) with 45,041,748 [4] reported cases of COVID-19 ...
COVID-19 situation in India during PLD, LD and UL phases. The total CRD cases due to the COVID-19 pandemic in India during the PLD, LD, and UL phases are shown in Table 2 and Figure 3. Figure 2 shows the spatio-temporal distribution of confirmed cases in the states of India. In the PLD period, the highest case numbers were reported in KL ...
In this study we cluster the districts of India in terms of the spread of COVID-19 and related variables such as population density and the number of specialty hospitals. Simulation using a compartment model is used to provide insight into differences in response to public health interventions. Two case studies of interest from Nizamuddin and Dharavi provide contrasting pictures of the success ...
Abstract. COVID-19 epidemic is declared as the public health emergency of international concern by the World Health Organisation in the last week of March 2020. This disease originated from China in December 2019 has already caused havoc around the world, including India. The first case in India was reported on 30th January 2020, with the cases ...
conomic impact of the Covid-19 pandemic in India. Even prior to the pandemic, the Indian econ-omy was marked by a slowdown of economic growth. and record increases in unemployment and poverty. Thus, India's capacity to deal with a new cr. sis was weak when the pandemic hit in March 2020. The economic crisis after March 2.
impact and drivers of poverty and inequality in India during the pandemic. We have three main findings. First, India has made significant progress in reducing poverty in recent decades, but the economic downturn associated with the COVID-19 pandemic is estimated to have temporarily increased poverty and inequality.