Skip to main content

Analyze

NIEHS Dashboard Data Sources

tschuetz

GitHub Repository

“To empower additional modeling efforts, the complete time series of all daily PVI scores and data are available at https://github.com/COVID19PVI/data. “

12 Key Indicators

“[The authors] assembled U.S. county- and state-level datasets into 12 key indicators across four major domains: current infection rates (infection prevalence, rate of increase), baseline population concentration (daytime density/traffic, residential density), current interventions (social distancing, testing rates), and health and environmental vulnerabilities (susceptible populations, air pollution, age distribution, comorbidities, health disparities, and hospital beds).”

Three types of modeling

“Our modeling efforts directly address the discussion in [6], by contextualizing factors such as racial differences with corrections for socioeconomic factors, health resource allocation, and co-morbidities, plus highlighting place- based risks and resource deficits that might explain spatial distributions. Specifically, three types of modeling efforts were performed and are regularly updated. First, epidemiological modeling on cumulative case- and death-related outcomes provides insights into the epidemiology of the pandemic. Second, dynamic time-dependent modeling provides similar outcome estimates as national-level models, but with county-level resolution. Finally, a Bayesian machine learning approach provides data-driven, short-term forecasts. “

Blackness and PM 2.5

“With respect to factors affecting COVID-19 related mortality, we find that the proportion of Black residents and the PM2.5 index of small-particulate air pollution are the most significant predictors among those included, reinforcing conclusions from previous reports[7]. An increase of one percentage point of Black residents is associated with a 3.3% increase in the COVID-19 death rate. The effect of a 1 g/m3 increase in PM2.5 is associated with an approximately 16% increase in the COVID-19 death rate, a value at the high end of a previously reported confidence interval from a report in late April 2020[7] when deaths had reached 38% of the current total.”

Machine learning and prediction

“To accurately predict future cases and mortality, it is necessary to account for the fluid nature of the data. Accordingly, we developed a Bayesian spatiotemporal random-effects model that jointly describes the log-observed and log-death counts to build local forecasts. Log-observed cases for a given day are predicted using known covariates (e.g., population density, social distancing metrics), a spatiotemporal random-effect smoothing component, and the time- weighted average number of cases for these counts. This smoothed time-weighted average is related to a Euler approximation of a differential equation; it provides modeling flexibility while approximating potential mechanistic models of disease spread. The smoothed case estimates are used in a similar spatiotemporal model predicting future log-death counts based on a geometric mean estimate of the estimated number of observed cases for the previous seven days as well as the other data streams. The resulting county-level predictions and corresponding confidence intervals are shown (Fig. 1)."

Source: https://www.researchgate.net/publication/343642027_The_COVID-19_Pandemi…

US NIEHS Dashboard Creators and Curators

tschuetz

Skylar W. Marvel1, John S. House2, Matthew Wheeler2, Kuncheng Song1, Yihui Zhou1, Fred A. Wright1,3, Weihsueh A. Chiu4, Ivan Rusyn4, Alison Motsinger-Reif2*, David M. Reif1*

Affiliations:

1 Bioinformatics Research Center, Department of Biological Sciences, North Carolina State University, Raleigh, NC 27695, USA.

2 Biostatistics and Computational Biology Branch, National Institute of Environmental Health Sciences, Research Triangle Park, NC, 27709, USA.

3 Department of Statistics, North Carolina State University, Raleigh, NC 27695, USA

4 Veterinary Integrative Biosciences, College of Veterinary Medicine and Biomedical Sciences, Texas A&M University, College Station, TX 77845, USA.

US NIEHS Dashboard Types of Data

tschuetz

“Data sources in the current model (version 11.2.1) include the Social Vulnerability Index (SVI) of the Centers for Disease Control and Prevention (CDC) for emergency response and hazard mitigation planning (Horney et al. 2017), testing rates from the COVID Tracking Project (Atlantic Monthly Group 2020), social distancing metrics from mobile device data ( https://www.unacast.com/covid19/social-distancing-scoreboard), and dynamic measures of disease spread and case numbers ( https://usafacts.org/issues/coronavirus/). Methodological details concerning the integration of data streams—plus the complete, daily time series of all source data since February 2020 and resultant PVI scores—are maintained on the public Github project page (COVID19PVI 2020). Over this period, the PVI has been strongly associated with key vulnerability-related outcome metrics (by rank-correlation), with updates of its performance assessment posted with model updates alongside data at the Github project page (COVID19PVI 2020).”

Source: https://ehp.niehs.nih.gov/doi/10.1289/EHP8690

US NIEHS Dashboard Motivations

tschuetz

Empowering local actoors

“We present the PVI Dashboard as a dynamic container for contextualizing these disparities. It is a modular tool that will evolve to incorporate new data sources and analytics as they emerge (e.g., concurrent flu infections, school and business reopening statistics, heterogeneous public health practices). This flexibility positions it well as a resource for integrated prioritization of eventual vaccine distribution and monitoring its local impact. The PVI Dashboard can empower local and state officials to take informed action to combat the pandemic by communicating interactive, visual profiles of vulnerability atop an underlying statistical framework that enables the comparison of counties and the evaluation of the PVI’s component data.”

US NIEHS Dashboard Visualization

tschuetz

Built with toxicology knowledge

“The software used to generate PVI scores and profiles from these data is freely available at https://toxpi.org

General visualization capabilities

“The interactive visualization within the PVI Dashboard is intended to communicate factors underlying vulnerability and empower community action [...] The visualization and quantification of county-level vulnerability indicators are displayed by a radar chart, where each of the 12 indicators comprises a “slice” of the overall PVI profile. On loading, the Dashboard displays the top 250 PVI profiles (by rank) for the current day. The data, PVI scores, and predictions are updated daily, and users can scroll through historical PVI and county outcome data. Individual profiles are an interactive map layer with numerous display options/filters that include sorting by overall score, filtering by combinations of slice scores, clustering by profile similarity (i.e. vulnerability “shape”), and searching for counties by name or state (Additional functionality is detailed in the Supplement). User selection of any county overlays the summary Scorecard and populates surrounding panels with county- specific information (Figure 1). The scrollable panels at left include plots of vulnerability drivers relative to the nation-wide distribution across all U.S. counties, with the location of the selected county delineated. The panels across the bottom of the Dashboard report cumulative county numbers of cases and deaths; timelines of cumulative cases, deaths, PVI score, and PVI rank; daily changes in cases and deaths for the most recent 14-day period (commonly used in reopening guidelines[6]; and predicted cases and deaths for a 7-day forecast horizon.”

Visualizing comparison and "peer counties"

“the multi-criteria filtering capabilities in the Dashboard were used to find a “peer county” for comparison. “

Source: https://ehp.niehs.nih.gov/doi/10.1289/EHP8690 and https://www.researchgate.net/publication/343642027_The_COVID-19_Pandemi…

NIEHS Dashboard Data Sources

tschuetz

GitHub Repository

“To empower additional modeling efforts, the complete time series of all daily PVI scores and data are available at https://github.com/COVID19PVI/data. “

12 Key Indicators

“[The authors] assembled U.S. county- and state-level datasets into 12 key indicators across four major domains: current infection rates (infection prevalence, rate of increase), baseline population concentration (daytime density/traffic, residential density), current interventions (social distancing, testing rates), and health and environmental vulnerabilities (susceptible populations, air pollution, age distribution, comorbidities, health disparities, and hospital beds).”

Three types of modeling

“Our modeling efforts directly address the discussion in [6], by contextualizing factors such as racial differences with corrections for socioeconomic factors, health resource allocation, and co-morbidities, plus highlighting place- based risks and resource deficits that might explain spatial distributions. Specifically, three types of modeling efforts were performed and are regularly updated. First, epidemiological modeling on cumulative case- and death-related outcomes provides insights into the epidemiology of the pandemic. Second, dynamic time-dependent modeling provides similar outcome estimates as national-level models, but with county-level resolution. Finally, a Bayesian machine learning approach provides data-driven, short-term forecasts. “

Blackness and PM 2.5

“With respect to factors affecting COVID-19 related mortality, we find that the proportion of Black residents and the PM2.5 index of small-particulate air pollution are the most significant predictors among those included, reinforcing conclusions from previous reports[7]. An increase of one percentage point of Black residents is associated with a 3.3% increase in the COVID-19 death rate. The effect of a 1 g/m3 increase in PM2.5 is associated with an approximately 16% increase in the COVID-19 death rate, a value at the high end of a previously reported confidence interval from a report in late April 2020[7] when deaths had reached 38% of the current total.”

Machine learning and prediction

“To accurately predict future cases and mortality, it is necessary to account for the fluid nature of the data. Accordingly, we developed a Bayesian spatiotemporal random-effects model that jointly describes the log-observed and log-death counts to build local forecasts. Log-observed cases for a given day are predicted using known covariates (e.g., population density, social distancing metrics), a spatiotemporal random-effect smoothing component, and the time- weighted average number of cases for these counts. This smoothed time-weighted average is related to a Euler approximation of a differential equation; it provides modeling flexibility while approximating potential mechanistic models of disease spread. The smoothed case estimates are used in a similar spatiotemporal model predicting future log-death counts based on a geometric mean estimate of the estimated number of observed cases for the previous seven days as well as the other data streams. The resulting county-level predictions and corresponding confidence intervals are shown (Fig. 1)."

Source: https://www.researchgate.net/publication/343642027_The_COVID-19_Pandemi…

US NIEHS Dashboard Creators and Curators

tschuetz

Skylar W. Marvel1, John S. House2, Matthew Wheeler2, Kuncheng Song1, Yihui Zhou1, Fred A. Wright1,3, Weihsueh A. Chiu4, Ivan Rusyn4, Alison Motsinger-Reif2*, David M. Reif1*

Affiliations:

1 Bioinformatics Research Center, Department of Biological Sciences, North Carolina State University, Raleigh, NC 27695, USA.

2 Biostatistics and Computational Biology Branch, National Institute of Environmental Health Sciences, Research Triangle Park, NC, 27709, USA.

3 Department of Statistics, North Carolina State University, Raleigh, NC 27695, USA

4 Veterinary Integrative Biosciences, College of Veterinary Medicine and Biomedical Sciences, Texas A&M University, College Station, TX 77845, USA.

Safe Side Off the Fence

EfeCengiz

The documentary is missing because the documentary is as safe as the fence it mocks in its title.
In the beginning we are asked to bear witness to the construction and use of the most devastation weapon of indiscriminate death the world has ever seen, and all the harm the construction of such a tool, yet its construction and its use is justified near instantaneously by repeating the same old propaganda.
In continuation, we are asked to bear witness to the continuous production of similar weapons and the devastation caused by the mishandling of the waste that accumulated in their production, yet why such a production took place is not only left unquestioned, but simple hints of cold war propaganda is left in their places for safekeeping.
In the end, we are asked to bear witness to a sombre victory, same spectres of patriotism and nation-of-God watching over our shoulder, yet how the pitiful situation of being forced to celebrate even such a small victory is never explored.
To sum up, we are shown people, good people, who struggle against the symptoms of a disease, yet this disease itself never named, nor challenged. It could not have been challenged, as it would force a complete change in their discourse.

If we sincerely would like to critique how the bodies of these workers were made disposable; used, harmed, dislocated and discharged as deemed necessary; if we wish to explore this topic as the necropolitical issue it is, we cannot stop halfway through. This inability to stop chasing connections, relationalities wherever it fits our ideology, is not a call for “objectivism”, it’s a call to respect the term of Anthropocene with all its rhizomatic connections.

An investigation of nuclear waste, that does not factor the use of its product, the socio-political effects of said product, and the historical conditions that even led to the possibility of producing it in such ways and such quantities, are of no use for us.  It cannot penetrate the barrier of capitalist realism. If it could, at least a single mention of workers unions would have existed. Instead, it has confessionals by atomic weapons lawyers whose heart goes out to these workers.
An America that refuse to face up to the fact that it is what it is by the great necropolitical project it led for hundreds of years, I struggle to accumulate sympathy for, what I can easily accumulate is rage however, which this documentary is missing..
Wish the documentary would have at least attempted to say something radical, instead of praising these disposable bodies for being patriotic about it. There are lives who never had false fences built as idols for safety, the collective idols of old America, the patriotic nation under God were built upon their broken bodies, what would you ask of them?

A complex set of data to understand and use.

lclplanche

One of the reasons for the specific nature of data and knowledge management in this context is the economic necessity and attractiveness of stable, high paying employment. In terms of the beginning of the accumulation of local knowledge regarding the risks to which the workers and the neighbors were being exposed to, this clearly played a role. For fear of losing their good paying jobs, and due to the military nature of their occupation, workers never told anything about their jobs to their families, or didn't ask questions that could have led to uncomfortable answers. This dynamic continued later, as we can see by the testimony of the worker who worked on the clean-up of the Weldon Springs site. The Priest also notes that in the neighborhood, people were wary of information leaking, as it might depreciate their property values.

Something else which we can observe is that, on top of the economic necessity for preserving one's job, there is also a sentiment of pride in doing one's work properly. A worker recalls that the relationship that the workers had to having to wear blue (and reduce your actions because you were contaminated) was that it was just part the job, and that they had a job to do. After the Weldon springs plant closed, there was a liberation of voices, and it was easier to report health concerns. The sentiment of pride in doing ones work properly is completed by a sentiment of patriotism. The same worker, Mr Schneider, said: "We have to believe what our government tells us, what the heck, uh. Best country in the world, I still think it is." Another example of the relationship between the job and the risk is the testimony of the clean-up worker who said that they shut of their Geiger counters, because they were "just going nuts". Here we can see that when the risk is too high, it becomes less visible, less understandable, because it is inescapable. Another reason for the difficulty of accumulating and sharing information, at least until the 1990s, is the priority of beating the communists. The discourse of emergency and national priority is not conducive to asking questions (as we can observe today in different ways).

The closing of the Weldon Springs plant coincided with the rise of environmental concerns in the USA and the change in environmental perspective had an impact on the categorization of places such as the Weldon Springs one, which became a Superfund site. This required a change in management at the department of energy because they started needing to have conversations and interactions with the public. This did not solve all the knowledge management problems however, because the measures put in place to deal with the injustices were insufficient compared to the nature of the events that had unfolded.

This is for multiple reasons. The first the nature of the risk means that the production of knowledge and regulations was complicated by a lack of understanding of the different medical pathways, conditions, and interactions which lead to the development of health problems. The number of people affected is also quite small, so the statistics may not appear to be significant. The second is the complexity of the accumulation of data in order to gain reparation and recognition, something which led to a movement to make the process more collective, in order to support the data finding and management process and make the knowledge of the administrative procedures consolidated. Finally, there were instances where the records of employee exposure were falsified, which meant that the access to this information was impossible.