Skip to content

alt

NOAA – Volcanoes

Introduction

NCEI archives and assimilates tsunami, earthquake and volcano data to support research, planning, response and mitigation. Long-term data, including photographs, can be used to establish the history of natural hazard occurrences and help mitigate against future events. The NCEI NOAA Volcanic Eruption Database is a global listing of over 500 significant eruptions which includes information on the latitude, longitude, elevation, type of volcano, and last known eruption. A significant eruption is classified as one that meets at least one of the following criteria: caused fatalities, caused moderate damage (approximately $1 million or more), with a Volcanic Explosivity Index (VEI) of 6 or larger, caused a tsunami, or was associated with a major earthquake. For a complete list of current and past activity for all volcanoes on the planet active during the last 10,000 years

Source: NOAA

Tags: Climate and Environment, Disasters, Time-series, Risk, Daily

Modules

Scrapping:

Below is the API endpoints and the parameters that we need to pass to get the data.

https://www.ngdc.noaa.gov/hazel/hazard-service/api/v1/{dataset}?maxYear={end_year}&minYear={start_year}

Geocoder:

Coordinates are added to the metadata for the country. Region and region code are also appended. Geocoder library is used for getting coordinates. We also have a separate JSON file for country’s coordinates to avoid calling third party library to make geocoding process more efficient and faster.

Standardization:

Additional information like sample frequency, units, source and description are included in the metadata. Function for fetching ISO country code and appending it is present in standardization. Predefined domain and subdomain are added in this step.

Cleaning:

Duplicate and additional columns are removed from the data. Location names are rectified and country names are formatted correctly.

Metadata

Metadata Attributes

Attributes Descriptions
timestamp standard timestamp used for the timeseries , tsunami was observed
id Id used to connect timeseries data to the metadata
map_coordinates Latitude and Longitude of the station location (geojson format
country The country where the tsunami effects were observed.
country_code ISO 3-letter country code
description description of the dataset
domain Predefined domain by Taiyo.
name name of the data
original_id in this case we create our own original id using {city_measure_indicator}
region region for a country according to World Bank Standards}
region_code region code for a region according to World Bank Standards.
sample_frequency frequency in which data gets updated on the source
sub_domain Predefined subdomain by Taiyo.
time_of_sampling time of data collection
date_of_sampling date of data collection
timezone Timezone for the time and date
units Type of value stored in timeseries
measure type of measure (min, max, median)
url url for the each of the datasets.
latest_timestamp_id mongoDB id for latest timestamp in the timeseries
income_level region code for a region according to World Bank Standards.
publish
area area where Volcanoes effects were observered
damageAmountOrder For those events not offering an exact number of houses damaged, the following four-level scale was used to classify the damage and was listed in the Houses Destroyed column. If the actual number of houses destroyed was listed, a descriptor was also added for search purposes.
housesDestroyedAmountOrder
deaths Whenever possible, numbers of deaths are listed.
deathsAmountOrder When a description was found in the historical literature instead of an actual number of deaths, this value was coded and listed in the Deaths column. If the actual number of deaths was listed, a descriptor was also added for search purposes; 0 None; 1 Few (~1 to 50 deaths); 2 Some (~51 to 100 deaths); 3 Many (~101 to 1000 deaths); 4 Very many (over 1000 deaths)
injuries Whenever possible, numbers of injuries from the runup are listed.
injuriesAmountOrder When a description was found in the historical literature instead of an actual number of injuries, this value was coded and listed in the Injuries column. If the actual number of injuries was listed, a descriptor was also added for search purposes. 0 None;1 Few (~1 to 50 injuries); 2 Some(~51 to 100 injuries); 3 Many (~101 to 1000 injuries); 4 Very many (over 1000 injuries)
housesDestroyed Whenever possible, numbers of houses destroyed are listed.
housesDamagedAmountOrder Valid values: 0 to 4 For those events not offering an exact number of houses damaged, the following four-level scale was used to classify the damage and was listed in the Houses Destroyed column. If the actual number of houses destroyed was listed, a descriptor was also added for search purpos
damageMillionsDollars The value in the Damage column should be multiplied by 1,000,000 to obtain the actual dollar amount.
sourceEventValidity Validity of the actual tsunami occurrence is indicated by a numerical rating of the reports of that event: -1 erroneous entry, 0 event that only caused a seiche or disturbance in an inland river, 1 very doubtful tsunami, 2 questionable tsunami, 3 probable tsunami, 4 definite tsunami
sourceCauseCode The source of the tsunami: 0 Unknown 1 Earthquake, 2 Questionable Earthquake, 3 Earthquake and Landslide, 4 Volcano and Earthquake, 5 Volcano, Earthquake, and Landslide; 6 Volcano; 7 Volcano and Landslide; 8 Landslide; 9 Meteorological; 10 Explosion; 11 Astronomical Tide
tsunamiEventId Event Id of Tsunami linked to this volcano erruption
earthquakeEventId Event Id of Earthqauke linked to this volcano erruption
doubtful Doubtful values; n Runup entry was not doubtful; y Runup entry was doubtful;m The waves likely had a meteorologic source, and thus were not true tsunami waves
volcanoLocationId Location Id of Volcano
name The name of the volcano as listed by the Smithsonian Institution, Global Volcanism Program.
location location of the volcano
elevation elevation of Volcanoes in meters (m)
morphology
agent
deathsTotal Total Deaths
deathsAmountOrderTotal
damageMillionsDollarsTotal
damageAmountOrderTotal
housesDestroyedTotal
housesDestroyedAmountOrderTotal
significant
status status of Volcano
timeErupt erruption Time
vei A widely used classification scheme to describe the size of explosive eruptions. It is based principally on the erupted mass or volume of a deposit Newhall and Self, 1982. Historic eruptions that were definitely explosive, but carry no other descriptive information are assigned a default VEI of 2
injuriesTotal Total Injuries
injuriesAmountOrderTotal
missingAmountOrder
missingAmountOrderTotal
missing Whenever possible, numbers of injuries listed
missingTotal Toatl missing

Data Flow

The above data pipeline runs on Argo and it will be executed on a periodic frequency.

DAGs:

  • NOAA-Volcanoes: Total No of DAGs file is 1

Taiyo Data Format

Entity NOAA-Volcanoes
Frequency Even Based
Updated On 09-06-2022 UTC 12:14:16 PM
- -
Coverage covering all the countries with the
Uncertainties -
## Scope for Improvement

Following can be improved in the next version of the data product:

  • Every time Argo Workflow run, we overwrite existing data on the S3 bucket.
  • In future we might want to improve it to only scrap the data that we don’t already have.
  • https://www.ngdc.noaa.gov/hazard/tsu.shtml
  • https://www.ngdc.noaa.gov/hazel/view/hazards/tsunami/runup-search