NOAA – Volcanoes
Introduction
NCEI archives and assimilates tsunami, earthquake and volcano data to support research, planning, response and mitigation. Long-term data, including photographs, can be used to establish the history of natural hazard occurrences and help mitigate against future events. The NCEI NOAA Volcanic Eruption Database is a global listing of over 500 significant eruptions which includes information on the latitude, longitude, elevation, type of volcano, and last known eruption. A significant eruption is classified as one that meets at least one of the following criteria: caused fatalities, caused moderate damage (approximately $1 million or more), with a Volcanic Explosivity Index (VEI) of 6 or larger, caused a tsunami, or was associated with a major earthquake. For a complete list of current and past activity for all volcanoes on the planet active during the last 10,000 years
Source: NOAA
Tags: Climate and Environment, Disasters, Time-series, Risk, Daily
Modules
Scrapping:
Below is the API endpoints and the parameters that we need to pass to get the data.
https://www.ngdc.noaa.gov/hazel/hazard-service/api/v1/{dataset}?maxYear={end_year}&minYear={start_year}
Geocoder:
Coordinates are added to the metadata for the country. Region and region code are also appended. Geocoder library is used for getting coordinates. We also have a separate JSON file for country’s coordinates to avoid calling third party library to make geocoding process more efficient and faster.
Standardization:
Additional information like sample frequency, units, source and description are included in the metadata. Function for fetching ISO country code and appending it is present in standardization. Predefined domain and subdomain are added in this step.
Cleaning:
Duplicate and additional columns are removed from the data. Location names are rectified and country names are formatted correctly.
Metadata
Metadata Attributes
Attributes | Descriptions |
---|---|
timestamp | standard timestamp used for the timeseries , tsunami was observed |
id | Id used to connect timeseries data to the metadata |
map_coordinates | Latitude and Longitude of the station location (geojson format |
country | The country where the tsunami effects were observed. |
country_code | ISO 3-letter country code |
description | description of the dataset |
domain | Predefined domain by Taiyo. |
name | name of the data |
original_id | in this case we create our own original id using {city_measure_indicator} |
region | region for a country according to World Bank Standards} |
region_code | region code for a region according to World Bank Standards. |
sample_frequency | frequency in which data gets updated on the source |
sub_domain | Predefined subdomain by Taiyo. |
time_of_sampling | time of data collection |
date_of_sampling | date of data collection |
timezone | Timezone for the time and date |
units | Type of value stored in timeseries |
measure | type of measure (min, max, median) |
url | url for the each of the datasets. |
latest_timestamp_id | mongoDB id for latest timestamp in the timeseries |
income_level | region code for a region according to World Bank Standards. |
publish | |
area | area where Volcanoes effects were observered |
damageAmountOrder | For those events not offering an exact number of houses damaged, the following four-level scale was used to classify the damage and was listed in the Houses Destroyed column. If the actual number of houses destroyed was listed, a descriptor was also added for search purposes. |
housesDestroyedAmountOrder | |
deaths | Whenever possible, numbers of deaths are listed. |
deathsAmountOrder | When a description was found in the historical literature instead of an actual number of deaths, this value was coded and listed in the Deaths column. If the actual number of deaths was listed, a descriptor was also added for search purposes; 0 None; 1 Few (~1 to 50 deaths); 2 Some (~51 to 100 deaths); 3 Many (~101 to 1000 deaths); 4 Very many (over 1000 deaths) |
injuries | Whenever possible, numbers of injuries from the runup are listed. |
injuriesAmountOrder | When a description was found in the historical literature instead of an actual number of injuries, this value was coded and listed in the Injuries column. If the actual number of injuries was listed, a descriptor was also added for search purposes. 0 None;1 Few (~1 to 50 injuries); 2 Some(~51 to 100 injuries); 3 Many (~101 to 1000 injuries); 4 Very many (over 1000 injuries) |
housesDestroyed | Whenever possible, numbers of houses destroyed are listed. |
housesDamagedAmountOrder | Valid values: 0 to 4 For those events not offering an exact number of houses damaged, the following four-level scale was used to classify the damage and was listed in the Houses Destroyed column. If the actual number of houses destroyed was listed, a descriptor was also added for search purpos |
damageMillionsDollars | The value in the Damage column should be multiplied by 1,000,000 to obtain the actual dollar amount. |
sourceEventValidity | Validity of the actual tsunami occurrence is indicated by a numerical rating of the reports of that event: -1 erroneous entry, 0 event that only caused a seiche or disturbance in an inland river, 1 very doubtful tsunami, 2 questionable tsunami, 3 probable tsunami, 4 definite tsunami |
sourceCauseCode | The source of the tsunami: 0 Unknown 1 Earthquake, 2 Questionable Earthquake, 3 Earthquake and Landslide, 4 Volcano and Earthquake, 5 Volcano, Earthquake, and Landslide; 6 Volcano; 7 Volcano and Landslide; 8 Landslide; 9 Meteorological; 10 Explosion; 11 Astronomical Tide |
tsunamiEventId | Event Id of Tsunami linked to this volcano erruption |
earthquakeEventId | Event Id of Earthqauke linked to this volcano erruption |
doubtful | Doubtful values; n Runup entry was not doubtful; y Runup entry was doubtful;m The waves likely had a meteorologic source, and thus were not true tsunami waves |
volcanoLocationId | Location Id of Volcano |
name | The name of the volcano as listed by the Smithsonian Institution, Global Volcanism Program. |
location | location of the volcano |
elevation | elevation of Volcanoes in meters (m) |
morphology | |
agent | |
deathsTotal | Total Deaths |
deathsAmountOrderTotal | |
damageMillionsDollarsTotal | |
damageAmountOrderTotal | |
housesDestroyedTotal | |
housesDestroyedAmountOrderTotal | |
significant | |
status | status of Volcano |
timeErupt | erruption Time |
vei | A widely used classification scheme to describe the size of explosive eruptions. It is based principally on the erupted mass or volume of a deposit Newhall and Self, 1982. Historic eruptions that were definitely explosive, but carry no other descriptive information are assigned a default VEI of 2 |
injuriesTotal | Total Injuries |
injuriesAmountOrderTotal | |
missingAmountOrder | |
missingAmountOrderTotal | |
missing | Whenever possible, numbers of injuries listed |
missingTotal | Toatl missing |
Data Flow
The above data pipeline runs on Argo and it will be executed on a periodic frequency.
DAGs:
- NOAA-Volcanoes: Total No of DAGs file is 1
Taiyo Data Format
Entity | NOAA-Volcanoes |
---|---|
Frequency | Even Based |
Updated On | 09-06-2022 UTC 12:14:16 PM |
- | - |
Coverage | covering all the countries with the |
Uncertainties | - |
## Scope for Improvement |
Following can be improved in the next version of the data product:
- Every time Argo Workflow run, we overwrite existing data on the S3 bucket.
- In future we might want to improve it to only scrap the data that we don’t already have.
Useful Links
- https://www.ngdc.noaa.gov/hazard/tsu.shtml
- https://www.ngdc.noaa.gov/hazel/view/hazards/tsunami/runup-search