Weather data analysis and visualization – Big data tutorial Part 2/9 – Dataset

Tutorial big data analysis: Weather changes in the Carpathian-Basin from 1900 to 2014 – Part 2/9

Preparation – Dataset

Weather data from NOAA ? National Climatic Center ? accessible by using a great toolset, that let you select your area of interest on a map interactively.

Custom dataset is downloadable by the use of the map tool of NOAA:

http://www.ncdc.noaa.gov/cdo-web/datasets

The detailed, experiment dataset is downloadable here: Weather.zip

The experiment weather dataset covers:

Weather stations and their location of the Carpathian Basin, daily historical data from 01.01.1900 to 14.02.2014.

  • GHCND:UPM00033397 – SAMBOR, UP
  • GHCND:UPM00033631 – UZHHOROD, UP
  • GHCND:ROM00015085 – BISTRITA, RO
  • GHCND:UPM00033634 – BEREGOVO, UP
  • GHCND:LOE00105562 – HURBANOVO, LO
  • GHCND:UPM00033398 – DROHOBYCH, UP
  • GHCND:UPM00033633 – MEJGOR E, UP
  • GHCND:ROM00015280 – VARFU OMUL, RO
  • GHCND:ROE00100898 – BAIA MARE, RO
  • GHCND:UPM00033638 – KHUST, UP
  • GHCND:SIE00115196 – MURSKA SOBOTA RAKICAN, SI
  • GHCND:ROE00108890 – CARANSEBES, RO
  • GHCND:ROE00108891 – CEAHLAU TOACA, RO
  • GHCND:UPM00033524 – DOLINA, UP
  • GHCND:SIE00115096 – VELIKI DOLENCI, SI
  • GHCND:ROE00100903 – DROBETA TURNU S., RO
  • GHCND:ROE00100902 – CLUJ NAPOCA, RO
  • GHCND:LO000011934 – POPRAD TATRY, LO
  • GHCND:ROE00100904 – TG JIU, RO
  • GHCND:UPM00033645 – YAREMCHA, UP
  • GHCND:RIE00111909 – NOVI SAD, RB
  • GHCND:AUW00034165 – VIENNA, AU
  • GHCND:HU000012942 – PECS POGANY, HU
  • GHCND:ROE00108899 – RAMNICU VALCEA, RO
  • GHCND:ROE00100829 – ARAD, RO
  • GHCND:ROE00108898 – OCNA SUGATAG, RO
  • GHCND:HRE00105203 – OSIJEK, HR
  • GHCND:ROE00108894 – DEVA, RO
  • GHCND:UPM00033647 – RAKHOV, UP
  • GHCND:ROE00108897 – MIERCUREA CIUC, RO
  • GHCND:UPM00033646 – POGEGEVSKAYA, UP
  • GHCND:UPM00033518 – NIGNIY STUDENIY, UP
  • GHCND:ROE00108901 – SIBIU, RO
  • GHCND:UPM00033514 – VELIKIY BEREZNY, UP
  • GHCND:LOE00116344 – KOSICE, LO
  • GHCND:UPM00033515 – PLAY, UP
  • GHCND:UPM00033516 – SLAVSKO, UP
  • GHCND:UPM00033657 – SELIATYN, UP
  • GHCND:UPM00033517 – NIGNIE VOROTA, UP
  • GHCND:AU000005901 – WIEN, AU
  • GHCND:UPM00033511 – TURKA, UP
  • GHCND:RIE00100818 – BELGRADE OBSERVATORY, RB
  • GHCND:UPM00033513 – STPIY, UP
  • GHCND:UPM00033651 – KOLOMYIA, UP
  • GHCND:LOE00116364 – ORAVSKA LESNA, LO

The data format of the rows of the experiment dataset:

STATION,STATION_NAME,ELEVATION,LATITUDE,LONGITUDE,DATE,PRCP,SNWD,SNOW,TMAX,TMIN,WESD

Where:

  • WESD – Water equivalent of snow on the ground (tenths of mm)
  • TMAX – Maximum temperature (tenths of degrees C)
  • SNWD – Snow depth (mm)
  • SNOW – Snowfall (mm)
  • PRCP – Precipitation (tenths of mm)
  • TMIN – Minimum temperature (tenths of degrees C)

Example from the CSV:

Remark: -9999 marks no measurement data is available, rows with all -9999 measurement data are not in the CSV

GHCND:ROE00108901,SIBIU RO,444,45.8,24.15,19610101,0,-9999,-9999,24,-14,-9999
GHCND:ROE00108901,SIBIU RO,444,45.8,24.15,19610102,0,-9999,-9999,53,-21,-9999
GHCND:ROE00108901,SIBIU RO,444,45.8,24.15,19610103,0,-9999,-9999,102,-20,-9999
GHCND:ROE00108901,SIBIU RO,444,45.8,24.15,19610104,0,-9999,-9999,124,-18,-9999
GHCND:ROE00108901,SIBIU RO,444,45.8,24.15,19610105,0,-9999,-9999,132,44,-9999
GHCND:ROE00108901,SIBIU RO,444,45.8,24.15,19610106,0,-9999,-9999,60,-13,-9999
GHCND:ROE00108901,SIBIU RO,444,45.8,24.15,19610107,0,-9999,-9999,67,-16,-9999
GHCND:ROE00108901,SIBIU RO,444,45.8,24.15,19610108,0,-9999,-9999,8,-46,-9999
GHCND:ROE00108901,SIBIU RO,444,45.8,24.15,19610109,0,-9999,-9999,-8,-55,-9999
GHCND:ROE00108901,SIBIU RO,444,45.8,24.15,19610110,15,-9999,-9999,6,-18,-9999