Imported the SQLite Wildfires dataset from Kaggle into R Studio. The Wildfires dataset is a spatial database of wildfires that occurred in the United States from 1992 to 2015 generated to support the national Fire Program Analysis (FPA) system. It is made up of roughly 1.8 million wildfires.
Created a new data frame in R by selecting most relevant columns, reducing the size from 1.4 GB to 134 MB.
Columns selected included(summary): Day of Year Discovered, Year Discovered, State, Cause of the Fire, Latitude, Longitude, Estimated Size of the Fire
Exported data frame to a csv.
Imported csv into BigQuery cloud UI.
Added new columns for Month of the Year, and Year/Month Combined. Ran SQL queries to populate new columns.
Pulled monthly Temperature and Drought data from NOAA for the years included in the Wildfire dataset, by state.
Imported each state table into Excel as combined data sheet for cleaning.
Imported combined states table into BigQuery.
Cloned BigQuery fires table to create a fires_climate table and added fields for Drought Severity, Precipitation, Temperature values.
Used query language to populate new fires_climate columns with data from the combined states table by state and YearMonth.
Linked result to Google's Data Studio for some exploratory visual analysis.
Looked at acres burned by state, year, month, and cause of fire..