We're Team Arson and we're using the power of predictive modeling to combat wildfires.

Overview

Logo We're Team Arson and we're using the power of predictive modeling to combat wildfires.

Arson Map

Inspiration

There’s been a lot of wildfires in California in recent years, and a lot of the most recent wildfires have been uncontained. The government does not have the capacity to deal with such a huge amount of wildfires so it has to pick and choose which fires to bring under control. This picking and choosing should be done based on wildfire and wind data in order to minimize the damage caused by wildfires We should also prioritize mitigating fires that can spread across many counties/ have a large chance of spreading further

What it does

Our project consists of a web app with an interactive map. We represent our wildfire as a MDP and determine how at risk counties are based on the fire location(s).

How we built it

We split the project into 2 main parts: web app and AI

Website

Artificial Intelligence

  • Represent the wildfire as a MDP (Markov Decision Process)
    • States: Counties
    • Actions: Traversing Counties
    • Probability distribution: generated from wind data
    • Transition Model: generated from wind data
    • Reward function: Uniform for every county burned (prone to change if scaled up)
  • Use bellman equation to iterate through counties and propagate the fire
    • Utility values ranging between 0 and 1 represent how at risk a county is
    • Screenshot
    • Run until utility values reach equilibrium or until 100 iterations are run
    • Gamma = 0.8
  • Represent the map as a graph
    • Counties are nodes
    • Wind speeds are edges
    • Assign each county with a risk (for reward function)
    • Spawn fires on specific counties

Challenges we ran into

Our project has a pretty large scope. We needed to develop a model and integrate it with a web app. This required extensive knowledge on AWS utilities and crisp communication between team members. The machine learning portion of this hackathon was difficult as we had to decide on what type of model to use for the wildfire and how to assign reward and utility values.

Accomplishments that we're proud of

We were able to integrate the web app with the model really quickly. This was surprising since usually connecting the pieces together will have a lot of bugs. It was also Austin, Raaj, and Romuz's first hackathons and this was a fairly complex project compared to a standard web app.

What we learned

This hackathon was a first for many of us. This was the first time any of us had implemented a machine learning model and connected it to a web app.

This was my first time at a hackathon and I couldn't have asked for better teammates than Jerry, Raaj, and Romuz. I learned so much over the last two days about machine learning, data science, React, and working as a team to help tackle some of California's greatest challenges. - Austin Rivard

As a first-year student, I have learned a lot of new skill sets while working with this team. I was happy to be a member of such an agile team. I learned numerous of new concepts, such as working with AWS, writing algorithms, and the graph data structures. - Romuz Abdulhamidov

What's next for Arson

  • Scale up to entire California to generate a better map during wildfire season
  • Generate more accurate Reward values for each county burned
  • Incorporate type 2 rewards based on R(state, action)
    • Wildfire gets bigger as it burns more land
    • Wildfire gets smaller in the presence of firefighters
  • Automatically train and deploy models by integrating real-time data for wind and wildfires

Demo

Screenshot

Owner
Jerry Lee
software engineer
Jerry Lee
2019 Data Science Bowl

Kaggle-2019-Data-Science-Bowl-Solution - Here i present my solution to kaggle 2019 data science bowl and how i improved it to win a silver medal in that competition.

Deepak Nandwani 1 Jan 01, 2022
Randomisation-based inference in Python based on data resampling and permutation.

Randomisation-based inference in Python based on data resampling and permutation.

67 Dec 27, 2022
Provide a market analysis (R)

market-study Provide a market analysis (R) - FRENCH Produisez une étude de marché Prérequis Pour effectuer ce projet, vous devrez maîtriser la manipul

1 Feb 13, 2022
Monitor the stability of a pandas or spark dataframe ⚙︎

Population Shift Monitoring popmon is a package that allows one to check the stability of a dataset. popmon works with both pandas and spark datasets.

ING Bank 403 Dec 07, 2022
Hydrogen (or other pure gas phase species) depressurization calculations

HydDown Hydrogen (or other pure gas phase species) depressurization calculations This code is published under an MIT license. Install as simple as: pi

Anders Andreasen 13 Nov 26, 2022
Python ELT Studio, an application for building ELT (and ETL) data flows.

The Python Extract, Load, Transform Studio is an application for performing ELT (and ETL) tasks. Under the hood the application consists of a two parts.

Schlerp 55 Nov 18, 2022
Synthetic Data Generation for tabular, relational and time series data.

An Open Source Project from the Data to AI Lab, at MIT Website: https://sdv.dev Documentation: https://sdv.dev/SDV User Guides Developer Guides Github

The Synthetic Data Vault Project 1.2k Jan 07, 2023
Efficient matrix representations for working with tabular data

Efficient matrix representations for working with tabular data

QuantCo 70 Dec 14, 2022
Advanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).

PandasVault ⁠— Advanced Pandas Functions and Code Snippets The only Pandas utility package you would ever need. It has no exotic external dependencies

Derek Snow 374 Jan 07, 2023
This is a repo documenting the best practices in PySpark.

Spark-Syntax This is a public repo documenting all of the "best practices" of writing PySpark code from what I have learnt from working with PySpark f

Eric Xiao 447 Dec 25, 2022
Jupyter notebooks for the book "The Elements of Statistical Learning".

This repository contains Jupyter notebooks implementing the algorithms found in the book and summary of the textbook.

Madiyar 369 Dec 30, 2022
Stitch together Nanopore tiled amplicon data without polishing a reference

Stitch together Nanopore tiled amplicon data using a reference guided approach Tiled amplicon data, like those produced from primers designed with pri

Amanda Warr 14 Aug 30, 2022
Using Data Science with Machine Learning techniques (ETL pipeline and ML pipeline) to classify received messages after disasters.

Using Data Science with Machine Learning techniques (ETL pipeline and ML pipeline) to classify received messages after disasters.

1 Feb 11, 2022
Programmatically access the physical and chemical properties of elements in modern periodic table.

API to fetch elements of the periodic table in JSON format. Uses Pandas for dumping .csv data to .json and Flask for API Integration. Deployed on "pyt

the techno hack 3 Oct 23, 2022
Demonstrate a Dataflow pipeline that saves data from an API into BigQuery table

Overview dataflow-mvp provides a basic example pipeline that pulls data from an API and writes it to a BigQuery table using GCP's Dataflow (i.e., Apac

Chris Carbonell 1 Dec 03, 2021
Conduits - A Declarative Pipelining Tool For Pandas

Conduits - A Declarative Pipelining Tool For Pandas Traditional tools for declaring pipelines in Python suck. They are mostly imperative, and can some

Kale Miller 7 Nov 21, 2021
Parses data out of your Google Takeout (History, Activity, Youtube, Locations, etc...)

google_takeout_parser parses both the Historical HTML and new JSON format for Google Takeouts caches individual takeout results behind cachew merge mu

Sean Breckenridge 27 Dec 28, 2022
MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]

MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020] by Kaisiyuan Wang, Qianyi Wu, Linsen Song, Zhuoqian Yang, Wa

112 Dec 28, 2022
🌍 Create 3d-printable STLs from satellite elevation data 🌏

mapa 🌍 Create 3d-printable STLs from satellite elevation data Installation pip install mapa Usage mapa uses numpy and numba under the hood to crunch

Fabian Gebhart 13 Dec 15, 2022
A set of procedures that can realize covid19 virus detection based on blood.

A set of procedures that can realize covid19 virus detection based on blood.

Nuyoah-xlh 3 Mar 07, 2022