The repo for mlbtradetrees.com. Analyze any trade in baseball history!

Last update: Nov 20, 2022

Related tags

Data Analysis BaseballTradeTrees

Overview

MLB Trade Trees

2.0.0 Release: November 24, 2021

www.mlbtradetrees.com allows you to view the trade tree of any player in MLB history.

What is a trade tree?

A trade tree will show you the complete details of a trade made by a team. Let's use Hall Of Fame candidate Cliff Lee for some examples, as he was traded multiple times throughout his career..

Here is the simplest form of his tree:

Cliff Lee was traded to the Mariners in 2009, and the Phillies received 3 players in return. All players the Phillies received in return either retired or became free agents, ending the tree with them.

Let's take a look at a more complicated example:

We can see the Mariners traded away Cliff Lee in 2010, receiving 4 players in return. 2 Players' lines end due to free agency and being picked up on waivers. 2 players' lines continue due to being traded away the next year. Some of those players' lines end however some continue to be traded away, so the tree grows. The tree finally ends in 2014 due to the final player hitting free agency.

Some of these trees can get pretty massive, spanning decades and dozens of trades. An example is Harry Simpson.

The Database

The transaction, team and player databases are thanks to Retrosheet. I will only update transactions when they update the database.

I have made some adjustments to the database that allows the search to go more smoothly:

Transaction database (data/sorted_transactions_final.csv)

Nan players involved in trades were changed to "PTBNL/Cash" (player to be named later). Most of the time you see this in a tree, it is a cash transaction.
Transactions of players that were released or granted free agency, then signed back with the team as their next transaction were deleted as it caused trees to end prematurely.
Franchise tags were added to the database to ensure that a team name change doesn't end a tree.

Team database (data/teams.csv)

All teams in the database received a franchise tag if they are part of the same franchise. They received a unique franchise code if they are an independant team.

Player database (data/teams.csv)

Nothing changed, just made a copy with the full name to easily get the user input. (static/css/searchable_players.csv)

Installing Locally

If you want to run the website locally:

install flask
install pandas
install JSGlue (allows Jinja to work in a js file)

Run server.py

What am I working on?

Updated Nov. 24 2021

Some players don't display properly due to having very old teams not listed in the teams database. Usually these are players before 1920. I just need to update the transactions database to find all teams without the franchise tag.
Adding stat support with pybaseball. I'd like to add total war contributed by players in a trade on the tree.
Searching for and filtering trees based on team, year, players in a tree, length of trees, etc.
Various UI enhancements, like clickable nodes to get a player's tree, collapsable nodes for easier readability.

The repo for mlbtradetrees.com. Analyze any trade in baseball history!

Related tags

Overview

MLB Trade Trees

2.0.0 Release: November 24, 2021

www.mlbtradetrees.com allows you to view the trade tree of any player in MLB history.

What is a trade tree?

The Database

Transaction database (data/sorted_transactions_final.csv)

Team database (data/teams.csv)

Player database (data/teams.csv)

Installing Locally

What am I working on?

Updated Nov. 24 2021

Owner

InDels analysis of CRISPR lines by NGS amplicon sequencing technology for a multicopy gene family.

A distributed block-based data storage and compute engine

MidTerm Project for the Data Analysis FT Bootcamp, Adam Tycner and Florent ZAHOUI

Elasticsearch tool for easily collecting and batch inserting Python data and pandas DataFrames

Aggregating gridded data (xarray) to polygons

Analysis scripts for QG equations

Parses data out of your Google Takeout (History, Activity, Youtube, Locations, etc...)

PySpark bindings for H3, a hierarchical hexagonal geospatial indexing system

Stitch together Nanopore tiled amplicon data without polishing a reference

PyClustering is a Python, C++ data mining library.

DaDRA (day-druh) is a Python library for Data-Driven Reachability Analysis.

This is a repo documenting the best practices in PySpark.

WaveFake: A Data Set to Facilitate Audio DeepFake Detection

Retail-Sim is python package to easily create synthetic dataset of retaile store.

Two phase pipeline + StreamlitTwo phase pipeline + Streamlit

Hidden Markov Models in Python, with scikit-learn like API

Office365 (Microsoft365) audit log analysis tool

talkbox is a scikit for signal/speech processing, to extend scipy capabilities in that domain.

Sentiment analysis on streaming twitter data using Spark Structured Streaming & Python

The lastest all in one bombing tool coded in python uses tbomb api