Haphazard scripts for scraping bitcoin/bitcoin data from GitHub

Last update: Oct 12, 2022

Related tags

Web Crawling bitcoin-github-scrape

Overview

This is a quick-and-dirty tool used to scrape bitcoin/bitcoin pull request and commentary data.

Each output/<pr number> folder contains

comments.json: an aggregated list of both issue and review comments, in Github's original format
commits.json: a list of commit objects corresponding to the PR, in Github's original format
pr.json: the pull request object, in Github's original format
comments_abbrev.csv: abbreviated representation of each comment in CSV format
pr_abbrev.csv: abbreviated representation of the PR in CSV format
done: the datetime we retrieved the PR data

Limitations

Right now this doesn't really handle open PRs (or PRs that are expected to be updated) properly since it will not refresh data once the done sentinel is created. This could be fixed by comparing various timestamps to the done sentinel and overwriting.

Haphazard scripts for scraping bitcoin/bitcoin data from GitHub

Related tags

Overview

Limitations

See also

Owner

James O'Beirne

Crawl BookCorpus

Scrap-mtg-top-8 - A top 8 mtg scraper using python

API to parse tibia.com content into python objects.

Deep Web Miner Python | Spyder Crawler

A python script to extract answers to any question on Quora (Quora+ included)

Pro Football Reference Game Data Webscraper

OSTA web scraper, for checking the status of school buses in Ottawa

Web-Scraping using Selenium Master

A simple Discord scraper for discord bots

Scrapes mcc-mnc.com and outputs 3 files with the data (JSON, CSV & XLSX)

Web-scraping - Program that scrapes a website for a collection of quotes, picks one at random and displays it

爬虫案例合集。包括但不限于《淘宝、京东、天猫、豆瓣、抖音、快手、微博、微信、阿里、头条、pdd、优酷、爱奇艺、携程、12306、58、搜狐、百度指数、维普万方、Zlibraty、Oalib、小说、招标网、采购网、小红书》

A web scraper that exports your entire WhatsApp chat history.

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

A simple, configurable and expandable combined shop scraper to minimize the costs of ordering several items

淘宝茅台抢购最新优化版本，淘宝茅台秒杀，优化了茅台抢购线程队列

A module for CME that spiders hashes across the domain with a given hash.

Poolbooru gelscraper - a simple python script for scraping images off gelbooru pools.

Command line program to download documents from web portals.

Web scrapper para cotizar articulos