😷 Dowload dos documentos da CPI da Pandemia

Overview

Documentos da CPI da Pandemia

A CPI da Pandemia recebeu milhares de documentos pΓΊblicos, todos disponibilizados no site do Senado Federal.

Mas como clicar um por um leva tempo, automatizamos o download e descompactaΓ§Γ£o de todos esses arquivos, facilitando assim nΓ£o sΓ³ o acesso, mas tambΓ©m buscas nos arquivos com ferramentas como Evernote, Spotlight, etc.

Avisos importantes

Nome dos arquivos

Para sincronizar esses arquivos na nuvem e evitar erros no sistema de arquivos, todos os nomes de arquivos foram normalizados retirando acentuaΓ§Γ£o e caracteres especiais.

Por exemplo, um arquivo chamado OfΓ­cio.text Γ© renomeado para Oficio.txt.

Erros

Algumas links para baixar os documentos pΓΊblicos nΓ£o funcionam. Mesmo com estratΓ©gias de repetir a tentativa em caso de erro, pode ser que nem todos os arquivos listados estejam, de fato, disponΓ­vel.

Links que nΓ£o puderem ser baixados sΓ£o listados no arquivo erros.txt.

SΓ³ quero baixar os arquivos

O resultado estΓ‘ disponibilizado nesse diretΓ³rio no Dropbox, e vocΓͺ pode baixar tudo com um clique.

Vou tentar manter esse diretΓ³rio atualizado executando esse programa cerca de 3x semana.

Sou hacker e quero mais

VocΓͺ tambΓ©m pode baixar tudo direto do Senado Federal, instalando esse pacote e digitando apenas um comando.

UtilizaΓ§Γ£o com docker

Requer Docker:

$ docker build -t docs-cpi-pandemia .
$ docker run -it -v $PWD/data:/data docs-cpi-pandemia

Os arquivos serΓ£o baixados em um diretΓ³rio data/ dentro da pasta onde vocΓͺ executou esse comando.

InstalaΓ§Γ£o sem docker

Requer Python 3.9 e Poetry:

$ poetry install

Uso

$ poetry run python -m cpi_pandemia

Existem opΓ§Γ΅es que podem ser configuradas, as instruΓ§Γ΅es e valores padrΓ΅es podem ser vistos adicionando --help ao final do comando.

Owner
Eduardo Cuducos
Sociologist, geek, bread baker, and sports fan. Co-founder of OperaΓ§Γ£o Serenata de Amor.
Eduardo Cuducos
𝐴 π‘‘π‘’π‘™π‘’π‘”π‘Ÿπ‘Žπ‘š π‘π‘œπ‘‘ π‘‘β„Žπ‘Žπ‘‘ π‘π‘Žπ‘› π‘‘π‘œπ‘€π‘›π‘™π‘œπ‘Žπ‘‘ π‘£π‘–π‘‘π‘’π‘œ π‘Žπ‘›π‘‘ π‘Žπ‘’π‘‘π‘–π‘œ π‘“π‘Ÿπ‘œπ‘š π‘¦π‘œπ‘’π‘‘π‘’π‘π‘’ π‘Žπ‘›π‘‘ π‘£π‘–π‘‘π‘’π‘œ 𝑀𝑒𝑏𝑠𝑖𝑑𝑒𝑠 π‘žπ‘’π‘–π‘π‘˜π‘™π‘¦

𝐴 π‘‘π‘’π‘™π‘’π‘”π‘Ÿπ‘Žπ‘š π‘π‘œπ‘‘ π‘‘β„Žπ‘Žπ‘‘ π‘π‘Žπ‘› π‘‘π‘œπ‘€π‘›π‘™π‘œπ‘Žπ‘‘ π‘£π‘–π‘‘π‘’π‘œ π‘Žπ‘›π‘‘ π‘Žπ‘’π‘‘π‘–π‘œ π‘“π‘Ÿπ‘œπ‘š π‘¦π‘œπ‘’π‘‘π‘’π‘π‘’ π‘Žπ‘›π‘‘ π‘£π‘–π‘‘π‘’π‘œ 𝑀𝑒𝑏𝑠𝑖𝑑𝑒𝑠 π‘žπ‘’π‘–π‘π‘˜π‘™π‘¦

SOCIAL MECHANIC 2 Aug 04, 2022
Fetch McDonald invoices from mailbox and merge them to one PDF file.

concatenate Fetch McDonald invoices from mailbox and merge them to one PDF file. Description This script will fetch all McDonald invoice pdfs from a p

3 Oct 06, 2022
Bulk Downloader for Reddit

saveddit is a bulk media downloader for reddit pip3 install saveddit Setting up authorization Register an application with Reddit Write down your clie

Pranav 136 Jan 03, 2023
AI Dungeon Catalog Archive Toolkit

AI Dungeon Content Archive Toolkit (AID CAT) AID CAT is a command-line utility that will allow you to download JSON backups of: Your private and publi

Mimi 31 Oct 26, 2022
Smule Video Downloader

Smule Video Downloader Using Requests,Re & Urllib Installation - apt install git (for vps) or pkg install git (for termux)

Hansen Gianto 4 Aug 31, 2022
A Simple YouTube Video Downloader With Python

Simple YouTube Video Downloader Simple YouTube Video Downloader is an open source project with a very simple UI that tries to speed up the process of

Brian Han 2 Jan 03, 2022
A simple GUI video downloader built off of the python module 'yt-dlp'

Simple-Youtube-DL-Gui Supported Operating Systems Windows 7 (x64), Windows 8 (x64), and Windows 10 (x64) How to use Main Gui Extract program from arch

12 Dec 30, 2022
Automatically download and crop key information from the arxiv daily paper. (cpu version)

Automatically download and crop key information from the arxiv daily paper. (cpu version)

HeoLis 4 Jul 30, 2022
A Unit3D Mass Release Downloader

Unit3DMassDL A Unit3D Mass Release Downloader. Currently supports Aither. Installation Ensure Python 3 is installed in your system. Run the following

2 Apr 11, 2022
Download h3t4y for later read

h3nt4y_dl Download h3nt4y for later read TαΊ£i h3nt4y về đọc thΓ΄i nΓ o cΓ‘c bαΊ‘n Ζ‘iiiiiiii! (TαΊ£i tα»« h**taivn nhΓ©) Usage: python get_that_ht4i.py New versio

1 Dec 03, 2021
A Python package for downloading / archiving all available episodes from a podcast RSS feed.

allcasts πŸ“» πŸ—ƒ A Python package for downloading all available episodes from a podcast RSS feed. Useful for making private archives of your favourite p

Lewis Gentle 5 Nov 20, 2022
Using Youtube downloader is the fast and easy way to download and save any YouTube video.

Youtube video downloader using Django Using Django as a backend along with pytube module to create Youtbue Video Downloader. https://yt-videos-downloa

Suman Raj Khanal 10 Jun 18, 2022
Fully Automated YouTube Channel ▢️with Added Extra Features.

Fully Automated Youtube Channel β–’β–ˆβ–€β–€β–ˆ β–ˆβ–€β–€β–ˆ β–€β–€β–ˆβ–€β–€ β–€β–€β–ˆβ–€β–€ β–ˆβ–‘β–‘β–ˆ β–ˆβ–€β–€β–„ β–ˆβ–€β–€ β–ˆβ–€β–€β–ˆ β–’β–ˆβ–€β–€β–„ β–ˆβ–‘β–‘β–ˆ β–‘β–‘β–ˆβ–‘β–‘ β–‘β–’β–ˆβ–‘β–‘ β–ˆβ–‘β–‘β–ˆ β–ˆβ–€β–€β–„ β–ˆβ–€β–€ β–ˆβ–„β–„β–€ β–’β–ˆβ–„β–„β–ˆ β–€β–€β–€β–€ β–‘β–‘β–€β–‘β–‘ β–‘β–’β–ˆβ–‘β–‘ β–‘β–€β–€β–€ β–€β–€β–€β–‘

sam-sepiol 249 Jan 02, 2023
Python code to crawl computer vision papers from top CV conferences. Currently it supports CVPR, ICCV, ECCV, NeurIPS, ICML, ICLR, SIGGRAPH

Python code to crawl computer vision papers from top CV conferences. Currently it supports CVPR, ICCV, ECCV, NeurIPS, ICML, ICLR, SIGGRAPH. It leverages selenium, a website testing framework to crawl

Xiaoyang Huang 39 Nov 21, 2022
YouTube to MP3 or 4, you get to choose...

UTubeToMP YouTube to MP3 or 4, you get to choose... If you don't wanna git clone andor dont wanna install python. Here: Repl.it Instructions: Pretty s

1 Jan 29, 2022
Download videos and audio with a graphical interface in python

Youtube-Downloader Download videos and audio with a graphical interface in python Windows To run windows using Command Prompt python main.py linux To

2 Jan 07, 2022
Youtube videos and channels scraper python wrapper!

YouTubeCrawle Wrapper for python Why This wrapper? This is wrapper is not limited to videos only it can scrape both channel and videos seperately ;D

Kei 16 Aug 08, 2022
A python module to download ISO Standards

ISO Standards Downloader A python module to download ISO Standards from https://standards.iso.org/iso-iec/ Report Bug Β· Request Feature Table of conte

Daniel 1 Dec 29, 2021
A simple contents download module using url for python

A simple contents download module using url for python

Fayas Noushad 16 Oct 20, 2022
Download all games from a public Itch.io Game Jam

Itch Jam Downloader Downloads all games from a public Itch.io Game Jam. What you'll need: Python 3.8+ pip install -r requirements.txt For site mirrori

Dragoon Aethis 19 Dec 07, 2022