Scrapping the data from each page of biocides listed on the BAUA website into a csv file

Last update: Nov 30, 2021

Related tags

Overview

Baua Biocides Scraper

Scrapping the data from each page of biocides listed on the BAUA website (https://www.baua.de/DE/Biozid-Meldeverordnung/Offen/offen.html) into a csv file.
A windows standalone client is avalaible in the dist folder

About the project

What's the problem?

Baua website contains many usefull data for biocides domain, but the website only allows you to search product by product and it is not easy to find and get some informations with over 80,000 products listed

The idea

Facilitate the data manipulation with providing a csv file with all data scraped from Baua website.

How does it work ?

The user start the program.
The program extract data from Baua website.
A csv file containing data are created.

Roadmap

This project was created after a request and is not intended to evolve. Nevertheless you can fork the project to improve it by yourself and propose them via the project pull requests. or make a suggestion via the project issues.

Build with

Programming language : Python 3.10.0
Scraping Framework : Scrapy 2.5.1
HTTP library : Requests 2.26.0
Standalone Builder : PyInstaller 4.7

Demo

You can use the windows standalone client in the dist folder

Version management

We use a semantic version management, that is a version number MAJOR.MINOR.CORRECTIVE :

the MAJOR version number when there are non backward compatible changes,
the MINOR version number when there are backward compatible feature additions,
the FIX version number when there are backwards compatible bug fixes.

See SignMail tags For more info: semver.org

Authors

Eric De Maria - Numio - Initial work

License

This project is licensed under the GNU GPL 3 license - See the LICENSE file for more details.

Instagram_scrapper - This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or excel file easily.

Instagram_scrapper This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or exce

5 Oct 17, 2022

Scrapes mcc-mnc.com and outputs 3 files with the data (JSON, CSV & XLSX)

mcc-mnc.com-webscraper Scrapes mcc-mnc.com and outputs 3 files with the data (JSON, CSV & XLSX) A Python script for web scraping mcc-mnc.com Link: mcc

1 Nov 7, 2021

AssistScraper - program for /r/nba to use to find list of all players a player assisted and how many assists each player recieved

5 Nov 25, 2021

EBay-email-tracker - Scapes an entire search page of a particular item on eBay and sends regular updates to an email address

Introduction This is a project I built with the sole intent to learn more about

1 Jan 14, 2022

An application that on a given url, crowls a web page and gets all words, sorts and counts them.

Web-Scrapping-1 An application that on a given url, crowls a web page and gets all words, sorts and counts them. Installation Using the package manage

1 Jan 16, 2022

Releases(v0.1.0)

v0.1.0(Nov 30, 2021)

The windows standalone client for the first public version of Baua Biocides Scraper
Source code(tar.gz)
Source code(zip)
Baua_Biocides_Scraper_Windows.zip(16.02 MB)

Scrapping the data from each page of biocides listed on the BAUA website into a csv file

Related tags

Overview

Baua Biocides Scraper

About the project

What's the problem?

The idea

How does it work ?

Roadmap

Build with

Demo

Version management

Authors

License

You might also like...

Instagram_scrapper - This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or excel file easily.

Scrapes mcc-mnc.com and outputs 3 files with the data (JSON, CSV & XLSX)

AssistScraper - program for /r/nba to use to find list of all players a player assisted and how many assists each player recieved

A Python module to bypass Cloudflare's anti-bot page.

Screenhook is a script that captures an image of a web page and send it to a discord webhook.

A Python module to bypass Cloudflare's anti-bot page.

Python script who crawl first shodan page and check DBLTEK vulnerability

EBay-email-tracker - Scapes an entire search page of a particular item on eBay and sends regular updates to an email address

An application that on a given url, crowls a web page and gets all words, sorts and counts them.

Releases(v0.1.0)

v0.1.0(Nov 30, 2021)

Owner

Eric DE MARIA

一个m3u8视频流下载脚本

Libextract: extract data from websites

Generate a repository with mirror links for DriveDroid app

Anonymously scrapes onlinesim.ru for new usable phone numbers.

Web3 Pancakeswap Sniper bot written in python3

A training task for web scraping using python multithreading and a real-time-updated list of available proxy servers.

A list of Python Bots used to extract data from several websites

A high-level distributed crawling framework.

Audio media crawler for lbry.

Deep Web Miner Python | Spyder Crawler

茅台抢购最新优化版本，茅台秒杀，优化了抢购协程队列

Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

Tool to scan for secret files on HTTP servers

Scrapes proxies and saves them to a text file

mlscraper: Scrape data from HTML pages automatically with Machine Learning

爬虫案例合集。包括但不限于《淘宝、京东、天猫、豆瓣、抖音、快手、微博、微信、阿里、头条、pdd、优酷、爱奇艺、携程、12306、58、搜狐、百度指数、维普万方、Zlibraty、Oalib、小说、招标网、采购网、小红书》

Scrap-mtg-top-8 - A top 8 mtg scraper using python

An Automated udemy coupons scraper which scrapes coupons and autopost the result in blogspot post

🥫 The simple, fast, and modern web scraping library

Danbooru scraper with python