Python Project on Pro Data Analysis Track

Last update: Nov 10, 2021

Related tags

Data Analysis Udacity-BikeShare-Project

Overview

Udacity-BikeShare-Project:

Python Project on Pro Data Analysis Track

Basic Data Exploration with pandas on Bikeshare Data

Basic Udacity project using pandas library in Python for their bikeshare data exploration.

Project Overview:

This project focuses on pandas library usage and simple statistics methods to perform a rudimentary analysis on the bikeshare data from three major U.S. cities - Chicago, Washington, and New York City - to display information such as most popular days or most common stations.

Running the program:

You can input 'python bikeshare.py' on your terminal to run this program. I use Anaconda's command prompt on a Windows 10 machine.

Program Details:

The program takes user input for the city (e.g. Chicago), month for which the user wants to view data (e.g. January; also includes an 'all' option), and day for which the user wants to view data (e.g. Monday; also includes an 'all' option).

Upon receiving the user input, it goes ahead and asks the user if they want to view the raw data (5 rows of data initially) or not. Following the input received, the program prints the following details:

Requirements:

Language: Python 3.6 or above

Libraries: pandas, numpy, time

Project Data:

chicago.csv - Stored in the data folder, the chicago.csv file is the dataset containing all bikeshare information for the city of Chicago provided by Udacity.

new_york_city.csv - Dataset containing all bikeshare information for the city of New York provided by Udacity.

washington.csv - Dataset containing all bikeshare information for the city of Washington provided by Udacity. Note: This does not include the 'Gender' or 'Birth Year' data.

Built with:

IDE : PyCharm

Python 3.9 - The language used to develop this.

pandas - One of the libraries used for this.

numpy - One of the libraries used for this.

time - One of the libraries used for this.

Author:

Belal Mohammed Ali

NANO Degree Program from FWD Initiative:

Date of Project Submission:

--Date created: 10/10/2021

--Date last modified: 3/19/2021

Python Project on Pro Data Analysis Track

Related tags

Overview

Udacity-BikeShare-Project:

Project Overview:

Running the program:

Program Details:

Requirements:

Built with:

Author:

Date of Project Submission:

Owner

Belal Mohammed

Python tools for querying and manipulating BIDS datasets.

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

This is a repo documenting the best practices in PySpark.

API>local_db>AWS_RDS - Disclaimer! All data used is for educational purposes only.

A real-time financial data streaming pipeline and visualization platform using Apache Kafka, Cassandra, and Bokeh.

Python data processing, analysis, visualization, and data operations

Working Time Statistics of working hours and working conditions by industry and company

MeSH2Matrix - A set of Python codes for the generation of biomedical ontologies from the MeSH keywords of the PubMed scholarly publications

A lightweight, hub-and-spoke dashboard for multi-account Data Science projects

Gaussian processes in TensorFlow

BinTuner is a cost-efficient auto-tuning framework, which can deliver a near-optimal binary code that reveals much more differences than -Ox settings.

Python dataset creator to construct datasets composed of OpenFace extracted features and Shimmer3 GSR+ Sensor datas

Python implementation of Principal Component Analysis

GWpy is a collaboration-driven Python package providing tools for studying data from ground-based gravitational-wave detectors

Demonstrate a Dataflow pipeline that saves data from an API into BigQuery table

TextDescriptives - A Python library for calculating a large variety of statistics from text

Fast, flexible and easy to use probabilistic modelling in Python.

A collection of robust and fast processing tools for parsing and analyzing web archive data.

A CLI tool to reduce the friction between data scientists by reducing git conflicts removing notebook metadata and gracefully resolving git conflicts.

Elementary is an open-source data reliability framework for modern data teams. The first module of the framework is data lineage.