Learn machine learning the fun way, with Oracle and RedBull Racing

Last update: Oct 24, 2022

Related tags

Overview

Red Bull Racing Analytics Hands-On Labs

Introduction

Are you interested in learning machine learning (ML)? How about doing this in the context of the exciting world of F1 racing?! Get your ML skills bootstrapped here with Oracle and Red Bull Racing!

This tutorial teaches ML analytics with a series of hands-on labs (HOLs) using the Data Science service in Oracle Cloud Infrastructure.

You'll learn how to get data from some public data sources, then how to analyze this data using some of the latest ML techniques. In the process you'll build ML models and test them out in a predictor app.

Getting Started

There is some infrastructure that must be deployed before you can enjoy this tutorial. See the Terraform documentation for more information.

After the OCI infrastructure is deployed, proceed with the beginner's tutorial to start through the ML labs.

Prerequisites

You must have an OCI account. Click here to create a new cloud account.

This solution is designed to work with several OCI services, allowing you to quickly be up-and-running:

There are required OCI resources (see the Terraform documentation for more information) that are needed for this tutorial.

Notes/Issues

None at this time.

URLs

Oracle and Red Bull partnership announcement

Contributing

This project is open source. Please submit your contributions by forking this repository and submitting a pull request! Oracle appreciates any contributions that are made by the open source community.

License

Licensed under the Universal Permissive License (UPL), Version 1.0.

See LICENSE for more details.

Comments

Refactored Terraform code
Compatible with ORM, Cloud Shell and Terraform CLI

Updated README to include instructions for all three methods

Refactored, removing unnecessary resources (Vault, public Subnet, etc.).

Added a nerd knob so that it could use an existing Group (rather than create a new one)

Fixed ORM RegEx filters to allow dashes (-) and underscores (_), for the names
opened by timclegg 2
Issue with hands on lab guide - launchapp.sh missing

https://github.com/oracle-devrel/redbull-analytics-hol/tree/main/beginners#beginners-hands-on-lab

In Starting The Web Application it reads:

cd /home/opc/redbull-analytics-hol/beginners/web ./launchapp.sh start

However is launchapp.sh is missing, for example

(redbullenv) cd /home/opc/redbull-analytics-hol/beginners/web (redbullenv) ./launchapp.sh start bash: ./launchapp.sh: No such file or directory

opened by raekins 1
fix: Updating schema.yaml syntax

Making the variable notation follow what the doc syntax shows (https://docs.oracle.com/en-us/iaas/Content/ResourceManager/Concepts/terraformconfigresourcemanager_topic-schema.htm)

opened by timclegg 1
Exploratory Data Analysis Merge Issue

Hello I have been encountering an issue while running the lab. The Jupyter notebook 03.f1_analysis_EDA.ipynb has the following issue on cell number 5:

ValueError Traceback (most recent call last) in ----> 1 df1 = pd.merge(races,results,how='inner',on=['raceId']) 2 df2 = pd.merge(df1,quali,how='inner',on=['raceId','driverId','constructorId']) 3 df3 = pd.merge(df2,drivers,how='inner',on=['driverId']) 4 df4 = pd.merge(df3,constructors,how='inner',on=['constructorId']) 5 df5 = pd.merge(df4,circuit,how='inner',on=['circuitId'])

~/redbullenv/lib64/python3.6/site-packages/pandas/core/reshape/merge.py in merge(left, right, how, on, left_on, right_on, left_index, right_index, sort, suffixes, copy, indicator, validate) 85 copy=copy, 86 indicator=indicator, ---> 87 validate=validate, 88 ) 89 return op.get_result()

~/redbullenv/lib64/python3.6/site-packages/pandas/core/reshape/merge.py in init(self, left, right, how, on, left_on, right_on, axis, left_index, right_index, sort, suffixes, copy, indicator, validate) 654 # validate the merge keys dtypes. We may need to coerce 655 # to avoid incompatible dtypes --> 656 self._maybe_coerce_merge_keys() 657 658 # If argument passed to validate,

~/redbullenv/lib64/python3.6/site-packages/pandas/core/reshape/merge.py in _maybe_coerce_merge_keys(self) 1163 inferred_right in string_types and inferred_left not in string_types 1164 ): -> 1165 raise ValueError(msg) 1166 1167 # datetimelikes must match exactly

ValueError: You are trying to merge on object and int64 columns. If you wish to proceed you should use pd.concat

I’m using an oracle automatic deployment provided by oracle as part of their environment. I do not have a lot of experience with Python but one possible ible solution is to read the numeric values form the csv file as integer or float but I’m almost certain the solution might be a little more elaborated than that 😉. Anyway thanks for your time. I’m really excited to test your solution and finish the lab. Thanks again.

opened by yankodavila 2
Has the PAR for the stack deploy image expired.

Cannot deploy stack as getting PAR expired message.

2021/11/07 10:50:11[TERRAFORM_CONSOLE] [INFO] Error Message: work request did not succeed, workId: ocid1.coreservicesworkrequest.oc1.eu-amsterdam-1.abqw2ljrwz2n7qqj7ghdwtnlrqol355oumc7a6coushvgdrebskspaewh7ea, entity: image, action: CREATED. Message: Import image not found: PAR is invalid (maybe is expired or deleted), please check.

PAR in stack file is https://objectstorage.eu-frankfurt-1.oraclecloud.com/p/khhPjc_IMuyBOMfZUcJajIzCpoZ5aC-D7VMCU__GVZRlIQueXLIIcaaqLOZIuT1a/n/emeasespainsandbox/b/publichol/o/redbullhol-20210809-1523

opened by Mel-A-M 1

Releases(v0.1.8)

v0.1.8(Feb 18, 2022)

Optimized the models generation for Quickstarts Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.7...v0.1.8
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(20.78 KB)
v0.1.7(Feb 17, 2022)

add quickstart configuration by @snafuz in https://github.com/oracle-devrel/redbull-analytics-hol/pull/43

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.6...v0.1.7
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(17.20 KB)
v0.1.6(Feb 17, 2022)
What's Changed

add quickstart configuration by @snafuz in https://github.com/oracle-devrel/redbull-analytics-hol/pull/43

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.5...v0.1.6
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(17.20 KB)
v0.1.5(Feb 16, 2022)
What's Changed

Livelabs02162022 by @jasperan in https://github.com/oracle-devrel/redbull-analytics-hol/pull/41

fix: updated Alyssa Cotton's changes by @jasperan in https://github.com/oracle-devrel/redbull-analytics-hol/pull/42

New Contributors

@jasperan made their first contribution in https://github.com/oracle-devrel/redbull-analytics-hol/pull/41

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.4...v0.1.5
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(11.33 KB)
v0.1.4(Jan 25, 2022)
What's Changed

Update Port for Jupyter Lab. Changed with last Stack script by @operard in https://github.com/oracle-devrel/redbull-analytics-hol/pull/38

automatically set the latest Oracle Linux 7.9 image build number as default OS image by @snafuz in https://github.com/oracle-devrel/redbull-analytics-hol/pull/40

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.3...v0.1.4
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(11.33 KB)
v0.1.3(Nov 10, 2021)
What's Changed

fix: ORM zip file not being generated properly

Fixed it so that ORM can be used to deploy the lab.

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.2...v0.1.3
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(11.21 KB)
v0.1.0(Nov 9, 2021)
The lab has been refactored to not use a custom compute image, but rather to build out the compute instance.

What's Changed

feat: removing custom image usage by @timclegg in https://github.com/oracle-devrel/redbull-analytics-hol/pull/34

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.0.12...v0.1.0
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(8.62 KB)
v0.0.12(Sep 6, 2021)

Redbull HOL Beginner Extension Period to access Image
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(9.01 KB)
v0.0.11(Aug 10, 2021)

Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(8.06 KB)
v0.0.10(Aug 10, 2021)

The SSH public key is optional, but present in the ORM dialog. Happy deploying!
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(8.06 KB)
v0.0.9(Aug 9, 2021)

The SSH key isn't directly needed for the hands-on lab, so making this optional. Also some doc updates.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.83 KB)
v0.0.8(Aug 9, 2021)

Updated docs and a bug in the deployment.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.83 KB)
v0.0.7(Aug 6, 2021)

This release has a refactored "one-click" (or really close to it!) hands-on lab.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.82 KB)
v0.0.6(Aug 4, 2021)

This repo now can build its own ZIP files for ORM deployments. These are automatically built and stored in the release (as it's made).
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(8.19 KB)
v0.0.5(Jul 28, 2021)

Fixing situations where the group name and/or dynamic group name creation would fail, if it already existed. This might occur in situations where the HoL would be deployed more than once in the same tenancy. This eliminates the potential for collision with the same group names being used.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.40 KB)
v0.0.4(Jul 23, 2021)

Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(10.23 KB)
v0.0.3(Jul 15, 2021)

Fixed home region detection.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.26 KB)
v0.2(Jul 14, 2021)
This release makes it easier to deploy the infrastructure, whether using ORM, Cloud Shell or Terraform CLI.

Added DevRel defined tags (and ignored the default tags)

Compatible with ORM, Cloud Shell and Terraform CLI

Updated README to include instructions for all three methods

Refactored, removing unnecessary resources (Vault, public Subnet, etc.).

Added a nerd knob so that it could use an existing Group (rather than create a new one)

Fixed ORM RegEx filters to allow dashes (-) and underscores (_), for the names

Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.19 KB)
v0.1(Jun 21, 2021)

This release includes the beginner series of tutorials, along with the Terraform stack to create the required OCI resources.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(9.24 KB)

Owner

Oracle DevRel

GitHub Repository

Probabilistic reasoning and statistical analysis in TensorFlow

TensorFlow Probability TensorFlow Probability is a library for probabilistic reasoning and statistical analysis in TensorFlow. As part of the TensorFl

3.8k Jan 05, 2023

Python Package for DataHerb: create, search, and load datasets.

The Python Package for DataHerb A DataHerb Core Service to Create and Load Datasets.

4 Feb 11, 2022

Randomisation-based inference in Python based on data resampling and permutation.

67 Dec 27, 2022

A probabilistic programming library for Bayesian deep learning, generative models, based on Tensorflow

ZhuSuan is a Python probabilistic programming library for Bayesian deep learning, which conjoins the complimentary advantages of Bayesian methods and

2.2k Dec 28, 2022

MetPy is a collection of tools in Python for reading, visualizing and performing calculations with weather data.

MetPy MetPy is a collection of tools in Python for reading, visualizing and performing calculations with weather data. MetPy follows semantic versioni

971 Dec 25, 2022

Picka: A Python module for data generation and randomization.

Picka: A Python module for data generation and randomization. Author: Anthony Long Version: 1.0.1 - Fixed the broken image stuff. Whoops What is Picka

108 Nov 30, 2021

Implementation in Python of the reliability measures such as Omega.

reliabiliPy Summary Simple implementation in Python of the [reliability](https://en.wikipedia.org/wiki/Reliability_(statistics) measures for surveys:

2 Apr 27, 2022

Statistical & Probabilistic Analysis of Store Sales, University Survey, & Manufacturing data

Statistical_Modelling Statistical & Probabilistic Analysis of Store Sales, University Survey, & Manufacturing data Statistical Methods for Decision Ma

1 Jan 27, 2022

Stock Analysis dashboard Using Streamlit and Python

StDashApp Stock Analysis Dashboard Using Streamlit and Python If you found the content useful and want to support my work, you can buy me a coffee! Th

27 Dec 09, 2022

A library to create multi-page Streamlit applications with ease.

107 Jan 04, 2023

We're Team Arson and we're using the power of predictive modeling to combat wildfires.

We're Team Arson and we're using the power of predictive modeling to combat wildfires. Arson Map Inspiration There’s been a lot of wildfires in Califo

3 Oct 17, 2021

Advanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).

PandasVault ⁠— Advanced Pandas Functions and Code Snippets The only Pandas utility package you would ever need. It has no exotic external dependencies

374 Jan 07, 2023

🌍 Create 3d-printable STLs from satellite elevation data 🌏

mapa 🌍 Create 3d-printable STLs from satellite elevation data Installation pip install mapa Usage mapa uses numpy and numba under the hood to crunch

13 Dec 15, 2022

A stock analysis app with streamlit

StockAnalysisApp A stock analysis app with streamlit. You select the ticker of the stock and the app makes a series of analysis by using the price cha

50 Nov 27, 2022

Projects that implement various aspects of Data Engineering.

DATAWAREHOUSE ON AWS The purpose of this project is to build a datawarehouse to accomodate data of active user activity for music streaming applicatio

2 Oct 14, 2021

Manage large and heterogeneous data spaces on the file system.

signac - simple data management The signac framework helps users manage and scale file-based workflows, facilitating data reuse, sharing, and reproduc

109 Dec 14, 2022

An implementation of the largeVis algorithm for visualizing large, high-dimensional datasets, for R

largeVis This is an implementation of the largeVis algorithm described in (https://arxiv.org/abs/1602.00370). It also incorporates: A very fast algori

336 May 25, 2022

Demonstrate the breadth and depth of your data science skills by earning all of the Databricks Data Scientist credentials

Data Scientist Learning Plan Demonstrate the breadth and depth of your data science skills by earning all of the Databricks Data Scientist credentials

27 Nov 01, 2022

Useful tool for inserting DataFrames into the Excel sheet.

PyCellFrame Insert Pandas DataFrames into the Excel sheet with a bunch of conditions Install pip install pycellframe Usage Examples Let's suppose that

1 Feb 16, 2022

Toolchest provides APIs for scientific and bioinformatic data analysis.

Toolchest Python Client Toolchest provides APIs for scientific and bioinformatic data analysis. It allows you to abstract away the costliness of runni

11 Jun 30, 2022

Learn machine learning the fun way, with Oracle and RedBull Racing

Related tags

Overview

Red Bull Racing Analytics Hands-On Labs

Introduction

Getting Started

Prerequisites

Notes/Issues

URLs

Contributing

License

Comments

Refactored Terraform code

Issue with hands on lab guide - launchapp.sh missing

fix: Updating schema.yaml syntax

Exploratory Data Analysis Merge Issue

Has the PAR for the stack deploy image expired.

Releases(v0.1.8)

v0.1.8(Feb 18, 2022)

v0.1.7(Feb 17, 2022)

v0.1.6(Feb 17, 2022)

What's Changed

v0.1.5(Feb 16, 2022)

What's Changed

New Contributors

v0.1.4(Jan 25, 2022)

What's Changed

v0.1.3(Nov 10, 2021)

What's Changed

v0.1.0(Nov 9, 2021)

What's Changed

v0.0.12(Sep 6, 2021)

v0.0.11(Aug 10, 2021)

v0.0.10(Aug 10, 2021)

v0.0.9(Aug 9, 2021)

v0.0.8(Aug 9, 2021)

v0.0.7(Aug 6, 2021)

v0.0.6(Aug 4, 2021)

v0.0.5(Jul 28, 2021)

v0.0.4(Jul 23, 2021)

v0.0.3(Jul 15, 2021)

v0.2(Jul 14, 2021)

v0.1(Jun 21, 2021)

Owner

Oracle DevRel

Probabilistic reasoning and statistical analysis in TensorFlow

Python Package for DataHerb: create, search, and load datasets.

Randomisation-based inference in Python based on data resampling and permutation.

A probabilistic programming library for Bayesian deep learning, generative models, based on Tensorflow

MetPy is a collection of tools in Python for reading, visualizing and performing calculations with weather data.

Picka: A Python module for data generation and randomization.

Implementation in Python of the reliability measures such as Omega.

Statistical & Probabilistic Analysis of Store Sales, University Survey, & Manufacturing data

Stock Analysis dashboard Using Streamlit and Python

A library to create multi-page Streamlit applications with ease.

We're Team Arson and we're using the power of predictive modeling to combat wildfires.

Advanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).

🌍 Create 3d-printable STLs from satellite elevation data 🌏

A stock analysis app with streamlit

Projects that implement various aspects of Data Engineering.

Manage large and heterogeneous data spaces on the file system.

An implementation of the largeVis algorithm for visualizing large, high-dimensional datasets, for R

Demonstrate the breadth and depth of your data science skills by earning all of the Databricks Data Scientist credentials

Useful tool for inserting DataFrames into the Excel sheet.

Toolchest provides APIs for scientific and bioinformatic data analysis.