This is a repository to learn and get more computer vision skills, make robotics projects integrating the computer vision as a perception tool and create a lot of awesome advanced controllers for the robots of the future.

Last update: Nov 03, 2022

Overview

THE COMPUTER VISION DOJO

This repository was created to learn and gain new knowledge about computer vision and all its possible applications in the field of robotics and smart systems.

SOFTWARE DEPENDENCIES 💻

PYTHON DEPENDENCIES

Python
Python is a programming language that lets you work quickly and integrate systems more effectively.
OpenCV
OpenCV (Open Source Computer Vision Library) is a library of programming functions mainly aimed at real-time computer vision.
Numpy
Numpy is a general-purpose array-processing package. It provides a high-performance multidimensional array object, and tools for working with these arrays. It is the fundamental package for scientific computing with Python.
Matplotlib
Matplotlib is a comprehensive library for creating static, animated, and interactive visualizations in Python.

C++ DEPENDENCIES

Microsoft C++ Build Tools
The Microsoft C++ Build Tools provides MSVC toolsets via a scriptable, standalone installer without Visual Studio. Recommended if you build C++ libraries and applications targeting Windows from the command-line (e.g. as part of your continuous integration workflow). Includes tools shipped in Visual Studio 2015 Update 3, Visual Studio 2017 version 15.9, and all major updates to Visual Studio 2019 (v16.x).
OpenCV
OpenCV (Open Source Computer Vision Library) is a library of programming functions mainly aimed at real-time computer vision.
cmake
CMake is an open-source, cross-platform family of tools designed to build, test and package software.

AUTHOR

Elkin Javier Guerra Galeano

Student of Mechatronics Engineering at EIA University, excited for integrating Software and Hardware systems.
He is curious about Control Theory and implementing Robotics Solutions with different math designs.
He has skills with problem-solving for real-life applications. He is passionate about building knowledge from a theory-practice approach.

This is a repository to learn and get more computer vision skills, make robotics projects integrating the computer vision as a perception tool and create a lot of awesome advanced controllers for the robots of the future.

Related tags

Overview

THE COMPUTER VISION DOJO

SOFTWARE DEPENDENCIES 💻

PYTHON DEPENDENCIES

C++ DEPENDENCIES

AUTHOR

Elkin Javier Guerra Galeano

Owner

Elkin Javier Guerra Galeano

Repository for Scene Text Detection with Supervised Pyramid Context Network with tensorflow.

A set of workflows for corpus building through OCR, post-correction and normalisation

Implementation of our paper 'PixelLink: Detecting Scene Text via Instance Segmentation' in AAAI2018

The code for CVPR2022 paper "Likert Scoring with Grade Decoupling for Long-term Action Assessment".

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

fishington.io bot with OpenCV and NumPy

Select range and every time the screen changes, OCR is activated.

Super Mario Game With Python

A simple python program to record security cam footage by detecting a face and body of a person in the frame.

CNN+LSTM+CTC based OCR implemented using tensorflow.

A simple demo program for using OpenCV on Android

A synthetic data generator for text recognition

Random maze generator and solver

An interactive document scanner built in Python using OpenCV

Sort By Face

An advanced 2D image manipulation with features such as edge detection and image segmentation built using OpenCV

M-LSDを用いて四角形を検出し、射影変換を行うサンプルプログラム

governance proposal to make fei redeemable for eth

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching