Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.

Last update: Jan 02, 2023

Overview

Deep Vision and Graphics

This repo supplements course "Deep Vision and Graphics" taught at YSDA @fall'21. The course is the successor of "Deep Learning" course taught at YSDA in 2015-2021. New course focuses more on applications of deep learning for computer vision.

Lecture and seminar materials for each week are in ./week* folders. Homeworks are in ./homework* folders.

General info

Telegram chat room (russian).
YSDA deadlines & admin stuff can be found at the YSDA LMS (ysda students only).
Any technical issues, ideas, bugs in course materials, contribution ideas - add an issue

Syllabus

week01 Intro, recap of Neural network basics, optimization, backprop, biological networks
week02 Images, linear filtering, convolutional networks, batchnorms, augmentations
week03 ConvNet architectures and how to find them, sparse convolutions in 3D, ConvNets for videos, transfer learning
week04 Dense prediction: semantic segmentation, superresolution/image synthesis, perceptual losses
week05 Non-convolutional architectures: transformers (some recap of their use in NLP), mixers, FFT convolutions
week06 Visualizing and understanding deep architectures, adversarial examples
week07 Object detection, instance/panoptic segmentation, 2D/3D human pose estimation
week08 Representation learning: face recognition, verification tasks, self-supervised learning, image captioning
week09 Latent models (GLO, AEs, flow models, diffusion models, VQ-VAE, generative transformers, CLIP, DALL-E)
week10 Generative adversarial networks
week11 Shape and motion estimation: spatial transformers, optical flow, stereo, monodepth, point cloud generation, implicit and semi-implicit shape representations
week12 New view synthesis: multi-plane images, neural radiance fields, mesh-based and point-based representations for NVS, neural renderers

Contributors & course staff

Course materials and teaching performed by

Victor Lempitsky - all main track lectures
Victor Yurchenko - seminars, homeworks, admin stuff
Fedor Ratnikov - seminars, homeworks, admin staff
To be continued

Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.

Related tags

Overview

Deep Vision and Graphics

General info

Syllabus

Contributors & course staff

Owner

Yandex School of Data Analysis

Official implementation of "SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers"

Multi-Objective Reinforced Active Learning

Keras implementation of Deeplab v3+ with pretrained weights

TICC is a python solver for efficiently segmenting and clustering a multivariate time series

GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation. (CVPR 2021)

Monk is a low code Deep Learning tool and a unified wrapper for Computer Vision.

Supervised Contrastive Learning for Downstream Optimized Sequence Representations

SCU OlympicsRunning Baseline

A python module for scientific analysis of 3D objects based on VTK and Numpy

A Temporal Extension Library for PyTorch Geometric

This repository contains the implementation of the following paper: Cross-Descriptor Visual Localization and Mapping

Pytorch implementation of Compressive Transformers, from Deepmind

Distributing Deep Learning Hyperparameter Tuning for 3D Medical Image Segmentation

An updated version of virtual model making

Bayesian inference for Permuton-induced Chinese Restaurant Process (NeurIPS2021).

Model-based reinforcement learning in TensorFlow

Implementation of Pix2Seq in PyTorch

Code for our ICASSP 2021 paper: SA-Net: Shuffle Attention for Deep Convolutional Neural Networks

A curated list of awesome Model-Based RL resources

Monocular Depth Estimation - Weighted-average prediction from multiple pre-trained depth estimation models