E2e music remastering system - End-to-end Music Remastering System Using Self-supervised and Adversarial Training

Last update: Dec 15, 2022

Overview

End-to-end Music Remastering System

This repository includes source code and pre-trained models of the work End-to-end Music Remastering System Using Self-supervised and Adversarial Training by Junghyun Koo, Seungryeol Paik, and Kyogu Lee.

We provide inference code of the proposed system, which targets to alter the mastering style of a song to desired reference track.

Pre-trained Models

Model	Number of Epochs Trained	Details
Music Effects Encoder	1000	Trained with MTG-Jamendo Dataset
Mastering Cloner	1000	Trained with the above pre-trained Music Effects Encoder and Projection Discriminator

Inference

To run the inference code,

Download pre-trained models above and place them under the folder named 'model_checkpoints' (default)
Prepare input and reference tracks under the folder named 'inference_samples' (default).
Target files should be organized as follow:

    "path_to_data_directory"/"song_name_#1"/input.wav
    "path_to_data_directory"/"song_name_#1"/reference.wav
    ...
    "path_to_data_directory"/"song_name_#n"/input.wav
    "path_to_data_directory"/"song_name_#n"/reference.wav

Run 'inference.py'

python inference.py \
    --ckpt_dir "path_to_checkpoint_directory" \
    --data_dir_test "path_to_directory_containing_inference_samples"

Outputs will be stored under the folder 'inference_samples' (default)

Note: The system accepts WAV files of stereo-channeled, 44.1kHZ, and 16-bit rate. Target files shold be named "input.wav" and "reference.wav".

Configurations of each sub-networks

A detailed configuration of each sub-networks can also be found at

Self_Supervised_Music_Remastering_System/configs.yaml

E2e music remastering system - End-to-end Music Remastering System Using Self-supervised and Adversarial Training

Related tags

Overview

End-to-end Music Remastering System

Pre-trained Models

Inference

Configurations of each sub-networks

Owner

Junghyun (Tony) Koo

Official code for ICCV2021 paper "M3D-VTON: A Monocular-to-3D Virtual Try-on Network"

PaddleRobotics is an open-source algorithm library for robots based on Paddle, including open-source parts such as human-robot interaction, complex motion control, environment perception, SLAM positioning, and navigation.

Turning SymPy expressions into PyTorch modules.

HMLET (Hybrid-Method-of-Linear-and-non-linEar-collaborative-filTering-method)

Harmonic Memory Networks for Graph Completion

This is a official repository of SimViT.

Face recognition project by matching the features extracted using SIFT.

Codebase for the solution that won first place and was awarded the most human-like agent in the 2021 NeurIPS Competition MineRL BASALT Challenge.

An image processing project uses Viola-jones technique to detect faces and then use SIFT algorithm for recognition.

PyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"

Real-CUGAN - Real Cascade U-Nets for Anime Image Super Resolution

[ICLR2021] Unlearnable Examples: Making Personal Data Unexploitable

[NeurIPS 2021 Spotlight] Aligning Pretraining for Detection via Object-Level Contrastive Learning

Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)

Global-Local Context Network for Person Search

Code release for NeuS

Rate-limit-semaphore - Semaphore implementation with rate limit restriction for async-style (any core)

An Open-Source Tool for Automatic Disease Diagnosis..

机器学习、深度学习、自然语言处理等人工智能基础知识总结。

[ICLR 2021] "Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective" by Wuyang Chen, Xinyu Gong, Zhangyang Wang