Python tools for the corpus analysis of popular music.

Last update: Aug 20, 2022

Related tags

Overview

CATCHY

Corpus Analysis Tools for Computational Hook discovery

Python tools for the corpus analysis of popular music recordings. The tools can be used separately or together. I.e.: you can use your own psychoacoustic features and still use the other modules. Note that to use all scripts, it is assumed that audio files come pre-segmented (e.g., into structural sections).

The base feature modules' requirements include Matlab, Librosa and VAMP.

Structure

Extracting catchy features from a folder of files involves three steps (look for the eurovision_demo.ipynb ipython notebook for a more detailed demo):

Base feature extraction

Here, basic, familiar feature time series are extracted. The toolbox currently implements (wrappers for) MFCC, chroma, melody and perceptual feature extraction. (Rhythm features under development in branch rhythm.) This part of the toolbox relies on a lot of external code, but it's also easy to work around: if you want to use other features, just save them to a set of csv files (1 per song section--see below) in some folder (1 per feature).
Pitch (and rhythm) descriptor extraction

This part computes mid-level pitch descriptors from chroma and/or melody information computed in step one. Essentially an implementation of several kinds of audio bigram descriptors. See also [1] and [2].
Feature transforms

Compute 'first' and 'second order' aggregates of any of the features computed in step 1 and step 2. See [2].

The above three steps correspond to the three columns in below diagram.

Known issues:

i/o currently very conservative--you may have to do your own mkdirs when writing features.
Matlab path handling hasn't been checked on other machines than mine.

Hopefully these will be addressed soon.

License

Matlab scripts under GNU Public license; everything else, see LICENSE.

If you use this, feel free to refer to [2].

[1] Van Balen, J., Wiering, F., & Veltkamp, R. (2015). Audio Bigrams as a Unifying Model of Pitch-based Song Description. In Proc. 11th International Symposium on Computer Music Multidisciplinary Research (CMMR). Plymouth, United Kingdom.

[2] Van Balen, J., Burgoyne, J. A., Bountouridis, D., Müllensiefen, D., & Veltkamp, R. (2015). Corpus Analysis Tools for Computational Hook Discovery. In Proc. 16th International Society for Music Information Retrieval Conference (pp. 227–233). Malaga, Spain.

Home page: http://www.github.com/jvbalen/catchy

Python tools for the corpus analysis of popular music.

Related tags

Overview

CATCHY

Corpus Analysis Tools for Computational Hook discovery

Structure

Known issues:

License

Owner

Jan VB

Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21

Telegram Voice-Chat Bot Written In Python Using Pyrogram.

Frescobaldi LilyPond Editor

Welcome to Nexus. Your personal virtual assistant

A python package for calculating the PESQ.

Nayeli: cool telegram groups vc music project

L-SpEx: Localized Target Speaker Extraction

Enhanced Audio Player for Discord

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Open-Source Tools & Data for Music Source Separation: A Pragmatic Guide for the MIR Practitioner

MelGAN test on audio decoding

An AI for Music Generation

convert-to-opus-cli is a Python CLI program for converting audio files to opus audio format.

Identify the emotion of multiple speakers in an Audio Segment

PianoPlayer - Automatic fingering generator for piano scores

pyo is a Python module written in C to help digital signal processing script creation.

Sound-Equalizer- This is a Sound Equalizer GUI App Using Python's PyQt5

Inner ear models for Python

TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music

Spotify Song Recommendation Program