High-resolution guitar transcription via domain adaptation

Queen Mary University of London

High-resolution guitar transcription turns any recording of solo guitar into MIDI, without the need for special equipment or clean recording conditions.

Artist: Walter Rodriguez Jr. Video source

Abstract

We present High-resolution guitar transcription, a method for training a guitar transcription model with excellent performance on real-world recordings. We use a domain adaptation approach to train a model on a small dataset of high-quality solo guitar transcriptions, based on the "High-resolution piano transcription" model by Kong et al..

Building on the work of Maman and Bermano, we align existing guitar transcriptions to the model activations of the piano transcription model. We then use these aligned transcriptions to train a new model, which is able to transcribe the entire GuitarSet in a zero shot setting with state-of-the-art accuracy.

Alignments

Using our alignment method, we take a transcribed score and match it to the audio recording with high accuracy.

Here we show a piece from our training data. The original audio is from "Johnny Smith - Autumn Nocturne" and the video shows the aligned transcription. Note how the fine-alignment process recovers the micro-timing variations of chord onsets, despite these notes appearing in the same time instant in the original score.

The source data was obtained from professional transcriber, François Leduc. The GuitarPro files are commercially available from his website and for ease of reproduction are listed as follows:

Training Split

  • Earl Klugh - Michelle
  • George Benson - Tenderly
  • Johnny Smith - Autumn Nocturne
  • Kenny Burrell - But Not For Me
  • Paulinho Nogueira - Traverssia
  • Pat Metheny - Medley
  • Joe Pass - Bluesette
  • Earl Klugh - Ding Dong The Witch Is Dead
  • Harry Leahey - Embraceable You
  • Earl Klugh - April Fools
  • Chet Atkins - Bye Bye Blackbird
  • Paulinho Nogueira - JoaÃÉo e Maria
  • Joe Pass - Stardust
  • Ralph Towner - Here There and Everywhere
  • Charlie Byrd - God Rest Ye Merry Gentlemen
  • Laurindo Almeida - Nocturne
  • Joe Pass - All The Things You Are
  • Cal Collins - The Nearness Of You
  • George Benson - My One and Only Love
  • Jimmy Raney - The End Of A Love Affair
  • Joe Pass - Autumn Leaves
  • Earl Klugh - Emily
  • Earl Klugh - Alice In Wonderland
  • Earl Klugh - Winter Wonderland
  • Ted Greene - Someday my Prince Will Come (Baroque variation)
  • Earl Klugh - Someday My Prince Will Come
  • Lenny Breau - It Could Happen to You
  • Earl Klugh - Smoke Gets In Your Eyes
  • Earl Klugh - Moon River
  • Paulinho Nogueira - Carolina
  • Ralph Towner - Fall In Love Too Easily
  • Pasquale Grasso - Have You Met Miss Jones
  • Pasquale Grasso - Over The Rainbow
  • Wes Montgomery - While We're Young
  • Luiz Bonfa - Tenderly
  • Earl Klugh - Lullaby of Birdland
  • Earl Klugh - The Night Has A Thousand Eyes
  • Jonathan Kreisberg - I Thought About You
  • Lenny Breau - Warm up and Improvisation on Autumn Leaves
  • Stochelo Rosenberg - Improvisation from Neune Campsite Holland
  • Earl Klugh - Our Love Is Here To Stay
  • Johnny Smith - The Maid With Flaxen Hair
  • George Van - Eps Scott's Lullabye
  • Tuck Andress - Winter Wonderland
  • Jonathan Kreisberg - Have You Met Mrs' Jones
  • Al Viola - Cheek to Cheek
  • Lenny Breau - Autumn Leaves
  • Luiz Bonfa - Night and Day
  • Tuck Andress - Silent Night
  • Joe Pass - Someone To Watch Over Me
  • Charlie Byrd - It Came Upon the Midnight Clear
  • Earl Klugh - You Make Me Feel So Young
  • Oscar Peterson - Little Girl Blue
  • Lenny Breau - Emily
  • Lenny Breau - But Beautiful
  • Earl Klugh - If I Fell
  • Jonathan Kreisberg - Canto De Ossanha
  • Chet Atkins - Struttin
  • Russel Malone - Softly and Tenderly
  • Julian Lage - Autumn Leaves
  • Joe Pass - Li'l Darlin 'live'
  • Jimmy Ponder - A Tribute To Rose

Validation split

  • Joe Pass - They Can't Take That Away From Me
  • Johnny Smith - The Boy Next Door
  • Kenny Burrell - People
  • Earl Klugh - The Summer Know
  • Earl Klugh - Tenderly
  • Luiz Bonfa - Quebra Mar (The Seawall)
  • Joe Pass - Night and Day
  • Toninho Horta - Moon River

Test split

  • Paulinho Nogueira - Bachianinha no1
  • Joe Pass - White Christmas
  • Martin Taylor - Danny Boy
  • Earl Klugh - Angelina
  • Martin Taylor - I Remember Clifford
  • Tony Mottola - Yesterday Yesterdays
  • Tuck Andress - Coventry Carol, What child is this
  • Stephen D -. Anderson Disney Medley
  • Peter Leitch - Quasimodo

Transcription Performance

Due to diverse training conditions, we are able to transcribe different types of guitar. In all of the following examples the original audio can be heard on the left channel, while the transcribed audio (synthesised as piano) can be heard on the right channel.


Electric Guitar

Artist: Julian Lage Video source

Steel String Acoustic Guitar

Artist: Sierra Hull Video source

Fingerstyle Acoustic Guitar

Artist: Clive Carroll Video source

Acoustic Archtop Guitar

Artist: Jonathan Stout Video source

Electric Archtop Guitar

Artist: RUE Video source

We also include some examples of our model applied to settings that are very different from the training data (out-of-distribution).


Out of distribution: Rock Guitar

Artist: Tina Šetkić Video source

Out of distribution: Harp

Artist: Amy Turk Video source

Out of distribution: Mandolin

Artist: Sierra Hull Video source

Out of distribution: Violin

Artist: Sarah Kim Video source

Other Notes and Credits

As mentioned in the paper, we identified two alignments in GuitarSet where all the notes appeared to be aligned to the second note onset, instead of the first note onset. This introduces a constant delay in the annotations rendering them inaccurate. The files are:

  • 04_BN3-154-E_comp (off by + 0.409s)
  • 04_Jazz1-200-B_comp (off by + 0.309s)
This issue is being tracked in the GuitarSet repo here.

This work was supported by the UKRI Centre for Doctoral Training in Artificial Intelligence and Music. XR is supported by UK Research and Innovation [grant number EP/S022694/1].

MIDI visualisations were created using ffmpeg, sfizz, Salamander Piano samples and MIDI Visualizer.

Related Links

Automatic Music Transcription is a large field, with lots of excellent work in recent years.

High-Resolution Piano Transcription with Pedals by Regressing Onset and Offset Times introduces the piano transcription model which we build on. They demonstrate that their model is robust to mis-aligned labels, due to a regression technique inspired by YOLOV3.

Unaligned Supervision For Automatic Music Transcription in The Wild provided inspiration for our alignment approach where a score is aligned to model activations from an existing transcription model.

Our work is specific to the guitar, but several other audio-to-MIDI solutions are available for other instruments.

BibTeX

TBC