High-resolution guitar transcription via domain adaptation

Xavier Riley, Drew Edwards, Simon Dixon,

Queen Mary University of London

Paper (IEEE) Paper (arXiv) Dataset (Zenodo)

Abstract

We present High-resolution guitar transcription, a method for training a guitar transcription model with excellent performance on real-world recordings. We use a domain adaptation approach to train a model on a small dataset of high-quality solo guitar transcriptions, based on the "High-resolution piano transcription" model by Kong et al..

Building on the work of Maman and Bermano, we align existing guitar transcriptions to the model activations of the piano transcription model. We then use these aligned transcriptions to train a new model, which is able to transcribe the entire GuitarSet in a zero shot setting with state-of-the-art accuracy.

Using our alignment method, we take a transcribed score and match it to the audio recording with high accuracy.

Here we show a piece from our training data. The original audio is from "Johnny Smith - Autumn Nocturne" and the video shows the aligned transcription. Note how the fine-alignment process recovers the micro-timing variations of chord onsets, despite these notes appearing in the same time instant in the original score.

The source data was obtained from professional transcriber, François Leduc. The GuitarPro files are commercially available from his website and for ease of reproduction are listed as follows:

Training Split

Earl Klugh - Michelle
George Benson - Tenderly
Johnny Smith - Autumn Nocturne
Kenny Burrell - But Not For Me
Paulinho Nogueira - Traverssia
Pat Metheny - Medley
Joe Pass - Bluesette
Earl Klugh - Ding Dong The Witch Is Dead
Harry Leahey - Embraceable You
Earl Klugh - April Fools
Chet Atkins - Bye Bye Blackbird
Paulinho Nogueira - JoaÃÉo e Maria
Joe Pass - Stardust
Ralph Towner - Here There and Everywhere
Charlie Byrd - God Rest Ye Merry Gentlemen
Laurindo Almeida - Nocturne
Joe Pass - All The Things You Are
Cal Collins - The Nearness Of You
George Benson - My One and Only Love
Jimmy Raney - The End Of A Love Affair
Joe Pass - Autumn Leaves
Earl Klugh - Emily
Earl Klugh - Alice In Wonderland
Earl Klugh - Winter Wonderland
Ted Greene - Someday my Prince Will Come (Baroque variation)
Earl Klugh - Someday My Prince Will Come
Lenny Breau - It Could Happen to You
Earl Klugh - Smoke Gets In Your Eyes
Earl Klugh - Moon River
Paulinho Nogueira - Carolina
Ralph Towner - Fall In Love Too Easily
Pasquale Grasso - Have You Met Miss Jones
Pasquale Grasso - Over The Rainbow
Wes Montgomery - While We're Young
Luiz Bonfa - Tenderly
Earl Klugh - Lullaby of Birdland
Earl Klugh - The Night Has A Thousand Eyes
Jonathan Kreisberg - I Thought About You
Lenny Breau - Warm up and Improvisation on Autumn Leaves
Stochelo Rosenberg - Improvisation from Neune Campsite Holland
Earl Klugh - Our Love Is Here To Stay
Johnny Smith - The Maid With Flaxen Hair
George Van - Eps Scott's Lullabye
Tuck Andress - Winter Wonderland
Jonathan Kreisberg - Have You Met Mrs' Jones
Al Viola - Cheek to Cheek
Lenny Breau - Autumn Leaves
Luiz Bonfa - Night and Day
Tuck Andress - Silent Night
Joe Pass - Someone To Watch Over Me
Charlie Byrd - It Came Upon the Midnight Clear
Earl Klugh - You Make Me Feel So Young
Oscar Peterson - Little Girl Blue
Lenny Breau - Emily
Lenny Breau - But Beautiful
Earl Klugh - If I Fell
Jonathan Kreisberg - Canto De Ossanha
Chet Atkins - Struttin
Russel Malone - Softly and Tenderly
Julian Lage - Autumn Leaves
Joe Pass - Li'l Darlin 'live'
Jimmy Ponder - A Tribute To Rose

Validation split

Joe Pass - They Can't Take That Away From Me
Johnny Smith - The Boy Next Door
Kenny Burrell - People
Earl Klugh - The Summer Know
Earl Klugh - Tenderly
Luiz Bonfa - Quebra Mar (The Seawall)
Joe Pass - Night and Day
Toninho Horta - Moon River

Test split

Paulinho Nogueira - Bachianinha no1
Joe Pass - White Christmas
Martin Taylor - Danny Boy
Earl Klugh - Angelina
Martin Taylor - I Remember Clifford
Tony Mottola - Yesterday Yesterdays
Tuck Andress - Coventry Carol, What child is this
Stephen D -. Anderson Disney Medley
Peter Leitch - Quasimodo

Transcription Performance

Due to diverse training conditions, we are able to transcribe different types of guitar. In all of the following examples the original audio can be heard on the left channel, while the transcribed audio (synthesised as piano) can be heard on the right channel.

Electric Guitar

Artist: Julian Lage Video source

Steel String Acoustic Guitar

Artist: Sierra Hull Video source

Fingerstyle Acoustic Guitar

Artist: Clive Carroll Video source

Acoustic Archtop Guitar

Artist: Jonathan Stout Video source

Electric Archtop Guitar

Artist: RUE Video source

We also include some examples of our model applied to settings that are very different from the training data (out-of-distribution).

Out of distribution: Rock Guitar

Artist: Tina Šetkić Video source

Out of distribution: Harp

Artist: Amy Turk Video source

Out of distribution: Mandolin

Artist: Sierra Hull Video source

Out of distribution: Violin

Artist: Sarah Kim Video source

Other Notes and Credits

As mentioned in the paper, we identified two alignments in GuitarSet where all the notes appeared to be aligned to the second note onset, instead of the first note onset. This introduces a constant delay in the annotations rendering them inaccurate. The files are:

04_BN3-154-E_comp (off by + 0.409s)
04_Jazz1-200-B_comp (off by + 0.309s)

This issue is being tracked in the GuitarSet repo here.

This work was supported by the UKRI Centre for Doctoral Training in Artificial Intelligence and Music. XR is supported by UK Research and Innovation [grant number EP/S022694/1].

MIDI visualisations were created using ffmpeg, sfizz, Salamander Piano samples and MIDI Visualizer.