A Universal Music Translation Network

A waveNet autoencoder is enhanced with a class-predictor, and a switched decoder conditioned on the predicted class.

Any source can be encoded to a latent, and then have its class manually specified, in order to translate the audio to another class.
piano to horn section, for instance.

[arxiv:1805.07848]