Please use devices such as speakers, headphones, and earphones in a quiet environment to analyze the sound source.
Sample Index |
Song A | Song B | Style Transfer |
Converted Output | |
---|---|---|---|---|---|
#1 | "Young Lust" by Pink Floyd | → | "If You're Too Shy" by The 1975 | A→B | |
← | B→A | ||||
#2 | "Everything Goes On" by Porter Robinson | → | "I'm in Love With You" by The 1975 | A→B | |
← | B→A |
Please use devices such as speakers, headphones, and earphones in a quiet environment to analyze the sound source.
Conversion Type |
Sample Index |
Initial Mix (x) |
Target Style Mix (ref) |
MixFXcloner w/ MEE |
MixFXcloner w/ Φ∅ |
MixFXcloner w/ Φnorm |
MixFXcloner w/ Φp.s. |
Ground Truth (gt) |
---|---|---|---|---|---|---|---|---|
Multitrack conversion |
#1 | |||||||
#2 | ||||||||
#3 | ||||||||
#4 | ||||||||
Single stem conversion |
drums | |||||||
bass | ||||||||
vocals | ||||||||
other |
Note that this is not a fair comparison since DeepAFx-ST converts the EQ and compression style as a mixture level, while MixFXcloner performs stem-wise conversion on the general mixing style.
Style | Input | Reference | DeepAFx-ST | MixFXcloner |
---|---|---|---|---|
Neutral to Warm | ||||
Telephone to Neutral | ||||
Bright to Broadcast |
This demonstration is not mentioned in the paper, but this example implies a potential for controllable style transfer using latent space.
Input: | Reference A | → | Reference B |
---|---|---|---|
Target Style Mix | - | ||
Interpolation Output | |||
Individual Output | - |