Please use devices such as speakers, headphones, and earphones in a quiet environment to analyze the sound source.
| Sample Index |
Song A | Song B | Style Transfer |
Converted Output | |
|---|---|---|---|---|---|
| #1 | "Young Lust" by Pink Floyd | → | "If You're Too Shy" by The 1975 | A→B | |
| ← | B→A | ||||
| #2 | "Everything Goes On" by Porter Robinson | → | "I'm in Love With You" by The 1975 | A→B | |
| ← | B→A |
Please use devices such as speakers, headphones, and earphones in a quiet environment to analyze the sound source.
| Conversion Type |
Sample Index |
Initial Mix (x) |
Target Style Mix (ref) |
MixFXcloner w/ MEE |
MixFXcloner w/ Φ∅ |
MixFXcloner w/ Φnorm |
MixFXcloner w/ Φp.s. |
Ground Truth (gt) |
|---|---|---|---|---|---|---|---|---|
| Multitrack conversion |
#1 | |||||||
| #2 | ||||||||
| #3 | ||||||||
| #4 | ||||||||
| Single stem conversion |
drums | |||||||
| bass | ||||||||
| vocals | ||||||||
| other |
Note that this is not a fair comparison since DeepAFx-ST converts the EQ and compression style as a mixture level, while MixFXcloner performs stem-wise conversion on the general mixing style.
| Style | Input | Reference | DeepAFx-ST | MixFXcloner |
|---|---|---|---|---|
| Neutral to Warm | ||||
| Telephone to Neutral | ||||
| Bright to Broadcast |
This demonstration is not mentioned in the paper, but this example implies a potential for controllable style transfer using latent space.
| Input: | Reference A | → | Reference B |
|---|---|---|---|
| Target Style Mix | - | ||
| Interpolation Output | |||
| Individual Output | - | ||