Transforming an audio clip to sound as if it was recorded in the target environment. Given the image of the target environment and the waveform for the original audio, the goal is to re-synthesize the audio to match the acoustics of the target room as suggested by its apparent geometry and materials.