This is deterministic “data-as-sound” and “sound-as-data.” It’s not semantic conversion (no AI describing images).
Best results: small images, short audio.
Estimated WAV duration
—
Estimated WAV size
—
Loaded image
—
Loaded audio
—
Pro tip: encode an image -> WAV, then decode that WAV back -> image.