Zoom here & Zoom there: Ambisonics

Ever since we started this digital semester as part of the MCT program, we knew we were in for a ride. With each of us sitting at our homes in different parts of the world, riding on the Zoom ferris wheel, resisting that urge to have snacks midway the meeting (sometimes giving up and having them), we kept trying to make the experience more immersive. With our first class in HighFidelity we tried to host ourselves in a more natural way, as it created a spatial map of our voices making it easier to follow each other. After trying multiple software to make the jamming experience more than just an exchange of audio, we tried to do a similar thing during the workshop in Ambisonics, using Zoom as the exchange platform.

Workshop Task:

The task seemed easy: integrate an ambisonic “effect” within Zoom, to simulate the organic way of having conversations. With the voice of each member coming from a different direction and fooling the HRTF into believing that our course mates are sitting right beside with their faces getting displayed in Zoom it was a pretty weird but an interesting experience. Put simply, the aim was to explore a more natural way of communicating online, wherein it feels like all the members are in the same room talking.

Setup:

All members of Team A lie at the Windows end of the spectrum and spoiler alert: this side of the spectrum isn’t as smooth as it could (should?) - we had our fair share of fails and wins while setting up the Ambisonical Zoom. So, let’s start detailing: to integrate ambisonics into the otherwise flat and mono sound of Zoom you need an external software which does this, i.e. maps the input audio into the spatial audio field and then passes it to a binaural decoder and to Zoom via a routing driver.

The first part of this signal flow is achieved by using Reaper as the host DAW and IEM plugins, to manipulate the audio into an ambisonic sound. The input audio from the microphone is received in as an audio track in Reaper, and is processed by the RoomEncoder plugin. This audio is sent to the master track in Reaper which has the Binaural decoder by IEM loaded and then outputted to Zoom via the audio driver Virtual Cable by VB Audio. A scheme of the process can be seen below.

Ambisonics Signal Flow

Ambisonics plugins in Reaper

Don’t forget to select Cable Output driver as the input device in Zoom’s audio settings.

Zoom Setting for Ambisonics

Our experience:

Windows has always struggled with multiple software that use a single audio device. Since we all had different sound devices, a mix of internal and external sound cards, we faced different issues and ultimately were unable to get this system working between the three of us. For example, for one of us, initialising the soundcard in Reaper would cause all Zoom audio to be lost, and this could not be rectified without a restart, making it difficult to communicate with other teammates while trying to get the system working.

Some of our classmates kept their ambisonics settings activated when returning to the main communication room in Zoom, and, after several months of hearing only single channel/centred voices, it was a nice surprise to hear some of the voices from different directions. In real life it is never the case that the exact same sound waves hit both eardrums, even if the sound comes from straight in front or back of you. Because of this, the unnatural way of listening while in a Zoom meeting needs more focus, so, even slightly offset audio felt better and required less mental effort.

Conclusion:

It would be great to have this ambisonic feature as a built-in of Zoom, or as a standalone program that does all the processing for you. Having to get the soundcard to play nice with Reaper, with the virtual audio device (VB-CABLE Virtual Audio Device) and with Zoom, as well as setting up the correct ambisonic routings in Reaper for the master tracks and input tracks is definitely not an user-friendly solution. It would also be interesting to try other solutions like Røde’s soundfield VST, which requires less configuration and doesn’t need a DAW that supports multiple audio channels per track. Overall, it was a nice and fruitful experience that helped understand the theory behind ambisonics.