Apple Glass could make AR believable with accurate 3D audio


Apple’s AR headset or “Apple Glass” smart glasses may offer realistic sound effects to go with its digital imagery laid over a real-world view, by accurately reproducing a sound in such a way that it matches where the sound would come from in 3D space.

Gaming and virtual reality systems all offer some level of audio feedback to the user to accompany visuals. In the case of augmented reality, the experience of seeing a digitally-created item in the real-world view could easily be accentuated by a sound effect when it is interacted with in some way.

While audio can help sell the illusion of video, there are some limitations. A typical stereo headphone setup may provide some limited left and right differentiation at a basic level, while virtual sound systems are able to fake 3D positions by altering the properties of the sound effect to mimic distortions based on its positioning.

For example, a sound played behind the user’s head could be muffled, while one from the front could be much clearer. However, while this can be useful, the effect isn’t truly believable as differences in physiological elements like ears, the head, and body can alter what a person expects to hear from a noise in a specific position.

Each HRTF varies on a number of factors, so as to match differences between people in terms of spatial hearing expectations. While it is relatively time-consuming or impractical to create a HRTF for an individual user using consumer electronics with any real level of accuracy, there is the possibility of using either a generic “average” HRTF, or selecting a HRTF that is closest to the user’s needs.

An example ratings matrix for a group of users experiencing different HRTF settings.

An example ratings matrix for a group of users experiencing different HRTF settings.

The HRTF is then used as part of a binaural simulation digital signal processing algorithm to reproduce an audio recording as binaural sound, which in turn is played through headphones.

The patent centers around the idea of selecting the nearest HRTF to what is needed, by whittling down the database to a select few. The pruning process can include taking away options that are far from basic characteristics of the user, such as height, gender, race, and age.

Cutting down the database to a smaller number increases the chance of the most-correct HRTF being used, the patent advises. After the initial settings, the system could then narrow down its choice to the most generic from the reduced list, to select the most appropriate HRTF for use.

The system would search for the best match for a listener's HRTF, not their specific settings.

The system would search for the best match for a listener’s HRTF, not their specific settings.

It is also suggested that, by grouping users, the mass usage of different HRTFs could be monitored to help improve selection. For example, knowing that users aged 20-25 prefer HRTFs with certain properties will mean the system would err towards those particular filters if it encounters someone matching those settings in the future.

The filing lists its inventors as Darius A. Satongar, Jonathan D. Sheaffer, Martin E. Johnson, and Peter V. Jupin.

Apple files numerous patent applications on a weekly basis. While the existence of a filing indicates areas of interest for the company’s research and development efforts, they do not guarantee the concept will be used in a future product or service.

Apple’s audio processing has resulted in products like the HomePod, as well as its pass-through noise cancellation feature in the AirPods Pro, but it is still continuing to develop more sound-related ideas.

In December 2019, it was granted a patent for a “System to move a virtual sound away from a listener using a crosstalk canceller,” which performs a similar spatial audio trick in making a sound appear to emanate from a different place than the device’s speakers.

A related patent called “Display Devices with Multimodal Audio” suggests a headset could have earpieces that adjust to suit the type of audio being presented.



Source link

Be the first to comment

Leave a Reply

Your email address will not be published.


*



The reCAPTCHA verification period has expired. Please reload the page.