Select and Configure your Encoding Preset
- Small: Applies a subtle spatial effect, providing a closer, more intimate audio environment.
- Medium: Offers a balanced spatial effect, suitable for most audio content and providing a natural sense of space.
- Large: Maximizes the spatial effect, creating a more expansive and immersive sound environment, ideal for highly immersive content.
The Configurations Menu, along with its various parameters, Management Buttons, and Preset Drop-Down Menu, offers extensive customization and control over the encoding process. It allows for fine-tuning of encoding settings to align with the desired output format, including options for selecting the appropriate Encoder Type, Output File Type, Profile, Frame Rate, and more. Creating and saving custom configurations can be particularly beneficial for specific use cases, such as encoding for music or film profiles, ensuring optimal quality and the correct file format for the final encoded output. The ability to save and manage presets further enhances workflow efficiency by enabling quick application of tailored configurations, eliminating the need to manually adjust settings with each new project.
When the large “ADD” button is pressed, the current configuration, along with the selected sources from the Source Table, creates new encoding sessions in the Encoding Queue List.
Figure 3.2 – 1 | Configurations Menu Overview
1. Management Buttons
Figure 3.2.1 – 1 | Management Buttons of Configurations
The Configurations Management Buttons enable the ability to save, edit, and delete encoding presets.
• Save Button: By pressing the Save Button on the top right, you can save a particular configuration in a preset.
After clicking the Save Button, you can choose whether to create a new configuration preset or override an existing one.
It’s important to note that the Save Button will only be enabled once changes have been made to one or more parameter values in the current preset.
• Cancel Button: The Cancel Button drops all the current changes.
• “-” Button: The “-” Button allows for the deletion of the currently selected configuration..
2. Preset Drop Down Menu
Figure 3.2.2 – 1 | Preset Drop Down Menu of Configurations
The Preset Drop-Down Menu allows for the selection of the desired configuration preset.
• “Custom” Status: The Custom status appears if the current configuration is not stored as a preset.
It’s important to note that it is possible to apply a custom configuration to a source and add the corresponding session to the Queue List without the need to save the configuration as a preset.
3. Encoder Types
There are currently three Encoder Types in the Auro Encoder Service:
3.1 Auro-Codec Encoder
Figure 3.2.3.1 – 1 | Auro-Codec Configuration Parameters
The renowned Auro-Codec enables distribution of immersive mixes via a single file, supporting 2D + 3D formats in a single carrier. Uncompressed PCM quality is available for 5.1 Surround and 7.1 mixes, and systems equipped with an Auro-Decoder can playback the original AURO-3D® mix. The Auro-Codec parameters provide the ability to configure and define specific behaviors to be applied to the currently selected sources:
• Encoder Type: The Encoder Type Drop-Down Menu provides options to select the desired encoder. Each selection has its own set of parameters.
• Output Channel Order: The Output Channel Order Drop Down Menu is only visible when the Output File Type “multi-channel WAV” is selected. This parameter determines the organization of audio channels within the resulting interleaved WAV file.
The Channel Order within a multi-channel WAV-file has an important impact on the correct channel mapping during playback. It should be chosen carefully depending on the target playback system or device, which can expect or support different channel orders.
When the Encoder Type is set to “Conversion”, you can choose between ITU and SMPTE channel orders. For a multi-channel WAV file in Auro 13.1 format, the ITU and SMPTE channel orders are as follows:
When the Encoder Type “Auro-Codec” is selected, the Output Channel Order is automatically set to the “ITU” channel order. This is because Auro-Codec encoded content must follow the ITU channel order to ensure proper playback on most devices.
For both 5.1-based and 7.1-based Auro-Codec encoded surround carriers, the ITU channel order is as follows:
• ITU Channel Order for 5.1-based Auro-Codec encoded surround carriers:
L, R, C, LFE, Ls, Rs.
• ITU Channel Order for 7.1-based Auro-Codec encoded surround carriers:
L, R, C, LFE, Lb, Rb, Ls, Rs.
(Note that the channel orders of all other formats with fewer channels can be easily deducted from the example provided above by simply omitting the unused channels from the list, as they follow the same channel order sequence).
• Output File Type Drop Down Menu: This feature allows you to select the desired format for the output file.
There are four available options:
• multi channel WAV: Select this option if you want the output file to be in a multi-channel WAV format, which supports multiple audio channels in a single file.
• mono channel WAV: Choose this option if you prefer the output file to be in a mono-channel WAV format, where each audio channel is saved as a separate file.
• FLAC: Select this option if you want the output file to be in FLAC format, which is a lossless audio compression format known for its high-quality audio reproduction.
• FLAC + WAV: Choose this option if you want to generate both FLAC and WAV versions of the output file, with FLAC being a lossless compressed format and WAV being an uncompressed format.
Remarks about the FLAC Export Option:
The FLAC export option allows users to encode audio files in the FLAC format, which is a lossless audio compression format known for its high audio quality and small file size. However, in order for the FLAC export option to work, the “flac” Application must be installed on the user’s system. If the “flac” Application is not installed, the FLAC export option will not function properly.
Checking for FLAC Application Existence:
To determine if the “flac” Application is installed on your system, you can use the following method:
On macOS:
Open a terminal window.
Type “flac -v” in the terminal window and press Enter.
If the “flac” Application is installed, the version information for “flac” will be displayed in the terminal window, confirming its existence on your system.
On Windows:
Open a command prompt window.
Type “flac -v” in the command prompt window and press Enter.
If the “flac” Application is installed, the version information for “flac” will be displayed in the command prompt window, confirming its existence on your system.
Note: If the “flac” Application is not installed on your system, you will need to install it in order to use the FLAC export option provided by the 3D Audio Encoder software. You can download and install the “flac” Application from the official FLAC website or through package managers available on your operating system.
Flac version 1.3.4 is recommended
How to Install the FLAC Application:
Click HERE to learn how to enable the FLAC Export Options in the AURO-3D® Encoder by installing the FLAC Application.
• Profile Drop Down Menu: When the Encoder Type is set to Auro-Codec, you are presented with the option to select either the Music or the Movie profile.
The Movie profile provides the ability to specify a specific fps (frames per second) value for the encoding process, while the Music profile grants access to additional Pre-Roll parameters to ensure proper encoding at the start time of the resulting encoded file.
• Frame Rate Drop Down Menu: Provides the option to select different frame rates for encoding Auro-Codec encoded content in the Movie profile. There are eight available values to choose from:
• 23.976 fps
• 24 fps
• 25 fps
• 29.97 fps
• 30 fps
• 48 fps
• 60 fps
• 72 fps
• Low Pass Filter Drop Down Menu: The Low-pass filter dropdown menu allows you to apply a low-pass filter to the output of the mix. It offers cut-off frequencies of 0 Hz, 18 kHz, 19 kHz, and 20 kHz to choose from. This filter is disabled by default and set to 0.
• Downmix Gains Drop Down Menu: Allows you to switch between Source or Standard mode for the downmix behaviour. In Standard mode, no dynamic treatments will be applied to the encoded version of the file, while in Source mode, the Source Config parameters stored in the source itself will be applied.
• Pre roll Button: When activated, a pre-roll of silence will be included at the beginning of the encoded file. The length of this early silence can be adjusted using the Pre roll Time parameter. Many playback systems require around 300 to 500 ms to start decoding encoded content, and this pre-roll ensures smooth decoding performance for the starting samples of the encoded file. By including a period of silence before the actual audio content begins, potential decoding issues or audio artifacts that may occur during the initial playback can be mitigated, resulting in a more seamless listening experience for the end-user.
• Pre roll Time Parameter: The Pre roll Time parameter allows to specify the duration of the Pre roll in milliseconds. This parameter determines the length of the silence that will be included at the beginning of the encoded file when the Pre roll Button is activated.
• Loudness Standard: Choosing a Loudness standard lets provide feedback on the current compliance of the encoding in the Statistics Panel. If the measured Loudness value is too loud, the value will become red, indicating that the the current encoding is not compliant with the chosen standard. This applies to all Loudness measurements (PLI, LRA, TP, etc.) for both input and output files of each encoding.
• Normalize Loudness: When activated, the encoding process will automatically adjust the levels of the encoded file to ensure compliance with the selected standard. This implies increasing or decreasing the levels to reach the target levels. The result of the normalization process can be consulted in the Statistic Panels. Note that this feature acts as a simple trim adjustment, and not as a dynamic tool. For example, if the integrated value appears to be too low, but the true peak value appears to be too high, the normalization process will simply decrease all levels evenly to make the encoded file compliant, and will not selectively increase the PLI level and decrease the TP one.
• Measured Output Loudness: This option serves as an additional quality control step after the encoding process. When activated, the ‘Measured Output Loudness’ method will reevaluate the Loudness levels of the resulting encoded file, allowing the user to verify if the levels align with the expected requirements. It is important to understand that without this option, the loudness output levels are calculated by the software and not measured. Measuring the levels after the encoding process by activating the ‘Measured Output Loudness’ option is a more accurate method, but it also extends the duration of the encoding process compared to the usual calculation method. As usual, the result of the extra measurement can be consulted in the Statistic Panels.
3.2 Auro-CX Encoder
Figure 3.2.3.2 – 1 | Auro-CX Configuration Parameters
Auro-CX is a next-generation audio codec specifically designed for high-quality streaming of immersive and interactive audio. It builds on the foundational features of Next Generation Audio (NGA) codecs by providing scalable audio coding from lossless to low-bitrate perceptual coding. Auro-CX supports multiple audio formats, including channel-based, object-based, and scene-based audio, making it versatile for home entertainment, broadcast, and more.
Some of its key features include dynamic object rendering, support for personalized and interactive audio, and integration with advanced Auro-3D technologies such as binaural rendering (Auro-Headphones) and adaptive bit-rate streaming. Additionally, it supports a wide range of formats, including stereo, surround, and immersive speaker configurations, up to Auro 13.1. With its advanced bitstream handling and scalability, Auro-CX is designed to provide high-quality sound while maintaining flexibility for various streaming environments, making it a robust codec for future multimedia applications.
⚠️ Important Limitation in the Auro Encoder Service:
Although the current version of the Auro-3D Encoder Service allows for the creation, retrieval, and modification of object elements within Sources, it is crucial to note that all encoders, at this stage, output only pure channel-based content. Object-based audio is not yet supported in the final output, but future updates aim to address this limitation and enable full object-based audio encoding.
The Auro-CX parameters provide the ability to configure and define specific behaviors to be applied to the currently selected sources:
• Encoder Type: The Encoder Type Drop-Down Menu provides options to select the desired encoder. Each selection has its own set of parameters.
• Output File Type Drop Down Menu: This feature allows you to select the desired format for the output file.
There are two available options:
• m4a: A compressed audio format commonly used for storing high-quality audio in a small file size. The m4a format is well-suited for efficient distribution of audio content and supports all Auro-CX encoding modes. It is ideal for delivering high-quality immersive audio in a compact, widely compatible format.
• HLS: The HTTP Live Streaming (HLS) format generates fragmented files for adaptive streaming over the web. This format breaks the audio into small segments, which can dynamically adjust to varying network conditions, making it perfect for delivering immersive audio in streaming environments. HLS provides seamless, scalable playback on a range of devices, ensuring an optimal listening experience even under changing bandwidth constraints.
• Profile Drop Down Menu: When the Encoder Type is set to Auro-Codec, you are presented with the option to select either the Music or the Movie profile.
The Movie profile provides the ability to specify a specific fps (frames per second) value for the encoding process, while the Music profile grants access to additional Pre-Roll parameters to ensure proper encoding at the start time of the resulting encoded file.
• Frame Rate Drop Down Menu: Provides the option to select different frame rates for encoding Auro-Codec encoded content in the Movie profile. There are eight available values to choose from:
• 23.976 fps
• 24 fps
• 25 fps
• 29.97 fps
• 30 fps
• 48 fps
• 60 fps
• 72 fps
• Downmix Gains Drop Down Menu: Allows you to switch between Source or Standard mode for the downmix behaviour. In Standard mode, no dynamic treatments will be applied to the encoded version of the file, while in Source mode, the Source Config parameters stored in the source itself will be applied.
• Encoder Mode: The Encoder Mode defines the type of compression or encoding applied to the audio content. The Auro-CX encoder offers four different modes to accommodate various use cases and quality requirements:
• Lossless: This mode preserves the original audio quality without any compression loss. It is ideal for high-fidelity use cases where maintaining the full dynamic range and detail of the source audio is crucial, such as in professional or archival environments.
• AAC (Auro-Codec Legacy): This mode ensures backward compatibility with older Auro-Codec encodings. Using AAC (Advanced Audio Codec), this mode provides good sound quality with moderate compression, making it a reliable choice for traditional media delivery with Auro-Codec compatibility.
• Transparent: Offers a balance between compression and audio quality. Transparent mode uses advanced compression techniques to reduce file size while maintaining perceptual audio quality that is indistinguishable from the original to most listeners. It is ideal for high-quality streaming scenarios where file size and bandwidth efficiency are important.
• Lossy: This mode applies maximum compression, sacrificing some audio quality to achieve a smaller file size. It is suitable for environments where low bandwidth or storage constraints are critical, such as mobile streaming or low-data streaming services.
• Quality Drop Down Menu: This menu allows the selection of the desired quality level when using the “Transparent” or “Lossy” Encoder Modes. The selected quality level affects the balance between file size and audio quality. The Quality Drop Down Menu is only available when “Transparent” or “Lossy” modes are selected:
• Low: Provides maximum compression, resulting in smaller file sizes at the expense of some audio quality. This option is ideal for low-bandwidth environments or when storage efficiency is a priority.
• Medium: Balances compression and audio quality, offering a middle ground between file size and perceptual audio fidelity. This is suitable for general streaming purposes where a compromise between quality and file size is acceptable.
• High: Minimally compresses the audio, preserving nearly all perceptual quality while slightly reducing file size. This is best for situations where audio quality is important, but some compression is still necessary to manage bandwidth or storage.
• Auro-Matic: Auro-Matic is the proprietary upmixing algorithm used to enhance non-immersive audio content and create a more immersive listening experience when played back through an Auro-3D configuration. This section allows you to configure the Auro-Matic settings, either by using predefined presets or by customizing the effect. The strength of the effect can also be adjusted.
• Settings: You can choose between two settings: Default or Custom. The default option applies a preset configuration, while the custom setting allows for more personalized control over the parameters.
• Auro-Matic Preset Drop Down Menu: This menu provides a choice of five presets that tailor the Auro-Matic upmixing effect to the type of content. The available presets are:
• Small: Applies a subtle upmixing effect, ideal for music or other audio where minimal enhancement is required.
• Medium: Provides a moderate upmixing effect, suitable for content where more pronounced immersion is desired.
• Large: Maximizes the upmixing effect, creating an expansive 3D audio environment.
• Movie: Optimized for movies, this preset enhances the cinematic experience by immersing the listener in a 3D audio space.
• Speech: Designed for dialog-heavy content, this preset optimizes clarity and spatial positioning for speech-based audio.
• Strength: This slider adjusts how strongly the Auro-Matic effect is applied, with values ranging from 0 to 15. A higher value increases the intensity of the upmixing effect.
• Auro-Headphones: Auro-Headphones is a binaural audio rendering technology that provides an immersive spatial audio experience through standard stereo headphones. This feature allows the configuration of how audio is processed for headphone playback, ensuring that listeners experience 3D sound without requiring a multi-speaker setup. You can customize the settings or use default presets.
• Settings: Choose between Default or Custom settings. The default configuration applies a preset, while the custom option allows users to adjust specific parameters for the desired headphone experience.
• Preset Drop Down Menu: The Auro-Headphones preset menu provides three options for adjusting the spatial effect to the listener’s preference:
• LFE Level: This parameter controls the amount of Low-Frequency Effects (LFE) incorporated into the binaural mix. Adjusting the LFE level influences the bass and sub-bass frequencies, which are important for creating a fuller, more impactful listening experience.
• Loudness Standard: Choosing a Loudness standard lets provide feedback on the current compliance of the encoding in the Statistics Panel. If the measured Loudness value is too loud, the value will become red, indicating that the the current encoding is not compliant with the chosen standard. This applies to all Loudness measurements (PLI, LRA, TP, etc.) for both input and output files of each encoding.
• Normalize Loudness: When activated, the encoding process will automatically adjust the levels of the encoded file to ensure compliance with the selected standard. This implies increasing or decreasing the levels to reach the target levels. The result of the normalization process can be consulted in the Statistic Panels. Note that this feature acts as a simple trim adjustment, and not as a dynamic tool. For example, if the integrated value appears to be too low, but the true peak value appears to be too high, the normalization process will simply decrease all levels evenly to make the encoded file compliant, and will not selectively increase the PLI level and decrease the TP one.
3.3 Convert Encoder
Figure 3.2.3.3 – 1 | Convert Configuration Parameters
The Convert Encoder allows conversion of an input file format into a different output file format by specifying the Output File Type value. For example, converting an ADM file into a multi-channel WAV file (objects will be printed into the channel-based bed), or converting a multi-channel WAV file into multi-mono WAV files (all channels will be extracted from the original interleaved file), and so on. The Convert parameters provide the ability to configure and define specific behaviors to be applied to the currently selected sources:
• Encoder Type: The Encoder Type Drop-Down Menu provides options to select the desired encoder. Each selection has its own set of parameters.
• Output Channel Order: The Output Channel Order Drop Down Menu is only visible when the Output File Type “multi-channel WAV” is selected. This parameter determines the organization of audio channels within the resulting interleaved WAV file.
The Channel Order within a multi-channel WAV-file has an important impact on the correct channel mapping during playback. It should be chosen carefully depending on the target playback system or device, which can expect or support different channel orders.
When the Encoder Type is set to “Conversion”, you can choose between ITU and SMPTE channel orders. For a multi-channel WAV file in Auro 13.1 format, the ITU and SMPTE channel orders are as follows:
When the Encoder Type “Auro-Codec” is selected, the Output Channel Order is automatically set to the “ITU” channel order. This is because Auro-Codec encoded content must follow the ITU channel order to ensure proper playback on most devices.
For both 5.1-based and 7.1-based Auro-Codec encoded surround carriers, the ITU channel order is as follows:
• ITU Channel Order for 5.1-based Auro-Codec encoded surround carriers:
L, R, C, LFE, Ls, Rs.
• ITU Channel Order for 7.1-based Auro-Codec encoded surround carriers:
L, R, C, LFE, Lb, Rb, Ls, Rs.
(Note that the channel orders of all other formats with fewer channels can be easily deducted from the example provided above by simply omitting the unused channels from the list, as they follow the same channel order sequence).
• Output File Type Drop Down Menu: This feature allows you to select the desired format for the output file.
There are four available options:
• multi channel WAV: Select this option if you want the output file to be in a multi-channel WAV format, which supports multiple audio channels in a single file.
• mono channel WAV: Choose this option if you prefer the output file to be in a mono-channel WAV format, where each audio channel is saved as a separate file.
• Loudness Standard: Choosing a Loudness standard lets provide feedback on the current compliance of the encoding in the Statistics Panel. If the measured Loudness value is too loud, the value will become red, indicating that the the current encoding is not compliant with the chosen standard. This applies to all Loudness measurements (PLI, LRA, TP, etc.) for both input and output files of each encoding.
• Normalize Loudness: When activated, the encoding process will automatically adjust the levels of the encoded file to ensure compliance with the selected standard. This implies increasing or decreasing the levels to reach the target levels. The result of the normalization process can be consulted in the Statistic Panels. Note that this feature acts as a simple trim adjustment, and not as a dynamic tool. For example, if the integrated value appears to be too low, but the true peak value appears to be too high, the normalization process will simply decrease all levels evenly to make the encoded file compliant, and will not selectively increase the PLI level and decrease the TP one.
• Measured Output Loudness: This option serves as an additional quality control step after the encoding process. When activated, the ‘Measured Output Loudness’ method will reevaluate the Loudness levels of the resulting encoded file, allowing the user to verify if the levels align with the expected requirements. It is important to understand that without this option, the loudness output levels are calculated by the software and not measured. Measuring the levels after the encoding process by activating the ‘Measured Output Loudness’ option is a more accurate method, but it also extends the duration of the encoding process compared to the usual calculation method. As usual, the result of the extra measurement can be consulted in the Statistic Panels.