Katana VentraIP

Opus (audio format)

Opus is a lossy audio coding format developed by the Xiph.Org Foundation and standardized by the Internet Engineering Task Force, designed to efficiently code speech and general audio in a single format, while remaining low-latency enough for real-time interactive communication and low-complexity enough for low-end embedded processors.[4][5] Opus replaces both Vorbis and Speex for new applications, and several blind listening tests have ranked it higher-quality than any other standard audio format at any given bitrate until transparency is reached, including MP3, AAC, and HE-AAC.[6][7]

Filename extension

audio/ogg[2]
audio/opus (RTP)[3]

IETF codec working group

September 11, 2012 (2012-09-11)

August 26, 2012 (2012-08-26)

1.5.2 / April 12, 2024 (2024-04-12)

C89

Opus combines the speech-oriented LPC-based SILK algorithm and the lower-latency MDCT-based CELT algorithm, switching between or combining them as needed for maximal efficiency.[4] Bitrate, audio bandwidth, complexity, and algorithm can all be adjusted seamlessly in each frame. Opus has the low algorithmic delay (26.5 ms by default)[8] necessary for use as part of a real-time communication link, networked music performances, and live lip sync; by trading off quality or bitrate, the delay can be reduced down to 5 ms. Its delay is exceptionally low compared to competing codecs, which require well over 100 ms, yet Opus performs very competitively with these formats in terms of quality per bitrate.[9]


As an open format standardized through RFC 6716, a reference implementation called libopus is available under the New BSD License. The reference has both fixed-point and floating-point optimizations for low- and high-end devices, with SIMD optimizations on platforms that support them. All known software patents that cover Opus are licensed under royalty-free terms.[10] Opus is widely used as a voice over IP (VoIP) codec in applications such as Discord,[11] WhatsApp,[12][13][14] and the PlayStation 4.[15]

Improvements to voice activity detection (VAD) and speech/music classification using a recurrent neural network (RNN)

Support for ambisonics coding using channel mapping families 2 and 3

Improvements to stereo speech coding at low bitrate

Using wideband speech encoding down to 9 kbit/s (mediumband is no longer used)

Making it possible to use SILK down to bitrates around 5 kbit/s

Minor quality improvement on tones

Enabling the spec fixes in RFC 8251 by default

Security/hardening improvements

Patent claims[edit]

As an open standard, the algorithms are openly documented, and a reference implementation (including the source code) is published. Broadcom and the Xiph.Org Foundation own software patents on some of the CELT algorithms, and Skype Technologies/Microsoft own some on the SILK algorithms; each offers a royalty-free perpetual license for use with Opus, reserving only the right to make use of their patents to defend against infringement suits of third parties. Qualcomm, Huawei, France Telecom, and Ericsson have claimed that their patents may apply, which Xiph's legal counsel denies, and none have pursued any legal action.[10][134] The Opus license automatically and retroactively terminates for any entity that attempts to file a patent suit.

Official website

Opus on Hydrogenaudio Knowledgebase

Comparison of audio coding formats

Streaming media

xHE-AAC