Digital audio representation in C++

03 Oct 2023

Digital audio processing is a fundamental aspect of many modern applications, ranging from multimedia systems to music production software. Understanding how digital audio is represented in C++ can pave the way for building powerful audio applications. In this blog post, we will explore the basics of digital audio representation in C++.

Sample Rate and Bit Depth

Digital audio is a discrete representation of continuous analog audio signals. It consists of a sequence of samples, where each sample represents the amplitude of the audio signal at a specific point in time. Two crucial parameters that define digital audio representation are sample rate and bit depth.

Sample Rate: The sample rate measures the number of samples per second. Common sample rates include 44.1 kHz (CD quality) and 48 kHz (DVD quality). Higher sample rates provide better audio fidelity but come at the cost of increased memory usage and processing power.

Bit Depth: Bit depth represents the number of bits used to encode each sample. The bit depth determines the dynamic range and the precision of digital audio. Common bit depths include 16-bit (CD quality) and 24-bit (studio quality). Higher bit depths offer better audio resolution and reduce quantization noise.

PCM Representation

In C++, digital audio is commonly represented using Pulse Code Modulation (PCM). PCM is a method for digitally representing analog signals by taking regular samples of the signal’s amplitude. Each sample is then quantized and represented as a binary number.

Let’s take a look at a simple example of PCM representation in C++:

#include <iostream>
#include <cstdint>

int main() {
    // Audio parameters
    int sampleRate = 44100;
    int bitDepth = 16;

    // Sample calculation
    int channels = 2; // stereo audio
    int durationSeconds = 5;
    int numSamples = sampleRate * durationSeconds * channels;

    // Allocate memory for audio buffer
    int16_t* audioBuffer = new int16_t[numSamples];

    // Generate audio data
    for (int i = 0; i < numSamples; i++) {
        // Generate a sine wave
        double amplitude = 0.5; // 50% volume
        double frequency = 440; // 440 Hz (A4)
        double time = static_cast<double>(i) / sampleRate;
        double sampleValue = amplitude * std::sin(2 * M_PI * frequency * time);

        // Convert to PCM format
        int16_t pcmValue = static_cast<int16_t>(sampleValue * std::numeric_limits<int16_t>::max());

        // Store sample in buffer
        audioBuffer[i] = pcmValue;
    }

    // Process or save audio buffer

    // Clean up
    delete[] audioBuffer;

    return 0;
}

In the code snippet above, we allocate memory for an audio buffer to store the generated audio data. We then loop through each sample and generate a sine wave based on the desired frequency and amplitude. Finally, we convert the floating-point sample value to PCM format using the int16_t data type and store it in the audio buffer.

Conclusion

Understanding the basics of digital audio representation in C++ is essential for developing audio-based applications. By grasping concepts like sample rate, bit depth, and PCM representation, you can manipulate, process, and generate digital audio using the power of C++. So, go ahead and explore the fascinating world of digital audio in your next project!

#programming #audioengineering