Click here to Skip to main content
15,898,010 members
Articles / Programming Languages / C#
Article

A full-duplex audio player in C# using the waveIn/waveOut APIs

Rate me:
Please Sign up or sign in to vote.
4.75/5 (113 votes)
31 Aug 20033 min read 2.2M   22.3K   262   362
An article on low-level audio capture and playback using the waveIn/waveOut APIs through P/Invoke in C#.

Sample Image - cswavrec.gif

Introduction

As I already mentioned in my article A low-level audio player in C#, there are no built-in classes in the .NET framework for dealing with sound. This holds true not only for audio playback, but also for audio capture.

It should be noted, though, that the Managed DirectX 9 SDK does include classes for high-level and low-level audio manipulation. However, sometimes you don’t want your application to depend on the full DX 9 runtime, just to do basic sound playback and capture, and there are also some areas where Managed DirectSound doesn’t help at all (for example, multi-channel sound playback and capture).

Nevertheless, I strongly recommend you to use Managed DirectSound for sound playback and capture unless you have a good reason for not doing so.

This article describes a sample application that uses the waveIn and waveOut APIs in C# through P/Invoke to capture an audio signal from the sound card’s input, and play it back (almost) at the same time.

Using the code

The sample code reuses the WaveOutPlayer class from my article A low-level audio player in C#. The new classes in this sample are WaveInRecorder and FifoStream.

The FifoStream class extends System.IO.Stream to implement a FIFO (first-in first-out) of bytes. The overridden Write method adds data to the FIFO’s tail, and the Read method peeks and removes data from the FIFO’s head. The Length property returns the amount of buffered data at any time. Calling Flush will clear all pending data.

The WaveInRecorder class is analogous to the WaveOutPlayer class. In fact, if you look at the source files, you’ll notice that the implementations of these classes are very similar. As with WaveOutPlayer, the interface of this class has been reduced to the strict minimum.

Creating an instance of WaveInRecorder will cause the system to start recording immediately. Here’s the code that creates the WaveOutPlayer and WaveInRecorder instances.

C#
private void Start()
{
    Stop();
    try
    {
        WaveLib.WaveFormat fmt = new WaveLib.WaveFormat(44100, 16, 2);
        m_Player = new WaveLib.WaveOutPlayer(-1, fmt, 16384, 3, 
                        new WaveLib.BufferFillEventHandler(Filler));
        m_Recorder = new WaveLib.WaveInRecorder(-1, fmt, 16384, 3, 
                        new WaveLib.BufferDoneEventHandler(DataArrived));
    }
    catch
    {
        Stop();
        throw;
    }
}

The WaveInRecorder constructor takes five parameters. Except for the last parameter, their meaning is the same as in WaveOutPlayer.

The first parameter is the ID of the wave input device that you want to use. The value -1 represents the default system device, but if your system has more than one sound card, then you can pass any number from 0 to the number of installed sound cards minus one, to select a particular device.

The second parameter is the format of the audio samples.

The third and forth parameters are the size of the internal wave buffers and the number of buffers to allocate. You should set these to reasonable values. Smaller buffers will give you less latency, but the captured audio may have gaps on it if your computer is not fast enough.

The fifth and last parameter is a delegate that will be called periodically as internal audio buffers are full of captured data. In the sample application we just write the captured data to the FIFO, like this:

C#
private void DataArrived(IntPtr data, int size)
{
    if (m_RecBuffer == null || m_RecBuffer.Length < size)
        m_RecBuffer = new byte[size];
    System.Runtime.InteropServices.Marshal.Copy(data, m_RecBuffer, 0, size);
    m_Fifo.Write(m_RecBuffer, 0, m_RecBuffer.Length);
}

Similarly, the Filler method is called every time the player needs more data. Our implementation just reads the data from the FIFO, as shown below:

C#
private void Filler(IntPtr data, int size)
{
    if (m_PlayBuffer == null || m_PlayBuffer.Length < size)
        m_PlayBuffer = new byte[size];
    if (m_Fifo.Length >= size)
        m_Fifo.Read(m_PlayBuffer, 0, size);
    else
        for (int i = 0; i < m_PlayBuffer.Length; i++)
            m_PlayBuffer[i] = 0;
    System.Runtime.InteropServices.Marshal.Copy(m_PlayBuffer, 
                                                 0, data, size);
}

Note that we declared the temporary buffers m_RecBuffer and m_PlayBuffer as member fields in order to improve performance by saving some garbage collections.

To stop streaming, just call Dispose on the player and capture objects. We also need to flush the FIFO so that the next time Start is called there is no residual data to play.

C#
private void Stop()
{
    if (m_Player != null)
        try
        {
            m_Player.Dispose();
        }
        finally
        {
            m_Player = null;
        }
    if (m_Recorder != null)
        try
        {
            m_Recorder.Dispose();
        }
        finally
        {
            m_Recorder = null;
        }
    m_Fifo.Flush(); // clear all pending data
}

Conclusion

This sample demonstrates how to combine the waveIn and waveOut APIs in C#. As an exercise, you may want to combine this code with the audio effect framework in the article Programming Audio Effects in C#, to apply effects to a live audio input in real-time, although latency may be an issue for certain applications.

License

This article has no explicit license attached to it but may contain usage terms in the article text or the download files themselves. If in doubt please contact the author via the discussion board below.

A list of licenses authors might use can be found here


Written By
Web Developer
Luxembourg Luxembourg
Ianier Munoz lives in France and works as a senior consultant and analyst for an international consulting firm. His specialty is in multimedia applications, and he has authored some popular software, such as American DJ's Pro-Mix, Chronotron and Adapt-X.

Comments and Discussions

 
GeneralDoes not work in x64 Pin
yasaralper29-Aug-10 2:15
yasaralper29-Aug-10 2:15 
GeneralRe: Does not work in x64 Pin
NAMEJ12-Mar-11 0:30
NAMEJ12-Mar-11 0:30 
GeneralRe: Does not work in x64 Pin
yasaralper13-Mar-11 10:36
yasaralper13-Mar-11 10:36 
GeneralRe: Does not work in x64 Pin
Roey C19-Jan-12 7:57
Roey C19-Jan-12 7:57 
GeneralRe: Does not work in x64 - example is in error Pin
jcameron2330-Nov-12 13:45
jcameron2330-Nov-12 13:45 
Generaltwo USB-soundcards cause problems Pin
Member 402255423-Aug-10 20:40
Member 402255423-Aug-10 20:40 
QuestionI need digitized data [modified] Pin
rrsharif16-Aug-10 22:54
rrsharif16-Aug-10 22:54 
GeneralC# saving Wav Pin
Fahed Al Daye1-Jun-10 6:56
Fahed Al Daye1-Jun-10 6:56 
I am making an editor for the engine that will run both for Windows and xBox 360, I am using the XNA technology for the engine as it is the best for this stuff. However, the editor does not require the use of the XNA technology. How do I load sound in the form, have a button that plays the sound, stop the sound and then save the sound in my own .bin file?

Let us say the sound.bin have two sound files stored in them, not just a little c:\location\sound.wav string that directs it to the sound, but the actual two sounds are stored in the file. How do you do that? And Using the reverse method of saving these sounds to load them and store them in the array, array[0]=holds sound 1, array[1]=holds sound 2, then using the function I would be able to play that sound, how do I do that?

1) Click on the Add Sound button, a windows dialog box comes up. I select the .wav file I want to add.
2) When I select the file it stores it in an array and puts in the listbox: sound 0
3) I click Add Sound button again, a window dialog box comes up. I select the .wav file I want to add.
4) When I select the file it stores it in an array and puts in the listbox: sound 1
5) When I select the item in the listbox.selectIndex = 0, and press the Play button it plays the sound 0 from the Array[0] index.
6) When I select the item in the listbox.selectindex =1, and press the Play button it plays the sound 1 from the Array[1] index.
7) When I press the OK button in the Window, before the Window performs the action this.Close();, it saves the information in the Array in a file called test.bin.
8) The information that is stored in the Array is the actual sound, which is sound 0 and sound 1.
9) When the form loads, it executes the LoadSound() function I make and what it does it looks at the test.bin file and fills the Array[0] and Array[1] with Sound 0 and Sound 1

So what happens is that I toke an actual normal file0.wav and file1.wav and saved them both as a single file called test.bin. The test.bin holds sound0 and sound 1 or whatever sound name you called it with the actual file0.wav and file1.wav. When you open test.bin you see gibberish because it is two .wav files saved together as test.bin. The test.bin can hold as many .wav files you want. So let us say the first file0.wav is 12 KB and the second file1.wav is 12 KB, the test.bin should be 24 KB plus whatever bytes or KBs extra about the information of the sound. Can anyone help to the right direction how to achieve this? Thanks in advance.
Questionis it possible to duplicate channels ? Pin
srdusad13-May-10 19:43
srdusad13-May-10 19:43 
Questionquestion Pin
fribe20-Apr-10 5:23
fribe20-Apr-10 5:23 
Generaldelay Pin
jachuuu22-Mar-10 3:18
jachuuu22-Mar-10 3:18 
GeneralRe: delay Pin
k6343002745-Apr-11 16:18
k6343002745-Apr-11 16:18 
Generalvs 2010... Pin
grummbunger12-Mar-10 13:29
grummbunger12-Mar-10 13:29 
Generalecho Pin
xiscoverso9-Jan-10 1:05
xiscoverso9-Jan-10 1:05 
GeneralRe: echo Pin
alex let12-Feb-10 6:13
alex let12-Feb-10 6:13 
GeneralDoesn't work if running on a 64 bit application. Pin
coder25519-Dec-09 9:25
coder25519-Dec-09 9:25 
GeneralRe: Doesn't work if running on a 64 bit application. Pin
me200724-Feb-10 0:38
me200724-Feb-10 0:38 
QuestionWhy is the size fixed to 16384? Pin
ralphigo27-Oct-09 3:31
ralphigo27-Oct-09 3:31 
Generala better mothod Pin
mutasimmo16-Oct-09 23:15
mutasimmo16-Oct-09 23:15 
QuestionHow to Save the voice record file in the client system Using MCI in c# .net 2.0 web application Pin
bruze21-Sep-09 9:59
bruze21-Sep-09 9:59 
GeneralProblem using wave API's [modified] Pin
Member 211727530-Jun-09 3:26
Member 211727530-Jun-09 3:26 
GeneralRe: Problem using wave API's Pin
Member 21172751-Jul-09 1:31
Member 21172751-Jul-09 1:31 
QuestionWill it work on Vista ? Pin
Jack987316-Jun-09 1:39
Jack987316-Jun-09 1:39 
GeneralWhy not use the Queue object Pin
Member 474314211-Jun-09 9:48
Member 474314211-Jun-09 9:48 
GeneralYet another latency question.. Pin
MrGALit9-May-09 9:22
MrGALit9-May-09 9:22 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Praise Praise    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.