# latency and fuzzy echos in mic recording

17 replies to this topic

### #1rouncer

Senior Member

• Members
• 2722 posts

Posted 12 May 2012 - 01:41 PM

this code snippet takes capture from the mic and then pumps it to the output stream.
it has a little latency, and it has lots of bugs in the output.
if i switch it round and capture sound the frame before i write it, it cleans up but then theres even more latency, i need less latency as possible so i can use the computer as an amp for my guitar or voice... is this possible with direct sound?


void write_plays(void)
{

LPVOID lpvPtr1;
DWORD dwBytes1;
LPVOID lpvPtr2;
DWORD dwBytes2;

LPVOID clpvPtr1;
DWORD cdwBytes1;
LPVOID clpvPtr2;
DWORD cdwBytes2;

static int offset=0;
static int last_clock=timeGetTime();
int clock=timeGetTime();

int tp=clock-last_clock;

int write=(int)((float)tp/1000.0f*44100.0f);

static int cwp=0;

HRESULT hr;

DWORD cp,rp;

capture->GetCurrentPosition(&cp,&rp);

DWORD locksize;

if(rp>cp)
{
locksize=(capture_size-rp)+cp;
}
else
{
locksize=cp-rp;
}

rwp=cwp;

hr=capture->Lock(rp, locksize, &clpvPtr1, &cdwBytes1, &clpvPtr2, &cdwBytes2, 0);
if(hr==S_OK)
{
SHORT* us=(SHORT*)clpvPtr1;
SHORT* us2=(SHORT*)clpvPtr2;
DWORD  b=(cdwBytes1>>1);
DWORD  b2=(cdwBytes2>>1);

int j;
for(j=0;j<locksize;j++)
{
int wab;
if(j*2<(signed)B)
{
wab=us[j*2+0];
wab+=us[j*2+1];
wab/=2;
capture_copy[cwp]=(short)wab;
cwp++;
if(cwp==1000000) cwp=0;
}
else if((j*2-B)<b2)
{
wab=us2[(j*2-B)+0];
wab+=us2[(j*2-B)+1];
wab/=2;
capture_copy[cwp]=(short)wab;
cwp++;
if(cwp==1000000) cwp=0;
}

}
capture->Unlock(clpvPtr1, cdwBytes1, clpvPtr2, cdwBytes2);
}

DWORD pc,wc;

primary->GetCurrentPosition(&pc,&wc);

// Obtain write pointer.
hr=primary->Lock(wc%primary_size, primary_size, &lpvPtr1, &dwBytes1, &lpvPtr2, &dwBytes2, 0);
if(hr==S_OK)
{
SHORT* us=(SHORT*)lpvPtr1;
SHORT* us2=(SHORT*)lpvPtr2;
DWORD  b=(dwBytes1>>1);
DWORD  b2=(dwBytes2>>1);

int j;
for(j=0;j<write;j++)
{
int wa=0;

if(wa>32766) wa=32766;
if(wa<-32766) wa=-32766;
short wab=(short)wa;

ram_copy[ram_write]=wab;

ram_write++;

if(ram_write==100000000) ram_write=0;

//   short wab=saw;

if(j*2<(signed)B)
{
us[j*2+0]=wab;
us[j*2+1]=wab;
}
else if((j*2-B)<b2)
{
us2[(j*2-B)+0]=wab;
us2[(j*2-B)+1]=wab;
}
}
primary->Unlock(lpvPtr1, dwBytes1, lpvPtr2, dwBytes2);
}

offset+=write;
last_clock=clock;
}



you used to be able to fit a game on a disk, then you used to be able to fit a game on a cd, then you used to be able to fit a game on a dvd, now you can barely fit one on your harddrive.

### #2}:+()___ (Smile)

Member

• Members
• 169 posts

Posted 12 May 2012 - 02:31 PM

I thied the same thing years ago and eventually give up: Windows simply don't have realtime sound API. There is some hacks used by JACK, but I dunno about sound cards coverage.
Sorry my broken english!

### #3rouncer

Senior Member

• Members
• 2722 posts

Posted 12 May 2012 - 03:47 PM

damno... yeh i cleared it up, but its a SECOND late, its bad latency... im not giving up yet tho, if i got to maybe a third of a second somehow...
you used to be able to fit a game on a disk, then you used to be able to fit a game on a cd, then you used to be able to fit a game on a dvd, now you can barely fit one on your harddrive.

### #4}:+()___ (Smile)

Member

• Members
• 169 posts

Posted 12 May 2012 - 04:20 PM

Human ear can hear delays of several tens of msec. 1/3 of second is as bad as 1. Try JACK API, if it works it'll be much better.
Sorry my broken english!

### #5Reedbeta

DevMaster Staff

• 5308 posts
• LocationSanta Clara, CA

Posted 12 May 2012 - 05:34 PM

I've been using VSTHost with ASIO4ALL for this. The ASIO driver decreases the latency quite a lot (although it's still noticable), and should work with most any Windows setup; you can use the free VST SDK to build your own audio processing plugins, though I haven't done this.
reedbeta.com - developer blog, OpenGL demos, and other projects

### #6rouncer

Senior Member

• Members
• 2722 posts

Posted 12 May 2012 - 05:57 PM

working with direct sound a bit more, i think i should be able to get it to 10ms, im pretty sure, ive just got to work out how the thing works... ill be back if im successful.
you used to be able to fit a game on a disk, then you used to be able to fit a game on a cd, then you used to be able to fit a game on a dvd, now you can barely fit one on your harddrive.

### #7rouncer

Senior Member

• Members
• 2722 posts

Posted 26 May 2012 - 10:03 AM

naught to my victory, but what the hell is ASIO, do you need a special sound card for it? my computer doesnt seem to work with fruity loops ASIO4ALL.
you used to be able to fit a game on a disk, then you used to be able to fit a game on a cd, then you used to be able to fit a game on a dvd, now you can barely fit one on your harddrive.

### #8rouncer

Senior Member

• Members
• 2722 posts

Posted 26 May 2012 - 10:29 AM

actually, i got it to about ~100 milliseconds or something... but its still not really perfect enough for playing a guitar into or whatever. using the computer as an amp is actually impossible, which sucks. unless its not?
you used to be able to fit a game on a disk, then you used to be able to fit a game on a cd, then you used to be able to fit a game on a dvd, now you can barely fit one on your harddrive.

### #9Reedbeta

DevMaster Staff

• 5308 posts
• LocationSanta Clara, CA

Posted 26 May 2012 - 06:08 PM

Hmm, that's too bad. I got good enough latency for guitar playing with ASIO4ALL, but I haven't measured exactly what it is. You'd probably need some kind of high-end hardware to do much better. Consumer PC hardware/software are just not designed for real-time, minimum latency audio processing
reedbeta.com - developer blog, OpenGL demos, and other projects

### #10geon

Senior Member

• Members
• 939 posts

Posted 26 May 2012 - 07:25 PM

Reedbeta, on 26 May 2012 - 06:08 PM, said:

You'd probably need some kind of high-end hardware to do much better.

I believe there is an iPhone app that does realtime autotune on hardware as early as the 3G s.

I also remember playing with a mic on my 486 dx2 on win 3.11 and using the computer as an amp. I also did exactly that on my P2 with win2k. They might have had physical wires for that, but I'm not sure.

### #11TheNut

Senior Member

• Moderators
• 1701 posts
• LocationCyberspace

Posted 26 May 2012 - 07:34 PM

Are you sure you can't go lower than 100msec? I haven't touched my DirectX Sound capture code in a long time, but I just wrote a crude demo capturing audio at 10msec. It could probably go lower, but I'd have to invest more time to do it properly. The sound buffer should also be able to go just as low, so long as you can write the audio data to hardware fast enough. I remember doing some mic stuff way back in the day and I don't recall this being a problem. 100msec is for mobile phones, not desktop PCs
http://www.nutty.ca - Being a nut has its advantages.

### #12rouncer

Senior Member

• Members
• 2722 posts

Posted 27 May 2012 - 02:05 AM

TheNut... so you actually got direct sound to perform at 10ms? I wouldnt mind having a look at your code!

DWORD cp2,rpw2;
capture->GetCurrentPosition(&cp2,&rpw2);

rp=rpw2-(wri+1)*4; //wri is the amount of samples to read

capture->Lock(rp, capture_size, &clpvPtr1, &cdwBytes1, &clpvPtr2, &cdwBytes2, 0);

thats the position of the capture buffer im reading from, just behind the read cursor... is this the most quickest way to get to the data possible?

(note on the thread im reading and outputting every 10 ms, but there is more latency than that for some reason.)
you used to be able to fit a game on a disk, then you used to be able to fit a game on a cd, then you used to be able to fit a game on a dvd, now you can barely fit one on your harddrive.

### #13TheNut

Senior Member

• Moderators
• 1701 posts
• LocationCyberspace

Posted 27 May 2012 - 11:43 AM

The API use is the same, but going about copying the data is a bit different. First, we'll start from the beginning. This is how I initialize the DSound capture buffer.

bool CreateCaptureBuffer (int numChannels, int samplesPerSecond, int bitsPerSample)
{
// Create Capture Device
//
// GUIDs:	DSDEVID_DefaultCapture		- System-wide default audio capture device.
//			DSDEVID_DefaultVoiceCapture	- Default voice capture device.
if ( FAILED(DirectSoundCaptureCreate(&DSDEVID_DefaultVoiceCapture, &mDirectCapture, 0)) )
return false;

// Set direct capture buffer description
memset(&mWF, 0, sizeof(WAVEFORMATEX));
mWF.cbSize			= 0;
mWF.nChannels		= numChannels;
mWF.nSamplesPerSec	= samplesPerSecond;
mWF.wBitsPerSample	= bitsPerSample;
mWF.nBlockAlign		= numChannels * bitsPerSample / 8;
mWF.nAvgBytesPerSec	= mWF.nBlockAlign * samplesPerSecond;
mWF.wFormatTag		= WAVE_FORMAT_PCM;

// Set capture buffer description
memset(&mDSDesc, 0, sizeof(DSCBUFFERDESC));
mDSDesc.dwSize			= sizeof(DSCBUFFERDESC);
mDSDesc.dwBufferBytes	= mWF.nAvgBytesPerSec;
mDSDesc.lpwfxFormat		= &mWF;

// Create capture buffer
if ( FAILED(mDirectCapture->CreateCaptureBuffer(&mDSDesc, &mDSCaptureBuffer, 0)) )
return false;

return true;
}


In my test app, I used a mono channel, 44100 HZ 16 bit per sample capture buffer. As you can see from the code, I'm allocating a 1 second buffer (88200 bytes), which is an optimal comfort zone to deal with all situations. If my CPU starts to bottleneck, I can slow down the capture polling up to at most a 1 second delay.

Now, for the capture code:
int Read (char *buffer, int size)
{
void *capturedData1 = 0;
void *capturedData2 = 0;

int capturedLength1 = 0;
int capturedLength2 = 0;
int capturePos = 0;
int captureSize = 0;

// Get the current capture & read positions
return 0;

// Calculate the size of the captured buffer
// Cycle back?
if ( captureSize < 0 )
captureSize += mDSDesc.dwBufferBytes;
// Too late, missed data
if ( captureSize > size )
captureSize = size;

// Lock the buffer for copy
if ( FAILED(mDSCaptureBuffer->Lock(mCaptureOffset, captureSize, &capturedData1, (LPDWORD)&capturedLength1, &capturedData2, (LPDWORD)&capturedLength2, 0)) )
return 0;

// Deal with first buffer
if ( capturedData1 )
{
memcpy(buffer, capturedData1, capturedLength1);

// Move the capture offset
mCaptureOffset += capturedLength1;
mCaptureOffset %= mDSDesc.dwBufferBytes;	// Circular buffer
}

// Deal with second buffer
if ( capturedData2 )
{
memcpy(&buffer[capturedLength1], capturedData2, capturedLength2);

// Move the capture offset
mCaptureOffset += capturedLength2;
mCaptureOffset %= mDSDesc.dwBufferBytes;	// Circular buffer
}

// Unlock the buffer
mDSCaptureBuffer->Unlock(capturedData1, capturedLength1, capturedData2, capturedLength2);

// Return the copied amount
return (capturedLength1 + capturedLength2);

return 0;
}


I don't play around here. I get pointers to both buffers and copy them quickly. If the first one is set, then I copy over capturedLength1 bytes. If the second buffer is set, then I append capturedLength2 bytes. The total amount of captured bytes is equal to the sum of both buffers, which should be the same as captureSize. In my test app, bufferSize was set to 10 ms, or 882 bytes. Each time I called this method, 882 bytes were successfully read. If my CPU was not fast enough to do this, I would start to lose captured data. If that's what you're experiencing, check that your copy operations and other code is fast enough to poll at that interval.
http://www.nutty.ca - Being a nut has its advantages.

### #14rouncer

Senior Member

• Members
• 2722 posts

Posted 02 June 2012 - 05:55 AM

where does mCaptureOffset come from? i dont see where its initialized...
you used to be able to fit a game on a disk, then you used to be able to fit a game on a cd, then you used to be able to fit a game on a dvd, now you can barely fit one on your harddrive.

### #15TheNut

Senior Member

• Moderators
• 1701 posts
• LocationCyberspace

Posted 02 June 2012 - 10:28 AM

It's a member variable set to 0 when I call the StartCapture method (unlisted code), just before periodically calling the Read method.
http://www.nutty.ca - Being a nut has its advantages.

### #16rouncer

Senior Member

• Members
• 2722 posts

Posted 02 June 2012 - 01:02 PM

ah, thanks.
you used to be able to fit a game on a disk, then you used to be able to fit a game on a cd, then you used to be able to fit a game on a dvd, now you can barely fit one on your harddrive.

### #17rouncer

Senior Member

• Members
• 2722 posts

Posted 03 June 2012 - 03:59 PM

hmm, for some reason the gap between the capture offset and the read cursor is never 882 bytes, its jumping around all over the place! what have I got wrong?

while(1)
{
Sleep(10);

DWORD cp2,rpw2;
capture->GetCurrentPosition(&cp2,&rpw2);

int wri=rpw2-capture_offset;

if(wri<0)
{
wri+=capture_size;
}

int old_cwp=cwp;

HRESULT hr=capture->Lock(capture_offset, wri, &clpvPtr1, &cdwBytes1, &clpvPtr2, &cdwBytes2, 0);
if(hr==S_OK)
{
SHORT* us=(SHORT*)clpvPtr1;
SHORT* us2=(SHORT*)clpvPtr2;

if(us)
{
int j;
for(j=0;j<cdwBytes1/2;j++)
{
int wab=us[j];
capture_copy[cwp]=(short)wab;
cwp++;
if(cwp==10000000) cwp=0;
}
}

if(us2)
{
int j;
for(j=0;j<cdwBytes2/2;j++)
{
int wab=us2[j];
capture_copy[cwp]=(short)wab;
cwp++;
if(cwp==10000000) cwp=0;
}

}
capture->Unlock(clpvPtr1, cdwBytes1, clpvPtr2, cdwBytes2);

capture_offset+=wri;
capture_offset%=capture_size;

}
}


you used to be able to fit a game on a disk, then you used to be able to fit a game on a cd, then you used to be able to fit a game on a dvd, now you can barely fit one on your harddrive.

### #18rouncer

Senior Member

• Members
• 2722 posts

Posted 13 June 2012 - 05:00 AM

funnily enough capture_size wasnt setting, so i set it manually and it started working.

heres something i did stuffing around live.
http://soundcloud.co...r81/phaservoice

81.6 millisecond delay, still, unfortunately.

takes 3600 samples to get to the speaker.
you used to be able to fit a game on a disk, then you used to be able to fit a game on a cd, then you used to be able to fit a game on a dvd, now you can barely fit one on your harddrive.

#### 1 user(s) are reading this topic

0 members, 1 guests, 0 anonymous users