Name: 編碼-自己動手!Sound Synthesizer #4 - Waveout API, Sequencing & Ducktales. (Code-It-Yourself! Sound Synthesizer #4 - Waveout API, Sequencing & Ducktales)
Uploaded: 2021-01-14T10:16:17.000Z
Duration: 24 min 5 s
Description: 【看影片學英語】數萬部 YouTube 影片，搭配英漢字典即點即查，輕鬆掌握單字發音與用法，長久累積看電影不必再看字幕。

未來進行式

You know, got headphones on, then we're gonna be doing a code of yourself synthesizer video.

情態助動詞 B1

And today I thought it would be useful to have a look behind the scenes that what's actually happening If you've seen any of the videos before and frankly, to enjoy this video to its fullest, you probably should have seen the others were going to be looking at this mystery file called olc noisemaker dot age, which is that file I've written, which runs in the background to handle the sound hard work.

You may also notice to be going on in the background, And that's because I've developed a sequence in which now uses the synthesizer tools that we've already developed to produce this structure drum pattern.

It's not a very complicated addition to the synthesizer, so I'll be including it at the end of the video.

If you have been using my synthesizer, you'll know that we always include the OLC noisemaker dot h file at the top, and this includes the utilities to talk to the sound card and produce sounds.

When we start programming with code it yourself synthesizer, the first thing we need to do is create an object off type.

Well see noisemaker, and we talked about this in the first video on.

We also said there were two magic numbers at the end.

I think it's time to understand what these two magic numbers are really for.

Let's take a step back from the code and actually consider how our code it yourself sound synthesizer delivers sound to the speakers so we can hear it.

And I'm going to emphasize that this is an ideal real time scenario.

The synthesizer produces samples in digital form here, 16 bit would to the sound cut, which is also known as a Dacko, a digital to analog converter.

And it takes our digital binary word here and converts a single point off amplitude in the way form in analog.

Now we saw all of this in video one, However, for all programmers, this contains two dreaded words.

Rial time, an ideal all the Windows desktop operating system.

There is no such thing as real time, so we always have to try and come up with hacks and workarounds to make it appear real time on when we're dealing with the real time.

In the real world, things can't be ideal.

Other things get in the way and stop is achieving what we want to achieve.

This is a more realistic look of the system as well as my synthesizer application.

I'm also competing for resources with windows were chrome and O.

B s visual studio and knowing my look and probably mining Bitcoin for some international agency, which means all of these things are competing for resources on the CPU.

And if you're using just the regular windows sound mapper as we are with the one lone code a synthesizer, one of the things we can't escape from is sound Israel Time on dhe for our synthesizer were typically you doing everything at 44,100 hertz.

This means were sending this many samples to the sound card per second.

If we don't match these timings, the sound will sound choppy and broken up or it'll sound speeded up like the chipmunks.

Let's start off with a really naive approach.

Andi, this timer is clocked to output the frequency that we need and the timer generates an interrupt which interrupts windows on windows, then goes away and collects all of the information it requires to generate a single sample to deliver to the sound driver.

Theoretically, there's nothing wrong with this approach.

We know the samples will be delivered in real time to the hard work.

Practically, though, it's disastrous for Windows together, all of the information it needs to produce the sample it has to interrogate all of the processes which are using that particular sound interface on.

We're instructing it to do this at 44,000 times a second.

That means Windows needs to do 44,000 contact switches where it interrupts the currently running process stores.

Its state launches the new process to where it was before, gathers the sound information stores that process back on, then goes on and on and on, and we're asking it to do this of all processes 44,100 times a second.

Time required to do these context, which is is actually quite significant.

We could do a quick calculation to see how much time does the CPU have to produce each sample.

In this case, it's no 0.2 milliseconds approximately so Windows has to manage all of these interruptions and data gathering within a not point to Miller second Window.

Well, the most obvious thing to do is reduce the number of interrupts.

It is something a little bit more manageable, say, 20 Hertz.

Clearly, in this situation now, we must deliver more than one sample in order to achieve our 44,100 output sampling rate.

Doing simple calculation, we can see we need to produce 2205 samples now per into ABS, and this is approximately 50 milliseconds worth of audio.

Creating a packet of audio is just better all round.

As each processes switched in, it can go away and generate 50 milliseconds worth of audio.

This will result in fewer RAM and catch mrs on Be more optimal regarding CPU resources, however, it's introduced now one important dynamic, and that's Leighton.

See, in this case, there will always be a 50 millisecond delay between the process or the synthesizer, in this case, out putting sound honors.

Hearing it on legacy management is quite important to deal with in most say, digital audio workstations.

It's probably okay to have a bit of Leighton see, but in real time situation say, whether Player is playing the keyboard, Leighton see becomes a big problem.

In fact, if he gets over 30 milliseconds, it's very difficult to play an instrument where you press the key and you have to wait 30 milliseconds before you hear the sound.

Your brain just can't reconcile that, so we always try name for late and cease to be as low as possible.

It's worth thinking about an embedded system for this approach, though, so digital instruments say keyboards or guitar effects probably do use this approach of having an interrupt at the sample frequency.

There's nothing else to interrupt the process from doing what it's doing in the real world.

See, we could modify a drawing a little bit here.

Now we don't need a timer because the sound driver can directly tell us when it's done with the sound.

However, drivers don't usually like working with shared memory in this way, so it's no good just having one block of samples here because the sound driver will be too busy sending that through the CPU into the D.

Windows will be wanting to fill it at the same time, so we need to have more than one block in this case.

What we actually would prefer is a cue off sample blocks.

And this is nice, because if we assume that the system is now two parts again like in our ideal system, this side is clocked fundamentally at the frequency that is required to output the sound.

Where is this side could be very variable, and it's quite a common technique to use a Q or buffer like this to cross timing domains.

We'll have a little disclaimer that this layout that I'm showing on the screen is quite an abstraction.

But I believe it gets the point across quite elegantly, that when we're crossing time domains like this, we need to think about how we handle the data, and there are things that we have to be careful off.

If we output too many blocks, we increase the Leighton see because each one of these blocks represents a fixed amount of time on effectively.

So if I press a key on the keyboard, there'll be a delay of how many blocks are waiting in this queue.

On the other hand, if I don't have enough blocks in this cube, I'm starving the sound driver of sound to actually produce.

We could use a synthesizer to explore these effects.

Now I can reveal what these two magic numbers are for.

So the 256 is the number of samples in a block on dhe the eighties.

How many blocks I'm going to make available to put in the queue.

If I go to our ideal real time scenario, I can assume there is one block with one sampling.

This will, of course, require 44,100 updates per second to produce real time sound.

And I apologize if you've just blown up your head phones what we heard.

There was just lots of clicks and pops, and that's because the sound card is significantly starved of data.

I'm going to start playing it again, and we can see here.

Let's go to the other end of the spectrum.

I'm going to say each of my blocks sustain 256 samples, but I'm going to have 1000 offline.

So that delay was the late and see about five seconds worth in this case.

So this arrangement makes it impossible to play any kind of live instrument.

And it's all about finding the correct balance between the Leighton See on the performance.

I found that these two numbers worked well for most of our applications.

I'm calculating laden t by looking at the whole time on the CPU time.

So for every sample that gets processed by a synthesizer, we know we use this detail invariable that you get in your make noise function.

That's according to the CPU, but we also have wall time, which is around us.

That's that's the natural order of the universe progressing through time.

In the ideal scenario, the two should be the same.

字幕列表影片播放

編碼-自己動手!Sound Synthesizer #4 - Waveout API, Sequencing & Ducktales. (Code-It-Yourself! Sound Synthesizer #4 - Waveout API, Sequencing & Ducktales)

relevant

approach

process

scenario