Re: [linux-audio-dev] jack_convolve-0.0.10, libconvolve-0.0.3 released

List overview All Threads
Download

newer

older

[linux-audio-dev] Why alsamixer...

[linux-audio-dev] [ANN] Snd-ls...

Chris Cannam

27 Jun 2005 27 Jun '05

12:25 p.m.

Florian Schmidt:

...

P.S.: I am also currently working on a qt app which uses libconvolve

Any interest in making a DSSI plugin? Chris

Show replies by date

Florian Schmidt

27 Jun 27 Jun

12:49 p.m.

New subject: [linux-audio-dev] jack_convolve-0.0.10, libconvolve-0.0.3 released

On Mon, 27 Jun 2005 14:24:36 +0000 GMT "Chris Cannam" <chris.cannam(a)ferventsoftware.com> wrote:

...

Florian Schmidt:

P.S.: I am also currently working on a qt app which uses libconvolve

Any interest in making a DSSI plugin?

There are some problems with wrapping partitioned convolution into a ladspa/dssi. One of them is that ladspa/dssi do not garantee a chunk size in which the processing needs to be done. Partitioned convolution, like many other fft based algos operates with a fixed chunk size. It basically looks like this: 0] initialization phase ("loading a response file"): split the response file into chunks. zero pad them and fft them. store the fft'ed chunks somewhere.. then for every chunk of audio: 1] zero pad and fft the input chunk 2] store the fft'ed chunk in a ringbuffer (which has space for as many chunks as the response has, too) 3] multiply the fft'ed input chunks from the input ringbuffer to the corresponding fft'ed response chunks and add all these. 4] ifft the result from step 3] 5] save overlap, add previous overlap So, the partition/chunk size is determined in the initialization phase and all processing later on happens in chunks of this size. Now ladspa/dssi make no garantees about the buffer size with which the processing is done. Thus there would need to be done some extra buffering. But to distribute the load (assuming partition size is bigger than ladspa/dssi processing size) it would be nessecary to either a] split the above algorithm into pieces (which is rather difficult due to the different phases each depending on the result of the previous phase) b] use threading Both methods are kinda ugly or work intensive. Another reason is that it is impossible to dynamically change the number of input/output ports of a ladspa/dssi which would restrict the loadable responses to stereo files if the ladspa/dssi has i.e. two outputs. this would then require seperate versions for different channel counts. All in all and as i am an energy saver (mostly my own though) i will refrain from implementing a ladspa/dssi plugin. libconvolve isn't soo hard to use though, so someone else might give it a shot. Contact me though. The api might still change abit in the future (as i'm adding/removing experimental features like for example looped responses). After a discussion with Mario Lang i think i will OSC'ify jack_convolve though and make the qt gui only an OSC controller for jack_convolve. Flo -- Palimm Palimm! http://affenbande.org/~tapas/

Chris Cannam

28 Jun 28 Jun

6:35 p.m.

New subject: [linux-audio-dev] jack_convolve-0.0.10, libconvolve-0.0.3 released

On Monday 27 Jun 2005 15:49, Florian Schmidt wrote:

...

Well, you could always suggest for DSSI to include a mechanism by which a plugin indicates that it must have a fixed block size. Most hosts could probably accommodate that, if they had to. You'd be in good company -- or at least, in company. dssi-vst, for example, will get audibly perturbed if the block size changes as it makes absolutely no attempt to buffer between the DSSI variable block size and the VST fixed one. At the moment this is an obvious bug in dssi-vst, but it could become a perfectly sensible feature with just a little tweak to the API...

...

After a discussion with Mario Lang i think i will OSC'ify jack_convolve though and make the qt gui only an OSC controller for jack_convolve.

It does seem a shame not to end up with a DSSI plugin as well, then, given that it would then have much the same structure already. Chris

Florian Schmidt

8:31 p.m.

New subject: [linux-audio-dev] jack_convolve-0.0.10, libconvolve-0.0.3 released

On Tue, 28 Jun 2005 21:38:32 +0100 Chris Cannam <cannam(a)all-day-breakfast.com> wrote:

...

It does seem a shame not to end up with a DSSI plugin as well, then, given that it would then have much the same structure already.

Yeah, you definetly are right. I wonder though: Why stop at garanteeing a fixed buffer size for the whole runtime. The thing with the partitioned convolution is that, when used purely as an effect for recorded material (i.e. not playing realtime through it, in a host that can compensate for plugin delay), then large buffers definetly are desirable. Even larger than i.e. the maximum period size of my soundcard (which is 2048 frames). So it would be cool, if the plugin could use a fixed buffer size which also differs from the buffer size used by the underlying audio system (i.e. jack). For this the host would have to do some extra work (setup an extra thread to process the plugin (with a slightly smaller priority than the jack audio callback thread for example, to even the load), ringbuffers to feed/consume data to/from it), latency compensation for tracks sent through it. Or should the plugin do this internally and simply report to the host that it needs a fixed buffer size (which then corresponds to the audio system's buffer size).. Are dssi/ladspa's allowed to do threading? Without i wouldn't know how to do it. And even if it were allowed to do threading, how would the dssi know which priorities to use, etc (on a RP kernel it should have prio higher than i.e. hd and net irq's, but lower than the jack audio thread). Plus i wonder whether the (then fixed) buffer size should be user configurable in any way or would the plugin simply report "16k frames is what i want" :) Sometimes it does make sense to use it in realtime mode (with the same buffer size as the audio system), if you have the cpu power or the responses are short enough. Regards, Flo -- Palimm Palimm! http://affenbande.org/~tapas/

Steve Harris

29 Jun 29 Jun

7:48 a.m.

New subject: [linux-audio-dev] jack_convolve-0.0.10, libconvolve-0.0.3 released

On Wed, Jun 29, 2005 at 12:31:49AM +0200, Florian Schmidt wrote:

...

On Tue, 28 Jun 2005 21:38:32 +0100 Chris Cannam <cannam(a)all-day-breakfast.com> wrote:

It does seem a shame not to end up with a DSSI plugin as well, then, given that it would then have much the same structure already.

...

Plus i wonder whether the (then fixed) buffer size should be user configurable in any way or would the plugin simply report "16k frames is what i want" :) Sometimes it does make sense to use it in realtime mode (with the same buffer size as the audio system), if you have the cpu power or the responses are short enough.

I dont think youre going to get host developers to buy this, its basically moving the same problem you have into thier space. There are most hosts than plugins that do partitioned convolution, so the best thing to do is hack it up in the plugin. IMNSHO. NB There is allready a LADSPA plugin that does partitioned convolution (imp, id:1199), it just doesnt guaranteee that it will have constant cpu load, and burst proceswses when its buffers are full. This is a reasonable approach and not hard to implement. If the plugin /hints/ ontop of this that running it with power of two sized buffers of at least 1024 frames (or whatever) is a good idea then the sitatuion is less painful for host developers, as they can safely ignore it. In practice you will find that most hosts dont divide up the buffers they get from JACK/ALSA, so choking slightly when you get non power of two buffers is not that big a gamble. - Steve

Benno Senoner

9:15 a.m.

New subject: [linux-audio-dev] Arbitrary bufsizes in plugins requiring power of 2 bufsizes, Was: jack_convolve-0.0.10, libconvolve-0.0.3 released

My suggestion is to handle buffering in the convolution plugin and accept any buffer size from the host. I'd do it without threading to ensure the lowest possible latency. For example: assume we run convolution at 512 samples. use a ringbuffer structure (eg like RingBuffer.h in LinuxSampler). http://cvs.linuxsampler.org/cgi-bin/viewcvs.cgi/*checkout*/linuxsampler/src… (the process() in the example below is just a pseudo plugin API , input and output is mono) process(float *input,float *output, int numframes) { if(numframes == 512) { convolve(input, output, 512); return; } ringbuffer->write(input, numframes); while(numframes >0) { if(ringbuffer->read_space() >= 512) { ringbuffer->read(temp_buf, 512); convolve(temp_buf, output, 512); // does convolution and writes to the output array numframes -= 512; } else { write_silence(output,512); numframes -= 512; } } } This approach has the advantage that if the host supplies the convolution plugin with 512 frames then the added latency due to buffering is zero since if(numframes == 512) then it calls convolve() and returns without messing with ringbuffers. Otherwise, with the above approach both the number of frames used in the convolver and number of frames supplied by the host can be completely arbitrary. Drawbacks of the approach: especially on high CPU usage plugins (and convolution IS cpu hungry, especially at low buffer sizes), since the host will run the process() callbacks in RT mode, CPU spikes could introduce xruns and other bad stuff. Assume a 512 frames convolution will take 80% of the CPU on a certain machine. At 44.1kHz 512 frames = 11msec. 80% of 11msec =9.2msec If we run the above code in a host enviroment that uses eg 256 frames (5.5msec buffers), the first time process() is called the >=512 condition is not satisfied and thus a 0 filled buffer is returned (silence). At the second process() call, the >=512 condition is satisfied (there are exactly 512 frames in the buffer). And the convolve() function is called, eating 9.2msec of CPU. Since 9.2msec > 5.5msec ... sh*t happens ... XRUN. If numframes supplied by the host is bigger than 512 then there are no CPU spike problems. For example if the host supplies 1024 frames, the above code would call convolve() 2 times outputting 1024 frames. (eating 2x9.2msec out of the 22msec available) It would be a bit inefficient because if the plugin knows that the host supplies at least 1024 frames then you could run the convolution at 1024 achieving greater efficiency. If the host guarantees that it always supplies the same number of frames then the convolver could adjust it's internal framesize to to achieve optimal CPU usage. If not then a scheme like the above one is unavoidable. Just for curiousity, does anyone know that's the current status of the variable/fixed buffer sizes scenarios supplied to plugins by hosts on various plugin platforms like VST, AU etc ? The above code does some memory bouncing (only when numframes supplied by the host does not match the number of frames used in the convolver): it first copies the input to the ringbuffer's own buffer and then back to temp_buf. So some memory bandwidth is wasted but I think as long as you don't run hundreds of convolution plugins (impossible on today's machines) the added overhead is negligible since convolution is so CPU heavy. I think with an approach like the above you achieve the best of both worlds, no added latency if the host calls the plugin with numframes = power of 2 (matching the internal convolver's buffer size), and some added latency if the host does not use powers of 2. Regarding the CPU spikes, if the convolver uses less than 50% of CPU then you can run the host with the half convolver's numframes without getting XRUNs. eg if the convolver uses 40% CPU at 512 frames then running it in a host with 256 frames then the convolver will still use an average of 40% CPU but it will experience 80% CPU spikes. (eg 80% 0% 80% 0% etc ...). This is not so good because if we want to add an other plugin that has a constant CPU usage of 40% which would lead to an average 80% CPU usage we can't because during the 80% CPU spike we have only 20% of CPU headroom left. Florian, since we would like to add convolution to LinuxSampler over time it would be cool if you could add the above ideas to libconvolve so that one can use the lib without worrying about supplying the right buffer sizes etc, and in plugin hosts enviroments it would be handy too since we don't always know what the host will do. cheers, Benno http://www.linuxsampler.org Florian Schmidt wrote

...

Or should the plugin do this internally and simply report to the host that it needs a fixed buffer size (which then corresponds to the audio system's buffer size).. Are dssi/ladspa's allowed to do threading? Without i wouldn't know how to do it. And even if it were allowed to do threading, how would the dssi know which priorities to use, etc (on a RP kernel it should have prio higher than i.e. hd and net irq's, but lower than the jack audio thread). Plus i wonder whether the (then fixed) buffer size should be user configurable in any way or would the plugin simply report "16k frames is what i want" :) Sometimes it does make sense to use it in realtime mode (with the same buffer size as the audio system), if you have the cpu power or the responses are short enough. Regards, Flo

Florian Schmidt

11:43 a.m.

New subject: [linux-audio-dev] Arbitrary bufsizes in plugins requiring power of 2 bufsizes, Was: jack_convolve-0.0.10, libconvolve-0.0.3 released

On Wed, 29 Jun 2005 13:20:31 +0200 Benno Senoner <sbenno(a)gardena.net> wrote:

...

assume we run convolution at 512 samples. process(float *input,float *output, int numframes) { if(numframes == 512) { convolve(input, output, 512); return; }

This has a subtle bug afaict. Let's assume the host called several process() with numframes != 512 first, then one with numframes == 512, i.e.: 1. 123 2. 432 3. 234 4. 512 The 4th process call disregards data already in the ringbuffer which has been put there by previous calls with numframes != 512. There needs to be an additional test for whether the ringbuffer is empty. in the case that the host uses constant buffer size this would work out alright though and is indeed identical to the behaviour with the additional test. I can imagine though that subdividing periods into smaller parts is very very useful especially when automating plugins (and having control data changes which are not at period boundaries), so the fact that the constant buffer size is not garanteed does make very much sense. But in the case of a plugin which operates with fixed buffer sizes (and uses internal buffering as described) this approach wouldn't make sense, as the control data change wouldn't have any effect anyways (or it would be nontrivial to hack it into the algo, even if it is possible (i.e. gain changes could take effect at non buffer size boundaries for partitioned convolution)). So i'm very much in favour of Chris' Proposal to add a hint that makes the host use the same buffer size all the time. I would actually be very much in favour to make this the default behaviour and timestamp control change events as it's done in VST [see below].. [snip]

...

the first time process() is called the >=512 condition is not satisfied and thus a 0 filled buffer is returned (silence). At the second process() call, the >=512 condition is satisfied (there are exactly 512 frames in the buffer). And the convolve() function is called, eating 9.2msec of CPU. Since 9.2msec > 5.5msec ... sh*t happens ... XRUN.

exactly.

...

If numframes supplied by the host is bigger than 512 then there are no CPU spike problems. For example if the host supplies 1024 frames, the above code would call convolve() 2 times outputting 1024 frames. (eating 2x9.2msec out of the 22msec available) It would be a bit inefficient because if the plugin knows that the host supplies at least 1024 frames then you could run the convolution at 1024 achieving greater efficiency. If the host guarantees that it always supplies the same number of frames then the convolver could adjust it's internal framesize to to achieve optimal CPU usage.

Right. That's why the hint suggested by Chris would be useful..

...

If not then a scheme like the above one is unavoidable. Just for curiousity, does anyone know that's the current status of the variable/fixed buffer sizes scenarios supplied to plugins by hosts on various plugin platforms like VST, AU etc ?

Afaik in VST [i heard it somewhere, no garantees about correctness] the plugin knows about the host's buffer size. And the plugin will always be called with that buffer size [dunno if power of two is garanteed, but it would be sensible]. Control params are timestamped and provided as a list of value/frame pairs for the current process buffer, so the need to subdivide the buffer for finer grained automation/etc is not given. At the cost of some extra work on the plugin part. Personally i like this approach better than the LADSPA-not-garanteed-buffer-size approach (but i am biased).

...

Florian, since we would like to add convolution to LinuxSampler over time it would be cool if you could add the above ideas to libconvolve so that one can use the lib without worrying about supplying the right buffer sizes etc, and in plugin hosts enviroments it would be handy too since we don't always know what the host will do.

Actually how to solve this problem is application specific: Are cpu spikes preferrable over context switches (which a threading solution (which would even the load) would require (plus some extra latency)? I mean i could add above (nonthreading) mechanism and provide an extra function call for it, but i'd rather not since it hides a fundamental aspect of the partitioned convolution which every user should be aware of and for which a different solution might be more suited to the application at hand.. Plus i personally don't like the cpu spikey non threading solution at all for exactly the reasons you mentioned ;) Flo -- Palimm Palimm! http://affenbande.org/~tapas/

Alfons Adriaensen

12:19 p.m.

New subject: [linux-audio-dev] Arbitrary bufsizes in plugins requiring power of 2 bufsizes, Was: jack_convolve-0.0.10, libconvolve-0.0.3 released

On Wed, Jun 29, 2005 at 03:43:39PM +0200, Florian Schmidt wrote:

...

There's another subtle bug here in that the plugin assumes that an output buffer of 512 samples is made available by the host, even if the host is using a fixed smaller period size. This is a hidden form of buffering at the output, and in fact, when your plugin algo uses a fixed block size internally but you want to call it with another value, fixed or not, you will always need buffering at both the input and the output. I wrote a convolver library / JACK app similar to Florian's at about the same time (which is why it was never released). Main differences are that the API is a bit more general, it's C++, and it has the required I/O buffering built-in right into the data structures of the convolver engine, so there is no extra overhead in copying. The API it such that the extra delay can be easily avoided if the conditions permit it. -- FA

Benno Senoner

1:39 p.m.

New subject: [linux-audio-dev] Arbitrary bufsizes in plugins requiring power of 2 bufsizes, Was: jack_convolve-0.0.10, libconvolve-0.0.3 released

Alfons Adriaensen wrote:

...

On Wed, Jun 29, 2005 at 03:43:39PM +0200, Florian Schmidt wrote:

There's another subtle bug here in that the plugin assumes that an output buffer of 512 samples is made available by the host, even if the host is using a fixed smaller period size.

amazing how many bugs a few lines of code can have :) You are completely right. I forgot the issue since I concentrated on the input buffering side but forgot that you can output only numframes supplied to the process() function. So a 2nd output ringbuffer buffer would be required.

...

This is a hidden form of buffering at the output, and in fact, when your plugin algo uses a fixed block size internally but you want to call it with another value, fixed or not, you will always need buffering at both the input and the output. I wrote a convolver library / JACK app similar to Florian's at about the same time (which is why it was never released). Main differences are that the API is a bit more general, it's C++, and it has the required I/O buffering built-in right into the data structures of the convolver engine, so there is no extra overhead in copying. The API it such that the extra delay can be easily avoided if the conditions permit it.

Nice. It would be cool if you and Florian could join forces to make such a general convolution lib which contains the goodies described in this thread like no added latency if the host buffer sizes matches the plugin's buffer size and provides the automatic input and output buffering in case of odd frame sizes. Given the ever increasing CPU power, convolution reverbs are probably going to become the common way to do high quality reverbs and such a convolution lib would avoid to have several developers reinventing the wheel all over again committing the same bugs etc. thanks, Benno

Alfons Adriaensen

2:32 p.m.

New subject: [linux-audio-dev] Arbitrary bufsizes in plugins requiring power of 2 bufsizes, Was: jack_convolve-0.0.10, libconvolve-0.0.3 released

On Wed, Jun 29, 2005 at 05:45:12PM +0200, Benno Senoner wrote:

...

So a 2nd output ringbuffer buffer would be required.

Yes. In my lib, the two ringbuffers are part of the convolver, and the FFT and MAC operations operate directly on them. The API to write/read them is similar to ALSA's memory mapped buffer interface using *_prepare(), *_pointer() and *_commit() calls. This is AFAICT the most flexible interface you can have to circular buffers, enabling direct access. To disable the added delay when using a period size N equal to the convolver's, just call output_commit(N) once before starting the loop.

...

It would be cool if you and Florian could join forces to make such a general convolution lib which contains the goodies described in this thread like no added latency if the host buffer sizes matches the plugin's buffer size and provides the automatic input and output buffering in case of odd frame sizes.

Diversity is a good thing IMHO, and it's probably not bad to have both C and C++ versions. Both are (or will be in my case) GPL, so they will cross-fertilise anyway.

...

Given the ever increasing CPU power, convolution reverbs are probably going to become the common way to do high quality reverbs and such a convolution lib would avoid to have several developers reinventing the wheel all over again committing the same bugs etc.

For 'natural' reverb that will be the case. For 'effect' reverb, probably an algorithmic system wil provide much more flexibility and parameters. But you could even combine the two. The main problem with convolution based reverb is to obtain high quality impulse responses. It's not technical, but a matter of organisation, and of obtaining the permission to capture them and make them publicly available. WAVES has a number of very good ones (captured by Angelo Farina), but you first have to buy their reverb software to get access to them, even if most of them were recorded at places sponsored by public money (opera houses and churches). -- FA

Florian Schmidt

2:02 p.m.

New subject: [linux-audio-dev] Arbitrary bufsizes in plugins requiring power of 2 bufsizes, Was: jack_convolve-0.0.10, libconvolve-0.0.3 released

On Wed, 29 Jun 2005 16:19:34 +0200 Alfons Adriaensen <fons.adriaensen(a)alcatel.be> wrote:

...

I wrote a convolver library / JACK app similar to Florian's at about the same time (which is why it was never released). Main differences are that the API is a bit more general, it's C++, and it has the required I/O buffering built-in right into the data structures of the convolver engine, so there is no extra overhead in copying. The API it such that the extra delay can be easily avoided if the conditions permit it.

What other goodies does it have? What do you mean by the API being "a bit more general"? Different data types, etc? I'd say, dump mine if it weren't being simple C (which is sometimes preferable over C++). Flo -- Palimm Palimm! http://affenbande.org/~tapas/

fons adriaensen

5:16 p.m.

New subject: [linux-audio-dev] Arbitrary bufsizes in plugins requiring power of 2 bufsizes, Was: jack_convolve-0.0.10, libconvolve-0.0.3 released

On Wed, Jun 29, 2005 at 06:02:24PM +0200, Florian Schmidt wrote:

...

What other goodies does it have?

Nothing special. I'd like to add a mode using multiple block sizes for minimal delay. This is not trivial and requires multiple threads as well.

...

What do you mean by the API being "a bit more general"?

Mainly the I/O buffering that is included. Also it's designed to be dynamic, i.e. you can add/remove responses to/from the I/O matrix while it's running. No Xfading yet, but that would be relatively easy to add. (by 'a bit more' I mean just that, not 'much more'.) There are two classes: - Convdata : Contains a partitioned and already FFT'ed impulse response, ready for use by - Convolver : Performs convolution on N inputs to M outputs, defined by an N * M array of Convdata *.

...

I'd say, dump mine if it weren't being simple C (which is sometimes preferable over C++).

Don't !!! And yes, sometimes you would prefer C. -- FA

Benno Senoner

6:15 p.m.

New subject: [linux-audio-dev] Arbitrary bufsizes in plugins requiring power of 2 bufsizes, Was: jack_convolve-0.0.10, libconvolve-0.0.3 released

fons adriaensen wrote:

...

On Wed, Jun 29, 2005 at 06:02:24PM +0200, Florian Schmidt wrote:

What other goodies does it have?

Nothing special. I'd like to add a mode using multiple block sizes for minimal delay. This is not trivial and requires multiple threads as well.

Another very useful feature would be tail extension (combine convolution with traditional reverb processing to lighten the CPU load) one commercial product that implements it: read the "Lighten the load on your CPU" section: http://www.tascam.com/Products/GigaPulse.html That way you can run for example a softsampler with more reverb instances (for multi part setups) and lower latencies (thus playable in real time) without killing the CPU. I'm not expert on the matter .... Fons, Florian do you think such a cpu load decreasing trick (without any/only a little audio degradation) would be doable without too much troubles ? cheers, Benno

fons adriaensen

8:18 p.m.

New subject: [linux-audio-dev] Arbitrary bufsizes in plugins requiring power of 2 bufsizes, Was: jack_convolve-0.0.10, libconvolve-0.0.3 released

On Wed, Jun 29, 2005 at 10:20:47PM +0200, Benno Senoner wrote:

...

Another very useful feature would be tail extension (combine convolution with traditional reverb processing to lighten the CPU load)

I don't think this requires special support from the convolution engine - it's jsut an application of it. Suppose you have the start of a reverb response: (imagine the y-scale is logarithmic) |\ | \ | | | | | | --------------------------- and you feed back the output with the correct gain and delay, then the result will be a complete exponential decay. This is too simplistic, and will probably give a 'echo' like character to the reverb, but that's the principle. It should be applied to the first part of the reverb 'tail' only, not the early reflections. But you only need a short convolution. -- FA

Florian Schmidt

1 Jul 1 Jul

2:28 p.m.

New subject: [linux-audio-dev] Arbitrary bufsizes in plugins requiring power of 2 bufsizes, Was: jack_convolve-0.0.10, libconvolve-0.0.3 released

On Wed, 29 Jun 2005 23:07:24 +0200 fons adriaensen <fons.adriaensen(a)skynet.be> wrote:

...

Another very useful feature would be tail extension (combine convolution with traditional reverb processing to lighten the CPU load)

To avoid the echo like character one could do some preprocessing. Take a chunk of the tail and make it constant volume (if it is decaying exponential, this should be possible). Then fiddle some more with it to make it loop cleanly. Then use that as response and apply gain and do the feedback delay thing. This would allow arbitrary envelopes on the tail. Regards, Flo -- Palimm Palimm! http://affenbande.org/~tapas/

Benno Senoner

29 Jun 29 Jun

1:31 p.m.

New subject: [linux-audio-dev] Arbitrary bufsizes in plugins requiring power of 2 bufsizes, Was: jack_convolve-0.0.10, libconvolve-0.0.3 released

Florian Schmidt wrote:

...

On Wed, 29 Jun 2005 13:20:31 +0200 Benno Senoner <sbenno(a)gardena.net> wrote:

assume we run convolution at 512 samples. process(float *input,float *output, int numframes) { if(numframes == 512) { convolve(input, output, 512); return; }

This has a subtle bug afaict. Let's assume the host called several process() with numframes != 512 first, then one with numframes == 512, i.e.:

yes ! I was about to post it but you were faster !

...

1. 123 2. 432 3. 234 4. 512 The 4th process call disregards data already in the ringbuffer which has been put there by previous calls with numframes != 512. There needs to be an additional test for whether the ringbuffer is empty. in the case that the host uses constant buffer size this would work out alright though and is indeed identical to the behaviour with the additional test.

yes basically the code shoud be: process(float *input,float *output, int numframes) { if(ringbuffer->read_space() == 0 && numframes == 512) { convolve(input, output, 512); return; } ringbuffer->write(input, numframes); while(numframes >0) { if(ringbuffer->read_space() >= 512) { ringbuffer->read(temp_buf, 512); convolve(temp_buf, output, 512); // does convolution and writes to the output array numframes -= 512; } else { write_silence(output,512); numframes -= 512; } } }

...

I can imagine though that subdividing periods into smaller parts is very very useful especially when automating plugins (and having control data changes which are not at period boundaries), so the fact that the constant buffer size is not garanteed does make very much sense. But in the case of a plugin which operates with fixed buffer sizes (and uses internal buffering as described) this approach wouldn't make sense, as the control data change wouldn't have any effect anyways (or it would be nontrivial to hack it into the algo, even if it is possible (i.e. gain changes could take effect at non buffer size boundaries for partitioned convolution)). So i'm very much in favour of Chris' Proposal to add a hint that makes the host use the same buffer size all the time. I would actually be very much in favour to make this the default behaviour and timestamp control change events as it's done in VST [see below].. [snip]

exactly.

Right. That's why the hint suggested by Chris would be useful..

I agree.

...

Yes CPU spikes are always bad and that's why host and plugin should try to use the same buffer sizes preferably powers of 2 so that FFT based algorithms etc can work optimally. And in the case of VST as you said the host can always pass timestamped events to the plugin to provide sample accurate parameter changes without resorting to odd buffer sizes. My approach above is mainly useful if the host's buffer size is >= the convolution size since it does not introduce threads and offers the lowest possible latency without requiring low latency intra-thread communication. cheers, Benno

7556

days inactive

7560

days old

linux-audio-dev@lists.linuxaudio.org

Manage subscription

15 comments

7 participants

tags (0)

participants (7)

Alfons Adriaensen
Benno Senoner
Chris Cannam
Chris Cannam
Florian Schmidt
fons adriaensen
Steve Harris