Ardour session archiver (Was: new lossless/lossy audio compressor)

List overview All Threads
Download

newer

older

Re: [LAU] [ANN] lash_wrap - A...

ReplyTo munging (Was: What...

Marc-Olivier Barre

31 Jul 2007 31 Jul '07

6:25 p.m.

On 7/25/07, Paul Winkler <pw_lists(a)slinkp.com> wrote:

...

Pretty cool. Does JPEG 2000 handle float data? I'd love to be able to write an archive script for Ardour sessions that compresses the audio data losslessly, but FLAC won't do it.

I been thinking of the same thing... can't ardour handle FLAC files natively ? a simple script in <you-name-it> language calling flac to to compress the audio and change the session file to match the new names should work I think. Thoughts anyone ? __________________ Marc-Olivier Barre.

Show replies by thread

Paul Winkler

31 Jul 31 Jul

6:38 p.m.

On Tue, Jul 31, 2007 at 10:25:34PM +0200, Marc-Olivier Barre wrote:

...

I been thinking of the same thing... can't ardour handle FLAC files natively ?

Nope.

...

a simple script in <you-name-it> language calling flac to to compress the audio and change the session file to match the new names should work I think. Thoughts anyone ?

That'd be nice but as I said earlier, FLAC can't handle Ardour's wav files: pw@kermit sounds $ flac Audio\ 1-1.wav flac 1.1.2, Copyright (C) 2000,2001,2002,2003,2004,2005 Josh Coalson flac comes with ABSOLUTELY NO WARRANTY. This is free software, and you are welcome to redistribute it under certain conditions. Type `flac' for details. options: -P 4096 -b 4608 -m -l 8 -q 0 -r 3,3 Audio 1-1.wav: ERROR: unsupported compression type 3 But as pointed out earlier in the thread, wavpack is open source and it seems to work. -- Paul Winkler http://www.slinkp.com

Stefan Kost

1 Aug 1 Aug

6:57 a.m.

hi, Paul Winkler wrote:

...

On Tue, Jul 31, 2007 at 10:25:34PM +0200, Marc-Olivier Barre wrote:

I been thinking of the same thing... can't ardour handle FLAC files natively ?

Nope.

a simple script in <you-name-it> language calling flac to to compress the audio and change the session file to match the new names should work I think. Thoughts anyone ?

Seems that ardor uses WAVE_FORMAT_IEEE_FLOAT and flac can only do pcm. http://flac.sourceforge.net/faq.html#general__samples That make flac not idea for that purpose. If one still wants to do that, gstreamer could do the converion and en/decoding. Stefan

...

But as pointed out earlier in the thread, wavpack is open source and it seems to work.

Marc-Olivier Barre

8:32 a.m.

Paul Winkler wrote:

...

On Tue, Jul 31, 2007 at 10:25:34PM +0200, Marc-Olivier Barre wrote:

I been thinking of the same thing... can't ardour handle FLAC files natively ?

Nope.

I only meant natively reading them. On 8/1/07, Stefan Kost <ensonic(a)hora-obscura.de> wrote:

...

a simple script in <you-name-it> language calling flac to to compress the audio and change the session file to match the new names should work I think. Thoughts anyone ?

Well if FLAC can be read by ardour directly, gstreamer is not such a bother... I can include that in my script. Paul Winkler wrote:

...

> But as pointed out earlier in the thread, wavpack is open source and > it seems to work.

I'll give a try to wavpack too and tell you how things went. _________________ Marc-Olivier Barre.

Tim Blechmann

11:04 a.m.

On Tue, 2007-07-31 at 16:37 -0400, Paul Winkler wrote:

...

On Tue, Jul 31, 2007 at 10:25:34PM +0200, Marc-Olivier Barre wrote:

I been thinking of the same thing... can't ardour handle FLAC files natively ?

Nope.

well, it is possible to manually convert the session audio data to flac and change the filenames in the session files ... this works fine as long as libsndfile supports flac ... i'm usually doing this hack in order to avoid hard disc space when archiving recordings ... i'd like to see a proper 'native' support in ardour, though ... tim -- tim(a)klingt.org ICQ: 96771783 http://tim.klingt.org Linux is like a wigwam: no windows, no gates, apache inside, stable.

Gregory Alan Hildstrom

3:46 p.m.

I got 32-bit true lossless j2kaudio transcoding working today. I compared it against wavpack and wavpack seems far superior in terms of speed and compression for 32-bit floating point data. I created the test files with audacity, which only uses float values from +1 to -1, so I am not sure if it actually uses the full 32-bits, but it is still good enough for a meaningful test. Here are the compressed size % for the worst case that I tested: 32bWN.wav 44.1kHz/32b/10ch 60sec 10 channels of white noise 105840152B j2kaudio 91.06 wavpack 82.32 tar czf (gzip) 92.29 tar cjf (bz2) 93.24 The full test tables are here: http://geocities.com/hildstrom/projects/j2kaudio/ Are there any wav/audio programs out there that use the full float range of 3.4e+/-38? It seems to me that this larger range would use more of the 32 bits. Thanks. -Greg --- Marc-Olivier Barre <mobarre(a)gmail.com> wrote:

...

Paul Winkler wrote:

On Tue, Jul 31, 2007 at 10:25:34PM +0200, Marc-Olivier Barre wrote:

I been thinking of the same thing... can't ardour handle FLAC files natively ?

Nope.

I only meant natively reading them. On 8/1/07, Stefan Kost <ensonic(a)hora-obscura.de> wrote:

a simple script in <you-name-it> language calling flac to to compress the audio and change the session file to match the new names should work I think. Thoughts anyone ?

Well if FLAC can be read by ardour directly, gstreamer is not such a bother... I can include that in my script. Paul Winkler wrote:

> But as pointed out earlier in the thread, wavpack is open source and > it seems to work.

I'll give a try to wavpack too and tell you how things went. _________________ Marc-Olivier Barre. _______________________________________________ Linux-audio-dev mailing list Linux-audio-dev(a)lists.linuxaudio.org http://lists.linuxaudio.org/mailman/listinfo.cgi/linux-audio-dev

Paul Davis

4:09 p.m.

On Wed, 2007-08-01 at 10:45 -0700, Gregory Alan Hildstrom wrote:

...

all of them :) the idea is that 0 ... +1 maps to the nominal range of the D/A converter (-inf dB ... 0dbFS), leaving the rest of the range to span larger and smaller values without clipping. as fons explained recently, you could agree on a different convention for this, and say that 0 ... 2^23-1 mapped to the physical range, but you don't gain anything in so doing. --p

Paul Winkler

4:13 p.m.

On Wed, Aug 01, 2007 at 10:45:59AM -0700, Gregory Alan Hildstrom wrote:

...

Are there any wav/audio programs out there that use the full float range of 3.4e+/-38? It seems to me that this larger range would use more of the 32 bits.

Phil Frost

6:43 p.m.

On Wed, Aug 01, 2007 at 02:12:41PM -0400, Paul Winkler wrote:

...

On Wed, Aug 01, 2007 at 10:45:59AM -0700, Gregory Alan Hildstrom wrote:

Are there any wav/audio programs out there that use the full float range of 3.4e+/-38? It seems to me that this larger range would use more of the 32 bits.

I reckon there is more to it than that. I'd say any AC signal centered on 0 uses at least all the bits available in the mantissa. Clearly one can not be using all 32 bits if one's signal does not have 2**32 possible values. I can't tell you off the top of my head how many distinct values can represented within -1 to +1 with a 32 bit float, but since it's less than the full range I can tell you it's more than 2**24 and less than 2**32. True, a bigger range may not result in a higher quality signal, but in the context of compression, it may be important. No value between -1 and +1 has a biased exponent greater than 0x7f. Consequently, there are at best 2**31 values an audio signal could use, and one bit that will be always zero. Of those 2**31 possible values, the vast majority of them lie around zero. Cutting the range in half (to -.5 to +.5) excludes only 1/127th of the possible values. After cutting the range in half 64 times to about +/- .00000000000000000005421, roughly half of the 2**31 previously possible values still lie within this range. Would you say half of a typical audio signal lies within this range? I'm not about to go write a program to find out, but I hope you will accept my guess of "no". The point is this: floating point numbers have a lot of potential values around zero that are unlikely to be used. But wait, it gets worse! When was the last time you saw an audio interface with a 31 bit analog to digtal converter? Maybe they exist, but I consider myself lucky to get 24 bits. No matter how you cram 24 bits in to a 32 bit float, you still have 8 bits that mean nothing, even if they have been shuffled around a bit. I would suspect that many ardour sessions contain WAVs with floats which came from a 24 or 16 bit source. I haven't looked at the jack source, but it seems to me that the conversion between 32 bit floats in +/- 1 and 24 bit integers can be lossless. A trivial archiver could do this conversion and make the session roughly 75% of the original size. Of course, if this isn't the case for some sessions, this trivial archiver can't be useful and lossless anymore, which raises another point worth considering: if testing a lossless compression for float signals, consider testing with something that didn't come straight from a digital integer source. I'd suspect that IIR filters and reverbs in the right circumstances will fill in a good many of the values near zero that may typically be unused.

Gregory Alan Hildstrom

7:18 p.m.

Yes, I understand that any single 32-bit floating point number occupies 32 bits. I am specifically talking about sets of numbers and the number of bits actually used to represent the set. My basic thought is summarized in this example: If you limit your range of possible values to positive numbers, the sign bit is useless to you because it is always 0. This is true because of how you are using the variable; not because of the variable's implementation. The sign bit will still be stored in the float variable, but it is wasted space and precision if you do not use it. Your data could be represented with 31 bits instead of 32. Obviously wav data is signed, but this illustrates my point. I saved a 32-bit 44.1kHz 2-channel wav file using Audacity. The left channel was a full-amplitude sine wave sweep from 20Hz to 20kHz. The right channel was a full-amplitude square wave sweep from 20kHz to 20Hz. The exponent portion of a float variable is 8 bits, which can represent 0-255 if treated as an unsigned integer. I added a line of code to j2kaudio to print out the exponents of each float sample while reading the wav file, which I redirected to a file and plotted with gnuplot. The exponents ranged from 106 to 127, which is a range of 21. The 21 exponent values can be represented with 5 bits (2^5=32). I would argue that my wav file, saved with Audacity and ranging from -1 to +1 could be saved with a 5-bit exponent instead of an 8-bit exponent. Each 32-bit float value occupies 32 bits, but at most only 32-8+5=29 bits needed to be flipped in my wav file; I did not examine the mantissa. I then tried to create another 32-bit wav file with a greater dynamic range. I created a 1Hz sine wave at full amplitude. I then used the amplify tool (-50dB) to reduce its peak amplitude to -600dB, which is an incredibly tiny signal. I then added a second channel with a full amplitude sine wav. The small signal would be totally inaudible next to the normal signal. The exponent range was 0-10 for the incredibly small sine wave values near 0. The exponent range was 106-128 for the full amplitude sine wave. The total range I was able to create with Audacity was 0-127, still 1-bit shy of an 8-bit exponent, but those additional exponent values I manufactured were only used to define incredibly tiny numbers near the zero crossing. I guess I do not understand why I would use such a high-dynamic-range variable (32-bit float) and artificially limit its range to +/-1 (31 bits). I also do not understand why such a high-dynamic-range variable is necessary for audio reproduction given the limits of human hearing. I do understand why you might want to use it for various computations. I do not know if this is an Audacity-specific limitation or standard industry practice. These are just a couple of my hangups that I did not do a very good job of explaining before. Please clue me in if I am missing something. Thanks. -Greg --- Paul Winkler <pw_lists(a)slinkp.com> wrote:

...

On Wed, Aug 01, 2007 at 10:45:59AM -0700, Gregory Alan Hildstrom wrote:

Are there any wav/audio programs out there that use the full float range of 3.4e+/-38? It

seems to

me that this larger range would use more of the 32 bits.

That's not how float works. *Any* floating point number uses the full precision of all 32 bits (or 64, or whatever). This is because the radix point (like decimal point, but binary) can be anywhere. What you're saying would be true of fixed-point numbers. http://en.wikipedia.org/wiki/Floating_point -- Paul Winkler http://www.slinkp.com _______________________________________________ Linux-audio-dev mailing list Linux-audio-dev(a)lists.linuxaudio.org http://lists.linuxaudio.org/mailman/listinfo.cgi/linux-audio-dev

Gregory Alan Hildstrom

7:26 p.m.

Thanks for your comments. I must have been writing my reply when you posted yours. =:) I guess I was on the right track. -Greg --- Phil Frost <indigo(a)bitglue.com> wrote:

...

On Wed, Aug 01, 2007 at 02:12:41PM -0400, Paul Winkler wrote:

On Wed, Aug 01, 2007 at 10:45:59AM -0700, Gregory Alan Hildstrom wrote: > Are there any wav/audio programs out there that use the full float range of 3.4e+/-38? It

seems to

me that this larger range would use more of the 32 bits.

Paul Winkler

7:39 p.m.

Well, you guys have clearly dug into this a lot deeper than I have, so I withdraw my ignorant statements :) -PW -- Paul Winkler http://www.slinkp.com

Paul Davis

7:44 p.m.

On Wed, 2007-08-01 at 16:42 -0400, Phil Frost wrote:

...

I would suspect that many ardour sessions contain WAVs with floats which came from a 24 or 16 bit source. I haven't looked at the jack source, but it seems to me that the conversion between 32 bit floats in +/- 1 and 24 bit integers can be lossless. A trivial archiver could do this conversion and make the session roughly 75% of the original size.

Gregory Alan Hildstrom

8 p.m.

So does Ardour allow 32-bit float values to exceed +/-1.0? Audacity does not seem to allow this. -Greg --- Paul Davis <paul(a)linuxaudiosystems.com> wrote:

...

On Wed, 2007-08-01 at 16:42 -0400, Phil Frost wrote:

not if any of them had a gain value or plugin processing that created even a single sample exceeding the 0dBFS value of 1.0. people who want to save 25% of their file space (and risk losing >0dBFS values to clipped ones) with ardour can just go to Options -> Audio File Format -> Data and select "24 bit". --p _______________________________________________ Linux-audio-dev mailing list Linux-audio-dev(a)lists.linuxaudio.org http://lists.linuxaudio.org/mailman/listinfo.cgi/linux-audio-dev

Paul Davis

8:02 p.m.

On Wed, 2007-08-01 at 14:58 -0700, Gregory Alan Hildstrom wrote:

...

So does Ardour allow 32-bit float values to exceed +/-1.0?

you will never see them if you just record data arriving from a physical interface, but if you boost the gain of a track and bounce down or re-record it, sure. any values up to FLT_MAX. --p

Phil Frost

2 Aug 2 Aug

11:29 a.m.

The reason for using floats is as you said, it's huge dynamic range does far exceed human hearing or any reproduction hardware, but it is handy for internal computations. Floats are practically impossible to clip, and even very quiet signals get at least 24 significant bits. Many filters and effects can have internal stages at a level very different than the signal. The reason ardour uses floats is because it's what jackd uses, so it's the most straightforward way to record losslessly. Jackd uses it because it assumes there may be several processing stages between the input and output. For example, a given channel may be recorded from a microphone, go through a compressor, some reverb, a gain stage, then be mixed with several other channels. It makes more sense to perform the integer -> float conversion once at the A->D, and once again at the D->A, than to perform it between each linked stage. The conversion is not necessarily lossless, so each conversion has some cost in noise and distortion. Jack isn't the only software using floats; I know at least Apple's Core Audio uses floats end-to-end too. LADSPA plugins use floats on inputs and outputs by specification, which I think predates jack. I imagine floats are used internally for a good many effects that have integer inputs and outputs, and I seem to recall some DAW softwares touting their (sometimes double precision) floatiness as a feature. On Wed, Aug 01, 2007 at 02:18:14PM -0700, Gregory Alan Hildstrom wrote:

...

6543

days inactive

6545

days old

linux-audio-dev@lists.linuxaudio.org

Manage subscription

15 comments

7 participants

tags (0)

participants (7)

Gregory Alan Hildstrom
Marc-Olivier Barre
Paul Davis
Paul Winkler
Phil Frost
Stefan Kost
Tim Blechmann