Sorry if multiple copies of this appear. The spam filter doesn't like my
choice of titles. I've tried a few variations so far.
I'm looking to develop a music editor/sequencer somewhat in the vein of
Cakewalk/Rosegarden, but looking more towards the future of MIDI and audio
capabilities. I've been thinking about this for a long time, and I think I
have enough of a plan now to make a go at it.
This is a rather long post, so I've divided it into four parts: Why Not
Rosegarden?, Project Overview, Design Goals, and End Notes.
Why Not Rosegarden?
Why not just join the Rosegarden devlopment team? While Rosegarden has a lot
of promise, it's design goal is different than what I'm thinking of. To put
it crudely, Rosegarden's goal is to be a better Cakewalk. This isn't meant to
be disrespectful to the Rosegarden developers. I like Cakewalk. (If you
don't, substitute CueBase, or whatever professional tool of that genre.)
Music is a language. While music itself is much more complex and vague (like
natural language) than a programming language such as C, I will use the
analogy of a programming editor. Rosegarden (and all similar tools) is like a
C editor. When it's mature, it will be a very good C editor. In addition, the
modularity and generalized approach that Rosegarden uses will make it
possible to also be a good C++ editor. I'm thinking of something more like a
multi-editor that will work for Perl or lisp. It will have it's own
limitations, so I'm not trying to create the be-all end-all of music editors.
But there are some fundamental things that Rosegarden will never be able to
accomplish. (An most users won't have any reason to want an editor that will
accomplish these things.)
The first involves the way meter and tempo are handled. There are several
pieces of 20th century music that incorporate 2 or more voices playing
simultaneously in 2 or more meters. An example that chokes every editor I
know of (though I'm told lilypond can do it with difficulty, but it has no
sequencing capability and isn't a WYSIWYG) is Bartok's "Music for Strings,
Percussion, and Celeste". I'm not saying that you can't create a Rosegarden
file that will play this piece. What I am saying is that the notation will be
extremely ugly. The issue is that not all of the voices are playing
simultaneously in the same meter. To notate this in all in one meter means
that some of the voices will look extremely complicated with myriads of ties
and accent marks.
Another interesting thing is to have 2 voices playing with different tempo.
No examples come to mind, but I can think of some cool effects that could be
achieve by having one voice race ahead of another. Again, Rosegarden could
play such a piece, but the notation would be ugly.
A third thing revolves around scales and tunings. Rosegarden has some plans
for different tunings and perhaps support for quarter tones. And I think,
again, that a Rosegarden tuning would be applied to an entire composition,
not a single instrument. But I'm thinking of things more general. Suppose
each track had an element called "tuning". Suppose also that each note is
stored as an integer in a note event (as it is in Rosegarden). The tunings
element contains a mapping between the note number given in the event and
what is actually played. Microtones and temperaments could be stored in the
tunings file as a MIDI note number plus a pitch bend. (Or we could store
frequencies, and some function converts frequencies to MIDI note numbers plus
pitch bends.) More novel things (such as octaves that aren't quite perfect
frequency multiples) could also be accommodated. The tunings file could also
be used to define scales that include added scale degrees. (e.g. for a
Pythagorean tuning, we have to include extra values, so we can notate, say,
an F## instead of a G, as these tones may not be equivalent in a non-equal
temperament.)
Rosegarden is trying to be fairly general about instrument definitions, which
is good. I'm thinking of somewhat different approach that looks to the future
of MIDI, in which we want to achieve more realism of the performance. Each
Track has this abstract thing called an Instrument. Instruments may be atomic
(a string) or composite (a guitar, which is composed of several atomic string
instruments). Let's suppose we use soundfonts. We could create a soundfont
with banks 1-6 for the strings of a guitar. Each single-string Instrument
gets mapped to one of the tone banks, and the guitar Instrument is composed
of these single-string Instruments.
In this way, it would be possible to achieve a playback in which an open D
chord sounds different than a 5th position D chord played a string lower,
just like on a real guitar. Instruments should have "modes" which can be
selected (and changed within the song). A guitar might have pickstyle
(perhaps which eventually get mapped to tonebanks 1-6) and fingerstyle (which
eventually get mapped to tonebanks 7-12). An organ could have modes for
various configurations of the stops. We could get very elaborate here and
have organs with multiple keyboards, etc., but I'm falling into a dreamy
state here. The idea is that the design should allow for this type of
functionality to be added.
So, how can we accomplish such a beast? Let's take a tour of what's in my
brain. No, no! Not over there! Don't open that door. That's the basement
where all the beastly details hide. Whatcha mean it's empty? Of course it's
empty. No, not that door either. That's the closet of confusion and
inconsistency. This way, this way. Here's the beautiful entry...
Project Overview
No pretty pictures, so you'll just have to imagine.
We begin with a main window. The document-view model is appropriate. We'll
have the typical views that you'd expect -- TrackView, ScoreView (StaffView),
MixerView, and some sort of EventView.
TrackView should look similar to that of Rosegarden, but we need to implement
it differently. The window needs to be some sort of split window, in which
each track gets its own pane. At the top, should be a time ruler which can
display in seconds, SMPTE, or beats/measures of a track. It should default to
displaying beats/measures of track 1. Each track pane displays its clips
similar to Rosegarden/Cakewalk. But it should hold a TrackGroup instead of a
single Track (Byt default each Track is in its own group). Additionally, it
should have a hideable ruler that displays time (default to beats/measures)
and tempo markings for that TrackGroup. This means that the TrackGroup data
structure needs to hold info about its meter and tempo. TrackGroups can be
expanded to show each track. They can also be hidden to reduce clutter.
TrackGroups can be nested (e.g. strings, violins, violin1). During playback,
big vertical ruler bar should follow the sequencer, similar to the appearance
in Rosegarden.
ScoreView should display like a score with staff notation. Tracks can be
grouped, and it should be possible to hide groupings. Expanded track
groupings should show each track. When compressed, a single staff with
multiple voices (1 for each track) should be shown. In this way, 4-part
harmony, drums, etc. can be displayed as is most convient to the user.
Tracks should be displayed with appropriate staves (single 5-line staff for
voice, optional tablature for stringed instruments, etc.) Non-MIDI tracks
such as wav audio should be displayed as a simple line or bar. We should
include typical musical symbols, dynamics, guitar tab symbols, lyrics,
section markers, and effects markers (things like echo, flange, etc.) Each
track should have an option to hide various things (e.g. hide the tablature,
lyrics, or effects markers).
Eventually, we should be able to print out sheet music.
The MixerView should be pretty typical, but again, with the ability to create
a nesting of sub-mixers. Each track should get its own effects rack.
Additionally, each sub-mixer (or track group) should get an effects rack.
This way, we can set effects for individual instruments. Then we can layer
another level of effects (say panning or delay) to a group. Finally, we can
add a layer for everybody (say reverb). As much as possible, we shouldn't
care whether we're dealing with wavs or MIDI. If we have a MIDI track, the
popup menu should only display MIDI effects. For aggregate groupings, only
effects that can be used on either MID or wave should be available. (Perhaps
we can take existing MIDI or audio only effects and put wrappers around them
that select which effect is actually used.) Supporting plug-in effects would
be very good. In the end, though, I think effects should be handled in the
same way as instruments. That is, we build complex effects out of simpler
components.
This leads me to instruments. Instruments and effects should be stored in
definition files. Then we can set up our instruments once and use them in
multiple compositions. Ditto for tunings.
Files should be stored as XML (probably compressed, as Rosegarden does).
Design Goals
1) Flexibility is paramount. A rigid desisn isn't going to work here. There
are too many unknowns and open-ended issues.
2) Modularity. Keep the pieces as independent as possible. Try to make
generic base classes that defer as much of the details as possible to derived
classes.
3) Begin with a generic framework. Determine how the system should operate.
Do we run the whole thing like an IDE for a programming language? How
separated should the editor and sequencer be? Do we edit, save to file, and
compile (sequence)? Or should the sequencer be more closely coupled, causing
the data structures in memory to be shared between editor and sequencer? The
first approach would make it difficult (or impossible) to do realtime mixing
of MIDI tracks, but it would provide for very efficient playback. All of the
MIDI tracks would be compiled to a MIDI file, so the on-the-fly processing
would be minimal. Efficient, but limiting. Perhaps it should run more like an
interpreter, processing the file on demand. A "mixdown" option could be
added, that would compile everything down for a final release. Then anyone
with the sequencer could play the file. (Analoguous to Windows Media Player,
in which you can play .avi files, but you can't create or edit them.)
4) Simple things should be simple. The default should be that every track
plays at the same tempo and in the same meter, instruments should map to
General MIDI bank, and equal-temperament tuning with standard key signatures
should be used. Recording, playback, and basic editing should be easy and
intuitive. The most common features and feature groupings should appear in
the menus. An advanced tab should be present to allow for lesser uses
features.
5) Don't lose sight of the big picture. The end goal is to have a product
that allows for audio and MIDI tracks that will play in sync. Other types of
tracks (such as video, or images) could be included as well.
6) Get something that's useful and provides basic functionality up and
running quickly. Add features step by step. A brilliantly conceived program
isn't worth much if it doesn't run.
7) I'm leaning toward C++ as the primary language, though it may be better to
use a scripting language such as Perl (which I know) or Python (which I don't
know) for certain portions. My most common methodology involves writing a
core engine in C++, then developing a simple scripting language (usually in
Perl) to access the features of the core. I'm not sure such an approach is
appropriate here, though.
End Notes for the Curious
I used to be a software designer and programmer. I've been stagnant for
several years now due to an illness that's taken away my presenability in job
interviews. (It's hard to make a good impression in today's world of
corporate fluff where appearance is more important than ability when you look
haggard, shake like an addict in detox, and have periods of brain fog that
never fail to appear in an interview.)
I don't know all that much about the down and dirty of audio programming. I
wrote a little media player tool with DirectX in Windows, a linux tool that
converts files between MIDI and XML (I should probably stick this out there
one SourceForge, as someone else might think it useful. How do I do that?),
and a few other little audio utilities.
I know a good deal about software design and maintenance, and I'm proficient
in c++ (and a few other languages as well). I've been project leader on a few
different pieces of software of substantial (100k-200k lines) size, with
teams of 10 people of less.
My degree is in applied mathematics.
For any university professors out there, I think this would be a great
master's (or phd) thesis. And I just might know a pretty good candidate. ;)