Your post was very long and my response will not directly address what you wrote. I am quoting something I wrote a couple of years ago about interfaces for this blind people. It makes a single but I think critical point.
-------------
years ago 
someone paid me to do a text-based UI for ardour. it was centered on 
very efficient use of the keyboard and using a screen-reader.
the
 code probably still exists. i don't think it was very successful, 
partly for the reasons identified in the text you sent. but i think 
there is a more important reason.
working with audio tends to involve the use of the screen to act as a
 kind of memory. there are a ton of parameters in play, and its a huge 
barrier if you constantly need to remember what they are all set to. the
 2d expanse of the screen represents a kind of 2nd level cache of this 
information, where a sighted person can simply glance around and 
discover what they need to know about the current state of things. 
reproducing this functionality without the information-dense medium 
that the screen represents is a HUGE challenge. i've thought about it on
 and off every since the "ksi" interface for ardour was done. i have no 
ideas on how anyone could make progress on this. i think its a very 
interesting, very, very hard problem. i have no time to work on it.
as a practical note, if someone wants to do something like this, it 
would obviously be quite likely that basing their efforts on an open 
source tool is likely to offer a lot of possibilities that are simply 
not available when using closed source tools.
-----------------