Jump to content
  • entries
    945
  • comments
    4,956
  • views
    1,221,634

Roll over! Sit! Speak!


Nathan Strum

2,646 views

So here's the dillyo (word up)...

 

Since several of you regular readers (yes, you) just so happen to be 2600 programmers (who, I might add, are some of the most talented, creative, and staggeringly intelligent people on the face of the planet), I thought I'd ask for some programming help. And I figured it would be more appropriate to do it here, than clutter up the homebrew forums.

 

Now, by "help" I actually mean "do (most of) the programming for me". But I figured you already saw that coming - what with the staggering intelligence and all. I'm not operating under any delusions here of programming something entirely by myself. But, the goal of this little endeavor is to actually try and help programmers, by doing some drudge work for them (you).

 

Specifically, for the AtariVox.

 

avox-pic.jpg

 

My understanding of this little box is, that it takes quite a bit of trial-and-error to come up with decent sounding voices for it. And I'd like to see more homebrew games use it. It's a cool piece of gear. So I'd like to take a shot at doing some of that trial and error.

 

I've downloaded the AtariVox Programming Guide, the SpeakJet User Manual, and the PhraseALator Dictionary, and have a reasonable understanding of what you need to enter in as far as the voice data goes.

 

What I need, is a template.

 

Basically, a simple chunk of code (if there is such a thing) where I can just drop in the appropriate lines of code to create voices (and other AtariVox sounds), so I can then build a working binary, and load it into my Krokodile Cart, and test them. Then go back, modify and refine them, and eventually pass the results on to homebrew authors to use in their games.

 

Think of it as audible sprite drawing. :)

 

If anyone's interested (or even thinks this is a workable idea) let me know. For that matter, if it's not a workable idea, you should let me know that, too. ;)

33 Comments


Recommended Comments



I'd like to see an Mac AtariVox app, more than a GUI version of KrokCom. The current KrokCom works just fine (although a GUI version would be nice, someday).

 

One nice feature to add, that PhraseALator (likely) doesn't do, is generate 2600-ready code. I would think this wouldn't be too hard to do.

 

The key though, is in being able to access all of the sounds easily, and be able to quickly manipulate them (and their parameters) to make the voices sound somewhat-natural. That trial-and-error is what takes forever.

Link to comment
The key though, is in being able to access all of the sounds easily, and be able to quickly manipulate them (and their parameters) to make the voices sound somewhat-natural. That trial-and-error is what takes forever.

That's precisely why I thought that adapting John's 16-char demo to Bob's AtariVOX source would be useful, as mentioned here. The only issue is that the demo doesn't display quite right on real hardware, but I think John is working on that.

 

EDIT: answered by own question - the PhraseAlator can interface to the Avox via USB. So I assume the USB interface is a custom device. Will it ever be sold, and could it (in theory) allow management of the Savekey EEPROM data?

Link to comment
The key though, is in being able to access all of the sounds easily, and be able to quickly manipulate them (and their parameters) to make the voices sound somewhat-natural. That trial-and-error is what takes forever.

That's precisely why I thought that adapting John's 16-char demo to Bob's AtariVOX source would be useful, as mentioned here. The only issue is that the demo doesn't display quite right on real hardware, but I think John is working on that.

 

EDIT: answered by own question - the PhraseAlator can interface to the Avox via USB. So I assume the USB interface is a custom device. Will it ever be sold, and could it (in theory) allow management of the Savekey EEPROM data?

I believe that Richard H. is working on creating the USB device and David Galloway (djmips) is writing the software for it.

 

There also are, floating around (I have one), a serial interface that goes between the PC and the AVox. And I have used this to dump the EEPROM - in fact, I just used it a month or two ago when debugging Elevators Amiss.

Link to comment
I'd like to see an Mac AtariVox app, more than a GUI version of KrokCom. The current KrokCom works just fine (although a GUI version would be nice, someday).
agreed
One nice feature to add, that PhraseALator (likely) doesn't do, is generate 2600-ready code. I would think this wouldn't be too hard to do.
I like this idea
The key though, is in being able to access all of the sounds easily, and be able to quickly manipulate them (and their parameters) to make the voices sound somewhat-natural. That trial-and-error is what takes forever.
Link to comment

cd-w gave me a little 4K program so you can compose speech using a Cuttle Cart or Supercharger rather than relying on the USB interface. I also made an AtariVox header file for it that makes it easier to string allophones together. Let me know if this is useful.

 

Blog Post with Attachment

 

What I do is speak it myself the way I'd like it to sound, then adjust the parameters word by word until it matches how I say it. It works well for me. Riding the pitch is very important in naturalistic delivery. Although the AtariVox does a poor job with certain allophones (p in particular is almost inaudible) when you humanize the phrasing, it is more intelligible because your mind fills in the gaps.

 

The turnaround time between tweaking the bytes and hearing the change with the Cuttle Cart using this method is like 10 seconds. It's not that bad. It's like anything else, though. The more time you spend on it, the better the result.

 

BTW, I think text to speech is for chumps. If you want it to sound good you have to hand pick every allophone. There are just too many ways to pronounce things. TTS will always be a little off. I mean, with direct control over the allophones you can approximate accents like Corcodile Dundee or Shrek. It's just so much more powerful.

Link to comment

I guess now would be the time to post my hack of Supercat's 16-char demo that sends the on-screen data to the Avox (in hex.) Set the numbers with the joystick, and press Reset to talk. Note that this does not work quite right on real hardware (number display is off) but in Stella w/Avox it should work (though I have not tested it this way.)

Link to comment
Riding the pitch is very important in naturalistic delivery. Although the AtariVox does a poor job with certain allophones (p in particular is almost inaudible) when you humanize the phrasing, it is more intelligible because your mind fills in the gaps.

 

See here for a brilliant example of how delivery is more important than phonetic precision. With the right wording, the lesson of the story makes sense:

 

Mural: Yonder nor sorghum stenches shut ladle gulls stopper torque wet strainers.

Link to comment

Guest
Add a comment...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
  • Recently Browsing   0 members

    • No registered users viewing this page.
×
×
  • Create New...