GENWiki

Premier IT Outsourcing and Support Services within the UK

User Tools

Site Tools


archive:programming:eurochar
       The Care and Feeding of Foreign Characters
	Copyright 1991 by Alex Gross
   Accessing, displaying, and printing foreign characters

on a computer can become a vast subject, but I will attempt

to keep it within bounds. There are two main reasons why

this field is so complex. First, each of us would naturally

like to summon up almost any character at will in any size on

any kind of monitor, as though the computer could read our

mind. And we'd also like to print it in any style or color

on any printer, as though here too the printer could know

exactly what we wanted. At least that's what many people

would claim they want. But second and more important, the

hackers who cobbled the first computers together had no way

of sharing this breadth of desire. They were happy that the

machines worked at all, and they never really foresaw how

commonplace computers might become. They improvised their

terminology as they went along, borrowing electronic terms

from punchcard operators and telephone jargon from radio

hams. It was assumed that the only language anyone would

ever want to use on a computer was this weird form of

English, though only in capital letters (and even English was

an afterthought to numbers). This

necessarily meant that accents, cedillas, and cute little

Polish "l's" with tails only got put in as a highly

improvised afterthought in various inconsistent ways.

   And this is one reason why there are so many ways of

generating foreign characters today. What follows are

numerous devices–though by no means all of them–to coax

diacritics from your keyboard. Keep in mind, as we said last

month with Cyrillic, that there are trade-offs involved in

most of these solutions. The one you choose will vary

according to your time, knowledge, curiosity and resources.

Each method has advantages and drawbacks.

   1)  Get a Mac.  As noted before, this machine is well

equipped to handle all kinds of graphic characters, including

accents. Drawback: you won't have an IBM compatible PC.

We'll try to discuss this a bit later without getting drawn

into polemics. The following examples assume you are still

IBM- or clone-bound.

   2)  The WordPerfect Solution.  This approach, fairly

close to ideal provided you already use WordPerfect, is also

a good argument for choosing it as your first word processor.

You can either use the Format-Other-Language command, which

will allow you to print (but not view on screen) most

accented characters, or you

can send off for WP's Foreign Language Modules, which come in

twenty-one different languages. These include both Canadian

and continental French, Swiss and orthodox German, Brazilian

and Iberian Portuguese, and-three-count them–versions of

English: American, British, and Australian! Drawback: none,

unless you happen to be wedded to another word processor.

(NOTE: reviews of the French and Spanish modules are still

upcoming). (1)

   And now come the other IBM-based strategies:
   3)  The DOS Formula.  The mother of all solutions, but

not for the faint of heart. You will have to open your DOS

manual and read about not one but four relatively obscure

commands: Country, Code Page, Keyboard, and Mode. These will

have an impact on two of your computer's most sacred centers,

its CONFIG.SYS and AUTOEXEC.BAT files. I will not quote all

the rules here–they are too long, and besides all of our DOS

Manuals are different. But if you (or a more technical

friend) can handle this, then most of your accent problems

should be behind you. Drawbacks: the relative complexity of

the process, but also its near-finality: you can only use one

language at a time, and you will have to be or become adept

at the full foreign keyboard, with its AZERTY for QWERTY in

French and the reversal of "z" and "y"

in German. But once you have mastered this, it is behind you

and really works. You will also be able to keep separate

files for two or more languages, so you can switch between

them.

   4)  The ASCII Dodge.  This is a retail solution, useful

for small quantities of text, provided you are working in

straight ASCII mode or using an ASCII-based word processor.

In this realm each of the common accented letters is produced

by pressing the ALT key plus a specific number. Thus, ALT

plus 136 give you a lower-case "e" with a circumflex. ASCII

tables are widespread in the computer literature, and pop-up

table programs showing the full ASCII "set" on screen can be

found in the screen or print utility file libraries of most

bulletin boards. Drawbacks: this technique's slowness and

clumsiness of use is the lesser complaint. The main problems

involve not only making sure your printer can handle these

characters (alas, no substitute here for hands-on work over

your printer manual) but also transferring these characters

correctly in and out of ASCII and various word processor

formats, which are not identical. For instance, older

versions of MicroSoft Word assign a different character to

ALT plus 136. Michelle Battaglia provided a relatively

simple two-step search-and-replace solution at a recent NY

Circle meeting: search for all your circumflexed "e's" and

replace them with a unique code in Word Processor A, such as

"e6," then convert to ASCII. Pour the ASCII file into word

processor B and then search out all instances of "e6:" globally

replace them with the correct form for circumflexed "e."

   5)  COMMERCIAL PRODUCTS:  Here we come back to Fancy

Font (or Fancy Word), which I mentioned last month. It will

provide disks for French, Spanish, Greek, Russian, and a

generic "Germanic," as well as some weirder alphabets. (2)

But you still won't see accents on the screen. Typical font

sizes for the French and Spanish are 12, 14, 16, and 18

points in regular, italic, and bold, though no doubt the age

of lasers and postscript has now created a far wider range of

choice. Beware: when I received their very first Spanish

disks, the upside down question marks were floating in mid

line, and they simply doubled the "more-than" and "less-than"

signs to create quotation marks. Perhaps this has been

solved by now–ask before you order. Windows Write and Word

for Windows, with their Mac-like environment, are also likely

to handle these characters well, but they require powerful

systems to run at their best. .

   6)  SHAREWARE: weighing in here most impressively is a

new multilingual word processor called INTEXT. You can

download it free on many bulletin boards (most recent

version: INTEXT12.ZIP) or send away for a more complete

version from its creator. (3) It boasts several accompanying

files for many languages, though you can only use one at a time.

The file called "&EUROPE" contains keyboards for most of that

continent's tongues based on the latin alphabet and even

allows you to choose between QWERTY and AZERTY in French.

Other files are entitled Arabic, Urdu, Farsi, Greek, Russian,

Polish, and "Yugoslavian." You will not see the letters on

the screen, however, and I have to confess that unlike others

I haven't got it to print very much yet, either because of

the age of my printer or my aversion to changing my dip

switches again. The program comes up readily on my monitor

(which an earlier version didn't) and claims to work only

with 24-pin dot matrix printers, though this may have changed

by now. One other shareware program, a golden oldie called

CHIWRITER, is also worth mentioning. You will still find

older versions gratis on BBS's, but its creators have now

opted for respectability and put out a still modestly priced

commercial version. (4) It will give you Greek along with

lots of math and symbols, some accented characters, and a few

other typographical gismos. Its typefaces are pretty

rudimentary, but newer versions boast laserjet quality.

   HARDWARE SOLUTIONS:  These include not only messing with

dip switches (I always use the shirt-pocket hook end of a

ballpoint pen-top for these, though a toothpick works too)

but actual boards you can insert inside your machine. Both

Hercules and Rampage boast such boards with character

capability, but make sure your computer and all its

attachments are unplugged before you try opening the box! I

have no first hand experience of either board–can some of

our readers provide more details? Specific query: does

anyone know if the Duke University Language Toolkit still

exists–haven't heard about it for a while.

   As I said about Cyrillic, some translators imagine there

is simply no way to use foreign characters, but the real

wonder is the sheer number of options. If you are using yet

another method our readers could profit from, by all means

let us know, and we'll include it in further coverage of this

subject later this year. It would be helpful if you could

tell us what systems, printers, monitors, graphics cards, or

parent programs it works (or doesn't work) with.

   And now, since I've recommended the Mac so highly,  a

few words on the great Mac-PC debate. My own view is roughly

this: the Mac is already a fine machine and likely to get

even better. The newer versions are much cheaper, though not

if you want color. But if I were starting over again today,

I would still choose an IBM Clone for most (though not all)

purposes for three reasons:

   1)  Compatibility--seven out of eight personal computers

are still IBM's or clones.

   2)  Ultimate expense--software for the Mac is almost

always more expensive than PC Software, not even including:

   3)  Shareware--the vast range of software available for

the PC free or for nominal sums. It keeps being created and

improved at a staggering rate. There is probably as much new

shareware and public domain software available for the PC

every few months as the entire supply of Mac shareware. This

remarkable storehouse of the useful and playful in my opinion

ranks right up there with papyrus scrolls, illuminated

manuscripts, the first encyclopedias, radio, film and

television as one of humanity's major cultural achievements.

And it keeps on growing in front of us.

   One final thought--I hope I'm not sounding too flip or

smug about computers. If I advise you in passing to install

a language module or delve into DOS or open your machine and

plug in a board, I'm not for an instant suggesting that any

of these is necessarily easy to do, especially if you are

trying it for the first time. Some computer tasks are

intrinsically difficult, even for

highly skilled people, and no one should feel guilty if

things are sometimes hard going. But just as frequently such

procedures can actually be easy and usually need to be done

only once. If a particular task takes forever and bogs you

down in endless calls to Tech Support numbers, then it may

not be what you need, and perhaps another solution will work

better.

   (1)  Contact the WordPerfect Corporation, 1555 North

Technology Way, Orem, UT 84057, for more details. TEL:(800)

451-5151.

   (2)  FancyFont & FancyWord are available from SoftCraft,

Inc, 222 State Street, Madison, WI 53703, TEL: (608) 257-

3300.

   (3)  The latest INTEXT program with all language files

is available for $49 from Intex Software Systems

International, P.O. Box 3068, Stamford, CT 06905-0068. They

also have listings of programs for other languages, and some

of these will be reviewed in future articles.

   (4)  The last price I saw for the commercial version of

CHIWRITER was $79, but check with CHIWRITER, Horstmann

Software Design, P.O. Box 4544, Ann Arbor, MI 48106.

NOTE: This piece originally appeared in the May 1991

issue of the ATA Chronicle, published by the American Translators Association.

/data/webs/external/dokuwiki/data/pages/archive/programming/eurochar.txt · Last modified: 2001/11/08 10:27 by 127.0.0.1

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki