language agnostic - Are you fluent in Unicode yet? -
almost 5 years ago joel spolsky wrote article, "the absolute minimum every software developer absolutely, positively must know unicode , character sets (no excuses!)".
like many, read carefully, realizing high-time got grips "replacement ascii". unfortunately, 5 years later feel have slipped few bad habits in area. have you?
i don't write many international applications, have helped build many asp.net internet facing websites, guess that's not excuse.
so benefit (and believe many others) can input people on following:
- how "get over" ascii once , all
- fundamental guidance when working unicode.
- recommended (recent) books , websites on unicode (for developers).
- current state of unicode (5 years after joels' article)
- future directions.
i must admit have .net background , happy information on unicode in .net framework. of course shouldn't stop differing background commenting though.
update: see this related question asked on stackoverflow previously.
since read joel article , other i18n articles kept close eye character encoding; , works if consistantly. if work in company standard use utf-8 , knows / work.
here interesting articles (besides joel's article) on subject:
- http://www.tbray.org/ongoing/when/200x/2003/04/06/unicode
- http://www.tbray.org/ongoing/when/200x/2003/04/26/utf
a quote first article; tips using unicode:
- embrace unicode, don't fight it; it's right thing do, , if weren't you'd have anyhow.
- inside software, store text utf-8 or utf-16; say, pick 1 of 2 , stick it.
- interchange data outside world using xml whenever possible; makes whole bunch of potential problems go away.
- try make application browser-based rather write own client; browsers getting quite @ dealing texts of world.
- if you're using else's library code (and of course are), assume unicode handling broken until proved correct.
- if you're doing search, try hand linguistic , character-handling problems off understands them.
- go off amazon or somewhere , buy latest revision of printed unicode standard; contains pretty need know.
- spend time poking around unicode web site , learning how code charts work.
- if you're going have serious work asian languages, go buy o'reilly book on subject ken lunde.
- if have macintosh, run out , grab lord pixel's unicode font inspection tool. totally cool.
- if you're going have down , dirty data, go attend 1 of twice-a-year unicode conferences. experts go , if don't know need know, you'll able find there knows.
Comments
Post a Comment