language agnostic - Are you fluent in Unicode yet? -


almost 5 years ago joel spolsky wrote article, "the absolute minimum every software developer absolutely, positively must know unicode , character sets (no excuses!)".

like many, read carefully, realizing high-time got grips "replacement ascii". unfortunately, 5 years later feel have slipped few bad habits in area. have you?

i don't write many international applications, have helped build many asp.net internet facing websites, guess that's not excuse.

so benefit (and believe many others) can input people on following:

  • how "get over" ascii once , all
  • fundamental guidance when working unicode.
  • recommended (recent) books , websites on unicode (for developers).
  • current state of unicode (5 years after joels' article)
  • future directions.

i must admit have .net background , happy information on unicode in .net framework. of course shouldn't stop differing background commenting though.

update: see this related question asked on stackoverflow previously.

since read joel article , other i18n articles kept close eye character encoding; , works if consistantly. if work in company standard use utf-8 , knows / work.

here interesting articles (besides joel's article) on subject:

a quote first article; tips using unicode:

  • embrace unicode, don't fight it; it's right thing do, , if weren't you'd have anyhow.
  • inside software, store text utf-8 or utf-16; say, pick 1 of 2 , stick it.
  • interchange data outside world using xml whenever possible; makes whole bunch of potential problems go away.
  • try make application browser-based rather write own client; browsers getting quite @ dealing texts of world.
  • if you're using else's library code (and of course are), assume unicode handling broken until proved correct.
  • if you're doing search, try hand linguistic , character-handling problems off understands them.
  • go off amazon or somewhere , buy latest revision of printed unicode standard; contains pretty need know.
  • spend time poking around unicode web site , learning how code charts work.
  • if you're going have serious work asian languages, go buy o'reilly book on subject ken lunde.
  • if have macintosh, run out , grab lord pixel's unicode font inspection tool. totally cool.
  • if you're going have down , dirty data, go attend 1 of twice-a-year unicode conferences. experts go , if don't know need know, you'll able find there knows.

Comments

Popular posts from this blog

c++ - How do I get a multi line tooltip in MFC -

asp.net - In javascript how to find the height and width -

c# - DataTable to EnumerableRowCollection -