Category Archives: Unicode

A code point iterator adapter for C++ strings in UTF-8

As the last post in this series I’ve been writing on Unicode and UTF-8, I thought I would elaborate on an interesting idea I mentioned in my previous post. When discussing how a std::string object that stores UTF-8 text is … Continue reading

Posted in C/C++, Character encoding, Unicode | 10 Comments

Using UTF-8 as the internal representation for strings in C and C++ with Visual Studio

In today’s long post, I’m going to explain the guidelines we follow at Retibus Software in order to handle Unicode text in Windows programs written in C and C++ with Microsoft Visual Studio. Our approach is based on using the … Continue reading

Posted in C/C++, Character encoding, Unicode | 73 Comments

Character encodings and the beauty of UTF-8

我愛 UTF-8 أحِبّ In my previous blog post, I discussed what was needed to ensure that a web site uses the UTF-8 character encoding consistently. I thought I should write a post on why I think UTF-8 is superior to … Continue reading

Posted in Character encoding, Unicode | 3 Comments

Thanks for signing up, Mr. González – Welcome back, Mr. González!

In my previous post in the blog, I mentioned my frustration about those programs that surprisingly fail on Unicode support and encoding issues. I thought I should write a post about this because it never stops to amaze me how, … Continue reading

Posted in Character encoding, Unicode, Web development | Leave a comment