B4J Library [B4A/B4J] UniversalDetector - Encoding (charset) detector

Based on: juniversalchardet v2.4.0 (14. November 2020.)
Detect encoding (charset) of any kind of text based file or string.
juniversalchardet is a Java port of "universalchardet", that is the encoding detector library of Mozilla.

EDIT: I have also tested on Android and library work just fine.

Encodings that can be detected:
  • Chinese
    • ISO-2022-CN
    • BIG-5
    • EUC-TW
    • HZ-GB-2312
    • GB-18030
  • Cyrillic
    • ISO-8859-5
    • KOI8-R
    • WINDOWS-1251
    • MACCYRILLIC
    • IBM866
    • IBM855
  • Greek
    • ISO-8859-7
    • WINDOWS-1253
  • Hebrew
    • ISO-8859-8
    • WINDOWS-1255
  • Japanese
    • ISO-2022-JP
    • Shift_JIS
    • EUC-JP
  • Korean
    • ISO-2022-KR
    • EUC-KR
  • Unicode
    • UTF-8
    • UTF-16BE / UTF-16LE
    • UTF-32BE / UTF-32LE / X-ISO-10646-UCS-4-3412 / X-ISO-10646-UCS-4-2143
  • Others
    • WINDOWS-1252
    • US-ASCII
 

Attachments

  • jUniversalDetectorLibrary.zip
    207.5 KB · Views: 411
  • jUniversalDetectorExample.zip
    2.4 KB · Views: 360
  • AndroidExample.zip
    9.5 KB · Views: 328
Last edited:
Top