Back to Silas S. Brown's home page

Simplifying Unicode punctuation for SMS

Jump to the form

SMS "text messages" on mobile phones are frequently sent in either GSM 03.38 (a 7-bit ASCII-like code with some currency symbols etc) or Unicode. A single special character, such as a curved quotation mark, could cause otherwise-ASCII English text to be sent as Unicode, multiplying its size. This can be a problem if you want to include some text copied from a website or other source in your SMS. The Javascript-based browser tool below replaces common Unicode punctuation with ASCII, and removes diacritics and some formatting. (If you use a phone's browser, it will also attempt to place the result in the phone's SMS composer.)

Paste text into box and press

Length limits in some setups:SMS
 Transfer time of
slower networks
Single text (limit of very old phones and of UK landline "spoken text" 2018)1701606 seconds
Lyca/O2 UK limits sending (but not receiving) to (as of 2014)213430612 seconds
Limit of Sony Ericsson T100 (any longer are shown fragmented)640291836 seconds
Receiving limit apparently introduced by EE / T-Mobile early 2017 (longer can be falsely reported delivered)17113926012 minutes
Seems HTC/WM6.1 firmware bug can confuse delivery reports after80536012,2408 minutes
Limit of Samsung Galaxy S2 (Android 4.1)120804018,36012 minutes
Vodafone UK's limit 2013 (longer not sent)127850919,43113 minutes
Theoretical limit25517,08539,01525 minutes

All material © Silas S. Brown unless otherwise stated.
Android is a trademark of Google LLC.
HTC and Touch are trademarks of HTC Corporation.
Javascript is a trademark of Oracle Corporation in the US.
Samsung is a registered trademark of Samsung.
Sony Ericsson is probably a trademark of Sony Ericsson Mobile Communications AB.
Unicode is a registered trademark of Unicode, Inc. in the United States and other countries.
Vodafone is a trademark of Vodafone Group Plc.
Any other trademarks I mentioned without realising are trademarks of their respective holders.