Urdu shift tables | Search |
NowSMS Support Forums ⬆ MMS & SMS Technical Discussions (unsupported) ⬆ |
◄ ► |
Author | Message | |||
Nimre Nepoh New member Username: Nimre Post Number: 1 Registered: 05-2021 |
Checkiing Urdu shift tables in this link https://en.wikipedia.org/wiki/GSM_03.38#Urdu_language_(Arabic_and_basic_Latin_scripts) I noticed that they miss two characters: غ and ي. Do you know if there is any way to send them using gsm encoding? | |||
Des - NowSMS Support Board Administrator Username: Desosms Post Number: 6286 Registered: 08-2008 |
Hi, It has been quite a few years since I last reviewed the specifications for 3GPP TS 23.038, which is the specification that defines SMS data coding. Shift tables were added to this standard beginning with 3GPP Release 8 (2008). These shift tables are a way to support national language characters in SMS messages without requiring the entire message to be encoded with Unicode characters. If a message contains any characters that require Unicode encoding, the entire message becomes subject to a 70 character limitation per message instead of the English/Latin 160 character limit. Shift tables are a way to overcome this limitation, which doesn't quite expand the limit to 160, but is better than 70. (For others reading this thread, see https://support.nowsms.com/discus/messages/1/70000.html for some additional info about SMS shift tables.) I reviewed specifications (found here: https://portal.3gpp.org/desktopmodules/Specifications/SpecificationDetails.aspx?specificationId=745), and basically it does not appear that there have been any additions/modifications to these shift tables since 3GPP Release 9 (2009), which is when the Urdu shift tables were first defined. The specifications are now up to 3GPP Release 16. The following characters do not appear in the shift tables defined by this specification: غ U+063A ي U+064A Unfortunately, there is limited space in a shift table. The authors of 3GPP TS 23.038 managed to add 112 characters in the Urdu shift table, of which 80 of these characters can be encoded in 7 bits (the other 32 are encoded as two 7-bit characters). For whatever reason, these two characters were not included. The following shift tables were defined in 3GPP Release 8: Turkish: Ğ ğ Ş ş İ ı ç Spanish: ç Á Í Ó Ú á í ó ú Release 8.1 added: Portuguese: Á À Â Ã ª á à â ã É Ê é ê Í í Ó Ô Õ º ó ô õ Ú Ü ú ü ` ç ∞ Release 9 added: Bengali Gujarati Hindi Kannada Malayalam Oriya Punjabi Tamil Telugu Urdu Regards, Des NowSMS Support | |||
Nimre Nepoh New member Username: Nimre Post Number: 2 Registered: 05-2021 |
Thanks, I thought so as well. Will try to find solution. |