Urdu shift tables

Urdu shift tables SearchSearch
Author Message
Nimre Nepoh
New member
Username: Nimre

Post Number: 1
Registered: 05-2021
Posted on Thursday, May 20, 2021 - 09:52 am:   

Checkiing Urdu shift tables in this link https://en.wikipedia.org/wiki/GSM_03.38#Urdu_language_(Arabic_and_basic_Latin_scripts) I noticed that they miss two characters: غ and ي. Do you know if there is any way to send them using gsm encoding?
Des - NowSMS Support
Board Administrator
Username: Desosms

Post Number: 6286
Registered: 08-2008
Posted on Thursday, May 20, 2021 - 09:27 pm:   

Hi,

It has been quite a few years since I last reviewed the specifications for 3GPP TS 23.038, which is the specification that defines SMS data coding. Shift tables were added to this standard beginning with 3GPP Release 8 (2008). These shift tables are a way to support national language characters in SMS messages without requiring the entire message to be encoded with Unicode characters. If a message contains any characters that require Unicode encoding, the entire message becomes subject to a 70 character limitation per message instead of the English/Latin 160 character limit. Shift tables are a way to overcome this limitation, which doesn't quite expand the limit to 160, but is better than 70. (For others reading this thread, see https://support.nowsms.com/discus/messages/1/70000.html for some additional info about SMS shift tables.)

I reviewed specifications (found here: https://portal.3gpp.org/desktopmodules/Specifications/SpecificationDetails.aspx?specificationId=745), and basically it does not appear that there have been any additions/modifications to these shift tables since 3GPP Release 9 (2009), which is when the Urdu shift tables were first defined. The specifications are now up to 3GPP Release 16.

The following characters do not appear in the shift tables defined by this specification:

غ U+063A
ي U+064A

Unfortunately, there is limited space in a shift table. The authors of 3GPP TS 23.038 managed to add 112 characters in the Urdu shift table, of which 80 of these characters can be encoded in 7 bits (the other 32 are encoded as two 7-bit characters). For whatever reason, these two characters were not included.

The following shift tables were defined in 3GPP Release 8:

Turkish: Ğ ğ Ş ş İ ı ç
Spanish: ç Á Í Ó Ú á í ó ú

Release 8.1 added:

Portuguese: Á À Â Ã ª á à â ã É Ê é ê Í í Ó Ô Õ º ó ô õ Ú Ü ú ü ` ç ∞

Release 9 added:

Bengali
Gujarati
Hindi
Kannada
Malayalam
Oriya
Punjabi
Tamil
Telugu
Urdu


Regards,

Des
NowSMS Support
Nimre Nepoh
New member
Username: Nimre

Post Number: 2
Registered: 05-2021
Posted on Monday, May 24, 2021 - 02:24 pm:   

Thanks, I thought so as well. Will try to find solution.

Add Your Message Here, or click here to start a new topic.
Post:
Bold text Italics Underline Create a hyperlink Insert a clipart image
Options: Automatically activate URLs in message
Action: