Skip to main content



      Home
Home » Modeling » TMF (Xtext) » Possible Unicode Bug - Greek Polytonic(Although acute accented letters included in UTF8 range '\u1F00'..'\u1FFF' for Polytonic Greek, XText shows error )
Possible Unicode Bug - Greek Polytonic [message #894157] Fri, 06 July 2012 21:55 Go to next message
Eclipse UserFriend
The body below contains UTF8 Polytonic Greek.

In order to allow for IDs using polytonic Greek, have defined the following rule in my grammar:

terminal ID: '^'?('a'..'z'|'A'..'Z'|'α'..'ω'|'Α'..'Ω'|'ύ'|'\u1F00'..'\u1FFF')('a'..'z'|'A'..'Z'|'α'..'ω'|'Α'..'Ω'|'\u1F00'..'\u1FFF'|'0'..'9')*
;

The generated grammar correctly handles accented Greek characters in the range of 1F00 to 1FFF except for certain letters with an acute accent mark. Examples include 1F71 ά and 1F7B ύ both of which are within the specified range.

When using an XText editor with the generated grammar, these characters with acute accent marks are marked as an error, and the error message states "extraneous input expecting RULE_STRING"

I have also tried explicitly declaring ύ in the terminal ID rule, but I get the same error.

Is this a bug?

[Updated on: Sat, 07 July 2012 14:33] by Moderator

Re: Possible Unicode Bug - Greek Polytonic [message #894217 is a reply to message #894157] Sat, 07 July 2012 14:30 Go to previous messageGo to next message
Eclipse UserFriend
Did you double check the encoding of all files?
We do nothing special in Xtext here, so this could also be a Java or
Antlr issue. A bug with a reproducible example would be welcome.

Am 07.07.12 03:55, schrieb Kokongi Nei:
> The body below contains UTF8 Polytonic Greek.
>
> In order to allow for IDs using polytonic Greek, have defined the
> following rule in my grammar:
>
> terminal ID:
> '^'?('a'..'z'|'A'..'Z'|'α'..'ω'|'Α'..'Ω'|'ύ'|'\u1F00'..'\u1FFF')('a'..'z'|'A'..'Z'|'α'..'ω'|'Α'..'Ω'|'\u1F00'..'\u1FFF'|'0'..'9')*
>
> ;
>
> The generated grammar correctly handles accented Greek characters in the
> range of 1F00 to 1FFF except for certain letters with an acute accent
> mark. Examples include 1F71 ά and 1F7B ύ both of which are within the
> specified range.
>
> When using an XText editor with the generated grammar, these characters
> with acute accent marks are marked as an error, and the error message
> states "extraneous input expecting RULE_STRING"
>
> I have also tried explicitly declaring ύ in the terminal ID rule, but I
> get the same error.
>
> Is this a bug?
>
>


--
Need professional support for Eclipse Modeling?
Go visit: http://xtext.itemis.com
Re: Possible Unicode Bug - Greek Polytonic [message #894218 is a reply to message #894217] Sat, 07 July 2012 14:35 Go to previous message
Eclipse UserFriend
The file is encoded at UTF-8. I have uploaded some screen shots to show what the error looks like.
Previous Topic:Customizing error messages
Next Topic:Import...As
Goto Forum:
  


Current Time: Tue Jul 15 06:13:35 EDT 2025

Powered by FUDForum. Page generated in 0.16355 seconds
.:: Contact :: Home ::.

Powered by: FUDforum 3.0.2.
Copyright ©2001-2010 FUDforum Bulletin Board Software

Back to the top