Skip to main content


Eclipse Community Forums
Forum Search:

Search      Help    Register    Login    Home
Home » Modeling » TMF (Xtext) » Possible Unicode Bug - Greek Polytonic(Although acute accented letters included in UTF8 range '\u1F00'..'\u1FFF' for Polytonic Greek, XText shows error )
Possible Unicode Bug - Greek Polytonic [message #894157] Sat, 07 July 2012 01:55 Go to next message
Kokongi Nei is currently offline Kokongi NeiFriend
Messages: 5
Registered: July 2012
Junior Member
The body below contains UTF8 Polytonic Greek.

In order to allow for IDs using polytonic Greek, have defined the following rule in my grammar:

terminal ID: '^'?('a'..'z'|'A'..'Z'|'α'..'ω'|'Α'..'Ω'|'ύ'|'\u1F00'..'\u1FFF')('a'..'z'|'A'..'Z'|'α'..'ω'|'Α'..'Ω'|'\u1F00'..'\u1FFF'|'0'..'9')*
;

The generated grammar correctly handles accented Greek characters in the range of 1F00 to 1FFF except for certain letters with an acute accent mark. Examples include 1F71 ά and 1F7B ύ both of which are within the specified range.

When using an XText editor with the generated grammar, these characters with acute accent marks are marked as an error, and the error message states "extraneous input expecting RULE_STRING"

I have also tried explicitly declaring ύ in the terminal ID rule, but I get the same error.

Is this a bug?

[Updated on: Sat, 07 July 2012 18:33]

Report message to a moderator

Re: Possible Unicode Bug - Greek Polytonic [message #894217 is a reply to message #894157] Sat, 07 July 2012 18:30 Go to previous messageGo to next message
Jan Koehnlein is currently offline Jan KoehnleinFriend
Messages: 760
Registered: July 2009
Location: Hamburg
Senior Member
Did you double check the encoding of all files?
We do nothing special in Xtext here, so this could also be a Java or
Antlr issue. A bug with a reproducible example would be welcome.

Am 07.07.12 03:55, schrieb Kokongi Nei:
> The body below contains UTF8 Polytonic Greek.
>
> In order to allow for IDs using polytonic Greek, have defined the
> following rule in my grammar:
>
> terminal ID:
> '^'?('a'..'z'|'A'..'Z'|'α'..'ω'|'Α'..'Ω'|'ύ'|'\u1F00'..'\u1FFF')('a'..'z'|'A'..'Z'|'α'..'ω'|'Α'..'Ω'|'\u1F00'..'\u1FFF'|'0'..'9')*
>
> ;
>
> The generated grammar correctly handles accented Greek characters in the
> range of 1F00 to 1FFF except for certain letters with an acute accent
> mark. Examples include 1F71 ά and 1F7B ύ both of which are within the
> specified range.
>
> When using an XText editor with the generated grammar, these characters
> with acute accent marks are marked as an error, and the error message
> states "extraneous input expecting RULE_STRING"
>
> I have also tried explicitly declaring ύ in the terminal ID rule, but I
> get the same error.
>
> Is this a bug?
>
>


--
Need professional support for Eclipse Modeling?
Go visit: http://xtext.itemis.com


---
Get professional support from the Xtext committers at www.typefox.io
Re: Possible Unicode Bug - Greek Polytonic [message #894218 is a reply to message #894217] Sat, 07 July 2012 18:35 Go to previous message
Kokongi Nei is currently offline Kokongi NeiFriend
Messages: 5
Registered: July 2012
Junior Member
The file is encoded at UTF-8. I have uploaded some screen shots to show what the error looks like.
Previous Topic:Customizing error messages
Next Topic:Import...As
Goto Forum:
  


Current Time: Wed Feb 19 07:36:33 GMT 2020

Powered by FUDForum. Page generated in 0.05968 seconds
.:: Contact :: Home ::.

Powered by: FUDforum 3.0.2.
Copyright ©2001-2010 FUDforum Bulletin Board Software

Back to the top