B4J Question Problem with JTidy (char coding)

rosippc64a

Active Member
Licensed User
Longtime User
Hi All,
I downloaded a html source of a webpage. In it there are words having utf8 characters, like: "Török Gábor". When I do this:
B4X:
        tid.Parse(File.OpenInput(File.DirTemp, "index.html"), File.DirTemp, "1.xml")
, in the 1.xml contains:
B4X:
Török Gábor
It is eventually wrong, because the char "ö" must be: ö, so the good encoding is:
B4X:
Török Gábor
How can I tell the jTidy,
1. not to transform my utf8 chars into &xxx; code or
2. use the good html entity?

Thanks in advance
Steven
 
Cookies are required to use this site. You must accept them to continue using the site. Learn more…