Hi All,
I downloaded a html source of a webpage. In it there are words having utf8 characters, like: "Török Gábor". When I do this:
tid.Parse(File.OpenInput(File.DirTemp, "index.html"), File.DirTemp, "1.xml")
, in the 1.xml contains:
Török Gábor
It is eventually wrong, because the char "ö" must be: ö, so the good encoding is:
Török Gábor
How can I tell the jTidy,
1. not to transform my utf8 chars into &xxx; code or
2. use the good html entity?
Thanks in advance
Steven