Hi there
Whilst JTidy works well to some extent for my scenario, there is an issue with it parsing html5 attributes as its indicating that things like nav, footer are data-site etc are 'unknown'
I'm using this to parse html content to xml and then run that with xml2map to draw out a structure on a tree and all html5 related elements are not identified.
Is there another method I can use to convert my html5 content to xml besides using jtidy to xmlElements?
I'm still to explore jsoup and it it can do this all the better?
Can someone advise please? Thanks.
Whilst JTidy works well to some extent for my scenario, there is an issue with it parsing html5 attributes as its indicating that things like nav, footer are data-site etc are 'unknown'
I'm using this to parse html content to xml and then run that with xml2map to draw out a structure on a tree and all html5 related elements are not identified.
Is there another method I can use to convert my html5 content to xml besides using jtidy to xmlElements?
I'm still to explore jsoup and it it can do this all the better?
Can someone advise please? Thanks.