My goal is to get text from any web page, which does not provide any feed. Now I know that I can use Jtidy to convert a HTML page to XHTML page. And then XHTML can be parsed to text with a XML parser. Sadly, I cant get it working.
Can anyone post a working code for the same?
Can anyone post a working code for the same?