My goal is to get text from any web page, which does not provide any feed. Now I know that I can use Jtidy to convert a HTML page to XHTML page. And then XHTML can be parsed to text with a XML parser. Sadly, I cant get it working.
This is what I want to do:
-Pick random web page, say for example a google search,
-convert the page into XML
-Take the relevant data from the parsed XML.
My goal is to take this data in my app in 'text' form.