I am trying my first attempt at "scraping" and using NJDude's helpful basic example here to guide me.
What puzzles me is that the SearchString.IndexOf(whatever) construction will find some phrases on the page but not others. I know my phrases are correctly typed because I cut and paste them from the "View Source" version of the site which Google Chrome kindly gives me. And I have been making sure that all double quotes are converted to single quotes as in the example, using
Surely my actual B4A code must be right or it wouldn't find any phrases at all ?
Is there some reason why this doesn't work with some characters, or are there "invisible" characters that Chrome's "view source" isn't showing me ?
An example of the page I'm trying to read is: Site Search 'G-EZWA' - Planespotters.net Just Aviation and I'm trying to get to the items in the table below the word "Registrations".
The result of
is -1 but as far as I can see it's there, albeit with double quotes in the original. I just can't find it with SearchString.IndexOf().
Can anyone help please ? zip file should be attached.
Caravelle
What puzzles me is that the SearchString.IndexOf(whatever) construction will find some phrases on the page but not others. I know my phrases are correctly typed because I cut and paste them from the "View Source" version of the site which Google Chrome kindly gives me. And I have been making sure that all double quotes are converted to single quotes as in the example, using
B4X:
SearchString = SearchString.Replace(QUOTE, "'")
Is there some reason why this doesn't work with some characters, or are there "invisible" characters that Chrome's "view source" isn't showing me ?
An example of the page I'm trying to read is: Site Search 'G-EZWA' - Planespotters.net Just Aviation and I'm trying to get to the items in the table below the word "Registrations".
The result of
B4X:
SearchString.IndexOf("<td class='nowrap dt-asc'>")
Can anyone help please ? zip file should be attached.
Caravelle