FUDforum
Fast Uncompromising Discussions. FUDforum will get your users talking.

Home » Imported messages » comp.lang.php » Scrape dynamically generated hyperlinks
Show: Today's Messages :: Polls :: Message Navigator
Switch to threaded view of this topic Create a new topic Submit Reply
Scrape dynamically generated hyperlinks [message #179682] Sat, 17 November 2012 19:46 Go to next message
student4lifer is currently offline  student4lifer
Messages: 2
Registered: November 2012
Karma: 0
Junior Member
Could someone let me know if it is possible to scrape dynamically generated hyperlinks like those from the following site:

https://fox.taleo.net/careersection/fox_external_career_section/jobsearch.f tl?


TIA.
Re: Scrape dynamically generated hyperlinks [message #179684 is a reply to message #179682] Sat, 17 November 2012 22:05 Go to previous messageGo to next message
Jerry Stuckle is currently offline  Jerry Stuckle
Messages: 2598
Registered: September 2010
Karma: 0
Senior Member
On 11/17/2012 2:46 PM, student4lifer wrote:
> Could someone let me know if it is possible to scrape dynamically generated hyperlinks like those from the following site:
>
> https://fox.taleo.net/careersection/fox_external_career_section/jobsearch.f tl?
>
>
> TIA.
>
>
It's always possible - the browser does it all the time. All you need
to do is load the iframe(s), parse and execute the javascript and make
the appropriate calls back to the server based on the parameters you wish.

Now if you want to ask "Is it practical?", that's another story.

--
==================
Remove the "x" from my email address
Jerry Stuckle
JDS Computer Training Corp.
jstucklex(at)attglobal(dot)net
==================
RE: Scrape dynamically generated hyperlinks [message #179685 is a reply to message #179682] Sun, 18 November 2012 00:03 Go to previous messageGo to next message
student4lifer is currently offline  student4lifer
Messages: 2
Registered: November 2012
Karma: 0
Junior Member
Thanks for your reply, Jerry. I tried to isolate a single hyperlink (from the parsed page with all its javasript) but clicking on it brought back the whole 'generic' starting page with many other hyperlinks as well. Only then, clicking on any of the hyperlinks truly brought back the detail record. Is it possible to manipulate/isolate the parsed page so that clicking on the hyperlink only once would return the detail record? Thanks again.
Re: Scrape dynamically generated hyperlinks [message #179686 is a reply to message #179685] Sun, 18 November 2012 00:36 Go to previous messageGo to next message
Jerry Stuckle is currently offline  Jerry Stuckle
Messages: 2598
Registered: September 2010
Karma: 0
Senior Member
On 11/17/2012 7:03 PM, student4lifer wrote:
> Thanks for your reply, Jerry. I tried to isolate a single hyperlink (from the parsed page with all its javasript) but clicking on it brought back the whole 'generic' starting page with many other hyperlinks as well. Only then, clicking on any of the hyperlinks truly brought back the detail record. Is it possible to manipulate/isolate the parsed page so that clicking on the hyperlink only once would return the detail record? Thanks again.
>
>

It doesn't look like that option is available on the first page. But
then I haven't gone through all the html and javascript, either.

--
==================
Remove the "x" from my email address
Jerry Stuckle
JDS Computer Training Corp.
jstucklex(at)attglobal(dot)net
==================
Re: Scrape dynamically generated hyperlinks [message #179690 is a reply to message #179682] Sun, 18 November 2012 04:23 Go to previous message
Thomas 'PointedEars'  is currently offline  Thomas 'PointedEars'
Messages: 701
Registered: October 2010
Karma: 0
Senior Member
student4lifer wrote:

> Could someone let me know if it is possible to scrape dynamically
> generated hyperlinks like those from the following site:
>
>
https://fox.taleo.net/careersection/fox_external_career_section/jobsearch.f tl?

It is possible. You need an ECMAScript-capable browser emulation written in
PHP. Good luck.


PointedEars
--
Danny Goodman's books are out of date and teach practices that are
positively harmful for cross-browser scripting.
-- Richard Cornford, cljs, <cife6q$253$1$8300dec7(at)news(dot)demon(dot)co(dot)uk> (2004)
  Switch to threaded view of this topic Create a new topic Submit Reply
Previous Topic: Data sanitation for mysql queries.
Next Topic: Repetetive code question
Goto Forum:
  

-=] Back to Top [=-
[ Syndicate this forum (XML) ] [ RSS ]

Current Time: Wed Nov 27 02:58:59 GMT 2024

Total time taken to generate the page: 0.02350 seconds