Rebuilding Process Information Thread

The VG Resource

Wiki Sprites Models Textures Sounds Login

VGFacts DidYouKnowGaming?

Users browsing this thread: 1 Guest(s)

Rebuilding Process Information Thread

Raz

Offline

Member

Retired Staff

Posts: 245
Threads: 5
Joined: May 2008

Steam

#15

01-07-2014, 03:50 PM

(01-07-2014, 02:18 PM)Phaze Wrote: On the subject of retrieving data, wouldn't it be theoretically possible to make a DOM-parsing program that will auto-download the cached pages for you from Google and parse the data? I've never done this sort of thing before* but if it's really gonna take ages to do, it might be worth for me or one of the staff to look into.

*While I haven't done something to scrape pages automatically, I once made a DOM-parsing program in PHP that I never finished to clean up saved pages from a forum to archive them neatly and fix the broken CSS.

And a chat with Dazz tells me that Google rejects crawling-like activity. Oh well '_;

I wrote a scraper but the IP address I was using got banned pretty quickly, even if I randomized the IP/interval between requests Google looks for patterns in the search requests and will block too many similar requests with a CAPTCHA. The funny thing is, the biggest web crawler in the world doesn't let you crawl their servers. Huh.

Website

« Next Oldest | Next Newest »

Messages In This Thread

Rebuilding Process Information Thread - by Dazz - 01-07-2014, 11:44 AM

RE: Rebuilding Process Information Thread - by Ton - 01-07-2014, 12:27 PM

RE: Rebuilding Process Information Thread - by Kedric - 01-07-2014, 01:12 PM

RE: Rebuilding Process Information Thread - by Petie - 01-07-2014, 01:20 PM

RE: Rebuilding Process Information Thread - by Ploaj - 01-07-2014, 01:23 PM

RE: Rebuilding Process Information Thread - by Kedric - 01-07-2014, 01:44 PM

RE: Rebuilding Process Information Thread - by Dazz - 01-07-2014, 01:45 PM

RE: Rebuilding Process Information Thread - by Dazz - 01-07-2014, 01:57 PM

RE: Rebuilding Process Information Thread - by Phaze - 01-07-2014, 02:12 PM

RE: Rebuilding Process Information Thread - by Kedric - 01-07-2014, 02:12 PM

RE: Rebuilding Process Information Thread - by Phaze - 01-07-2014, 02:18 PM

RE: Rebuilding Process Information Thread - by Raz - 01-07-2014, 03:50 PM

RE: Rebuilding Process Information Thread - by Phaze - 01-07-2014, 08:55 PM

RE: Rebuilding Process Information Thread - by Random Talking Bush - 01-07-2014, 02:26 PM

RE: Rebuilding Process Information Thread - by Garamonde - 01-07-2014, 03:10 PM

RE: Rebuilding Process Information Thread - by Dazz - 01-07-2014, 03:10 PM

RE: Rebuilding Process Information Thread - by Garamonde - 01-07-2014, 03:56 PM

RE: Rebuilding Process Information Thread - by DJane Coco - 01-07-2014, 04:38 PM

RE: Rebuilding Process Information Thread - by daemoth - 01-07-2014, 05:14 PM

RE: Rebuilding Process Information Thread - by puggsoy - 01-07-2014, 05:18 PM

RE: Rebuilding Process Information Thread - by psychospacecow - 01-07-2014, 05:22 PM

RE: Rebuilding Process Information Thread - by Pik - 01-07-2014, 07:44 PM

RE: Rebuilding Process Information Thread - by JosephSeraph - 01-07-2014, 08:03 PM

RE: Rebuilding Process Information Thread - by Dazz - 01-07-2014, 08:39 PM

RE: Rebuilding Process Information Thread - by MadkaT - 01-07-2014, 09:21 PM

RE: Rebuilding Process Information Thread - by Dazz - 01-07-2014, 09:24 PM

RE: Rebuilding Process Information Thread - by Raz - 01-07-2014, 10:51 PM

RE: Rebuilding Process Information Thread - by Deathbringer - 01-07-2014, 11:11 PM

RE: Rebuilding Process Information Thread - by Petie - 01-07-2014, 11:12 PM

RE: Rebuilding Process Information Thread - by Palculator - 01-08-2014, 03:52 AM

RE: Rebuilding Process Information Thread - by Dazz - 01-08-2014, 04:25 AM

RE: Rebuilding Process Information Thread - by Palculator - 01-08-2014, 05:35 AM

RE: Rebuilding Process Information Thread - by Petie - 01-08-2014, 05:48 AM

RE: Rebuilding Process Information Thread - by Dazz - 01-08-2014, 09:40 AM

RE: Rebuilding Process Information Thread - by Ton - 01-08-2014, 10:29 AM

RE: Rebuilding Process Information Thread - by tigerlily - 01-08-2014, 04:05 PM

RE: Rebuilding Process Information Thread - by Dunkelschwamm - 01-08-2014, 05:55 PM

RE: Rebuilding Process Information Thread - by Quirby64 - 01-08-2014, 06:18 PM

RE: Rebuilding Process Information Thread - by Raz - 01-08-2014, 06:49 PM

RE: Rebuilding Process Information Thread - by Ton - 01-08-2014, 07:49 PM

RE: Rebuilding Process Information Thread - by Dazz - 01-08-2014, 11:24 PM

RE: Rebuilding Process Information Thread - by Petie - 01-08-2014, 11:59 PM

RE: Rebuilding Process Information Thread - by Phaze - 01-09-2014, 04:50 PM

RE: Rebuilding Process Information Thread - by Raz - 01-09-2014, 07:56 PM

RE: Rebuilding Process Information Thread - by Phaze - 01-10-2014, 06:55 PM

RE: Rebuilding Process Information Thread - by Raz - 01-10-2014, 08:33 PM

RE: Rebuilding Process Information Thread - by Petie - 01-10-2014, 10:03 PM

RE: Rebuilding Process Information Thread - by Garamonde - 01-09-2014, 04:22 PM

RE: Rebuilding Process Information Thread - by Petie - 01-09-2014, 08:44 PM

View a Printable Version

Forum Jump: