Bill Patterson on Sat, 9 Aug 2003 00:12:13 -0400


[Date Prev] [Date Next] [Thread Prev] [Thread Next] [Date Index] [Thread Index]

[PLUG] www.ecommercics.com/spider/


An improved spider.php and search.php programs are now available at
www.ecommercics.com/spider/.

I have found the problem that was keeping some of the spidered web pages
off of our database Wednesday night.  I was not adding escape characters
before quotes, and the SQL was therefore in error for SOME of the web
pages being saved to the database (especially from Chris's site).  I
have now invoked the php function addslashes(), have targetted
www.cjfearnley.com, built a (manually truncated) database from the
spider of it, and can search successfully for "deanforamerica" which we
had not found before!  I have also added error checking to the
spiderlib.php call to mysql_query(). <g>

Please feel free to play with the spider.php and search.php programs.  
(Please note that I have disabled the INSERT statement in the spider.php
program so that the database won't grow too much.)  If you want the real
code to look at it is at www.ecommercics.com/spider/spidisplay/*.show .

Bill
_________________________________________________________________________
Philadelphia Linux Users Group        --       http://www.phillylinux.org
Announcements - http://lists.netisland.net/mailman/listinfo/plug-announce
General Discussion  --   http://lists.netisland.net/mailman/listinfo/plug