| Forums | Active Topics | [Click to Join Our Forums] | Cell Sites Gallery | FAQ | Members List | Search | Today's Posts | Mark Forums Read |
| The WirelessAdvisor Community Annoucements | Site Feedback | Contests | Introductions | Milestones |
|
| | Thread Tools | Search this Thread |
| | #1 (permalink) |
| Luv My Treo !!!!! Join Date: Dec 2002 Location: SE Wisconsin Posts: 5,556
Phone(s): Treo Pro, Nokia 6131, Moto i325 IS Provider(s): at&t/at&t/Nextel Devices: Assorted handheld & installed GPS Thanks: 35
Thanked 32 Times in 27 Posts
Images: 167 |
Joe, Sorry for bugging you again. If I look at "Who's Online" from the Quick links, other than members & guests, I see multiple entries for "Yahoo! Slurp Spider" & I also see "yahooaskjeeves". Who/What on earth are these? Thanks Much
__________________ ![]() The secret of success is sincerity. Once you can fake that you've got it made. Jean Giraudoux (1882-1944) |
| | |
| | #2 (permalink) |
| Join Date: Dec 2006 Location: Bay Area, CA Posts: 1,738
Phone(s): Sony Ericsson K850i, Nokia 6131 v5.50, Nokia 6820 v5.30 Provider(s): AT&T Blue; T-Mobile To Go Devices: Palm TX Thanks: 7
Thanked 16 Times in 15 Posts
| I know I'm not JFB, but having worked with a spider bot before I can give you some info. Google, Yahoo search and other services will "crawl" through the WWW, loading, storing, indexing, analyzing and what not different pages. "Good" bots follow the rules (sites can limit the bot access by using robots.txt file to set their bot access rules) and also identify themselves similarly to the way browsers identify themselves by setting the User-Agent header in the HTTP request sent to the site. The sites can then use these to identify the active bots.
|
| | |
| | #3 (permalink) |
| Luv My Treo !!!!! Join Date: Dec 2002 Location: SE Wisconsin Posts: 5,556
Phone(s): Treo Pro, Nokia 6131, Moto i325 IS Provider(s): at&t/at&t/Nextel Devices: Assorted handheld & installed GPS Thanks: 35
Thanked 32 Times in 27 Posts
Images: 167 | Wirelessly posted (SAMSUNG-SGH-I607/I607FG1 Mozilla/4.0 (compatible; MSIE 4.01; Windows CE; Smartphone; 320x240) UP.Link/6.3.1.17.0) Thanks dmapr (even though you are not JFB So what would be an example of a "bad" bot? Thanks Much
__________________ ![]() The secret of success is sincerity. Once you can fake that you've got it made. Jean Giraudoux (1882-1944) Last edited by Charlyee; 06-21-2007 at 7:50 PM. |
| | |
| | #4 (permalink) | |
| Join Date: Dec 2006 Location: Bay Area, CA Posts: 1,738
Phone(s): Sony Ericsson K850i, Nokia 6131 v5.50, Nokia 6820 v5.30 Provider(s): AT&T Blue; T-Mobile To Go Devices: Palm TX Thanks: 7
Thanked 16 Times in 15 Posts
| Quote:
A "bad" bot would be a bot that shows up on the site and ignoring any directives that may've been specified in the robots.txt file proceeds to read every link as fast as they could, clogging up the server resources. Here's a Web robots FAQ, hopefully it can answer some questions — I'm about seven years removed from robots and my memory may be failing me. | |
| | |
| | #5 (permalink) |
| Administrator Join Date: Jan 1998 Location: New Jersey, USA Posts: 2,493
Phone(s): Verizon (HTC) VX6800 Provider(s): Verizon Wireless Devices: Garmin eTrex Legend, Sirius Starmate Thanks: 29
Thanked 22 Times in 13 Posts
Images: 6 |
Yup, dmapr explained it nicely. It is good to see those because it means the search engines are indexing all of the great info. we have here. I could exclude them from the "who's online" list, but I think it is interesting to see when they are active.
__________________ Joe Are you new to WA? Introduce yourself in our Welcome Forum! Find out how to get a FREE WA T-shirt --> HERE |
| | |
| | #6 (permalink) |
| Easy,Cheap & Sleazy Join Date: Sep 2002 Location: Union County NJ Posts: 8,331
Phone(s): Razr V3xx, enV, Adventure, 6010 Provider(s): AT&T, Verizon, T-Mobile ToGo Thanks: 0
Thanked 0 Times in 0 Posts
Images: 293 |
Wow, that is pretty wild & I never knew about them. I agree Joe leave them be if they are not hurting anything.
|
| | |
| | #7 (permalink) | ||
| Battery mgmt is my life Join Date: Oct 2002 Location: Cambridge, MA Posts: 1,468
Phone(s): LG CU500,BlackBerry 8830, Previous: BB 8703e, Nokia 6200, Siemens S46, Ericsson R280LX Provider(s): T-Mobile (personal), Verizon (work) Devices: Palm T2 Thanks: 8
Thanked 9 Times in 8 Posts
| Quote:
Quote:
Sounds like your memory is fine. I haven't worked with bots per se, but in the early days of the Web I did a bit of playing around to make my sites appear higher in search engines, and I also tracked bot behavior on the Web server I ran. As you implied, allowing bots is the price we pay to have the Web indexed efficiently, enabling Google and other search engines. To Charlyee's point, they have to do essentially what a person would do, because there is only one protocol, HTTP, that all Web servers use, and the HTML pages served by that protocol have a fairly small number of commands that are relevant. Links and images are still a large part of what's out there. Tim Berners-Lee's insight was to understand that this was enough to do lots of cool stuff. Of course, there are plenty of ways to protect your site from bots in addition robots.txt, which is essentially the honor system. You can require a login for example, like corporate intranets do. But if you want your information available to the world, you have to accommodate bots. SW
__________________ One Zero: "It is a great pleasure..." Zero One: "...to work on such a large mobile computer." -- Star Trek TNG, Season 1, Ep. 15, (1988) | ||
| | |
| | #8 (permalink) | |
| Luv My Treo !!!!! Join Date: Dec 2002 Location: SE Wisconsin Posts: 5,556
Phone(s): Treo Pro, Nokia 6131, Moto i325 IS Provider(s): at&t/at&t/Nextel Devices: Assorted handheld & installed GPS Thanks: 35
Thanked 32 Times in 27 Posts
Images: 167 | Wirelessly posted (SAMSUNG-SGH-I607/I607FG1 Mozilla/4.0 (compatible; MSIE 4.01; Windows CE; Smartphone; 320x240) UP.Link/6.3.1.17.0) Quote:
Steve, thanks for the additional piece of information, it made it more interesting yet. dmapr, thanks for the link. Fire, aren't you glad I asked, now we both learnt something additional today
__________________ ![]() The secret of success is sincerity. Once you can fake that you've got it made. Jean Giraudoux (1882-1944) Last edited by Charlyee; 06-21-2007 at 10:43 PM. | |
| | |
| | #9 (permalink) | |
| Easy,Cheap & Sleazy Join Date: Sep 2002 Location: Union County NJ Posts: 8,331
Phone(s): Razr V3xx, enV, Adventure, 6010 Provider(s): AT&T, Verizon, T-Mobile ToGo Thanks: 0
Thanked 0 Times in 0 Posts
Images: 293 | Quote:
| |
| | |
| | #10 (permalink) | ||
| Join Date: Dec 2006 Location: Bay Area, CA Posts: 1,738
Phone(s): Sony Ericsson K850i, Nokia 6131 v5.50, Nokia 6820 v5.30 Provider(s): AT&T Blue; T-Mobile To Go Devices: Palm TX Thanks: 7
Thanked 16 Times in 15 Posts
| Quote:
Quote:
Steve's point just triggered another piece of memory bot traps. Bot traps are usually some obscure links that lead the bots on wild goose chases, that can be hidden/disguised from the user who is browsing. For instance, you can place a number of "legitimate" looking links on your web page and hide them from view using CSS or some other attributes. Usually bots do not try to analyze the full HTML (it'll slow them down too much) to see whether the links will be visible or not and may fall into the trap. | ||
| | |
| Currently Active Users Viewing This Thread: 1 (0 members and 1 guests) | |
| Thread Tools | Search this Thread |
|
|
| | ||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Online Retailers | flipflopjim | GENERAL Wireless Discussion | 0 | 08-18-2004 9:51 PM |
| Online ESN Changes on VZW.com | Adman | Northeastern US Wireless Forum | 11 | 12-02-2003 9:32 PM |
| How do I get online via aol | ziptbird | MOTOROLA | 0 | 01-06-2003 10:54 PM |
| Ordred VX1 online | All Other Brands of Wireless Phones | 0 | 08-01-2002 11:08 AM | |