Go Back   WirelessAdvisor.com Forums > Site Information > The WirelessAdvisor Community

The WirelessAdvisor Community | Subject: JFB - Who is online question :) in Site Information; Joe, Sorry for bugging you again. If I look at "Who's Online" from the Quick links, other than members & ...

The WirelessAdvisor Community Annoucements | Site Feedback | Contests | Introductions | Milestones


Ad Links
T-Mobile Deals
Reply
 
Thread Tools Search this Thread
Old 06-21-2007, 5:48 PM   #1 (permalink)
Luv My Treo !!!!!
 
Charlyee's Avatar
 
Join Date: Dec 2002
Location: SE Wisconsin
Posts: 5,556
Phone(s): Treo Pro, Nokia 6131, Moto i325 IS
Provider(s): at&t/at&t/Nextel
Devices: Assorted handheld & installed GPS
Thanks: 35
Thanked 32 Times in 27 Posts
Images: 167
167 Images


Default JFB - Who is online question :)

Joe,

Sorry for bugging you again. If I look at "Who's Online" from the Quick links, other than members & guests, I see multiple entries for "Yahoo! Slurp Spider" & I also see "yahooaskjeeves".

Who/What on earth are these?

Thanks Much
__________________


The secret of success is sincerity. Once you can fake that you've got it made.
Jean Giraudoux (1882-1944)
Charlyee no ha iniciado sesión  
Digg this Post!Add Post to del.icio.usFurl this Post!
Reply With Quote
Old 06-21-2007, 7:18 PM   #2 (permalink)

 
Join Date: Dec 2006
Location: Bay Area, CA
Posts: 1,738
Phone(s): Sony Ericsson K850i, Nokia 6131 v5.50, Nokia 6820 v5.30
Provider(s): AT&T Blue; T-Mobile To Go
Devices: Palm TX
Thanks: 7
Thanked 16 Times in 15 Posts
Default Re: JFB - Who is online question :)

Quote:
Originally Posted by Charlyee View Post
Joe,

Sorry for bugging you again. If I look at "Who's Online" from the Quick links, other than members & guests, I see multiple entries for "Yahoo! Slurp Spider" & I also see "yahooaskjeeves".

Who/What on earth are these?

Thanks Much
I know I'm not JFB, but having worked with a spider bot before I can give you some info. Google, Yahoo search and other services will "crawl" through the WWW, loading, storing, indexing, analyzing and what not different pages. "Good" bots follow the rules (sites can limit the bot access by using robots.txt file to set their bot access rules) and also identify themselves similarly to the way browsers identify themselves — by setting the User-Agent header in the HTTP request sent to the site. The sites can then use these to identify the active bots.
dmapr está en línea  
Digg this Post!Add Post to del.icio.usFurl this Post!
Reply With Quote
Old 06-21-2007, 7:48 PM   #3 (permalink)
Luv My Treo !!!!!
 
Charlyee's Avatar
 
Join Date: Dec 2002
Location: SE Wisconsin
Posts: 5,556
Phone(s): Treo Pro, Nokia 6131, Moto i325 IS
Provider(s): at&t/at&t/Nextel
Devices: Assorted handheld & installed GPS
Thanks: 35
Thanked 32 Times in 27 Posts
Images: 167
167 Images


Default

Wirelessly posted (SAMSUNG-SGH-I607/I607FG1 Mozilla/4.0 (compatible; MSIE 4.01; Windows CE; Smartphone; 320x240) UP.Link/6.3.1.17.0)

Thanks dmapr (even though you are not JFB ), very interesting. They actually show up as looking at posts just like a member/guest. So they mimic human behavior for the purpose of information gathering?

So what would be an example of a "bad" bot?

Thanks Much
__________________


The secret of success is sincerity. Once you can fake that you've got it made.
Jean Giraudoux (1882-1944)

Last edited by Charlyee; 06-21-2007 at 7:50 PM.
Charlyee no ha iniciado sesión  
Digg this Post!Add Post to del.icio.usFurl this Post!
Reply With Quote
Old 06-21-2007, 8:09 PM   #4 (permalink)

 
Join Date: Dec 2006
Location: Bay Area, CA
Posts: 1,738
Phone(s): Sony Ericsson K850i, Nokia 6131 v5.50, Nokia 6820 v5.30
Provider(s): AT&T Blue; T-Mobile To Go
Devices: Palm TX
Thanks: 7
Thanked 16 Times in 15 Posts
Default Re: JFB - Who is online question :)

Quote:
Originally Posted by Charlyee View Post
Wirelessly posted (SAMSUNG-SGH-I607/I607FG1 Mozilla/4.0 (compatible; MSIE 4.01; Windows CE; Smartphone; 320x240) UP.Link/6.3.1.17.0)

Thanks dmapr (even though you are not JFB ), very interesting. They actually show up as looking at posts just like a member/guest. So they mimic human behavior for the purpose of information gathering?

So what would be an example of a "bad" bot?

Thanks Much
The bots typically parse HTML looking for links and then follow them and repeat — a lot more methodically than a human would.

A "bad" bot would be a bot that shows up on the site and ignoring any directives that may've been specified in the robots.txt file proceeds to read every link as fast as they could, clogging up the server resources. Here's a Web robots FAQ, hopefully it can answer some questions — I'm about seven years removed from robots and my memory may be failing me.
dmapr está en línea  
Digg this Post!Add Post to del.icio.usFurl this Post!
Reply With Quote
Old 06-21-2007, 9:39 PM   #5 (permalink)
JFB
Administrator
 
JFB's Avatar
 
Join Date: Jan 1998
Location: New Jersey, USA
Posts: 2,493
Phone(s): Verizon (HTC) VX6800
Provider(s): Verizon Wireless
Devices: Garmin eTrex Legend, Sirius Starmate
Thanks: 29
Thanked 22 Times in 13 Posts
Images: 6
6 Images


Default Re: JFB - Who is online question :)

Yup, dmapr explained it nicely. It is good to see those because it means the search engines are indexing all of the great info. we have here.

I could exclude them from the "who's online" list, but I think it is interesting to see when they are active.
__________________
Joe

Are you new to WA?

Introduce yourself in our Welcome Forum!

Find out how to get a FREE WA T-shirt --> HERE
JFB está en línea  
Digg this Post!Add Post to del.icio.usFurl this Post!
Reply With Quote
Old 06-21-2007, 9:42 PM   #6 (permalink)
Easy,Cheap & Sleazy
 
Fire14's Avatar
 
Join Date: Sep 2002
Location: Union County NJ
Posts: 8,331
Phone(s): Razr V3xx, enV, Adventure, 6010
Provider(s): AT&T, Verizon, T-Mobile ToGo
Thanks: 0
Thanked 0 Times in 0 Posts
Images: 293
293 Images


Default Re: JFB - Who is online question :)

Wow, that is pretty wild & I never knew about them. I agree Joe leave them be if they are not hurting anything.
Fire14 no ha iniciado sesión  
Digg this Post!Add Post to del.icio.usFurl this Post!
Reply With Quote
Old 06-21-2007, 9:50 PM   #7 (permalink)
Battery mgmt is my life
 
SteveW's Avatar
 
Join Date: Oct 2002
Location: Cambridge, MA
Posts: 1,468
Phone(s): LG CU500,BlackBerry 8830, Previous: BB 8703e, Nokia 6200, Siemens S46, Ericsson R280LX
Provider(s): T-Mobile (personal), Verizon (work)
Devices: Palm T2
Thanks: 8
Thanked 9 Times in 8 Posts
Default Re: JFB - Who is online question :)

Quote:
Originally Posted by Charlyee
They actually show up as looking at posts just like a member/guest. So they mimic human behavior for the purpose of information gathering?
Quote:
Originally Posted by dmapr View Post
The bots typically parse HTML looking for links and then follow them and repeat — a lot more methodically than a human would.
...
— I'm about seven years removed from robots and my memory may be failing me.


Sounds like your memory is fine. I haven't worked with bots per se, but in the early days of the Web I did a bit of playing around to make my sites appear higher in search engines, and I also tracked bot behavior on the Web server I ran. As you implied, allowing bots is the price we pay to have the Web indexed efficiently, enabling Google and other search engines.

To Charlyee's point, they have to do essentially what a person would do, because there is only one protocol, HTTP, that all Web servers use, and the HTML pages served by that protocol have a fairly small number of commands that are relevant. Links and images are still a large part of what's out there. Tim Berners-Lee's insight was to understand that this was enough to do lots of cool stuff.

Of course, there are plenty of ways to protect your site from bots in addition robots.txt, which is essentially the honor system. You can require a login for example, like corporate intranets do. But if you want your information available to the world, you have to accommodate bots.


SW
__________________
One Zero: "It is a great pleasure..."
Zero One: "...to work on such a large mobile computer."
-- Star Trek TNG, Season 1, Ep. 15, (1988)
SteveW está en línea  
Digg this Post!Add Post to del.icio.usFurl this Post!
Reply With Quote
Old 06-21-2007, 10:32 PM   #8 (permalink)
Luv My Treo !!!!!
 
Charlyee's Avatar
 
Join Date: Dec 2002
Location: SE Wisconsin
Posts: 5,556
Phone(s): Treo Pro, Nokia 6131, Moto i325 IS
Provider(s): at&t/at&t/Nextel
Devices: Assorted handheld & installed GPS
Thanks: 35
Thanked 32 Times in 27 Posts
Images: 167
167 Images


Default

Wirelessly posted (SAMSUNG-SGH-I607/I607FG1 Mozilla/4.0 (compatible; MSIE 4.01; Windows CE; Smartphone; 320x240) UP.Link/6.3.1.17.0)

Quote:
Originally Posted by JFB
Yup, dmapr explained it nicely. It is good to see those because it means the search engines are indexing all of the great info. we have here.

I could exclude them from the "who's online" list, but I think it is interesting to see when they are active.
Joe thanks, yes it is interesting to see when they are active. Now that I know what they are I will be looking for them

Steve, thanks for the additional piece of information, it made it more interesting yet.

dmapr, thanks for the link. Yes your memory is just fine, being modest aren't you.?

Fire, aren't you glad I asked, now we both learnt something additional today
__________________


The secret of success is sincerity. Once you can fake that you've got it made.
Jean Giraudoux (1882-1944)

Last edited by Charlyee; 06-21-2007 at 10:43 PM.
Charlyee no ha iniciado sesión  
Digg this Post!Add Post to del.icio.usFurl this Post!
Reply With Quote
Old 06-21-2007, 10:35 PM   #9 (permalink)
Easy,Cheap & Sleazy
 
Fire14's Avatar
 
Join Date: Sep 2002
Location: Union County NJ
Posts: 8,331
Phone(s): Razr V3xx, enV, Adventure, 6010
Provider(s): AT&T, Verizon, T-Mobile ToGo
Thanks: 0
Thanked 0 Times in 0 Posts
Images: 293
293 Images


Default Re: JFB - Who is online question :)

Quote:
Originally Posted by Charlyee View Post
Wirelessly posted (SAMSUNG-SGH-I607/I607FG1 Mozilla/4.0 (compatible; MSIE 4.01; Windows CE; Smartphone; 320x240) UP.Link/6.3.1.17.0)



Joe thanks, yes it is interesting to see when they are active. Now that I know what they are I will be looking for them

Steve, thanks for the additional piece of information, it made it more interesting yet.

[v]dmapr[/b], thanks for the link. Yes your memory is just fine, being modest aren't you.?

Fire, aren't you glad I asked, now we both learnt something additional today
Yes I am glad, but I think my heads going to explode now.
Fire14 no ha iniciado sesión  
Digg this Post!Add Post to del.icio.usFurl this Post!
Reply With Quote
Old 06-21-2007, 11:30 PM   #10 (permalink)

 
Join Date: Dec 2006
Location: Bay Area, CA
Posts: 1,738
Phone(s): Sony Ericsson K850i, Nokia 6131 v5.50, Nokia 6820 v5.30
Provider(s): AT&T Blue; T-Mobile To Go
Devices: Palm TX
Thanks: 7
Thanked 16 Times in 15 Posts
Default Re: JFB - Who is online question :)

Quote:
Originally Posted by SteveW View Post
Sounds like your memory is fine.

Of course, there are plenty of ways to protect your site from bots in addition robots.txt, which is essentially the honor system. You can require a login for example, like corporate intranets do. But if you want your information available to the world, you have to accommodate bots.


SW
Quote:
Originally Posted by Charlyee View Post
Wirelessly posted (SAMSUNG-SGH-I607/I607FG1 Mozilla/4.0 (compatible; MSIE 4.01; Windows CE; Smartphone; 320x240) UP.Link/6.3.1.17.0)

Steve, thanks for the additional piece of information, it made it more interesting yet.

dmapr, thanks for the link. Yes your memory is just fine, being modest aren't you.?
I worked on the other side of bot — writing one, so we were very particular about honoring the system. When I say my memory is failing it means I won't be able to write an honorable one without some serious refreshing

Steve's point just triggered another piece of memory — bot traps. Bot traps are usually some obscure links that lead the bots on wild goose chases, that can be hidden/disguised from the user who is browsing. For instance, you can place a number of "legitimate" looking links on your web page and hide them from view using CSS or some other attributes. Usually bots do not try to analyze the full HTML (it'll slow them down too much) to see whether the links will be visible or not and may fall into the trap.
dmapr está en línea  
Digg this Post!Add Post to del.icio.usFurl this Post!
Reply With Quote
Reply


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off
Forum Jump

Similar Threads for: JFB - Who is online question :)
Thread Thread Starter Forum Replies Last Post
Online Retailers flipflopjim GENERAL Wireless Discussion 0 08-18-2004 9:51 PM
Online ESN Changes on VZW.com Adman Northeastern US Wireless Forum 11 12-02-2003 9:32 PM
How do I get online via aol ziptbird MOTOROLA 0 01-06-2003 10:54 PM
Ordred VX1 online All Other Brands of Wireless Phones 0 08-01-2002 11:08 AM


All times are GMT -4. The time now is 12:36 AM.