Advice Articles

  • Beginners
    Just getting started? Loads of powerful advice here for beginners.
  • Intermediate
    Got a handle on the basics? Find more advanced topics covered here.
  • Advanced
    Warning! Advanced topics covered here.
  • Affiliate resources
    Affiliate marketing resources for affiliates and affiliate program managers tools, websites, books and articles.
  • Product reviews
    Candid reviews of the latest products to take you to the next level.


 

Affiliate Marketing Forum

FAQFAQ  SearchSearch  MemberlistMemberlist  UsergroupsUsergroups   RegisterRegister ProfileProfile  Log in to check your private messagesPrivate Messages Log inLog in  
Googlebot indexing behaviour?

Affiliate Marketing Forum Index -> Search Engine Optimization
Post new topic   Reply to topic
View previous topic :: View next topic  
Author Message
Guest






PostPosted: Sat Jan 17, 2004 8:15 pm    Post subject: Googlebot indexing behaviour? Reply with quote

Hello everyone,

How does a search engine robot, e.g. Google, index a page with lots of external links, if the meta tag includes "index, follow, all"?

Will the Googlebot leave the page being indexed to follow an external link, therefore failing to index the rest of the current page, and indeed failing to index the rest of the site, or does a Googlebot follow a link and then turn round and come back to index the rest of the page and site?

Or alternatively, does it spawn multiple copies of itself and continue to index the current page and site while following the external links?

My concern is that in having external links, and the "index, follow, all" tag for the benefit of the internal links, Googlebot might be led away from the site, and which will not be indexed as a result. Is this correct, and if so, how can Googlebot be prevented from following external links, while still following internal links and indexing the entire site?

As a general question, how does Google manage to index the entire Web? The bandwidth requirement to copy billions of pages back to its databases must be absolutely astronomical! Does it really index the entire web as it finds it, or does it just index what it can and leave the rest?

Thanks very much.
Back to top
Debs



Joined: 16 Aug 2003
Posts: 4296
Location: NY

PostPosted: Sun Jan 18, 2004 12:02 am    Post subject: Re: Googlebot indexing behaviour? Reply with quote

1. Google, and other SE's, spider the site they are at, recording all data and links. AFter the spider is done with what it is "programmed to pick up" from that site, then it continues to the next site in the list, and so on.

I say programmed to pick up because sometimes GB will pick up your index page, and that's it, other times it will pick up a few pages, or the whole site.

2. Astronomical servers? Absolutely. Having the technology to be able to do such indexing does require phenomenal resources we ordinary folk can only dream about Wink

Googlebot, like most major SE's, have a "restrainer" built into them so they don't hit a site, pulling pages so fast it would overload the server of the site it is indexing.

Sometimes, having a dynamic site, or a very large content site, can cause a bot to hang around your site so long your host might shut down access to your site for going over your bandwidth limit for the month!

Debs
_________________
Learn how to turn keyphrases into quality, well-targeted articles your visitors and SE's will love with Gary Antosh's new ebook "Web Content Made Easy!"
Back to top View user's profile Send private message
Guest






PostPosted: Sun Jan 18, 2004 12:25 am    Post subject: Reply with quote

Thanks Debs!

Does the meta tag: <meta name="robots" content="index, follow, all">

have any real value? Should it be included or excluded? Should "robots.txt" be used instead?

Thanks again.
Back to top
Sean Burns



Joined: 11 Oct 2003
Posts: 232
Location: Sydney

PostPosted: Sun Jan 18, 2004 12:31 am    Post subject: Reply with quote

That meta tag doesn't do anything. A crawler will index a page and follow all links by default.

You only really need to use that tag or robots.txt if you want to stop them from crawling part of your site.

Cheers

Sean Burns
Back to top View user's profile Send private message Visit poster's website
edburdo



Joined: 14 Jul 2003
Posts: 1761
Location: Bangor, Maine

PostPosted: Sun Jan 18, 2004 1:14 pm    Post subject: Reply with quote

To give an example of what Sean mentions...

Say you have a Members only portion... you can set the Robot to ignore that. Or your stats directory, or your autoresponder directory...

Or just a junk directory for testing layouts/code...
_________________
Eric D. Burdo
They Made $6,513 a day With Clickbank Doing This...
Back to top View user's profile Send private message Visit poster's website
Gamezing



Joined: 30 Oct 2003
Posts: 66

PostPosted: Sun Jan 18, 2004 7:49 pm    Post subject: Reply with quote

I think a lot of people think of googlebot as a this one robot that goes out and does all the work. Instead of numerous spiders that they have all working out there at once hehe.

I have this odd picture in my head of the googlebot coming and all the natives bowing down to it lol
Back to top View user's profile Send private message AIM Address
Larry Chamberlain



Joined: 01 Aug 2003
Posts: 1126
Location: London, England

PostPosted: Sun Jan 18, 2004 10:16 pm    Post subject: Reply with quote

Quote:
I have this odd picture in my head of the googlebot coming and all the natives bowing down to it lol


This native would bow down to it and kiss its butt to get top ranking Very Happy

All the best,
Larry Chamberlain
_________________
Why Do Most Affiliates Make Less Than $500 Per Month?
All The Tools = Business Success.
Back to top View user's profile Send private message Visit poster's website
View previous topic :: View next topic  
Display posts from previous:   
Post new topic   Reply to topic    Affiliate Marketing Forum Index -> Search Engine Optimization All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum





Your host: Allan Gardyne.
Earning a good living from affiliate programs since 1998.