Hey John,
Thank you. I had figured this wasn’t the first time this came up; the
idea was hatched from thinking “there’s gotta be a better way”.
It would be complicated to implement and fraught with peril for the
uninformed user, I agree.
Don’t suppose I could persuade you guys to take a look at what I did
with my forum? Basically there are some global restrictions in the
robots and then using nofollow, limiting the “path” for the bot to the
sitemap urls generated with ( a modified ) GSitecrawler’s phpbb sample
project.
It seems to be working for the bot, but I have been there before.
Bill
On Dec 3, 3:53 am, JohnMu wrote:
> Hi Bill!
> That's something we've considered before and pretty much dropped. The
> big problem is that it's just too easy to break things completely with
> something like that. If you forget URLs in your Sitemap file or if you
> forget to update your Sitemap file or even if you forget that you have
> a Sitemap file, you could accidentally limit your site's indexing
> without knowing it. I do however agree that the problem you mentioned
> (duplicate content through URL parameters) is an important one - and
> it's one that we (and all other search engines) are always working on
> improving.
> John