ក្រុម រូបភាព ការបន្ទាន់សម័យ នានា វ៉ែប ថតឯកសារ
Recently Visited Groups | Help | Sign in
Google Groups Home
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  4 messages - Collapse all  -  Translate all to Translated (View all originals)
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
wreilly  
View profile  
 More options Dec 1 2008, 11:22 pm
From: wreilly
Date: Mon, 1 Dec 2008 08:22:10 -0800 (PST)
Local: Mon, Dec 1 2008 11:22 pm
Subject: Sitemap Protocol suggestion
Not sure if this belongs here or in the Sitemap Protocol section.

What do you guys think of adding a new directive to the sitemap?
Something like <index_exclusive>url</index_exclusive>.

Including this in the sitemap would direct the bot to only index the
pages listed and ignore any other internal links found on the pages
listed.
It would have to be understood by the user that including this
directive would mean they are taking responsibility for telling the
bot explicitly which URL’s to index and that anything left out would
not be indexed.

I came up with this ( “bright idea” ) while trying to use the nofollow
to eliminate the duplication WMT “errors” in a PHPBB forum, but this
would apply to most any forum or blog.

So rather than peppering the nofollow all over the place, one entry in
the sitemap would direct the bot to ignore all of the different ways
provided to the user to jump to content and eliminate the duplicate
Title and Meta Tag “errors”.

Bill


    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
wreilly  
View profile  
 More options Dec 2 2008, 4:09 am
From: wreilly
Date: Mon, 1 Dec 2008 13:09:24 -0800 (PST)
Local: Tues, Dec 2 2008 4:09 am
Subject: Re: Sitemap Protocol suggestion
Thud ; -))

On Dec 1, 10:22 am, wreilly wrote:


    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
JohnMu Google employee  
View profile  
 More options Dec 3 2008, 4:53 pm
From: JohnMu
Date: Wed, 3 Dec 2008 01:53:45 -0800 (PST)
Local: Wed, Dec 3 2008 4:53 pm
Subject: Re: Sitemap Protocol suggestion
Hi Bill!

That's something we've considered before and pretty much dropped. The
big problem is that it's just too easy to break things completely with
something like that. If you forget URLs in your Sitemap file or if you
forget to update your Sitemap file or even if you forget that you have
a Sitemap file, you could accidentally limit your site's indexing
without knowing it. I do however agree that the problem you mentioned
(duplicate content through URL parameters) is an important one - and
it's one that we (and all other search engines) are always working on
improving.

John


    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
wreilly  
View profile  
 More options Dec 3 2008, 6:53 pm
From: wreilly
Date: Wed, 3 Dec 2008 03:53:18 -0800 (PST)
Local: Wed, Dec 3 2008 6:53 pm
Subject: Re: Sitemap Protocol suggestion
Hey John,

Thank you. I had figured this wasn’t the first time this came up;  the
idea was hatched from thinking “there’s gotta be a better way”.

It would be complicated to implement and fraught with peril for the
uninformed user, I agree.

Don’t suppose I could persuade you guys to take a look at what I did
with my forum? Basically there are some global restrictions in the
robots and then using nofollow, limiting the “path” for the bot to the
sitemap urls generated with ( a modified ) GSitecrawler’s phpbb sample
project.

It seems to be working for the bot, but I have been there before.

Bill

On Dec 3, 3:53 am, JohnMu wrote:


    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
End of messages
« Back to Discussions « Newer topic     Older topic »

Create a group - Google Groups - Google Home - Terms of Service - Privacy Policy
©2010 Google