异常代码的博客

wordpress的robots设置

2009 十一月, 10 目录 常用资料.

User-agent: *
Disallow: /cgi-bin
Disallow: /wp-admin
Disallow: /wp-includes
Disallow: /wp-content/plugins
Disallow: /wp-content/cache
Disallow: /wp-content/themes
Disallow: /trackback
Disallow: /feed
Disallow: /comments
Disallow: /category/*/*
Disallow: */trackback
Disallow: */feed
Disallow: */comments
Disallow: /*?*
Disallow: /*?
Allow: /wp-content/uploads

# Google Image
User-agent: Googlebot-Image
Disallow:
Allow: /*

# Google AdSense
User-agent: Mediapartners-Google*
Disallow:
Allow: /*

# Internet Archiver Wayback Machine
User-agent: ia_archiver
Disallow: /

# digg mirror
User-agent: duggmirror
Disallow: /

注释一下robots里的各个元素

User-agent: *
Disallow: /cgi-bin 空间默认目录
Disallow: /wp-admin 后台地址
Disallow: /wp-includes 程序默认文件
Disallow: /wp-content/plugins 程序插件
Disallow: /wp-content/cache 程序缓存
Disallow: /wp-content/themes 程序主题模板
Disallow: /trackback 引用路径
Disallow: /feed 信息源路径(避免和内容页重复)
Disallow: /comments 评论路径(避免和内容页重复)
Disallow: /category/*/* 子频道路径(避免与大频道"/*"重复)
Disallow: */trackback 引用路径
Disallow: */feed 信息源路径
Disallow: */comments 评论路径
Disallow: /*?*(动态页面)
Disallow: /*?(动态页面)
Allow: /wp-content/uploads

# Google Image 图片搜索数据抓取
User-agent: Googlebot-Image
Disallow:
Allow: /*

# Google AdSense adsense数据抓取
User-agent: Mediapartners-Google*
Disallow:
Allow: /*

# Internet Archiver Wayback Machine (放置网站各时期页面被历史浏览工具抓取作为缓存)
User-agent: ia_archiver
Disallow: /

# digg mirror (digg数据抓取)
User-agent: duggmirror
Disallow: /

Sitemap: http://www.xxx.com/sitemap.xml(网站地图单独一行 也可以放置到robots最顶部)

«

No comments yet

There are no comments yet, be the first!

Leave a Comment

XHTML: You can use these tags: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

:wink: :-| :-x :twisted: :) 8-O :( :roll: :-P :oops: :-o :mrgreen: :lol: :idea: :-D :evil: :cry: 8) :arrow: :-? :?: :!: