Meta Robots 详解

2025-05-12 27

Bing 管理员工具

robots.txt利用

百度收录资源平台

robots.txt 生成

robots.txt sitemap

robots.txt文件

robots.txt 语法

百度收录解析与操作指南

<meta name="robots"> 是 HTML 的元标签，用于控制搜索引擎爬虫如何索引和跟踪网页内容。

在网页的 <head> 部分添加以下代码：

<meta name="robots" content="指令1,指令2">

其中 content 属性可包含多个指令，用逗号分隔。

指令	作用	示例
`index`	允许搜索引擎索引该页面	`<meta name="robots" content="index">`
`noindex`	禁止搜索引擎索引该页面	`<meta name="robots" content="noindex">`
`follow`	允许爬虫跟踪页面上的链接	`<meta name="robots" content="follow">`
`nofollow`	禁止爬虫跟踪页面上的链接	`<meta name="robots" content="nofollow">`
`none`	等同于 `noindex, nofollow`	`<meta name="robots" content="none">`
`noarchive`	禁止搜索引擎缓存页面快照	`<meta name="robots" content="noarchive">`
`nosnippet`	禁止在搜索结果中显示摘要	`<meta name="robots" content="nosnippet">`
`notranslate`	禁止自动翻译该页面	`<meta name="robots" content="notranslate">`
`noimageindex`	禁止索引页面上的图片	`<meta name="robots" content="noimageindex">`
`unavailable_after:[date]`	在指定日期后停止索引	`<meta name="robots" content="unavailable_after: 31-Dec-2024">`

<meta name="robots" content="index,follow">
<!-- 或直接省略，搜索引擎默认会索引和跟踪 -->

<meta name="robots" content="noindex,follow">
<!-- 适用于不想被收录但希望传递权重的页面 -->

<meta name="robots" content="index,nofollow">
<!-- 适用于允许收录但不想传递权重的页面 -->

<meta name="robots" content="noindex,nofollow">
<!-- 或简写为 -->
<meta name="robots" content="none">

<meta name="robots" content="noarchive,nosnippet">
<!-- 适用于敏感内容 -->

✅ 禁止收录登录页、隐私政策页（noindex）
✅ 允许收录但阻止权重传递（index,nofollow）
✅ 禁止缓存敏感内容（noarchive）
✅ 设置页面过期时间（unavailable_after）

这样设置后，搜索引擎会按照你的指令处理网页。