示例 采集百度搜索结果列表的标题和链接。
**采集代码**
$data = QueryList::get('https://www.baidu.com/s?wd=QueryList', null, [
'headers' => [
'User-Agent' => 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_3) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/80.0.3987.149 Safari/537.36',
'Accept-Encoding' => 'gzip, deflate, br',
]
])->rules([
'title' => ['h3', 'text'],
'link' => ['h3>a', 'href']
])
->range('.result')
->queryData();
print_r($data);
采集结果
Array
(
[0] => Array
(
[title] => QueryList|基于phpQuery的无比强大的PHP采集工具
[link] => http://www.baidu.com/link?url=GU_YbDT2IHk4ns1tjG2I8_vjmH0SCJEAPuuZN
)
[1] => Array
(
[title] => PHP 用QueryList抓取网页内容 - wb145230 - 博客园
[link] => http://www.baidu.com/link?url=zn0DXBnrvIF2ibRVW34KcRVFG1_bCdZvqvwIhUqiXaS
)
[2] => Array
(
[title] => 介绍- QueryList指导文档
[link] => http://www.baidu.com/link?url=pSypvMovqS4v2sWeQo5fDBJ4EoYhXYi0Lxx
)
//...
)
QueryList安装:https://querylist.cc/docs/guide/v4/installation
好用的工具:
- 八爪鱼
- 后羿
- 火车头
文章摘要:https://querylist.cc/docs/guide/v4/example
QueryList手册:https://querylist.cc/docs/guide/v4
评论 (0)