先看看网站
步骤
- 书写代码
- chrome小彩蛋, 快速生成css选择器
番外1, code runner 插件设置, 如何从终端输出
ctrl+shift+p
settings
- 选择
open user settings
番外2, 设置vscode的默认终端
php代码
<?php
require 'vendor/autoload.php';
use QL\QueryList;
$ql = new QueryList();
function get_tilte_date($url){
global $ql;
return $ql->get($url)->rules([
'date' => ['header > div > a > time.entry-date.published','text'],
'title' => ['header > h2 > a','text'],
'url'=>['header > h2 > a','href']
])->queryData();
}
function get_content($url){
global $ql;
return $ql->get($url)->find('article.post.type-post.status-publish.format-standard.hentry.category-uncategorized')->text();
}
function make_markdown($content_array){
$md_obj = fopen('mymd.md','w+');
foreach ($content_array as $key => $value) {
fwrite($md_obj,"## {$value['title']}\n");
fwrite($md_obj,"创作时间: `{$value['date']}`\n");
fwrite($md_obj,"```\n");
fwrite($md_obj,"{$value['content']}`\n");
fwrite($md_obj,"```\n");
}
fclose($md_obj);
}
function start($url){
$data = get_tilte_date($url);
foreach ($data as $key => $value) {
$data[$key]['content'] = get_content($value['url']);
}
make_markdown($data);
}
start("https://wordpress-edu-3autumn.localprod.forc.work/");
留个小作业
- 爬取分类下的图书名和对应价格, 保存到books.txt
- books.toscrape.com
- 最终效果...
小彩蛋(点击调转到函数定义的位置)
下一节