基于PHP/CURL/codeIgniter的Spider Webbot爬虫[4]-使用remove()移除多余的文字

476 查看

$uncommented_page=remove($web_page,"");//移除注释
$links_removed=remove($web_page,"<a","");//移除超链接
$images_removed=remove($web_page,"<img",">");//移除图片
$javascript_removed=remove($web_page,"<script","</script>");//移除其中的JS脚本

//

$removed=remove($web_page," - T","XT全文下载");//移除其中的JS脚本