修改:/e/class/connect.php文件,在该文件最上面加上以下函数:
//获取https链接内容 function getHTTPS($url) { $ch = curl_init(); curl_setopt($ch, CURLOPT_SSL_VERIFYPEER, FALSE); curl_setopt($ch, CURLOPT_HEADER, false); curl_setopt($ch, CURLOPT_FOLLOWLOCATION, true); curl_setopt($ch, CURLOPT_URL, $url); curl_setopt($ch, CURLOPT_REFERER, $url); curl_setopt($ch, CURLOPT_RETURNTRANSFER, TRUE); $result = curl_exec($ch); curl_close($ch); return $result; }
找到ReadFiletext函数如下代码:
function ReadFiletext($filepath){ $filepath=trim($filepath); $htmlfp=@fopen($filepath,"r"); //远程 if(strstr($filepath,"://")) { while($data=@fread($htmlfp,500000)) { $string.=$data; } } //本地 else { $string=@fread($htmlfp,@filesize($filepath)); } @fclose($htmlfp); return $string; }
改成:
function ReadFiletext($filepath){ $filepath=trim($filepath); $htmlfp=@fopen($filepath,"r"); //远程 if(strstr($filepath,"https://")){ return getHTTPS($filepath); } if(strstr($filepath,"://")) { while($data=@fread($htmlfp,500000)) { $string.=$data; } } //本地 else { $string=@fread($htmlfp,@filesize($filepath)); } @fclose($htmlfp); return $string; }
自此可实现采集https开头的网页链接
- 帝国cms自带采集和火车头采集器哪个更好用 [2024-04-25]
- [Chrome浏览器插件]anypicker可视化爬虫采集插件 [2024-04-22]
- TTC线报网实时自动采集程序源码,带模板和采集器 [2024-01-16]
- 超强站群蜘蛛池+采集系统源码一键安装版v9.0 [2024-01-11]
- GoFilm在线影视网站源码,多播放源自动采集,Vue+Gin开发 [2023-12-31]