CodeIgniter Forums
Web Scraping in CI4 - Printable Version

+- CodeIgniter Forums (https://forum.codeigniter.com)
+-- Forum: Using CodeIgniter (https://forum.codeigniter.com/forumdisplay.php?fid=5)
+--- Forum: Model-View-Controller (https://forum.codeigniter.com/forumdisplay.php?fid=10)
+--- Thread: Web Scraping in CI4 (/showthread.php?tid=90723)



Web Scraping in CI4 - nvl - 04-23-2024

I'm doing a CodeIgniter4 Project utilizing web scraper, paritcularly Goutte. Is this the right approach, are there any web scrapers more suitable for CI4? Where would I store the scraping scripts, what folder or file?


RE: Web Scraping in CI4 - InsiteFX - 04-23-2024

Not sure but this may set you on the right path.

freecodecamp - Web Scraping with PHP – How to Crawl Web Pages Using Open Source Tools


RE: Web Scraping in CI4 - minsk832 - 06-19-2024

You can easily use and integrate PHP libraries like

https://github.com/voku/simple_html_dom
https://github.com/spekulatius/PHPScraper

in CodeIgniter. Simply create a library.

PHP Code:
namespace App\Libraries;

use 
simplehtmldom\HtmlWeb;

class 
HtmlDomParser
{
    public function parseFromUrl($url)
    {
        $htmlWeb = new HtmlWeb();
        $html $htmlWeb->load($url);

        return $html;
    }