Software #web scraping#dom pruning
New AI Framework Co-Scraper Achieves 94.78% Accuracy for Web Data Extraction with Reusable Scrapers
Researchers introduced Co-Scraper, a two-stage framework for automated web data extraction that integrates query-aware DOM pruning with a fine-tuned Qwen3-8B model. On the SWDE test set, it achieved an F1 score of 94.78% and a reuse success rate of 90.39%, enabling lightweight, reusable scrapers for heterogeneous web content.
Jun 16, 2026 1 source