专题:Web Data Mining and Analysis

This cluster of papers focuses on techniques and technologies for extracting structured data from web pages, including web crawling, automatic wrapper generation, page segmentation, and mining data records. It also covers topics related to the hidden web, information retrieval, and content adaptation for different devices.
最新文献
Benchmark Test-Time Scaling of General LLM Agents

preprint Full Text OpenAlex

Resilience, Volume, and Temporal Trends Across 25 Years of the Wayback Machine

article Full Text OpenAlex

ReUseIt: Synthesizing Reusable AI Agent Workflows for Web Automation

article Full Text OpenAlex

On the Importance of Context Filtering in Retrieval-Augmented Code Completion

article Full Text OpenAlex

A Multimodal Phishing Website Detection System Using Explainable Artificial Intelligence Technologies

article Full Text OpenAlex

A-MINT: An LLM Pipeline for Automated Modeling of iPricings from SaaS Pricing Pages

book-chapter Full Text OpenAlex

Automated journalism

book-chapter Full Text OpenAlex

LLM-driven bot infiltration: protecting web surveys through prompt injections

article Full Text OpenAlex

ST-Raptor: LLM-Powered Semi-Structured Table Question Answering

article Full Text OpenAlex

Detecting AI adoption at scale: a web mining and LLM methodology

article Full Text OpenAlex

近5年高被引文献
Adapting Feature Selection Algorithms for the Classification of Chinese Texts

article Full Text OpenAlex 330 FWCI25.7659

Improving Tag-Clouds as Visual Information Retrieval Interfaces

preprint Full Text OpenAlex 317 FWCI0

Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining

paratext Full Text OpenAlex 168 FWCI0

Sentiment analysis on Twitter data integrating TextBlob and deep learning models: The case of US airline industry

article Full Text OpenAlex 151 FWCI19.4479

A guide to secondary coordination sphere editing

review Full Text OpenAlex 125 FWCI10.5507

A survey on deep learning approaches for text-to-SQL

article Full Text OpenAlex 124 FWCI21.9691

Large Language Models can Accurately Predict Searcher Preferences

article Full Text OpenAlex 118 FWCI96.2898

International Journal of Computer Science and Information Technology

paratext Full Text OpenAlex 112 FWCI0

An effective detection approach for phishing websites using URL and HTML features

article Full Text OpenAlex 108 FWCI31.8223

Learning correlation information for multi-label feature selection

article Full Text OpenAlex 104 FWCI18.3713