Towards Autonomous Web Navigation with LLM-based Agents
Loading...
Identifiers
Publication date
Authors
Advisors
Editors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Autonomous web navigation remains a complex
challenge, particularly due to the dynamic, diverse and unstructured nature of web environments. Traditional web scraping techniques, while effective, require rigid configurations tied to specific
website structures, limiting their generalizability. To address
these challenges, this work explores the usage of autonomous
agents powered by Large Language Models for autonomous web
navigation, focusing on the retrieval of academic publications
from webs of preprint repositories.
The proposed solution, based on hyperlink exploration, is
designed as a component of a potentially broader system for
AI-driven paper search assistance. It leverages a multi-agent
architecture and a structured tree-traversal like approach to
explore and extract relevant documents. Each agent is assigned
a specific role, including relevant URL extraction, document
collection, planning, presentation and quality control. The system
is implemented using AutoGen, which enables flexible agent
interactions and modular design. Unlike traditional web information extraction techniques, this approach generalizes navigation
patterns across different websites without relying on predefined
HTML selectors, allowing its usage on different websites.
Experimental results are promising, demonstrating the system’s effectiveness in retrieving relevant academic content. However, challenges such as increased response times and occasional
hallucinations indicate areas for refinement. Future work aims
to enhance interactivity by integrating advanced form-based
search capabilities, optimize retrieval efficiency, and implement
more robust evaluation frameworks. These improvements could
contribute to fully automated AI-driven web exploration, facilitating the development of more generalizable autonomous web
navigation tools.
Description
Traballo Fin de Máster en Intelixencia Artificial. Curso 20024-2025
Keywords
Bibliographic citation
Relation
Has part
Has version
Is based on
Is part of
Is referenced by
Is version of
Requires
Sponsors
Rights
Attribution-NonCommercial-NoDerivatives 4.0 International








