Towards Autonomous Web Navigation with LLM-based Agents

dc.contributor.affiliationUniversidade de Santiago de Compostela. Centro de Investigación en Tecnoloxías Intelixentes da USC (CiTIUS)
dc.contributor.authorIzquierdo Álvarez, Mario
dc.contributor.tutorMorón Reyes, Francisco José
dc.contributor.tutorFernández Pichel, Marcos
dc.date.accessioned2026-01-12T14:14:33Z
dc.date.available2026-01-12T14:14:33Z
dc.date.issued2025-02-21
dc.descriptionTraballo Fin de Máster en Intelixencia Artificial. Curso 20024-2025
dc.description.abstractAutonomous web navigation remains a complex challenge, particularly due to the dynamic, diverse and unstructured nature of web environments. Traditional web scraping techniques, while effective, require rigid configurations tied to specific website structures, limiting their generalizability. To address these challenges, this work explores the usage of autonomous agents powered by Large Language Models for autonomous web navigation, focusing on the retrieval of academic publications from webs of preprint repositories. The proposed solution, based on hyperlink exploration, is designed as a component of a potentially broader system for AI-driven paper search assistance. It leverages a multi-agent architecture and a structured tree-traversal like approach to explore and extract relevant documents. Each agent is assigned a specific role, including relevant URL extraction, document collection, planning, presentation and quality control. The system is implemented using AutoGen, which enables flexible agent interactions and modular design. Unlike traditional web information extraction techniques, this approach generalizes navigation patterns across different websites without relying on predefined HTML selectors, allowing its usage on different websites. Experimental results are promising, demonstrating the system’s effectiveness in retrieving relevant academic content. However, challenges such as increased response times and occasional hallucinations indicate areas for refinement. Future work aims to enhance interactivity by integrating advanced form-based search capabilities, optimize retrieval efficiency, and implement more robust evaluation frameworks. These improvements could contribute to fully automated AI-driven web exploration, facilitating the development of more generalizable autonomous web navigation tools.
dc.identifier.urihttps://hdl.handle.net/10347/45053
dc.language.isoeng
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectLLM-based
dc.subjectWeb navigation
dc.titleTowards Autonomous Web Navigation with LLM-based Agents
dc.typemaster thesis
dspace.entity.typePublication
relation.isTutorOfPublicationad1c87f4-64b2-44aa-ab80-4709cef31dfe
relation.isTutorOfPublication.latestForDiscoveryad1c87f4-64b2-44aa-ab80-4709cef31dfe

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Mario Izquierdo.pdf
Size:
3.97 MB
Format:
Adobe Portable Document Format