This page presently lists publications to which contributions have been made, but does not include any authored publications.

2025

A Domain-Agnostic Framework for Visual Element Detection from High-Level Descriptions

Maroun Ayli, Youssef Bakouny, Nader Jalloul, Hani Seifeddine, Rima Kilany

IEEE Access 2025 Accepted Contributor

Modern web applications feature dynamic and visually complex interfaces that challenge the reliability and maintainability of traditional automated testing frameworks. These conventional approaches often rely on fixed document object model locators, such as XPath or CSS selectors, which are highly sensitive to minor user interface changes. As a result, tests frequently break, leading to increased maintenance costs and reduced accessibility for non-technical users. To address these challenges, we propose a unified and lightweight framework that combines a domain-specific restricted natural language for intuitive, high-level test specification, with a vision-language module capable of real-time user interface element detection directly from live screenshots. This eliminates the brittleness of fixed locators and enables robust test execution even under frequent user interface changes. To validate our approach, we implemented and deployed a fully functional prototype, demonstrating that th...

A Domain-Agnostic Framework for Visual Element Detection from High-Level Descriptions

Maroun Ayli, Youssef Bakouny, Nader Jalloul, Hani Seifeddine, Rima Kilany

IEEE Access 2025 Accepted Contributor

Modern web applications feature dynamic and visually complex interfaces that challenge the reliability and maintainability of traditional automated testing frameworks. These conventional approaches often rely on fixed document object model locators, such as XPath or CSS selectors, which are highly sensitive to minor user interface changes. As a result, tests frequently break, leading to increased maintenance costs and reduced accessibility for non-technical users. To address these challenges, we propose a unified and lightweight framework that combines a domain-specific restricted natural language for intuitive, high-level test specification, with a vision-language module capable of real-time user interface element detection directly from live screenshots. This eliminates the brittleness of fixed locators and enables robust test execution even under frequent user interface changes. To validate our approach, we implemented and deployed a fully functional prototype, demonstrating that th...