Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      CodeSOD: A Unique Way to Primary Key

      July 22, 2025

      BrowserStack launches Figma plugin for detecting accessibility issues in design phase

      July 22, 2025

      Parasoft brings agentic AI to service virtualization in latest release

      July 22, 2025

      Node.js vs. Python for Backend: 7 Reasons C-Level Leaders Choose Node.js Talent

      July 21, 2025

      The best CRM software with email marketing in 2025: Expert tested and reviewed

      July 22, 2025

      This multi-port car charger can power 4 gadgets at once – and it’s surprisingly cheap

      July 22, 2025

      I’m a wearables editor and here are the 7 Pixel Watch 4 rumors I’m most curious about

      July 22, 2025

      8 ways I quickly leveled up my Linux skills – and you can too

      July 22, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      The Intersection of Agile and Accessibility – A Series on Designing for Everyone

      July 22, 2025
      Recent

      The Intersection of Agile and Accessibility – A Series on Designing for Everyone

      July 22, 2025

      Zero Trust & Cybersecurity Mesh: Your Org’s Survival Guide

      July 22, 2025

      Execute Ping Commands and Get Back Structured Data in PHP

      July 22, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      A Tomb Raider composer has been jailed — His legacy overshadowed by $75k+ in loan fraud

      July 22, 2025
      Recent

      A Tomb Raider composer has been jailed — His legacy overshadowed by $75k+ in loan fraud

      July 22, 2025

      “I don’t think I changed his mind” — NVIDIA CEO comments on H20 AI GPU sales resuming in China following a meeting with President Trump

      July 22, 2025

      Galaxy Z Fold 7 review: Six years later — Samsung finally cracks the foldable code

      July 22, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»How to Perform Visual Testing Using Selenium: A Detailed Guide

    How to Perform Visual Testing Using Selenium: A Detailed Guide

    June 25, 2025

    Visual testing is an essential part of UI validation, ensuring that web applications appear and function as intended across different browsers, devices, and screen resolutions. While Selenium is primarily used for functional testing, it can also be leveraged for visual regression testing with the help of additional tools and libraries.

    In this blog, we’ll explore how to perform visual testing using Selenium, covering key concepts, tools, and step-by-step implementation.

    Table of Contents

    1. What is Visual Testing?

    2. Why Use Selenium for Visual Testing?

    3. Tools for Visual Testing with Selenium

    4. Step-by-Step Guide to Perform Visual Testing

      • Prerequisites

      • Setting Up Selenium WebDriver

      • Capturing Screenshots

      • Comparing Screenshots

      • Generating Test Reports

    5. General Steps to Perform Visual Testing
    6. Best Practices for Visual Testing

    7. Conclusion


    1. What is Visual Testing?

    Visual testing (or visual regression testing) compares screenshots of a web application’s UI against baseline images to detect unintended visual changes. It helps identify issues like:

    • Layout shifts

    • Broken fonts or images

    • Overlapping elements

    • Responsive design failures

    Unlike functional testing, which verifies behavior, visual testing ensures the UI looks correct.


    2. Why Use Selenium for Visual Testing?

    Selenium WebDriver is widely used for automating browser interactions. While it doesn’t natively support visual comparisons, it can:

    • Capture screenshots of web pages.

    • Integrate with visual testing libraries (e.g., Applitools, Percy, or OpenCV).

    • Run cross-browser tests to ensure consistency.


    3. Tools for Visual Testing with Selenium

    Here are some popular tools for visual testing with Selenium:

    Tool Description
    Applitools AI-powered visual testing with automatic baseline management.
    Percy Cloud-based visual testing by BrowserStack.
    AShot Java-based screenshot comparison library.
    OpenCV Open-source computer vision library for image processing.
    SikuliX Uses image recognition for UI testing.

    We’ll use AShot (for Java) and Pillow (for Python) in this guide.


    4. Step-by-Step Guide to Perform Visual Testing

    Prerequisites 

    • Java/Python installed

    • Selenium WebDriver

    • Maven/Gradle (for Java) or pip (for Python)

    • A testing framework (JUnit/TestNG for Java, pytest for Python)


    Setting Up Selenium WebDriver

    Java (Maven)

    xml

    <dependency>
        <groupId>org.seleniumhq.selenium</groupId>
        <artifactId>selenium-java</artifactId>
        <version>4.10.0</version>
    </dependency>
    <dependency>
        <groupId>ru.yandex.qatools.ashot</groupId>
        <artifactId>ashot</artifactId>
        <version>1.5.4</version>
    </dependency>

    Python (pip)

    bash


    pip install selenium pillow opencv-python

    Capturing Screenshots

    Java (Using AShot)

    java


    import org.openqa.selenium.WebDriver;
    import org.openqa.selenium.chrome.ChromeDriver;
    import ru.yandex.qatools.ashot.AShot;
    import ru.yandex.qatools.ashot.Screenshot;
    import javax.imageio.ImageIO;
    import java.io.File;
    
    public class VisualTest {
        public static void main(String[] args) throws Exception {
            WebDriver driver = new ChromeDriver();
            driver.get("https://example.com");
            
            // Capture screenshot
            Screenshot screenshot = new AShot().takeScreenshot(driver);
            ImageIO.write(screenshot.getImage(), "PNG", new File("screenshot.png"));
            
            driver.quit();
        }
    }

    Python (Using Pillow)

    python


    from selenium import webdriver
    from PIL import Image
    
    driver = webdriver.Chrome()
    driver.get("https://example.com")
    
    # Capture screenshot
    driver.save_screenshot("screenshot.png")
    driver.quit()

    Comparing Screenshots

    Java (Using AShot)

    java


    import java.awt.image.BufferedImage;
    import javax.imageio.ImageIO;
    
    public class ImageComparator {
        public static boolean compareImages(String img1Path, String img2Path) throws Exception {
            BufferedImage img1 = ImageIO.read(new File(img1Path));
            BufferedImage img2 = ImageIO.read(new File(img2Path));
            
            if (img1.getWidth() != img2.getWidth() || img1.getHeight() != img2.getHeight()) {
                return false;
            }
            
            for (int y = 0; y < img1.getHeight(); y++) {
                for (int x = 0; x < img1.getWidth(); x++) {
                    if (img1.getRGB(x, y) != img2.getRGB(x, y)) {
                        return false;
                    }
                }
            }
            return true;
        }
    }

    Python (Using OpenCV)

    python


    import cv2
    import numpy as np
    
    def compare_images(img1_path, img2_path):
        img1 = cv2.imread(img1_path)
        img2 = cv2.imread(img2_path)
        
        if img1.shape != img2.shape:
            return False
        
        difference = cv2.subtract(img1, img2)
        return not np.any(difference)

    Generating Test Reports 

    Use testing frameworks like TestNG (Java) or pytest (Python) to log results:

    java


    @Test
    public void testVisualComparison() throws Exception {
        Assert.assertTrue(ImageComparator.compareImages("expected.png", "actual.png"));
    }

    5. General Steps When Using a Visual Testing Platform

    When using a dedicated visual testing platform (e.g., Percy, Applitools), follow these steps:

    1. Set Up Your Selenium Project

    Ensure you have a working Selenium automation framework in your preferred language (Java, Python, C#, JavaScript, etc.).

    2. Integrate the Visual Testing SDK

    Install the SDK provided by your chosen platform. Examples:

    • Python (Percy)

      bash


      pip install percy-selenium
    • JavaScript (Percy)

      bash


      npm install @percy/selenium-webdriver

    3. Capture Baselines

    The first time you run visual tests, the tool captures “baseline” screenshots (expected UI state).

    Example (Python with Percy)

    python


    from selenium import webdriver
    from percy import percy_snapshot
    
    driver = webdriver.Chrome()
    driver.get("https://your-application.com")
    percy_snapshot(driver, "Homepage - Initial State")
    
    # Perform actions
    driver.get("https://your-application.com/some-other-page")
    percy_snapshot(driver, "Another Page - After Interaction")
    driver.quit()

    4. Run Tests and Compare

    In subsequent runs, the tool compares new screenshots against baselines.

    5. Review and Approve Changes

    • Differences are highlighted in a dashboard.

    • Approve intentional changes (updates baseline).

    • Flag unintended changes as bugs.

    6. Integrate with CI/CD

    Run visual tests in pipelines (e.g., GitHub Actions, Jenkins) for continuous feedback.


    6. Best Practices for Visual Testing

    1. Strategic Snapshotting

    • Focus on critical UI components (headers, forms, key interactions) rather than capturing every element.

    • Prioritize page layouts and areas prone to visual regressions (e.g., responsive breakpoints).

    2. Handle Dynamic Content

    • Ignore/Mask dynamic elements (e.g., ads, timestamps, user-generated content) to avoid false positives.

    • Use tools like Percy’s ignoreRegions or Applitools’ ignoreDisplacements to exclude volatile areas.

    3. Maintain Baselines

    • Review baselines regularly and update them only for intentional UI changes.

    • Store baselines in version control (e.g., Git) to track historical changes.

    4. Cross-Browser & Device Testing

    • Test across multiple browsers (Chrome, Firefox, Safari) and viewport sizes (desktop, tablet, mobile).

    • Leverage cloud platforms (e.g., BrowserStack, Sauce Labs) for broader coverage.

    5. Configure Thresholds & Sensitivity

    • Adjust pixel-matching thresholds to balance between catching bugs and reducing noise.

    • Example: Set a 5% difference threshold for minor anti-aliasing changes.

    6. Component-Level Testing

    • Test isolated UI components (buttons, modals, cards) for design system consistency.

    • Tools like Storybook + Percy enable visual testing of individual components.

    Bonus: CI/CD Integration

    • Run visual tests automatically in pipelines (GitHub Actions, Jenkins) to catch regressions early.

    • Fail builds on critical visual deviations but allows manual review for minor changes.


    7. Conclusion

    Visual testing with Selenium helps ensure UI consistency across releases. While Selenium itself doesn’t support visual comparisons, integrating tools like AShot, OpenCV, or Applitools makes it possible.

    By following this guide, you can implement automated visual regression testing and catch UI bugs early in development.

    🚀 Next Steps: Try integrating visual testing into your CI/CD pipeline for seamless validation!


    Have questions? Drop them in the comments!
    #Selenium #VisualTesting #Automation #QA

    Source: Read More

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleNew TeamViewer Vulnerability Puts Windows Systems at Risk of Privilege Escalation
    Next Article How to Build a Custom Visual Testing Tool with Selenium: A Practical Guide

    Related Posts

    Development

    GPT-5 is Coming: Revolutionizing Software Testing

    July 22, 2025
    Development

    Win the Accessibility Game: Combining AI with Human Judgment

    July 22, 2025
    Leave A Reply Cancel Reply

    For security, use of Google's reCAPTCHA service is required which is subject to the Google Privacy Policy and Terms of Use.

    Continue Reading

    BrowserStack launches Figma plugin for detecting accessibility issues in design phase

    Tech & Work

    CVE-2024-41502 – Jetimob Plataforma Imobiliaria XSS in Observaces Field

    Common Vulnerabilities and Exposures (CVEs)

    Xbox co-founder now leads mysterious ‘ZeroOne’ team at Amazon

    News & Updates

    Brody King Abolish ICE Shirt

    Web Development

    Highlights

    This HP laptop I tested is a dream machine for traveling professionals. Here’s why

    April 11, 2025

    HP’s EliteBook Ultra G1i is lightweight, performs reliably, and boasts a long battery life. But…

    CVE-2025-44866 – Tenda W20E Command Injection Vulnerability

    May 1, 2025

    CVE-2025-5295 – FreeFloat FTP Server Buffer Overflow Vulnerability

    May 28, 2025

    Google’s Veo 3: The Dawn of AI-Powered Hollywood-Level Moviemaking?

    May 23, 2025
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.