Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      CodeSOD: A Unique Way to Primary Key

      July 22, 2025

      BrowserStack launches Figma plugin for detecting accessibility issues in design phase

      July 22, 2025

      Parasoft brings agentic AI to service virtualization in latest release

      July 22, 2025

      Node.js vs. Python for Backend: 7 Reasons C-Level Leaders Choose Node.js Talent

      July 21, 2025

      The best CRM software with email marketing in 2025: Expert tested and reviewed

      July 22, 2025

      This multi-port car charger can power 4 gadgets at once – and it’s surprisingly cheap

      July 22, 2025

      I’m a wearables editor and here are the 7 Pixel Watch 4 rumors I’m most curious about

      July 22, 2025

      8 ways I quickly leveled up my Linux skills – and you can too

      July 22, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      The Intersection of Agile and Accessibility – A Series on Designing for Everyone

      July 22, 2025
      Recent

      The Intersection of Agile and Accessibility – A Series on Designing for Everyone

      July 22, 2025

      Zero Trust & Cybersecurity Mesh: Your Org’s Survival Guide

      July 22, 2025

      Execute Ping Commands and Get Back Structured Data in PHP

      July 22, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      A Tomb Raider composer has been jailed — His legacy overshadowed by $75k+ in loan fraud

      July 22, 2025
      Recent

      A Tomb Raider composer has been jailed — His legacy overshadowed by $75k+ in loan fraud

      July 22, 2025

      “I don’t think I changed his mind” — NVIDIA CEO comments on H20 AI GPU sales resuming in China following a meeting with President Trump

      July 22, 2025

      Galaxy Z Fold 7 review: Six years later — Samsung finally cracks the foldable code

      July 22, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Machine Learning»Meet Amazon Nova Act: An AI Agent that can Automate Web Tasks

    Meet Amazon Nova Act: An AI Agent that can Automate Web Tasks

    April 2, 2025

    Amazon has revealed a new artificial intelligence (AI) model called Amazon Nova Act. This AI agent is designed to operate and take actions within a web browser, automating tasks like filling out forms, navigating interfaces, and handling popups. Think of it as an assistant working directly on websites. Amazon has also released Nova Act SDK, which lets developers experiment with the technology. Developers can create agents to handle simple online tasks.

    Current Status of AI Agents

    AI agents mostly talk or find information, responding in natural language or searching knowledge bases. According to Amazon, they envision AI agents being able to complete tasks in digital environments for users.

    However, agentic AI technology is still developing, meaning most AI agents rely heavily on existing application programming interfaces (APIs). Most real-world tasks lack comprehensive APIs, limiting what current agents can achieve reliably.

    Amazon hopes agents will eventually manage complex, multi-step jobs, such as planning large events or handling IT support tasks. Currently, AI agents still need constant human guidance and checking, making them less practical for truly independent work.

    What is Amazon Nova Act? Key Features and Functions

    Amazon Nova Act is an AI agent that can control and perform tasks within a web browser. This new AI model is trained to complete tasks in a web browser using simple commands. It is available as a research preview through the Nova Act SDK. The tool allows agents to handle tasks like scheduling and email management. It is designed to complete real-world tasks without human intervention at every step.

    Here are some features and functions:

    • Web Action Focus: Amazon Nova Act is trained specifically to operate and interact with web browser elements.
    • Developer SDK: A research preview SDK allows developers to build and test AI agent prototypes.
    • Task Automation: The goal is to automate simple browser tasks. This includes filling out forms or managing calendar entries. It can also handle tasks like ordering items online.
    • Atomic Commands: The SDK helps break down complex processes. It uses reliable basic commands like ‘search’ or ‘checkout.’
    • Detailed Instructions: Developers can add specific guidance to commands. For example, instructing the agent to decline optional add-ons.
    • API and Code Integration: The system allows calling external APIs, meaning developers can also insert Python code for checks or custom logic.
    • Reliability Emphasis: Amazon focused on high accuracy for tricky web elements. These include date pickers, dropdown menus, and pop-up windows. Internal tests show strong performance here.
    • Background Operation: AI agents can run without direct observation once set up using Amazon Nova Act. They can operate headlessly or on a schedule.
    • Cross-Environment Potential: Early tests suggest Nova Act can apply its interface understanding to new areas. Surprisingly, this includes environments like web-based games.

    Amazon stresses that Nova Act prioritizes reliability for foundational actions. Amazon is focused on targeting over 90% success on internal tests for specific web interactions. This focus means that built agents should work consistently once configured.

    Amazon Nova Act AI agent has claimed strong results on benchmarks measuring direct web control ability. The browser-based AI agent performs well against competitors in specific interaction tests. However, it hasn’t been compared using all common AI agent evaluations yet.

    Challenges to Autonomous AI Agent Workflow

    The main challenge for all AI agents is consistency. Early AI systems often prove slow or error-prone, and they struggle with tasks humans find simple. Amazon hopes its focus on reliable building blocks will offer an advantage. The true test will be how Nova Act performs in real-world developer applications.

    Conclusion

    Amazon Nova Act clearly shows Amazon’s step and move into the AI agent domain. Its emphasis on reliable task components addresses a key weakness in current agent technology. Amazon hopes to encourage practical applications by providing developers with tools to create AI agents to automate browser tasks. This release from Amazon intensified competition in agentic AI workflow automation and its potential impact on productivity. A truly autonomous AI agent needs to sustain consistent performance; only then will true workflow automation be achieved.


    Check out the Technical details and Try it here. All credit for this research goes to the researchers of this project. Also, feel free to follow us on Twitter and don’t forget to join our 85k+ ML SubReddit.

    🔥 [Register Now] miniCON Virtual Conference on OPEN SOURCE AI: FREE REGISTRATION + Certificate of Attendance + 3 Hour Short Event (April 12, 9 am- 12 pm PST) + Hands on Workshop [Sponsored]

    The post Meet Amazon Nova Act: An AI Agent that can Automate Web Tasks appeared first on MarkTechPost.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleA Comprehensive Guide to LLM Routing: Tools and Frameworks
    Next Article DeltaProduct: An AI Method that Balances Expressivity and Efficiency of the Recurrence Computation, Improving State-Tracking in Linear Recurrent Neural Networks

    Related Posts

    Machine Learning

    How to Evaluate Jailbreak Methods: A Case Study with the StrongREJECT Benchmark

    July 22, 2025
    Machine Learning

    Boolformer: Symbolic Regression of Logic Functions with Transformers

    July 22, 2025
    Leave A Reply Cancel Reply

    For security, use of Google's reCAPTCHA service is required which is subject to the Google Privacy Policy and Terms of Use.

    Continue Reading

    Microsoft Edge gets Copilot AI based New Tab Page, ditches MSN on Windows 11

    Operating Systems

    “I wish everybody could play my game.” The Outer Worlds 2 director doesn’t like its $80 price tag either, and says it was an Xbox decision

    News & Updates

    Tribblix is an illumos-based operating system with a retro style

    Linux

    AI Thumbnails Are Ruining Fortnite Discovery, But Epic Doesn’t Care

    Artificial Intelligence

    Highlights

    Run Multiple AI Coding Agents in Parallel with Container-Use from Dagger

    June 12, 2025

    In AI-driven development, coding agents have become indispensable collaborators. These autonomous or semi-autonomous tools can…

    I’ve never seen an Android phone that does everything that this one can (including night vision)

    May 6, 2025

    NVIDIA’s leaked APU could change gaming laptop design forever. Here’s why.

    May 30, 2025

    CVE-2025-2942 – WordPress Order Delivery Date Information Disclosure Vulnerability

    July 11, 2025
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.