Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      CodeSOD: A Unique Way to Primary Key

      July 22, 2025

      BrowserStack launches Figma plugin for detecting accessibility issues in design phase

      July 22, 2025

      Parasoft brings agentic AI to service virtualization in latest release

      July 22, 2025

      Node.js vs. Python for Backend: 7 Reasons C-Level Leaders Choose Node.js Talent

      July 21, 2025

      The best CRM software with email marketing in 2025: Expert tested and reviewed

      July 22, 2025

      This multi-port car charger can power 4 gadgets at once – and it’s surprisingly cheap

      July 22, 2025

      I’m a wearables editor and here are the 7 Pixel Watch 4 rumors I’m most curious about

      July 22, 2025

      8 ways I quickly leveled up my Linux skills – and you can too

      July 22, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      The Intersection of Agile and Accessibility – A Series on Designing for Everyone

      July 22, 2025
      Recent

      The Intersection of Agile and Accessibility – A Series on Designing for Everyone

      July 22, 2025

      Zero Trust & Cybersecurity Mesh: Your Org’s Survival Guide

      July 22, 2025

      Execute Ping Commands and Get Back Structured Data in PHP

      July 22, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      A Tomb Raider composer has been jailed — His legacy overshadowed by $75k+ in loan fraud

      July 22, 2025
      Recent

      A Tomb Raider composer has been jailed — His legacy overshadowed by $75k+ in loan fraud

      July 22, 2025

      “I don’t think I changed his mind” — NVIDIA CEO comments on H20 AI GPU sales resuming in China following a meeting with President Trump

      July 22, 2025

      Galaxy Z Fold 7 review: Six years later — Samsung finally cracks the foldable code

      July 22, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Machine Learning»Salesforce AI Research Introduces New Benchmarks, Guardrails, and Model Architectures to Advance Trustworthy and Capable AI Agents

    Salesforce AI Research Introduces New Benchmarks, Guardrails, and Model Architectures to Advance Trustworthy and Capable AI Agents

    May 1, 2025

    Salesforce AI Research has outlined a comprehensive roadmap for building more intelligent, reliable, and versatile AI agents. The recent initiative focuses on addressing foundational limitations in current AI systems—particularly their inconsistent task performance, lack of robustness, and challenges in adapting to complex enterprise workflows. By introducing new benchmarks, model architectures, and safety mechanisms, Salesforce is establishing a multi-layered framework to scale agentic systems responsibly.

    Addressing “Jagged Intelligence” Through Targeted Benchmarks

    One of the central challenges highlighted in this research is what Salesforce terms jagged intelligence: the erratic behavior of AI agents across tasks of similar complexity. To systematically diagnose and reduce this problem, the team introduced the SIMPLE benchmark. This dataset contains 225 straightforward, reasoning-oriented questions that humans answer with near-perfect consistency but remain non-trivial for language models. The goal is to reveal gaps in models’ ability to generalize across seemingly uniform problems, particularly in real-world reasoning scenarios.

    Complementing SIMPLE is ContextualJudgeBench, which evaluates an agent’s ability to maintain accuracy and faithfulness in context-specific answers. This benchmark emphasizes not only factual correctness but also the agent’s ability to recognize when to abstain from answering—an important trait for trust-sensitive applications such as legal, financial, and healthcare domains.

    Strengthening Safety and Robustness with Trust Mechanisms

    Recognizing the importance of AI reliability in enterprise settings, Salesforce is expanding its Trust Layer with new safeguards. The SFR-Guard model family has been trained on both open-domain and domain-specific (CRM) data to detect prompt injections, toxic outputs, and hallucinated content. These models serve as dynamic filters, supporting real-time inference with contextual moderation capabilities.

    Another component, CRMArena, is a simulation-based evaluation suite designed to test agent performance under conditions that mimic real CRM workflows. This ensures AI agents can generalize beyond training prompts and operate predictably across varied enterprise tasks.

    Specialized Model Families for Reasoning and Action

    To support more structured, goal-directed behavior in agents, Salesforce introduced two new model families: xLAM and TACO.

    The xLAM (eXtended Language and Action Models) series is optimized for tool use, multi-turn interaction, and function calling. These models vary in scale (from 1B to 200B+ parameters) and are built to support enterprise-grade deployments, where integration with APIs and internal knowledge sources is essential.

    TACO (Thought-and-Action Chain Optimization) models aim to improve agent planning capabilities. By explicitly modeling intermediate reasoning steps and corresponding actions, TACO enhances the agent’s ability to decompose complex goals into sequences of operations. This structure is particularly relevant for use cases like document automation, analytics, and decision support systems.

    Operationalizing Agents via Agentforce

    These capabilities are being unified under Agentforce, Salesforce’s platform for building and deploying autonomous agents. The platform includes a no-code Agent Builder, which allows developers and domain experts to specify agent behaviors and constraints using natural language. Integration with the broader Salesforce ecosystem ensures agents can access customer data, invoke workflows, and remain auditable.

    A study by Valoir found that teams using Agentforce can build production-ready agents 16 times faster compared to traditional software approaches, while improving operational accuracy by up to 75%. Importantly, Agentforce agents are embedded within the Salesforce Trust Layer, inheriting the safety and compliance features required in enterprise contexts.

    Conclusion

    Salesforce’s research agenda reflects a shift toward more deliberate, architecture-aware AI development. By combining targeted evaluations, fine-grained safety models, and purpose-built architectures for reasoning and action, the company is laying the groundwork for next-generation agentic systems. These advances are not only technical but structural—emphasizing reliability, adaptability, and alignment with the nuanced needs of enterprise software.


    Check out the Technical details. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. Don’t Forget to join our 90k+ ML SubReddit.

    🔥 [Register Now] miniCON Virtual Conference on AGENTIC AI: FREE REGISTRATION + Certificate of Attendance + 4 Hour Short Event (May 21, 9 am- 1 pm PST) + Hands on Workshop

    The post Salesforce AI Research Introduces New Benchmarks, Guardrails, and Model Architectures to Advance Trustworthy and Capable AI Agents appeared first on MarkTechPost.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleDeepSeek-AI Released DeepSeek-Prover-V2: An Open-Source Large Language Model Designed for Formal Theorem, Proving through Subgoal Decomposition and Reinforcement Learning
    Next Article Meta AI Introduces First Version of Its Llama 4-Powered AI App: A Standalone AI Assistant to Rival ChatGPT

    Related Posts

    Machine Learning

    How to Evaluate Jailbreak Methods: A Case Study with the StrongREJECT Benchmark

    July 22, 2025
    Machine Learning

    Boolformer: Symbolic Regression of Logic Functions with Transformers

    July 22, 2025
    Leave A Reply Cancel Reply

    For security, use of Google's reCAPTCHA service is required which is subject to the Google Privacy Policy and Terms of Use.

    Continue Reading

    CVE-2025-37833 – Linux Niu PCI-MSIX Touch Entry Data Vulnerability

    Common Vulnerabilities and Exposures (CVEs)

    CodeSOD: The Pirate’s Code

    News & Updates

    CVE-2025-48415 – Cisco USB Backdoor Command Injection Vulnerability

    Common Vulnerabilities and Exposures (CVEs)

    Step aside, Siri: Perplexity’s new AI voice assistant for iPhone can take it from here

    News & Updates

    Highlights

    CVE-2025-34079 – NSClient++ Remote Code Execution Vulnerability

    July 2, 2025

    CVE ID : CVE-2025-34079

    Published : July 2, 2025, 8:15 p.m. | 1 hour, 45 minutes ago

    Description : An authenticated remote code execution vulnerability exists in NSClient++ version 0.5.2.35 when the web interface and ExternalScripts module are enabled. A remote attacker with the administrator password can authenticate to the web interface (default port 8443), inject arbitrary commands as external scripts via the /settings/query.json API, save the configuration, and trigger the script via the /query/{name} endpoint. The injected commands are executed with SYSTEM privileges, enabling full remote compromise.

    This capability is an intended feature, but the lack of safeguards or privilege separation makes it risky when exposed to untrusted actors.

    Severity: 0.0 | NA

    Visit the link for more details, such as CVSS details, affected products, timeline, and more…

    CVE-2025-47665 – Bistromatic N360 Splash Screen Stored Cross-site Scripting Vulnerability

    May 7, 2025

    Sam Altman says OpenAI could need a “significant fraction” of the Earth’s power for future artificial intelligence computing

    June 18, 2025

    CVE-2025-24916 – Tenable Network Monitor Local Privilege Escalation

    May 23, 2025
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.