Organizations deploying generative AI applications need robust ways to evaluate their performance and reliability. When we launched LLM-as-a-judge (LLMaJ) and…
Machine Learning
The future of robotics has advanced significantly. For many years, there have been expectations of human-like robots that can navigate…
In the drive to remain competitive, businesses today are turning to AI to help them minimize cost and maximize efficiency.…
AI agents are increasingly vital in helping engineers efficiently handle complex coding tasks. However, one significant challenge has been accurately…
In today’s information-rich digital landscape, navigating extensive web content can be overwhelming. Whether you’re researching for a project, studying complex…
This post is co-written with Paul Pagnan from Lumi. Lumi is a leading Australian fintech lender empowering small businesses with…
Large language models (LLMs) can be used to perform natural language processing (NLP) tasks ranging from simple dialogues and information…
Multimodal Large Language Models (MLLMs) have advanced the integration of visual and textual modalities, enabling progress in tasks such as…
Inspecting wind turbines, power lines, 5G towers, and pipelines is a tough job. It’s often dangerous, time-consuming, and prone to…
This post is co-written with Emrah Kaya and Xinyi Zhou from Omron Europe. Data is one of the most critical…
Every year, AWS Sales personnel draft in-depth, forward looking strategy documents for established AWS customers. These documents help the AWS…
GPUs are widely recognized for their efficiency in handling high-performance computing workloads, such as those found in artificial intelligence and…
As LLMs scale, their computational and bandwidth demands increase significantly, posing challenges for AI training infrastructure. Following scaling laws, LLMs…
The Model Context Protocol (MCP) is an open standard (open-sourced by Anthropic) that defines a unified way to connect AI…
Large language models have transformed how machines comprehend and generate text, especially in complex problem-solving areas like mathematical reasoning. These…
Automatic speech recognition (ASR) technologies have advanced significantly, yet notable disparities remain in their ability to accurately recognize diverse languages.…
Process-supervised reward models (PRMs) offer fine-grained, step-wise feedback on model responses, aiding in selecting effective reasoning paths for complex tasks.…
Text-to-SQL translation, the task of transforming natural language queries into structured SQL statements, is essential for facilitating user-friendly database interactions.…
In this work, we propose Mutual Reinforcing Data Synthesis (MRDS) within LLMs to improve few-shot dialogue summarization task. Unlike prior…
Nomic has announced the release of “Nomic Embed Multimodal,” a groundbreaking embedding model that achieves state-of-the-art performance on visual document…