Machine Learning

TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization

April 18, 2025

Direct Preference Optimization (DPO) has been widely adopted for preference alignment of Large Language Models (LLMs) due to its simplicity…

DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation

April 18, 2025

Diffusion models have become the dominant approach for visual generation. They are trained by denoising a Markovian process which gradually…

Machine Learning

LLM Unlearning Benchmarks are Weak Measures of Progress

April 18, 2025

TL;DR: “Machine unlearning” aims to remove data from models without retraining the model completely. Unfortunately, state-of-the-art benchmarks for evaluating unlearning…

Machine Learning

Meta AI Introduces Perception Encoder: A Large-Scale Vision Encoder that Excels Across Several Vision Tasks for Images and Video

April 18, 2025

The Challenge of Designing General-Purpose Vision Encoders As AI systems grow increasingly multimodal, the role of visual perception models becomes…

Machine Learning

Stream ingest data from Kafka to Amazon Bedrock Knowledge Bases using custom connectors

April 18, 2025

Retrieval Augmented Generation (RAG) enhances AI responses by combining the generative AI model’s capabilities with information from external data sources,…

Machine Learning

Build a FinOps agent using Amazon Bedrock with multi-agent capability and Amazon Nova as the foundation model

April 18, 2025

AI agents are revolutionizing how businesses enhance their operational capabilities and enterprise applications. By enabling natural language interactions, these agents…

Machine Learning

Do Reasoning Models Really Need Transformers?: Researchers from TogetherAI, Cornell, Geneva, and Princeton Introduce M1—A Hybrid Mamba-Based AI that Matches SOTA Performance at 3x Inference Speed

April 18, 2025

Effective reasoning is crucial for solving complex problems in fields such as mathematics and programming, and LLMs have demonstrated significant…

A Hands-On Tutorial: Build a Modular LLM Evaluation Pipeline with Google Generative AI and LangChain

April 18, 2025

Evaluating LLMs has emerged as a pivotal challenge in advancing the reliability and utility of artificial intelligence across both academic…

Machine Learning

Google Unveils Gemini 2.5 Flash in Preview through the Gemini API via Google AI Studio and Vertex AI.

April 18, 2025

Google has introduced Gemini 2.5 Flash, an early-preview AI model accessible via the Gemini API through Google AI Studio and…

OpenAI Releases a Practical Guide to Building LLM Agents for Real-World Applications

April 18, 2025

OpenAI has published a detailed and technically grounded guide, A Practical Guide to Building Agents, tailored for engineering and product…

Machine Learning

IBM Releases Granite 3.3 8B: A New Speech-to-Text (STT) Model that Excels in Automatic Speech Recognition (ASR) and Automatic Speech Translation (AST)

April 18, 2025

As artificial intelligence continues to integrate into enterprise systems, the demand for models that combine flexibility, efficiency, and transparency has…

Machine Learning

Integrating Figma with Cursor IDE Using an MCP Server to Build a Web Login Page

April 17, 2025

Model Context Protocol makes it incredibly easy to integrate powerful tools directly into modern IDEs like Cursor, dramatically boosting productivity.…

Uploading Datasets to Hugging Face: A Step-by-Step Guide

April 17, 2025

Part 1: Uploading a Dataset to Hugging Face Hub Introduction This part of the tutorial walks you through the process…

Machine Learning

Researchers from AWS and Intuit Propose a Zero Trust Security Framework to Protect the Model Context Protocol (MCP) from Tool Poisoning and Unauthorized Access

April 17, 2025

AI systems are becoming increasingly dependent on real-time interactions with external data sources and operational tools. These systems are now…

CoMotion: Concurrent Multi-Person 3D Motion

April 17, 2025

We introduce an approach for detecting and tracking detailed 3D poses of multiple people from a single monocular camera stream.…

Disentangled Representational Learning with the Gromov-Monge Gap

April 17, 2025

Learning disentangled representations from unlabelled data is a fundamental challenge in machine learning. Solving it may unlock other problems, such…

Machine Learning

How Salesforce achieves high-performance model deployment with Amazon SageMaker AI

April 17, 2025

This post is a joint collaboration between Salesforce and AWS and is being cross-published on both the Salesforce Engineering Blog…

Machine Learning

Automate video insights for contextual advertising using Amazon Bedrock Data Automation

April 17, 2025

Contextual advertising, a strategy that matches ads with relevant digital content, has transformed digital marketing by delivering personalized experiences to…

Machine Learning

The future of quality assurance: Shift-left testing with QyrusAI and Amazon Bedrock

April 17, 2025

This post is co-written with Ameet Deshpande and Vatsal Saglani from Qyrus. As businesses embrace accelerated development cycles to stay…

Machine Learning

Do We Still Need Complex Vision-Language Pipelines? Researchers from ByteDance and WHU Introduce Pixel-SAIL—A Single Transformer Model for Pixel-Level Understanding That Outperforms 7B MLLMs

April 17, 2025

MLLMs have recently advanced in handling fine-grained, pixel-level visual understanding, thereby expanding their applications to tasks such as precise region-based…