Building cost-effective RAG applications with Amazon Bedrock Knowledge Bases and Amazon S3 Vectors

Vector embeddings have become essential for modern Retrieval Augmented Generation (RAG) applications, but organizations face significant cost challenges as they scale. As knowledge bases grow and require more granular embeddings, many vector databases that rely on high-performance storage such as SSDs or in-memory solutions become prohibitively expensive. This cost barrier often forces organizations to limit…

Read More

Evaluating generative AI models with Amazon Nova LLM-as-a-Judge on Amazon SageMaker AI

Evaluating the performance of large language models (LLMs) goes beyond statistical metrics like perplexity or bilingual evaluation understudy (BLEU) scores. For most real-world generative AI scenarios, it’s crucial to understand whether a model is producing better outputs than a baseline or an earlier iteration. This is especially important for applications such as summarization, content generation,…

Read More

Roblox’s New Age Verification Feature Uses AI to Scan Teens’ Video Selfies

In the briefing, Kaufman called Roblox “one of the safest places online for people to come together and spend time with their friends and their family.” Kirra Pendergast, founder and CEO of Safe on Social—an online safety organization operating worldwide—says Roblox’s latest safety measures are largely opt-in, therefore putting “responsibility on minors to identify and…

Read More

This AI Warps Live Video in Real Time

Dean Leitersdorf introduces himself over Zoom, then types a prompt that makes me feel like I’ve just taken psychedelic mushrooms: “wild west, cosmic, Roman Empire, golden, underwater.” He feeds the words into an artificial intelligence model developed by his startup, Decart, which manipulates live video in real time. “I have no idea what’s going to…

Read More

How to run an LLM on your laptop

For Pistilli, opting for local models as opposed to online chatbots has implications beyond privacy. “Technology means power,” she says. “And so who[ever] owns the technology also owns the power.” States, organizations, and even individuals might be motivated to disrupt the concentration of AI power in the hands of just a few companies by running…

Read More

Hackers Are Finding New Ways to Hide Malware in DNS Records

Hackers are stashing malware in a place that’s largely out of the reach of most defenses—inside domain name system (DNS) records that map domain names to their corresponding numerical IP addresses. The practice allows malicious scripts and early-stage malware to fetch binary files without having to download them from suspicious sites or attach them to…

Read More

Building enterprise-scale RAG applications with Amazon S3 Vectors and DeepSeek R1 on Amazon SageMaker AI

Organizations are adopting large language models (LLMs), such as DeepSeek R1, to transform business processes, enhance customer experiences, and drive innovation at unprecedented speed. However, standalone LLMs have key limitations such as hallucinations, outdated knowledge, and no access to proprietary data. Retrieval Augmented Generation (RAG) addresses these gaps by combining semantic search with generative AI,…

Read More

Where Are All the AI Drugs?

A new drug usually starts with a tragedy. Peter Ray knows that. Born in what is now Zimbabwe, the child of a mechanic and a radiology technician, Ray fled with his family to South Africa during the Zimbabwean War of Liberation. He remembers the journey there in 1980 in a convoy of armored cars. As…

Read More

Google France hosted a hackathon to tackle healthcare’s biggest challenges

From improving clinical trials to easing administrative workloads, AI is already changing what’s possible in healthcare. To help accelerate this progress, Google France recently brought together 130 experts for a 12-hour hackathon focused on building new medical prototypes using open AI models. Twenty-six teams used Google’s open models — including Gemma, MedGemma and TxGemma —…

Read More