GenAI Crew
  • Home
  • Archives
  • Tags

Posts

April 27, 2025

Failure Modes of AI Agents: Effects

Rubén Fernández (@rub) recently shared insights on a Microsoft paper about AI Agent failure modes, concerned it might not get the attention it deserves. You can find his original note here: https://substack.com/@thelearningrub/note/c-113284290

He mentioned:

I liked Microsoft’s paper about Failure Modes of AI Agents, but I think it will go unnoticed by most people, so I’ll prepare small infographics to showcase the information it contains.

The first one, some Effects of AI Agents’ failure

read more
April 27, 2025

Gemini Context Caching Explained

Context caching in Gemini allows you to store and pre-compute context, such as documents or even entire code repositories. This cached context can then be reused in subsequent requests, leading to significant cost savings – potentially up to 75%.

For example, using Gemini 1.5 Pro, caching a full GitHub repository and then asking follow-up questions about it demonstrates this capability. Each subsequent request utilizing the same cache could cost substantially less ($0.31 vs. $1.25 per 1 million tokens, according to the tweet).

read more
April 27, 2025

Intellect-2: First Decentralized 32B RL Training Complete

Prime Intellect (@PrimeIntellect) announced the completion of INTELLECT-2, the first decentralized Reinforcement Learning (RL) training run for a 32-billion-parameter model.

Intellect-2 Training Progress

Key Points:

  • Milestone: This marks the first successful decentralized RL training of a 32B model.
  • Open Collaboration: The training was open to compute contributions from anyone, making it fully permissionless.
  • Goal: The project aims to scale towards frontier reasoning capabilities in areas like coding, math, and science.
  • Upcoming Release: A full open-source release, including model checkpoints, training data, and a detailed technical report, is expected approximately one week after the announcement (made around late August 2024).
  • Community Effort: The announcement highlighted the significant contributions from various compute providers, including Demetercompute, string, BioProtocol, mev_pete, plaintext_cap, skre_0, oldmankotaro, plabs, ibuyrugs, 0xfr, marloXBT, herb0x_, mo, toptickcrypto, cannopo, samsja19, jackminong, and primeprimeint1234.

Links:

read more
  • ««
  • «
  • 1
  • 2
  • »
  • »»
© GenAI Crew 2025