← Archive Get every issue free →

Gradient Descent

Monday, April 20 2026

Good morning, Pierluigi — 4 items • ~3 min read

Today's Brief

Inference cost curves are now driving product decisions faster than capability benchmarks, which is why every major lab is rethinking how large language models are served at scale. This shift has second-order consequences for capital markets, where the ability to securely serve AI models at scale will become a key differentiator for financial institutions, potentially altering the competitive landscape in areas like fraud detection and instant payments. You should be evaluating whether your current workflow for integrating AI models into your financial analysis toolkit is optimized for the new cost curves, and considering a review of your technology investments to ensure they align with the emerging regulatory landscape in Europe and Asia.

📊 AI & Data Tools
Moonshot AI and Tsinghua Researchers Propose PrfaaS: A Cross-Datacenter KVCache Architecture that Rethinks How LLMs are Served at Scale

Researchers have proposed a new cross-datacenter KVCache architecture that rethinks how large language models are served at scale, potentially reducing inference costs. This changes your workflow for evaluating AI model performance, as you will need to consider the impact of datacenter architecture on model serving costs.

 •  → Read

Claude Token Counter, now with model comparisons

The Claude Token Counter tool now allows for model comparisons, enabling you to evaluate the performance of different AI models on the same task. This changes your metric for evaluating AI model performance, as you will need to consider the token count and model comparisons when selecting a model for your workflow.

 •  → Read

Even the best AI models lose about half their performance when charts get complicated, new benchmark finds

A new benchmark has found that even the best AI models lose about half their performance when dealing with complicated charts, highlighting the need for more robust model evaluation. This changes your decision to use AI models for data analysis, as you will need to carefully evaluate the limitations of the models and consider alternative approaches for complex data visualizations.

 •  → Read

🎧 Podcasts
How Capital One Delivers Multi-Agent Systems with Rashmi Shetty - #765

Capital One is designing and delivering multi-agent systems using AI, highlighting the potential for more effective and efficient financial systems. This changes your decision to invest in AI research and development, as you will need to consider the potential applications and benefits of multi-agent systems in finance.

 •  → Listen

That's your edge for today.

See you tomorrow morning with the next gradient step.

Subscribe free → Share this issue
Gradient Descent • Powered by Groq • Sources: curated RSS across 15+ publications
Delivered to pierluigi.derogatis@live.com
SharePost on X