Skip to main content
🚀 Claude Code Bootcamp — May 30 5 hours from prompting to production. Build 10 real-world projects with AI-assisted development. Register Now
Luca Berton selfie at AI on the Amstel meetup with packed auditorium at VU Amsterdam
AI

AI on the Amstel: How Frontier AI Models Are Built — Panel with DeepMind, NVIDIA, and Mistral

Recap of the June 2026 AI on the Amstel meetup at VU Amsterdam — a packed-auditorium panel with researchers from Google DeepMind, NVIDIA, and Mistral/Emmi on how frontier models are built and where the cutting edge is heading.

LB
Luca Berton
· 7 min read

The biggest lecture hall at VU Amsterdam — every seat taken, people standing in the aisles. That is what happens when you put researchers from Google DeepMind, NVIDIA, and Mistral on a panel and ask them how frontier AI models are actually built.

Tonight’s AI on the Amstel meetup delivered exactly what the community asked for: a hype-free, technical discussion about what is really happening at the cutting edge.

AI on the Amstel logo projected on stage with panelists

Panel introduction slide — Maurits Bleeker (Emmi/Mistral), Rick Lamers (NVIDIA), Thomas Mensink (Google DeepMind) — packed auditorium

The Panel

Three researchers representing different elements of the frontier:

Thomas Mensink — AI Researcher at Google DeepMind. The team behind Gemini, one of the top AI models in the world with over 750 million monthly active users.

Rick Lamers — AI Researcher at NVIDIA. A unique perspective — working on model and LLM research at the company that builds the chips that train these models.

Maurits Bleeker — Research Scientist at Emmi/Mistral. Mistral acquired Emmi in May 2026 as part of their move beyond chat-based AI. Emmi specializes in “Physics AI” — using AI to simulate and predict what happens in the real world.

The tagline on the slide says it all: “Panel with LLM + hardware + ‘real-world’ AI expertise!”

Close-up of panel slide — Maurits Bleeker, Rick Lamers, Thomas Mensink with roles

Panel from mid-auditorium — full house, panelists on stage

Panel discussion on stage — agenda slide showing panel, Q&A, and networking

Key Themes

The discussion covered two big questions the community had been requesting:

The Big AI Domains: Language, Image/Video, and Domain-Focused Models

The panelists mapped out the current landscape with refreshing clarity:

Rick Lamers (NVIDIA): “LLMs with strong tool calling abilities like GPT-5.5, Claude Opus 4.8 or Nemotron 3 Super are the biggest category today, followed by image/video generation/edit models, and then there’s a long tail of a lot of interesting more specialized work. These first two categories are what I would call most ‘bitter lesson pilled.’”

Maurits Bleeker (Emmi/Mistral): “We work mainly on physics-based systems to run large scale simulations for engineering. Although we use Transformers at the backbone for the learning algorithm, the current learning paradigm has nothing to do with how LLMs and vision language models (VLMs) are trained.”

Thomas Mensink (DeepMind): “AI systems are everywhere, and already in use for years. But the AI systems currently in the news are mostly the generative models, more specifically the language-based LLMs.”

Panel discussion slide — each panelist's perspective on AI domains

Wide angle of packed auditorium during the AI domains discussion

How Are Frontier Models Actually Built?

The panelists dove into the engineering reality behind models like Gemini and Mistral’s latest releases — the infrastructure decisions, training pipelines, data curation challenges, and the gap between what papers describe and what production actually looks like.

Where Is the Cutting Edge Heading?

A hype-free look at the next 1-2 years. What the researchers see in their day-to-day work that hints at where models are evolving — beyond the marketing announcements and benchmark races.

Most Promising Methods to Improve Frontier Models

The panelists shared what they are actively exploring:

Rick Lamers (NVIDIA): “Learning efficiency and credit assignment in RL to make ‘agentic models’ learn by search work even better, effective use of long-term memory/long context windows (compaction, hybrid attention), expansion of what are considered ‘verifiable’ tasks.”

Maurits Bleeker (Emmi/Mistral): “Building Large Engineering Models (LEMs). Also, the ‘next token’ training object for physics/AI4Engineering has not been found yet; pretraining large physics models is still an unsolved question.”

Thomas Mensink (DeepMind): “Data, data, data, I can’t make bricks without clay — Sherlock Holmes.”

Panel slide — most promising methods to improve frontier AI models

Wide auditorium shot during promising methods discussion

Reducing Hallucinations

Rick (NVIDIA): “It’s a full stack problem. You can’t use a simple trick to reduce hallucinations. Some of the most effective ways of reducing hallucinations today is using post-training validation test-time-compute to discourage models from claiming factual results that they haven’t fact checked against credible sources using tool calling.”

Maurits (Emmi/Mistral): “The notion of ‘hallucinations’ is somewhat ill-defined in AI4Simulation. It is often a question of how far a model’s predictions deviate from the true underlying simulation or real-world phenomenon. There’s a lot of effort in incorporating uncertainty estimation into these models, enabling a better assessment of when predictions may be unreliable.”

Thomas (DeepMind): “Image Generative Models vs. LLMs” — a personal opinion: with image models, we want hallucination (imagination). The challenge is knowing when to hallucinate and when to be factual. It is a multi-objective game.

Panel slide on reducing AI hallucinations — full stack problem, uncertainty estimation, image vs LLM

The Multimodal Challenge

Rick (NVIDIA): “The entirety of the pre-training, mid-training, and post-training stack should be made to work with sensible tokenization of both text, image, video, audio inputs such that rich information can be presented to models. This will make it feasible to build ‘one model to rule them all.’”

Maurits (Emmi/Mistral): “AI4Simulation is by nature multi-modal. We usually work with discretized (unstructured) mesh representations, and predict all kinds of quantities on those meshes. The standard 1/2/3D modalities (i.e., text/audio/images/video) and modelling techniques do not really work for AI4Simulation.”

Thomas (DeepMind): “Large models have the capacity to deal with multiple modalities. But tokenization is key to work in an aligned latent space.”

Hardware Roadmap Shaping AI Research

Rick (NVIDIA): “Hardware feature development like new low-precision forms (NVFP4) can act as multipliers on the continuous improvements (total FLOPs, memory bandwidth). Training and inference stacks need to maximally lean into these new hardware features to lock in the gains. Coding agents can help with some of the engineering burden (like DeepSeek-V4 launch).”

Maurits (Emmi/Mistral): “There are teams working on accelerating numerical solvers on GPUs. However, that is a different art. We are ‘limited’ to what we can run on GPUs, and we mainly rely on Transformer-based architectures. Any major breakthroughs will likely be incorporated into our models. We steal a lot of optimization tricks from language modelling/computer vision.”

Key data point: H100 pricing went from $2/hour a year ago to $4/hour now — compute demand outpacing supply.

Hardware roadmap slide — NVFP4, memory bandwidth, coding agents for engineering burden

Predictions for the Next 1-2 Years

Rick (NVIDIA): “Humans driving single agents will quickly move to humans driving small armies of agents, inference speed will continue to increase putting more pressure on humans to get better at keeping up/reviewing output of agents (and coming up with clever strategies for how to do this efficiently, probably more AI, turtles all the way down)”

Maurits (Emmi/Mistral): “We will release our first Large Engineering Model(s) this year. The question is how much domain specific those models will still be. Next, engineering will become more agentic, where the engineer serves as a human-in-the-design-loop, the steer the process.”

Thomas (DeepMind): “Generative AI will be used in many more domains at a higher level.”

Predictions slide — armies of agents, Large Engineering Models, generative AI expanding domains

Packed auditorium from the back — hundreds of attendees, AI on the Amstel logo on screen

The Venue and Community

AI on the Amstel is Amsterdam’s premier technical AI meetup — over 5,000 members, 4.7/5 rating, and 200-400+ attendees monthly depending on venue capacity. Tonight clearly pushed the upper boundary.

The event ran at the NU building (De Boelelaan 1111, VU Campus):

  • 17:00–18:00 — Networking in the atrium
  • 18:00–19:30 — Panel in Theaterzaal 1 (the VU’s biggest lecture hall)
  • 19:30+ — More networking

Luca Berton selfie with packed auditorium behind — AI on the Amstel at VU Amsterdam

Partners and Sponsors

  • Demonstrator Lab VU — Early stage startup incubator at VU, open to all students and researchers
  • VU StartHub — Startup pavilion in the heart of VU campus
  • AISO — Leading student-led AI society in the Netherlands, 500+ members in Amsterdam
  • Alibaba/Qwen — New meetup sponsor; Qwen is the leading AI model in China

Selfie with fellow attendees — packed blue-seat auditorium at VU Amsterdam

The Origin Story

The organizer (in the orange shirt) shared how AI on the Amstel started: “I couldn’t find the technical meetups I experienced in SF here in Amsterdam — so I started organizing it myself.”

That San Francisco-level quality shows. This is not a corporate event with sponsored talks — it is researchers having honest conversations about their work.

Origin story slide — SF skyline with quote about bringing SF-quality AI meetups to Amsterdam

Wide shot of packed auditorium with panel introduction slide

My Takeaway

What I appreciate about AI on the Amstel is the signal-to-noise ratio. No vendor pitches, no product demos — just researchers talking honestly about how this technology works and where it is going. The fact that they can fill the VU’s largest hall on a Tuesday evening shows how hungry the Amsterdam tech community is for substance over hype.

The panel format worked especially well here. Having DeepMind, NVIDIA, and Mistral/Emmi perspectives side by side — the model builder, the hardware maker, and the physics-AI startup — gave a multi-dimensional view you do not get from single-speaker events.

Another angle of the crowd and networking before the panel

Join the Community

If you are in Amsterdam and care about AI beyond the buzzwords, this is the meetup to attend.

Free 30-min AI & Cloud consultation

Book Now