18 of the top 20 pharmaceutical companies already use NVIDIA BioNeMo, and now that same stack is popping up inside Anthropic’s Claude Science workbench. That’s the headline that landed on the NVIDIA blog this week, and that’s a signal that the AI‑for‑science market is finally getting a unified front‑end.
Key Takeaways
- Claude Science lets scientists talk to AI agents in plain language, removing the need to hand‑craft model calls.
- The new BioNeMo Agent Toolkit bundles accelerated models like Evo 2, Boltz-2 and OpenFold3 as callable skills.
- Researchers can run genomics, proteomics or cheminformatics pipelines on any NVIDIA compute resource, from on‑prem GPUs to cloud clusters.
- 18 of the top 20 pharma firms rely on BioNeMo, underscoring its ecosystem reach.
- The integration promises faster iteration cycles, letting scientists refine hypotheses without leaving the Claude interface.
What NVIDIA BioNeMo Agent Toolkit Brings to Claude Science
When Anthropic announced Claude Science earlier this week, they emphasized that the platform lets scientists converse with agents to run end‑to‑end research. What the NVIDIA blog adds is that those agents now have direct access to a library of GPU‑accelerated tools, all wrapped as micro‑services. That’s not just a convenience layer; it’s a move that stitches together the model zoo, the data pipelines, and the compute fabric. Because the toolkit can call out to any NVIDIA compute node, researchers aren’t locked into a single cloud vendor. They’re free to spin up a local DGX system for confidential drug designs, or tap a public cloud for massive screening runs.
How the Agentic Workflow Works
Claude Science starts with a natural‑language prompt—say, “predict the 3‑D structure of this protein sequence.” The agent parses the request, picks the right skill from BioNeMo, and then formats the input so the underlying model can digest it. That’s where the BioNeMo Agent Toolkit shines: each skill includes metadata about required inputs, output formats, and compute footprints. The system then fires off a call to an NVIDIA NIM microservice, which runs the job on the fastest GPU you have available. When the job finishes, Claude returns the results right in the chat window, and you can ask follow‑up questions like “show me the top five binding pockets.” The loop repeats until the scientist is satisfied.
Natural Language to Accelerated Compute
What makes this integration feel less like a scripted API and more like a genuine research assistant is the way the agent handles context. If you ask for a genomic analysis, the toolkit automatically pulls in the correct reference genome, aligns the reads, and then runs variant calling using NVIDIA‑optimized libraries. If you switch to protein design, it swaps in OpenFold3 for structure prediction, then hands the result to Boltz-2 for energy minimization. That’s a lot of plumbing for a single sentence, but the user never sees the plumbing. They’re just getting the answer they asked for, and they can iterate instantly.
Accelerated Models in the Toolkit
Among the models bundled with BioNeMo, three stand out for life‑science workloads. Evo 2 is a protein‑design engine that can generate thousands of candidate sequences in minutes, thanks to tensor‑core optimizations. Boltz-2 focuses on molecular dynamics, delivering faster simulations of binding interactions without sacrificing accuracy. Finally, OpenFold3 pushes the envelope on protein‑structure prediction, cutting inference time roughly in half compared to its predecessor. Because each model runs on NVIDIA GPUs, the speed gains are real, not just marketing hype. And because the NIM microservices expose a consistent API, developers can swap one model for another without rewriting their workflow code.
Industry Adoption Signals
The fact that 18 of the top 20 pharmaceutical companies already rely on BioNeMo tells you the ecosystem is already mature enough for enterprise‑grade research. That’s a concrete metric that should calm any doubts about vendor lock‑in. Below is a snapshot of the adoption landscape:
- Pfizer, Merck, and Novartis use BioNeMo for high‑throughput screening.
- Roche and Johnson & Johnson use the toolkit for protein‑engineering pipelines.
- Amgen and AstraZeneca run genomics analyses on NVIDIA‑accelerated clusters.
- GSK and Sanofi integrate the NIM microservices into their cloud‑based drug‑discovery platforms.
Those names aren’t just marketing fluff; they’re the companies that spend billions on R&D each year. Their adoption signals that the toolkit can handle the scale and regulatory constraints of real‑world drug development.
Historical Context
AI for scientific discovery has been gathering momentum for several years. Early attempts relied on generic language models that had to be fine‑tuned for each domain. The lack of a common interface forced labs to write custom glue code, slowing down adoption. NVIDIA’s answer was to package domain‑specific models together with the compute infrastructure needed to run them efficiently. By exposing those models as micro‑services, BioNeMo created a shared language between researchers and the underlying hardware. Anthropic’s Claude Science builds on that foundation, turning the micro‑service catalog into a conversational partner. The partnership marks a step beyond isolated toolkits; it brings together a conversational UI, a curated model suite, and a hardware‑agnostic compute layer.
Implications for Researchers
For a scientist, the biggest win is speed. When you can ask Claude Science to design a set of inhibitors and get a ranked list back in under an hour, you can test more hypotheses in a day. That translates to faster cycles of synthesis, assay, and redesign. It’s also a democratizing force: you don’t need a dedicated ML engineer to spin up a GPU cluster, because the toolkit abstracts that away. You just need to phrase your question well, and the agent does the heavy lifting.
What This Means For You
If you’re building a bioinformatics pipeline, you can now expose your existing scripts as BioNeMo skills and let Claude handle orchestration. That means you can keep your legacy code while gaining a natural‑language front‑end. If you’re a startup focused on AI‑driven drug design, the integration offers a shortcut to production‑grade compute without the overhead of managing GPUs yourself. You can plug into NVIDIA’s NIM services, pay per inference, and still benefit from the same optimizations that the pharma giants enjoy.
Developers should also note that the toolkit’s skill definitions are JSON‑based, so extending them to custom models is straightforward. You just need to declare the input schema, the required GPU type, and the endpoint URL. Once that’s done, Claude Science will automatically know how to call your model, which cuts down the time to market for new AI‑enabled assays.
Scenario 1: Building a Custom Pipeline
Imagine you have a proprietary variant‑calling script that runs on a specific version of a genome reference. By wrapping that script as a BioNeMo skill, you give Claude the ability to invoke it directly from chat. The agent will handle data staging, launch the job on the fastest available GPU, and return the VCF file. You can then ask follow‑up questions like “highlight variants in the kinase domain” without ever leaving the conversation.
Scenario 2: Scaling a Startup
A small biotech team often scrambles to provision cloud GPUs for each new model they want to test. With the Agent Toolkit, they can declare their models as services, let Claude decide where to run them, and pay only for the compute that actually gets used. The result is a leaner cost structure and a faster feedback loop, which is critical when you’re racing against larger competitors.
Scenario 3: Enterprise Integration
Large pharma groups already have internal data lakes and compliance pipelines. By exposing those pipelines as BioNeMo skills, they can let Claude act as a secure front‑door. Researchers ask for “a SAR analysis on the latest series,” and the agent pulls the relevant data, runs the analysis on an on‑prem DGX, and streams the results back. The process stays within the corporate firewall while still benefiting from the conversational workflow.
Competitive Landscape
Claude Science is not the only platform trying to turn AI models into conversational assistants. Other vendors have released their own chat‑driven research tools, but most of them still require users to manage the underlying compute themselves. The distinguishing factor for the NVIDIA‑Anthropic combo is the smooth hand‑off to GPU‑accelerated micro‑services. That creates a lower barrier to entry and a more predictable performance profile. As more players experiment with agent‑centric designs, the market will likely converge on a set of common standards for skill definition and model serving. Until that happens, the BioNeMo Agent Toolkit offers the most mature, production‑ready implementation.
Key Questions Remaining
- How will data provenance be tracked when agents chain multiple skills together? Ensuring reproducibility will require strong metadata capture.
- What governance mechanisms will emerge to control which external models can be added as skills? Organizations will need policies to avoid accidental exposure of sensitive data.
- Will future versions of Claude Science support multi‑agent collaboration, where two or more agents negotiate a complex workflow? That could unlock even richer experimental designs.
Answering those questions will shape the next wave of AI‑enabled science platforms. For now, the integration of NVIDIA BioNeMo with Claude Science offers a clear path to faster, more accessible research.
Sources: NVIDIA Blog, original report

