By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
AIModelKitAIModelKitAIModelKit
  • Home
  • News
    NewsShow More
    Laserfiche Introduces AI Agents to Streamline Natural Language Workflows
    Laserfiche Introduces AI Agents to Streamline Natural Language Workflows
    5 Min Read
    Hugging Face Hosts Malicious Software Disguised as OpenAI Release: A Security Alert
    Hugging Face Hosts Malicious Software Disguised as OpenAI Release: A Security Alert
    5 Min Read
    Thinking Machines Aims to Create Conversational AI That Listens Effectively While Communicating
    Thinking Machines Aims to Create Conversational AI That Listens Effectively While Communicating
    4 Min Read
    OpenAI Unveils Its Response to Claude Mythos: A Comprehensive Overview
    OpenAI Unveils Its Response to Claude Mythos: A Comprehensive Overview
    4 Min Read
    Discover the Latest Developments at Mira Murati’s AI Company: What’s Happening Now?
    Discover the Latest Developments at Mira Murati’s AI Company: What’s Happening Now?
    5 Min Read
  • Open-Source Models
    Open-Source ModelsShow More
    Enhancing Scientific Impact with Global Partnerships and Open Resources
    Enhancing Scientific Impact with Global Partnerships and Open Resources
    5 Min Read
    Top 4 Ways Google Research Scientists Utilize Empirical Research Assistance
    Top 4 Ways Google Research Scientists Utilize Empirical Research Assistance
    5 Min Read
    Unlocking DeepInfra on Hugging Face: Explore Powerful Inference Providers 🔥
    Unlocking DeepInfra on Hugging Face: Explore Powerful Inference Providers 🔥
    5 Min Read
    How AI-Generated Synthetic Neurons are Revolutionizing Brain Mapping
    How AI-Generated Synthetic Neurons are Revolutionizing Brain Mapping
    5 Min Read
    Discover HoloTab by HCompany: Your Ultimate AI Browser Companion
    4 Min Read
  • Guides
    GuidesShow More
    Creating Type-Safe LLM Agents Using Pydantic AI: A Comprehensive Guide | Real Python
    Creating Type-Safe LLM Agents Using Pydantic AI: A Comprehensive Guide | Real Python
    5 Min Read
    Mastering List Flattening in Python: A Quiz from Real Python
    Mastering List Flattening in Python: A Quiz from Real Python
    4 Min Read
    Test Your Knowledge: Python Memory Management Quiz – Real Python
    Test Your Knowledge: Python Memory Management Quiz – Real Python
    2 Min Read
    Mastering OpenCode: AI-Assisted Python Coding Quiz Guide | Real Python
    Mastering OpenCode: AI-Assisted Python Coding Quiz Guide | Real Python
    2 Min Read
    Master Python & APIs: Your Ultimate Quiz Guide to Accessing Public Data – Real Python
    Master Python & APIs: Your Ultimate Quiz Guide to Accessing Public Data – Real Python
    4 Min Read
  • Tools
    ToolsShow More
    Optimizing Use-Case Based Deployments with SageMaker JumpStart
    Optimizing Use-Case Based Deployments with SageMaker JumpStart
    5 Min Read
    Safetensors Partners with PyTorch Foundation: Strengthening AI Development
    Safetensors Partners with PyTorch Foundation: Strengthening AI Development
    5 Min Read
    High Throughput Computer Use Agent: Understanding 12B for Optimal Performance
    High Throughput Computer Use Agent: Understanding 12B for Optimal Performance
    5 Min Read
    Introducing the First Comprehensive Healthcare Robotics Dataset and Essential Physical AI Models for Advancing Healthcare Robotics
    Introducing the First Comprehensive Healthcare Robotics Dataset and Essential Physical AI Models for Advancing Healthcare Robotics
    6 Min Read
    Creating Native Multimodal Agents with Qwen 3.5 VLM on NVIDIA GPU-Accelerated Endpoints
    Creating Native Multimodal Agents with Qwen 3.5 VLM on NVIDIA GPU-Accelerated Endpoints
    5 Min Read
  • Events
    EventsShow More
    NVIDIA and SAP Enhance Trust in Specialized Agents Through Collaboration
    NVIDIA and SAP Enhance Trust in Specialized Agents Through Collaboration
    7 Min Read
    Introducing NVIDIA Spectrum-X: The Open, AI-Native Ethernet Fabric for Gigascale AI with Enhanced MRC Capabilities
    Introducing NVIDIA Spectrum-X: The Open, AI-Native Ethernet Fabric for Gigascale AI with Enhanced MRC Capabilities
    5 Min Read
    NVIDIA and ServiceNow Collaborate on Next-Gen Autonomous AI Agents for Enterprise Solutions
    NVIDIA and ServiceNow Collaborate on Next-Gen Autonomous AI Agents for Enterprise Solutions
    6 Min Read
    Exploring Hack The Box’s Role in Locked Shields 2026: Contributions and Insights
    Exploring Hack The Box’s Role in Locked Shields 2026: Contributions and Insights
    5 Min Read
    Expert Educator Warns: The AI Bubble Is Deflating – Here’s Why
    Expert Educator Warns: The AI Bubble Is Deflating – Here’s Why
    5 Min Read
  • Ethics
    EthicsShow More
    Ilya Sutskever Defends His Role in Sam Altman’s OpenAI Ouster: ‘I Aimed to Protect the Company’
    Ilya Sutskever Defends His Role in Sam Altman’s OpenAI Ouster: ‘I Aimed to Protect the Company’
    6 Min Read
    Understanding AI Behavior: Distinguishing Artificial Intelligence from Consciousness
    Understanding AI Behavior: Distinguishing Artificial Intelligence from Consciousness
    5 Min Read
    Understanding Speech Transcription: How It Influences Power Dynamics and Bias
    Understanding Speech Transcription: How It Influences Power Dynamics and Bias
    6 Min Read
    Trump-Xi Summit in Beijing: Prioritizing Shared AI Risks for Global Cooperation
    Trump-Xi Summit in Beijing: Prioritizing Shared AI Risks for Global Cooperation
    6 Min Read
    Exploring AI in the Emergency Department: Promising Potential, Powerful Tools, but Unproven Results
    Exploring AI in the Emergency Department: Promising Potential, Powerful Tools, but Unproven Results
    5 Min Read
  • Comparisons
    ComparisonsShow More
    CodeBrain: Integrating Decoupled Tokenization with Multi-Scale Architecture for Enhanced EEG Foundation Models
    CodeBrain: Integrating Decoupled Tokenization with Multi-Scale Architecture for Enhanced EEG Foundation Models
    5 Min Read
    EgoMemReason: Benchmarking Memory-Driven Reasoning for Long-Horizon Egocentric Video Analysis
    EgoMemReason: Benchmarking Memory-Driven Reasoning for Long-Horizon Egocentric Video Analysis
    5 Min Read
    Unlocking the Potential of Order: Misleading LLMs with Adversarial Table Permutations in Research 2605.00445
    Unlocking the Potential of Order: Misleading LLMs with Adversarial Table Permutations in Research 2605.00445
    5 Min Read
    Enhanced Transformer Language Models: Achieving Sparser, Faster, and Lighter Architectures
    Enhanced Transformer Language Models: Achieving Sparser, Faster, and Lighter Architectures
    5 Min Read
    Enhancing Long-Term Talking Head Generation: AsymTalker for Identity Consistency through Asymmetric Distillation
    Enhancing Long-Term Talking Head Generation: AsymTalker for Identity Consistency through Asymmetric Distillation
    4 Min Read
Search
  • Privacy Policy
  • Terms of Service
  • Contact Us
  • FAQ / Help Center
  • Advertise With Us
  • Latest News
  • Model Comparisons
  • Tutorials & Guides
  • Open-Source Tools
  • Community Events
© 2025 AI Model Kit. All Rights Reserved.
Reading: Introducing the First Comprehensive Healthcare Robotics Dataset and Essential Physical AI Models for Advancing Healthcare Robotics
Share
Notification Show More
Font ResizerAa
AIModelKitAIModelKit
Font ResizerAa
  • 🏠
  • 🚀
  • 📰
  • 💡
  • 📚
  • ⭐
Search
  • Home
  • News
  • Models
  • Guides
  • Tools
  • Ethics
  • Events
  • Comparisons
Follow US
  • Latest News
  • Model Comparisons
  • Tutorials & Guides
  • Open-Source Tools
  • Community Events
© 2025 AI Model Kit. All Rights Reserved.
AIModelKit > Tools > Introducing the First Comprehensive Healthcare Robotics Dataset and Essential Physical AI Models for Advancing Healthcare Robotics
Tools

Introducing the First Comprehensive Healthcare Robotics Dataset and Essential Physical AI Models for Advancing Healthcare Robotics

aimodelkit
Last updated: March 17, 2026 1:01 am
aimodelkit
Share
Introducing the First Comprehensive Healthcare Robotics Dataset and Essential Physical AI Models for Advancing Healthcare Robotics
SHARE

Authors: Nigel Nelson, Lukas Zbinden, Mostafa Toloui, Sean Huver

Healthcare AI has primarily revolved around perception-based models, concentrating on interpreting signals to classify or segment pathologies and anatomy. Yet, healthcare fundamentally encompasses “doing,” rendering past perception-only datasets inadequate due to their static nature, which doesn’t account for embodiment, contact dynamics, or closed-loop control. The healthcare sector calls for standardized robotic bodies, synchronized vision–force–kinematics data, sim-to-real pairing, and cross-embodiment benchmarks to establish a solid foundation for Physical AI.

1. Open-H-Embodiment

Open-H-Embodiment is a collaborative, community-driven dataset initiative aimed at creating a shared foundation essential for training and evaluating AI autonomy and world foundation models in surgical robotics and ultrasound applications. Spearheaded by a steering committee featuring notable figures such as Prof. Axel Krieger from Johns Hopkins, Prof. Nassir Navab from the Technical University of Munich, and Dr. Mahdi Azizian from NVIDIA, this initiative has now expanded to include participation from over 35 organizations worldwide.

Collectively, these participants have joined forces to construct the first large-scale dataset aimed at propelling the advancement of Physical AI within healthcare robotics.

Participants

Notable participants include:

  • Balgrist
  • CMR Surgical
  • The Chinese University of Hong Kong
  • Great Bay University
  • Hong Kong Baptist University
  • Hamlyn
  • ImFusion
  • Johns Hopkins University
  • Leeds University
  • Mohamed bin Zayed University of Artificial Intelligence
  • Moon Surgical
  • NVIDIA
  • Northwell Health
  • Obuda University
  • The Hong Kong Polytechnic University
  • Qilu Hospital of Shandong University
  • Rob Surgical
  • Sanoscience
  • Surgical Data Science Collective
  • Semaphor Surgical
  • Stanford
  • Dresden University of Technology
  • Technical University of Munich
  • Tuodao
  • Turin
  • University of British Columbia
  • UC Berkeley
  • UC San Diego
  • University of Illinois Chicago
  • University of Tennessee
  • University of Texas
  • Vanderbilt
  • Virtual Incision

The Dataset

  • Comprises 778 hours of CC-BY-4.0 healthcare robotics training data, primarily focused on surgical robotics, along with ultrasound and colonoscopy autonomy data.
  • Includes simulations, benchtop exercises (such as suturing), and actual clinical procedures.
  • Utilizes both commercial robots (like CMR Surgical, Rob Surgical, and Tuodao) and research robots (including dVRK, Franka, and Kuka).
  • Accompanied by the release of two new, permissively open-source models trained on this dataset.

2. GR00T-H: Vision Language Action Model for Surgical Robotics

One of the significant innovations birthed from this initiative is GR00T-H, a derivative of NVIDIA’s Isaac GR00T N series of Vision-Language-Action (VLA) models. With training based on approximately 600 hours of Open-H-Embodiment data, GR00T-H is pioneering as the first policy model tailored for surgical robotics tasks.

Leveraging NVIDIA’s open-source ecosystem, Gr00T-H utilizes Cosmos Reason 2 2B as its Vision-Language Model (VLM) backbone.

pyramid

Architectural Design Choices

Developing surgical robotics calls for acute precision, and specialized hardware like cable-driven systems complicates imitation learning (IL). To tackle this, GR00T-H incorporates four pivotal design choices:

  • Unique Embodiment Projectors: A distinct, learnable MLP maps each robot’s specific kinematics to a uniform, normalized action space.
  • State Dropout (100%): Proprioceptive input is dropped during inference, generating a learned bias term for every system, which enhances real-world results.
  • Relative EEF Actions: Training employs a common relative End-Effector (EEF) action space to mitigate kinematic inconsistencies.
  • Metadata in Task Prompts: Directly injects instrument names and control index mapping into the VLM task prompt.

A prototype of GR00T-H has successfully executed a complete, end-to-end suture as demonstrated in the SutureBot benchmark, showcasing robust long-horizon dexterity.

gr00t_sutureGR00T-H performing end-to-end suturing.


3. Cosmos-H-Surgical-Simulator

Another groundbreaking creation is the Cosmos-H-Surgical-Simulator, designed as a World Foundation Model (WFM) for action-conditioned surgical robotics. Traditional simulators have struggled due to the complexities of real-world conditions, such as soft tissue, reflections, blood, and smoke.

Key Capabilities

  • Overcoming the Sim-to-Real Gap: Fine-tuned from NVIDIA Cosmos Predict 2.5 2B, it generates physically plausible surgical video directly from kinematic actions.
  • Efficiency Gains: Completing 600 rollouts took only 40 minutes in simulation compared to 2 days required for real-world benchtop methods.
  • WFM as a Physics Simulator: This model learns tissue deformation and tool interaction implicitly from data.
  • Synthetic Data Generation: Capable of generating realistic synthetic video-action pairs to enhance underrepresented datasets.

cosmos_h_surg_sim

Fine-Tuning Details

The model underwent fine-tuning using the Open-H-Embodiment dataset (utilizing 9 robot embodiments across 32 datasets), employing 64x A100 GPUs over approximately 10,000 GPU-hours and utilizing a unified 44-dimensional action space.


4. What is Next: Towards Reasoning For Surgical Robotics

Looking ahead, the goal for version 2 of the Open-H-Embodiment initiative is to transition from mere perceptual control to the development of reasoning-capable autonomy—a significant leap reminiscent of a surgical robotics ChatGPT moment—where systems can explain, plan, and adapt throughout long procedures. Achieving this goal necessitates extending Open-H-Embodiment into reasoning-ready data, enriched with annotated task traces that capture intents, outcomes, and failure modes. This transformative effort urges community engagement, and we invite you to participate. For more details, visit our Open-H GitHub Repo to help shape the future of healthcare robotics.


5. Get started today

Ready to dive in? Access the following resources to start working with the Open-H-Embodiment dataset and models:

Inspired by: Source

Contents
  • 1. Open-H-Embodiment
    • Participants
    • The Dataset
  • 2. GR00T-H: Vision Language Action Model for Surgical Robotics
    • Architectural Design Choices
  • 3. Cosmos-H-Surgical-Simulator
    • Key Capabilities
    • Fine-Tuning Details
  • 4. What is Next: Towards Reasoning For Surgical Robotics
  • 5. Get started today
Accelerated Assisted Generation Support for Intel Gaudi: Enhance Performance and Efficiency
Unlocking Agentic AI: Join the AWS & NVIDIA Hackathon to Shape the Future of Intelligent Agents
Hugging Face Partners with Microsoft to Introduce Hugging Face Model Catalog on Azure
Unlocking Groq on Hugging Face: Fast Inference Providers Explained 🔥
Revolutionizing Parkinson’s Detection: How AI Utilizes Standard MRI Scans for Early Diagnosis

Sign Up For Daily Newsletter

Get AI news first! Join our newsletter for fresh updates on open-source models.

By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Copy Link Print
Previous Article Warren Urges Pentagon to Explain xAI’s Access to Classified Networks: Key Concerns Raised Warren Urges Pentagon to Explain xAI’s Access to Classified Networks: Key Concerns Raised
Next Article Ensure Consistent Dataset for Comprehensive Peer Review and Multi-Turn Rebuttal Discussions Ensure Consistent Dataset for Comprehensive Peer Review and Multi-Turn Rebuttal Discussions

Stay Connected

XFollow
PinterestPin
TelegramFollow
LinkedInFollow

							banner							
							banner
Explore Top AI Tools Instantly
Discover, compare, and choose the best AI tools in one place. Easy search, real-time updates, and expert-picked solutions.
Browse AI Tools

Latest News

Creating Type-Safe LLM Agents Using Pydantic AI: A Comprehensive Guide | Real Python
Creating Type-Safe LLM Agents Using Pydantic AI: A Comprehensive Guide | Real Python
Guides
Laserfiche Introduces AI Agents to Streamline Natural Language Workflows
Laserfiche Introduces AI Agents to Streamline Natural Language Workflows
News
CodeBrain: Integrating Decoupled Tokenization with Multi-Scale Architecture for Enhanced EEG Foundation Models
CodeBrain: Integrating Decoupled Tokenization with Multi-Scale Architecture for Enhanced EEG Foundation Models
Comparisons
NVIDIA and SAP Enhance Trust in Specialized Agents Through Collaboration
NVIDIA and SAP Enhance Trust in Specialized Agents Through Collaboration
Events
//

Leading global tech insights for 20M+ innovators

Quick Link

  • Latest News
  • Model Comparisons
  • Tutorials & Guides
  • Open-Source Tools
  • Community Events

Support

  • Privacy Policy
  • Terms of Service
  • Contact Us
  • FAQ / Help Center
  • Advertise With Us

Sign Up for Our Newsletter

Get AI news first! Join our newsletter for fresh updates on open-source models.

AIModelKitAIModelKit
Follow US
© 2025 AI Model Kit. All Rights Reserved.
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?