By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
AIModelKitAIModelKitAIModelKit
  • Home
  • News
    NewsShow More
    Stay Ahead: The Future of IVF and the Latest in AI Innovations
    Stay Ahead: The Future of IVF and the Latest in AI Innovations
    6 Min Read
    Key Highlights from Day Two at TechEx North America: Strengthening Your Case for Innovation
    Key Highlights from Day Two at TechEx North America: Strengthening Your Case for Innovation
    7 Min Read
    Pope Leo Issues Caution on AI Risks in Landmark Papal Document
    Pope Leo Issues Caution on AI Risks in Landmark Papal Document
    5 Min Read
    OpenAI Solves 80-Year-Old Mathematics Problem: A Breakthrough Achievement
    OpenAI Solves 80-Year-Old Mathematics Problem: A Breakthrough Achievement
    5 Min Read
    Google I/O 2023: Unveiling the New Directions in AI-Driven Scientific Research
    Google I/O 2023: Unveiling the New Directions in AI-Driven Scientific Research
    5 Min Read
  • Open-Source Models
    Open-Source ModelsShow More
    ITBench-AA Report: Agentic Enterprise IT Models from IBM Fall Short with Scores Below 50% on Initial Benchmark — Insights from Artificial Analysis
    ITBench-AA Report: Agentic Enterprise IT Models from IBM Fall Short with Scores Below 50% on Initial Benchmark — Insights from Artificial Analysis
    4 Min Read
    OlmoEarth v1.1: Discover the Enhanced Efficiency of Our New Model Family
    OlmoEarth v1.1: Discover the Enhanced Efficiency of Our New Model Family
    5 Min Read
    Enhancing Scientific Impact with Global Partnerships and Open Resources
    Enhancing Scientific Impact with Global Partnerships and Open Resources
    5 Min Read
    Top 4 Ways Google Research Scientists Utilize Empirical Research Assistance
    Top 4 Ways Google Research Scientists Utilize Empirical Research Assistance
    5 Min Read
    Unlocking DeepInfra on Hugging Face: Explore Powerful Inference Providers 🔥
    Unlocking DeepInfra on Hugging Face: Explore Powerful Inference Providers 🔥
    5 Min Read
  • Guides
    GuidesShow More
    Master Sending Emails with Python: Take Our Quiz – Real Python
    Master Sending Emails with Python: Take Our Quiz – Real Python
    3 Min Read
    Integrating LLMs with Your Data Using Python MCP Servers – A Comprehensive Guide from Real Python
    Integrating LLMs with Your Data Using Python MCP Servers – A Comprehensive Guide from Real Python
    5 Min Read
    Ultimate Quiz to Optimize Your Python Development Environment – Real Python
    Ultimate Quiz to Optimize Your Python Development Environment – Real Python
    3 Min Read
    Mastering Scatter Plots in Python: A Comprehensive Quiz on Using plt.scatter() – Real Python Guide
    Mastering Scatter Plots in Python: A Comprehensive Quiz on Using plt.scatter() – Real Python Guide
    3 Min Read
    5 Essential Python Concepts You Need to Master
    5 Essential Python Concepts You Need to Master
    8 Min Read
  • Tools
    ToolsShow More
    Optimizing Use-Case Based Deployments with SageMaker JumpStart
    Optimizing Use-Case Based Deployments with SageMaker JumpStart
    5 Min Read
    Safetensors Partners with PyTorch Foundation: Strengthening AI Development
    Safetensors Partners with PyTorch Foundation: Strengthening AI Development
    5 Min Read
    High Throughput Computer Use Agent: Understanding 12B for Optimal Performance
    High Throughput Computer Use Agent: Understanding 12B for Optimal Performance
    5 Min Read
    Introducing the First Comprehensive Healthcare Robotics Dataset and Essential Physical AI Models for Advancing Healthcare Robotics
    Introducing the First Comprehensive Healthcare Robotics Dataset and Essential Physical AI Models for Advancing Healthcare Robotics
    6 Min Read
    Creating Native Multimodal Agents with Qwen 3.5 VLM on NVIDIA GPU-Accelerated Endpoints
    Creating Native Multimodal Agents with Qwen 3.5 VLM on NVIDIA GPU-Accelerated Endpoints
    5 Min Read
  • Events
    EventsShow More
    AI-Driven Shift Transforming Cybersecurity Skills and Talent Strategy: Insights from the Hack The Box Report
    AI-Driven Shift Transforming Cybersecurity Skills and Talent Strategy: Insights from the Hack The Box Report
    6 Min Read
    NVIDIA and Ineffable Intelligence Join Forces to Revolutionize Reinforcement Learning Infrastructure
    NVIDIA and Ineffable Intelligence Join Forces to Revolutionize Reinforcement Learning Infrastructure
    5 Min Read
    UK Financial Services Security Hackathon: Lloyds Banking Group, Hack The Box, and Google Cloud Join Forces
    UK Financial Services Security Hackathon: Lloyds Banking Group, Hack The Box, and Google Cloud Join Forces
    6 Min Read
    NVIDIA and SAP Enhance Trust in Specialized Agents Through Collaboration
    NVIDIA and SAP Enhance Trust in Specialized Agents Through Collaboration
    7 Min Read
    Introducing NVIDIA Spectrum-X: The Open, AI-Native Ethernet Fabric for Gigascale AI with Enhanced MRC Capabilities
    Introducing NVIDIA Spectrum-X: The Open, AI-Native Ethernet Fabric for Gigascale AI with Enhanced MRC Capabilities
    5 Min Read
  • Ethics
    EthicsShow More
    Transforming Organizational Design for the Era of Agentic AI
    Transforming Organizational Design for the Era of Agentic AI
    5 Min Read
    How the AI Era is Sparking an Intense Bug Hunting Arms Race
    How the AI Era is Sparking an Intense Bug Hunting Arms Race
    6 Min Read
    Ensuring Kids’ Pajamas Are Safe: Why Shouldn’t Their AI Be Just as Secure?
    Ensuring Kids’ Pajamas Are Safe: Why Shouldn’t Their AI Be Just as Secure?
    6 Min Read
    Palantir Responds to Sadiq Khan After £50 Million Metropolitan Police Contract Blocked
    Palantir Responds to Sadiq Khan After £50 Million Metropolitan Police Contract Blocked
    6 Min Read
    Can AI Help You Find True Love? How Dating Apps Are Betting on Artificial Intelligence
    Can AI Help You Find True Love? How Dating Apps Are Betting on Artificial Intelligence
    6 Min Read
  • Comparisons
    ComparisonsShow More
    Insights from Sarang Kulkarni: Key Lessons Learned in Developing Deep Research Agents for Production
    Insights from Sarang Kulkarni: Key Lessons Learned in Developing Deep Research Agents for Production
    6 Min Read
    Exploring OCR-Reasoning Benchmark: Assessing MLLMs’ Performance in Complex Text-Rich Image Reasoning
    Exploring OCR-Reasoning Benchmark: Assessing MLLMs’ Performance in Complex Text-Rich Image Reasoning
    5 Min Read
    Enhancing Azure Logic Apps: Introducing Sandboxed Code Interpreters for Agent Workflows
    Enhancing Azure Logic Apps: Introducing Sandboxed Code Interpreters for Agent Workflows
    0 Min Read
    Exploring AI Content Moderation for Safe and Effective Therapy Conversations
    Exploring AI Content Moderation for Safe and Effective Therapy Conversations
    6 Min Read
    Join the InfoQ Online Certification Program: New Cohorts for AI Engineering and Organizational Architecture
    Join the InfoQ Online Certification Program: New Cohorts for AI Engineering and Organizational Architecture
    5 Min Read
Search
  • Privacy Policy
  • Terms of Service
  • Contact Us
  • FAQ / Help Center
  • Advertise With Us
  • Latest News
  • Model Comparisons
  • Tutorials & Guides
  • Open-Source Tools
  • Community Events
© 2025 AI Model Kit. All Rights Reserved.
Reading: Insights from Sarang Kulkarni: Key Lessons Learned in Developing Deep Research Agents for Production
Share
Notification Show More
Font ResizerAa
AIModelKitAIModelKit
Font ResizerAa
  • 🏠
  • 🚀
  • 📰
  • 💡
  • 📚
  • ⭐
Search
  • Home
  • News
  • Models
  • Guides
  • Tools
  • Ethics
  • Events
  • Comparisons
Follow US
  • Latest News
  • Model Comparisons
  • Tutorials & Guides
  • Open-Source Tools
  • Community Events
© 2025 AI Model Kit. All Rights Reserved.
AIModelKit > Comparisons > Insights from Sarang Kulkarni: Key Lessons Learned in Developing Deep Research Agents for Production
Comparisons

Insights from Sarang Kulkarni: Key Lessons Learned in Developing Deep Research Agents for Production

aimodelkit
Last updated: May 27, 2026 9:00 pm
aimodelkit
Share
Insights from Sarang Kulkarni: Key Lessons Learned in Developing Deep Research Agents for Production
SHARE

### Understanding Deep Research Agentic Systems

Deep Research Agentic Systems, such as OpenAI and Gemini Deep Research Agent, represent a groundbreaking leap in artificial intelligence technology. Specifically designed for conducting multi-step research on the internet for complex tasks, these AI agents utilize dynamic reasoning and multi-hop information retrieval. Their primary strength lies in their ability to generate comprehensive, structured analytical reports akin to those produced by seasoned research analysts.

### Insights from the Arc of AI Conference 2026

At the Arc of AI Conference 2026, Sarang Kulkarni from the Thoughtworks team shared invaluable insights into the design and deployment of these advanced multi-agent research systems. His talk focused on the principles of deep reasoning and synthesis, derived from practical applications in healthcare and pharmaceutical R&D projects. Kulkarni emphasized the innovative techniques his team employed, including agentic loops and harness engineering, to optimize their solution’s performance.

### The Need for Advanced AI in Healthcare and R&D

In critical industries like healthcare and clinical trials, traditional AI models that provide simple Q&A capabilities are insufficient. Researchers require systems capable of discovering, connecting, and reasoning across diverse data sources, including internal databases and the vast expanse of the Internet. Such systems must not only maintain reliability and transparency but also comply with stringent industry standards.

### The High Cost of Drug Development

Kulkarni began his presentation by highlighting a stark reality: it typically costs around $2.6 billion to bring a new drug to market. Remarkably, nearly half of all research studies are conducted without prior evidence, often due to a lack of access to existing knowledge. This challenge is particularly evident in the drug discovery pipeline, where obtaining the right data at the right time proves to be a daunting task.

### Creating the Agentic RAG++ System

To address these challenges, Kulkarni’s team developed a Retrieval Augmented Generation (RAG) based chatbot two years ago, designed to sift through unstructured data. While this initial RAG solution worked well for simple queries, it proved inadequate for complex inquiries. Consequently, they evolved their approach into an enhanced system called Agentic RAG++.

This sophisticated framework comprises several crucial components. One key feature is the clarification loop, which helps to ensure that queries are well-defined. The research loop includes tasks like thinking, planning, executing, reflecting, and adjusting plans based on findings. Lastly, the writing loop focuses on documenting insights and reflecting on those documents for clarity and comprehensiveness.

### Tools for Enhanced Research Capabilities

The initial version of the researcher agent utilized two main tools: the RAG tool and a text2sql tool. The RAG tool, designed for weighted hybrid searches, processes multiple context chunks to enhance retrieval accuracy. Meanwhile, the text2sql tool aids in feeding back SQL query errors to the AI model, thus improving the accuracy of query execution over time.

### Challenges in AI Retrieval and Solutions

Kulkarni pointed out several potential pitfalls that researchers must navigate when using AI agents. These include high token costs, poor performance, and latency issues that can stem from incomplete data. To combat these challenges, the reflection loop is employed, which helps verify the completeness of data while also prompting the improvement of processes.

### Addressing Long-Horizon Tasks

When it comes to long-horizon tasks—those that require multiple steps to complete—Kulkarni emphasized the necessity of having an explicit think-act loop in place. Incorporating multiple steps like think, plan, inspect, and update ensures that the research remains coherent and produces useful outcomes. For this, they utilized solutions like Anthropic’s “think” tool, which provides a structured pause for reasoning.

### The Importance of Reflection Steps

The reflection phase in Kulkarni’s model extends beyond simple data verification; it encompasses a process reflection that assesses the overall completion of the project. An additional third step, termed the Draft Writing Loop, addresses any synthesis gaps. This ensures that no pivotal information derived from research is lost during the writing process.

### The Future of Harness Engineering in AI

Kulkarni concluded his presentation by discussing the emerging field of harness engineering. This approach focuses on designing tools, memory systems, and validation checks that make autonomous AI agents more reliable and accountable. The goal of harness engineering is to elevate AI solutions from mere prompt engineering to fully automated task execution.

In essence, since AI agents merge models with harnesses, the quality of the models directly influences the required complexity of the harness. The better the models are, the simpler the harness can be, paving the way for more efficient and effective AI systems in deep research applications.

Inspired by: Source

Enhancing Language Models through Graph-Guided Fine-Tuning Techniques
Advanced Language-Image Pre-Training Techniques for Enhanced 3D Medical Image Understanding in Research Paper [2510.15042]
Enhancing Robustness in Vision-Language Models with Partially Recentralization Softmax Loss
Unlocking Speed and Conversational Power: OpenAI’s Enhanced GPT-5.1 Models
Exploring GLM-4.5 and SGLang: Insights into Reasoning, Coding Skills, and Agentic Abilities

Sign Up For Daily Newsletter

Get AI news first! Join our newsletter for fresh updates on open-source models.

By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Copy Link Print
Previous Article ITBench-AA Report: Agentic Enterprise IT Models from IBM Fall Short with Scores Below 50% on Initial Benchmark — Insights from Artificial Analysis ITBench-AA Report: Agentic Enterprise IT Models from IBM Fall Short with Scores Below 50% on Initial Benchmark — Insights from Artificial Analysis

Stay Connected

XFollow
PinterestPin
TelegramFollow
LinkedInFollow

							banner							
							banner
Explore Top AI Tools Instantly
Discover, compare, and choose the best AI tools in one place. Easy search, real-time updates, and expert-picked solutions.
Browse AI Tools

Latest News

ITBench-AA Report: Agentic Enterprise IT Models from IBM Fall Short with Scores Below 50% on Initial Benchmark — Insights from Artificial Analysis
ITBench-AA Report: Agentic Enterprise IT Models from IBM Fall Short with Scores Below 50% on Initial Benchmark — Insights from Artificial Analysis
Open-Source Models
Stay Ahead: The Future of IVF and the Latest in AI Innovations
Stay Ahead: The Future of IVF and the Latest in AI Innovations
News
Exploring OCR-Reasoning Benchmark: Assessing MLLMs’ Performance in Complex Text-Rich Image Reasoning
Exploring OCR-Reasoning Benchmark: Assessing MLLMs’ Performance in Complex Text-Rich Image Reasoning
Comparisons
Master Sending Emails with Python: Take Our Quiz – Real Python
Master Sending Emails with Python: Take Our Quiz – Real Python
Guides
//

Leading global tech insights for 20M+ innovators

Quick Link

  • Latest News
  • Model Comparisons
  • Tutorials & Guides
  • Open-Source Tools
  • Community Events

Support

  • Privacy Policy
  • Terms of Service
  • Contact Us
  • FAQ / Help Center
  • Advertise With Us

Sign Up for Our Newsletter

Get AI news first! Join our newsletter for fresh updates on open-source models.

AIModelKitAIModelKit
Follow US
© 2025 AI Model Kit. All Rights Reserved.
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?