Pet-Bench: A Novel Benchmark for Large Language Models as E-Pets in Social Networks
Submitted on 4 Jun 2025 (v1), last revised 15 Dec 2025 (v3)
In the realm of artificial intelligence, particularly with Large Language Models (LLMs), the journey from mere text generation to interactive companionship is gaining traction. One of the pioneering efforts in this field is encapsulated in the research paper titled "Pet-Bench: Benchmarking the Abilities of Large Language Models as E-Pets in Social Network Services," authored by Hongcheng Guo and eight collaborators. This groundbreaking study introduces a new benchmark, Pet-Bench, that aims to fundamentally reshape our understanding of how LLMs can be utilized in creating emotionally resonant virtual pet experiences.
Understanding the Need for Pet-Bench
As society grows increasingly fascinated by interactive digital environments, especially those that simulate emotional connections, the demand for realistic virtual companionship has surged. Traditional applications of LLMs in pet simulations have primarily revolved around basic role-playing interactions. However, these interactions often lack the depth and complexity that define true companionship. Therefore, Pet-Bench steps in as a necessary initiative to evaluate LLMs beyond superficial exchanges.
What is Pet-Bench?
At its core, Pet-Bench is a meticulously crafted benchmark that assesses LLMs based on two key dimensions: self-interaction and human interaction. Unlike prior research efforts that may have focused on straightforward conversational tasks, Pet-Bench pioneers the inclusion of developmental behaviors and self-evolution—elements essential for fostering a more authentic pet-owner relationship.
Key Features of Pet-Bench
1. Comprehensive Interaction Models:
Pet-Bench comprises over 7,500 interaction instances designed to replicate a broad range of pet behaviors. This diversity allows for a deeper exploration of how LLMs can capture the nuances of companionship, offering insights not previously feasible in existing benchmarks.
2. Tasks Beyond Simple Conversations:
The benchmark challenges LLMs to perform tasks such as intelligent scheduling, where virtual pets can plan activities akin to real-life pet care. It also pushes models to engage in memory-based dialogues—a critical aspect of long-term relationships—where past interactions inform present conversations.
3. Psychological Engagement:
Another innovative feature is the focus on psychological conversations, where the LLM must simulate understanding and emotional responses. This aspect not only elevates the interactivity of virtual pets but also ensures that the engagements can influence users’ emotional states in meaningful ways.
Evaluation and Findings
In evaluating 28 LLMs, the study highlights significant performance variations that correlate with model size and their inherent capabilities. This finding emphasizes the necessity for specialized optimization when developing LLMs for applications in companionship. The data collected through Pet-Bench not only serves as an evaluative reference but also plays a vital role in guiding future advancements in the field.
The Future of Human-Pet Interactions
With the introduction of Pet-Bench, researchers and developers now have a foundational resource for benchmarking pet-related LLM abilities. The emphasis on self-evolution and developmental behaviors paves the way for creating more engaging and emotionally immersive experiences. As LLMs continue to evolve, the insights derived from Pet-Bench may transform how we perceive virtual companionship in social networks and beyond.
Submission History
- Version 1: Submitted on Wed, 4 Jun 2025
- Version 2: Revised on Fri, 5 Dec 2025
- Version 3: Most recent revision on Mon, 15 Dec 2025
As LLM technology and its applications develop, Pet-Bench stands as a significant step towards creating emotionally intelligent virtual companions. The research reflects not just a technological advancement but also an understanding of the deep human desire for connection, even in digital forms. Exploring the synthesis of artificial intelligence and emotional companionship has never been more critical, and Pet-Bench is at the forefront of this exploration.
Inspired by: Source

