Lessons from Phil Calçado’s Keynote on Generative AI at the InfoQ Dev Summit
At the recent InfoQ Dev Summit in Boston, Phil Calçado, the founder and CEO of Outropy, captivated an audience of software leaders with insights on the challenges of developing and scaling generative AI products. With over 30 years in software architecture and three years of in-the-field experience with AI systems, Calçado brought a wealth of practical knowledge to the table.
Understanding the Landscape of Generative AI
As he opened his keynote, Calçado acknowledged the diverse backgrounds of the audience and the rapidly evolving landscape of generative AI. He highlighted a stark contrast: while individual engineers benefit from an abundance of IDE automations, managers and reliability engineers often find themselves lacking adequate tools. “My initial idea back in 2021 was like, okay, can we automate this? Can I create basically VS Code for everything either the manager does or everything an engineer does?” he recalled, laying bare the need for innovation in management tools.
Learning from Early Missteps
Calçado did not shy away from discussing his initial product, which he candidly admitted “failed miserably.” The failure was attributed to a fundamental misalignment of user incentives. Rather than using the assistant for productivity, users leaned towards reverse-engineering its internal design. This experience underscored a crucial distinction: there’s a difference between tools that showcase technological novelty and those that deliver sustained user value—something every AI product must achieve to resonate with its audience.
Three Mindsets in AI Development
From his experiences, Calçado identified three prevalent mindsets in AI development that could hinder growth:
-
The Model Chasers: Teams focused on the latest model releases and funding cycles, sidelining immediate limitations. These teams often wait for the elusive promise of “AGI next year” instead of making iterative improvements.
-
Data Science Labs: These groups often engage in lengthy projects aimed at achieving marginal accuracy improvements, neglecting a product-focused perspective. The result is classifiers that barely outperform their predecessors.
- Software Engineering-Driven Teams: This mindset emphasizes rapid, iterative development using lightweight, pluggable components, fostering an adaptable workflow that can be tested and refined in production.
Structuring AI Systems: Workflows and Agents
To navigate the complexities of AI systems, Calçado introduced two fundamental abstractions: workflows and agents.
-
Workflows are static pipelines that focus on task execution, such as processing messages, eliminating duplicate conversations, and generating summarizations. They represent a structured approach to leveraging AI capabilities.
- Agents, in contrast, are semi-autonomous components equipped with memory and goal-oriented functions. They possess the ability to utilize tools dynamically, akin to stateful objects in classical object-oriented design. “Agents have memory, they’re goal-oriented, dynamic, and they like to collaborate,” Calçado noted, emphasizing their role in facilitating real-time interactions.
Avoiding Common Pitfalls
One significant warning from the keynote was about treating agents as fine-grained microservices. The inherent stateful and nondeterministic nature of LLM-driven agents can lead to operational fragility. Instead of conventional service boundaries, Calçado advocated for the use of semantic event buses. These buses utilize structured, intent-driven events to decouple components, allowing for more discoverable and pluggable behaviors throughout the system.
Decomposing AI Pipelines
When assembling AI pipelines, Calçado stressed the necessity of breaking monolithic projects into smaller, well-defined slices. Each slice should feature clear semantic interfaces. For instance, in Autopi’s daily-briefing feature, distinct components manage Slack ingestion, duplicate conversation detection, semantic summarization, and calendar context enrichment. This modularity fosters greater flexibility and efficiency in AI product development.
Rethinking Traditional Models
Calçado also challenged the applicability of the Twelve-Factor App model to AI systems. He highlighted critical differences, such as the static nature of configurations in traditional apps versus the dynamic configurations necessary for AI. In particular, he noted that the dependencies tied to LLM calls introduce latency bottlenecks, making it imperative for teams to explore durable-workflow frameworks like Temporal. Such frameworks can effectively separate orchestration from side effects, manage retries and timeouts, and facilitate checkpointing for long-running, stateful processes.
Extending Proven Engineering Principles
In closing, Calçado underscored that while AI demands new architectural considerations, the core solutions often lie in extending established software-engineering principles rather than starting from scratch. “We don’t need to rebuild everything from scratch,” he emphasized. “We need better platforms that integrate these patterns so teams can iterate rapidly without reinventing the plumbing.”
For developers eager to delve deeper into these insights, videos from the event will be available soon on InfoQ, offering a wealth of knowledge on navigating the complexities of generative AI in today’s fast-paced environment.
Inspired by: Source

