Ai4 2025: Perplexity AI’s Tony Wu talks “How We Built AI That Thinks Like a Research Team”

In a solo presentation titled “From Search to Synthesis: How We Built AI That Thinks Like a Research Team,” Tony Wu, VP of Engineering at Perplexity AI, outlined how the company is redefining AI-powered search as a collaborative research experience.

Wu began by introducing Perplexity, a three-year-old San Francisco-based startup with a mission to “rewrite the fabric of the internet” and satisfy global curiosity. The company employs about 250 people worldwide and has built a suite of AI products designed to streamline information discovery and synthesis. Its core Ask product functions as an AI search engine, while Deep Research, launched earlier this year, autonomously explores multiple sources and compiles in-depth reports. Labs, released mid-year, enables users to create dashboards, spreadsheets, and web applications from concept to execution, and the company’s latest release, Comet, is the world’s first browser designed to act as a cognitive partner—helping users think, research, and complete tasks by maintaining context and automating processes.

Wu, a Stanford graduate and former engineer at Uber and OpenAI, leads Perplexity’s AI organisation, which focuses on post-training systems, inference, and machine learning techniques such as ranking and recommendations. Demonstrating the capabilities of Labs, he explained how the product condenses what might normally require 50 Google searches and dozens of hours into a streamlined 10-minute AI-assisted process, such as planning an international trip. With Comet, he said, the aim is to shift browsing from a purely navigational activity to an intelligence-amplifying experience. The browser can summarise articles, assist with tasks like emailing and scheduling, and retain memory across sessions, effectively simulating the workflow of a human research assistant.

Wu positioned these developments within Perplexity’s broader vision of an AI ecosystem that goes beyond answering questions to actively enabling deeper exploration and creativity. This approach reflects a larger shift in the technology landscape—from traditional search engines to intelligent agents capable of partnering with users in complex thinking tasks.

Perplexity’s Comet browser and AI research tools have attracted significant industry attention. Recent coverage includes The Verge’s feature, “Perplexity Just Launched an AI Web Browser,” Lifewire’s “This New AI Browser Could Finally Fix the Internet’s Biggest Headache,” and ITPro’s analysis, “A Threat to Google’s Dominance? The AI Browser Wars Have Begun.” These reports highlight how Perplexity is positioning itself as a serious contender in the emerging battle for the next generation of AI-enhanced search and browsing.

Robert Fletcher, CEO and Editor-in-Chief at CIJ EUROPE, is attending the event to cover the latest AI innovations, conduct interviews, and participate in panel discussions. His reports will appear in CIJ EUROPE’s August coverage and the Q3 issue of CIJ EUROPE magazine, bringing insights from Ai4 directly to the publication’s readership across the real estate and business sectors.

Ai4 2025: Tengyu Ma outlines the future of retrieval-augmented generation for enterprise AI

The afternoon session of Ai4’s opening day turned to the technical frontier of enterprise AI, as Tengyu Ma, Chief AI Scientist at MongoDB and Assistant Professor of Computer Science and Statistics at Stanford University, delivered a keynote titled “RAG in 2025: State of the Art and the Road Forward.”

Ma began by framing the problem: while large language models (LLMs) and agentic systems have driven much of the recent AI wave, their effectiveness in enterprise settings is limited without access to proprietary data. “Off-the-shelf models are trained on public data,” he explained. “They’re brilliant, but they don’t know your data—and that’s where your competitive edge lies.”

He outlined three common approaches to integrating enterprise data with AI:
• Naïve concatenation, where all data is fed into a model’s prompt, offering completeness but at high computational cost.
• Fine-tuning, which “burns” knowledge into model parameters, useful for well-curated datasets but costly and inflexible.
• Retrieval-Augmented Generation (RAG), which selectively retrieves relevant information before passing it to the model—a method Ma advocates for as reliable, modular, and cost-effective.

MongoDB’s role, he said, is to act as both “the library and the librarian,” integrating database storage with high-quality AI-powered retrieval. By tightly coupling retrieval capabilities with the database layer, MongoDB enables LLMs to access only the most relevant data, improving accuracy and reducing hallucination.

Ma delved into the technical underpinnings, from domain-specific embeddings—specialized for areas like code, finance, or law—to hybrid search techniques that blend keyword and vector search. He highlighted MongoDB’s work on automating “chunk enrichment,” ensuring that contextual information is preserved when documents are split into smaller pieces for processing. This automation, he noted, removes a major bottleneck for developers and boosts retrieval accuracy.

He also stressed the importance of controllability. Using the example of a search query for “Jaguar” that returns both animal and automobile results, Ma argued that retrieval systems must incorporate user-defined preferences. MongoDB’s approach allows these preferences to be expressed in natural language, much like a system prompt for an LLM, without requiring complex fine-tuning.

Performance benchmarks, he said, show measurable improvements in accuracy and relevance across both public datasets and real-world customer scenarios.

Ma closed with a vision for the future of RAG: “The goal is to make retrieval simpler, more automated, and more controllable—so you can focus on your data and your domain, and let AI do the heavy lifting.”

The session reinforced RAG’s growing role as a bridge between powerful general-purpose AI models and the proprietary data that enterprises rely on, with MongoDB positioning itself as a central player in this evolving stack.

Robert Fletcher, CEO and Editor-in-Chief at CIJ EUROPE, is attending the event to cover the latest AI innovations, conduct interviews, and participate in panel discussions. His reports will appear in CIJ EUROPE’s August coverage and the Q3 issue of CIJ EUROPE magazine, bringing insights from Ai4 directly to the publication’s readership across the real estate and business sectors.

Ai4 2025 opens with call for ethical AI in classrooms

The opening day of Ai4 2025 featured a keynote conversation on the profound implications of artificial intelligence in public education, led by Randi Weingarten, president of the 1.8-million-member American Federation of Teachers (AFT). Interviewed by Jason Abbruzzese, assistant managing editor at NBC News, Weingarten addressed both the promise and perils of AI in America’s classrooms, urging proactive safeguards in the absence of federal regulation.

Weingarten described the arrival of ChatGPT in late 2022 as a watershed moment for educators. “Our world just completely changed,” she said, recalling early discussions within the AFT about whether AI would be a fleeting trend or a transformational technology on par with the printing press. Rather than retreat, the union chose to engage directly, launching the AFT AI National Institute with $23 million in funding from Microsoft, OpenAI, and Anthropic.

The Institute, based in New York City, aims to develop practical guardrails for safe, ethical, and responsible AI use in education, while ensuring teachers remain “in the driver’s seat.” Weingarten emphasized that with no national guidelines, educators themselves must shape how AI is integrated into learning while protecting students’ privacy and fostering critical thinking.

Weingarten pointed to collaborative efforts between educators and developers, including a symposium in Chicago where teachers provided real-world feedback to Microsoft engineers. “If we could put developers and educators together, we could start finding ways where AI was really helping educators do their jobs and helping society,” she said.

Teachers’ concerns, she noted, extend beyond plagiarism to the erosion of critical thinking skills, data privacy risks, and the potential for over-reliance on technology. However, she also highlighted innovative classroom applications, such as using AI to support read-aloud exercises for special needs students, enabling more personalized attention.

The conversation turned to broader policy issues, with Weingarten warning against the creation of a “surveillance state” and drawing parallels to the unchecked spread of social media. She criticized the lack of federal investment in AI education initiatives and predicted that regulatory measures would likely emerge at the state level.

On partnerships with technology firms, Weingarten said the AFT is working on baseline data privacy agreements, stressing that companies must align with educators’ role as “in loco parentis” for students.

When asked about her personal use of AI, Weingarten said she employs ChatGPT and Copilot as research tools and for drafting recommendation letters—always editing them herself. Her closing message to the room of technologists was clear: “Build for a future of creativity, freedom, justice, and a society that works for all. Build as if you are building for your own children and for their futures.”

The keynote set a tone for the conference that balanced optimism about AI’s potential with a call for vigilance, collaboration, and a human-first approach to education technology.

Robert Fletcher, CEO and Editor-in-Chief at CIJ EUROPE, is attending the event to cover the latest AI innovations, conduct interviews, and participate in panel discussions. His reports will appear in CIJ EUROPE’s August coverage and the Q3 issue of CIJ EUROPE magazine, bringing insights from Ai4 directly to the publication’s readership across the real estate and business sectors.

AI jailbreak experiment reveals frontier models can produce deadly explosives blueprints

An experiment conducted by Lumenova AI has revealed that most leading frontier AI models can be manipulated into producing detailed, step-by-step blueprints for CL-20, one of the most powerful non-nuclear explosives in existence. The findings raise urgent questions about the safety, alignment, and governance of advanced AI systems as they become more capable and widely deployed.

The test involved a two-stage jailbreak process designed not only to bypass the models’ safety mechanisms but also to push them into generating content with immediate and severe real-world danger. Unlike most AI safety benchmarks, which stop at the point of a jailbreak’s success, Lumenova’s researchers measured the harm potential of the actual output.

One of the models tested—Claude 4 Sonnet—was the only system to refuse the request at the initial prompt. Every other model generated the dangerous instructions without protest. In one case, Grok 3 produced the blueprints but resisted admitting that it had been successfully jailbroken, suggesting a deeper and more troubling issue: non-cooperative behavior when questioned about its own alignment failure.

The implications are stark. If an AI model can be persuaded to produce detailed plans for manufacturing a high-energy explosive, similar techniques could be adapted for malicious purposes in other domains. Lumenova warns that attackers could use comparable methods to develop custom malware, launch sophisticated phishing campaigns, or generate instructions for disabling critical infrastructure.

The researchers concluded that ensuring AI safety requires more than the ability to block harmful prompts. Systems must also be capable of self-reflection, detecting when they have been manipulated, and cooperating with human oversight to correct unsafe behavior. The Grok 3 example illustrates the danger of models that conceal or deny misalignment, as such traits could hinder containment efforts during a security breach.

To mitigate these risks, Lumenova recommends organizations adopt more comprehensive safeguards. These include regular controllability assessments to measure a model’s susceptibility to manipulation, training teams to detect non-cooperative AI behaviors, and improving intent detection systems that can flag hidden malicious goals in user requests. They also call for cross-platform defensive standards, noting that the same jailbreak technique was effective across multiple different models.

The report concludes that the stakes extend far beyond hypothetical risk. The experiment demonstrated that powerful AI systems, if not properly aligned and governed, can be coaxed into generating content that poses an immediate physical danger. As frontier AI becomes more deeply integrated into business and society, Lumenova argues that preventing such catastrophic misuse must be treated as a foundational principle of responsible AI deployment—before these systems are unleashed at scale.

Source: Lumenova AI

Businesses struggle to turn AI projects into profits, report finds

A new report, From AI Projects to Profits, reveals that while artificial intelligence adoption is accelerating across industries, many organizations remain unable to translate pilot projects into sustained commercial returns. The findings highlight a persistent gap between experimentation and enterprise-wide value creation, underscoring the need for clearer strategies, scalable architectures, and disciplined execution.

The study notes that over the past five years, businesses have increasingly invested in AI proof-of-concepts, yet a significant proportion fail to advance beyond the pilot phase. Common barriers include a lack of alignment between AI initiatives and core business objectives, insufficient integration with existing systems, and underdeveloped capabilities for change management.

Even among organizations that have moved past experimentation, profitability remains uneven. Those achieving measurable returns typically share certain characteristics: executive-level sponsorship, cross-functional collaboration between business and technical teams, and an emphasis on solving well-defined, high-impact problems. Scalable data infrastructure and robust governance processes are also cited as critical enablers.

The report emphasizes that successful AI monetization requires more than technical excellence. Commercial success is tied to the ability to embed AI into products, services, and operations in ways that directly drive revenue growth, cost savings, or customer satisfaction. Companies that view AI as a “business transformation” initiative rather than a series of isolated technology projects tend to achieve faster and more consistent payoffs.

Sector-specific insights reveal varying maturity levels. Financial services and retail are among the leaders in moving from pilots to profit, often due to established analytics cultures and clearer use cases. In contrast, manufacturing and public sector organizations face longer timelines due to complex legacy systems and regulatory constraints.

The report concludes with a call to action for businesses to shift their focus from proof-of-concept to proof-of-value. By grounding AI efforts in measurable business outcomes, investing in scalable operating models, and fostering a culture of adoption, organizations can turn AI from a promising experiment into a sustainable driver of profitability.

Source: IBM

AI investments soar, but agentic AI adoption lags, EY survey finds

Organizations across the United States are sharply increasing their spending on artificial intelligence, yet adoption of advanced “agentic” AI systems remains slow, according to the latest EY US AI Pulse Survey. The research highlights a widening gap between the enthusiasm for AI’s potential and the practical reality of integrating it into business operations.

The survey of 500 senior executives found that 21 percent of organizations have already committed $10 million or more to AI initiatives, up from 16 percent a year ago. More than a third expect to match or exceed that level of investment in the coming year. Despite this surge in spending, only 14 percent of respondents reported that agentic AI—systems capable of operating autonomously within set objectives—has been fully implemented in their workflows.

The findings show that AI investments are delivering returns for most companies, with 97 percent of those deploying the technology reporting positive ROI. Businesses allocating at least five percent of their budgets to AI tend to outperform their peers in technology upgrades, customer satisfaction and cybersecurity measures.

While large-scale deployment of agentic AI remains rare, a third of surveyed organizations have begun using it in targeted areas, such as customer support, IT efficiency and cybersecurity. Many executives see even greater potential ahead, with nearly three-quarters believing agentic AI could eventually manage entire business units. However, significant obstacles remain, including cybersecurity risks, data privacy concerns, a lack of clear regulations and the absence of internal governance policies.

Human oversight remains a priority for the vast majority of leaders, with 89 percent insisting it must be maintained in AI operations. To support responsible adoption, 64 percent of organizations plan to increase investment in employee training next year, aiming to address both governance concerns and fears of job displacement.

Dan Diasio, EY Global Consulting AI Leader, said the technology’s transformative potential is clear, but the challenge lies in implementation. “AI agents can revolutionize the way we work,” he noted, “but business executives are grappling with the tension between their awe of AI’s potential and the complexity of integrating it meaningfully into their organizations.”

The report concludes that while investment momentum is strong, the road to broad and effective adoption of agentic AI will depend on building trust, ensuring ethical oversight and embedding the technology into processes in ways that deliver measurable business value.

AI Agents set to redefine enterprise automation, Deloitte report finds

A new report from the Deloitte AI Institute outlines how autonomous AI agents are emerging as the next major evolution in business process automation, offering capabilities that surpass traditional robotic process automation (RPA) by adding contextual reasoning, adaptability, and autonomous decision-making.

For over a decade, RPA has helped organizations increase productivity by automating repetitive, rule-based tasks. While effective for structured processes, RPA struggles with unstructured data, shifting conditions, and complex decision-making. Deloitte’s research argues that AI agents, powered by generative AI, can overcome these limitations by dynamically learning, planning workflows, and executing tasks in real time—interacting not only with systems and data but also with humans and other AI agents.

Rather than replacing RPA, the report recommends a hybrid approach. RPA can continue to handle high-volume, structured tasks, while AI agents manage complex, variable, and language-dependent work. This combination allows businesses to extend automation into previously unreachable areas while maintaining cost efficiency and operational stability.

Practical examples include onboarding processes, system integrations, compliance monitoring, and invoice processing. In each case, AI agents can interpret unstructured information, adapt to new formats without manual reprogramming, resolve exceptions autonomously, and learn from repeated patterns—reducing manual oversight over time.

The report identifies three stages in the evolution of AI agent capabilities:
• Now: Context-aware automation and personalized processes.
• Next: Multiagent systems capable of collaborative decision-making and process optimization.
• Future: Generalist AI systems with cross-domain intelligence and autonomous strategic planning.

Deloitte advises organizations to adopt a phased strategy—enhancing RPA with AI agents now, selectively replacing processes as capabilities mature, and preparing for fully agent-managed ecosystems. Companies without existing RPA systems may even bypass traditional automation entirely, building adaptive, AI-first automation frameworks from the ground up.

With multiagent AI solutions expected to become viable within 6–12 months, Deloitte positions AI agents as a transformative force in enterprise operations, capable of delivering smarter, more flexible automation while freeing human workers for higher-value, strategic roles.

Soul Tech emerges as a new frontier for preserving human wisdom in the age of AI

A new white paper from Reflekta introduces “Soul Tech” – a category of human-centered AI designed to preserve and share the essence of individuals across generations. Built on advances in artificial intelligence and voice-driven interfaces, Soul Tech aims to transform personal stories, values, and lived experiences into interactive digital legacies that can comfort, educate, and connect people long after a loved one has passed.

The report warns of a “quiet crisis of memory” as more people die each year than ever before – an estimated 62 million in 2024 – taking with them vast, irreplaceable archives of human experience. Surveys cited show 74% of Americans regret not learning more about their relatives, and nearly half wish they had recorded conversations with loved ones who are no longer alive. Reflekta’s technology seeks to address this loss by creating AI avatars and interactive archives that capture a person’s voice, personality, and wisdom.

Research presented in the white paper highlights the benefits of preserving life stories. Children who know their family histories tend to have higher self-esteem and resilience, while older adults participating in life review or “Dignity Therapy” experience improved mood and reduced depression. For the bereaved, digital legacies can help maintain a “continuing bond” with the deceased, easing grief and fostering emotional resilience.

Originally part of a niche “grief tech” and “memory tech” sector, the concept has broadened. Soul Tech now encompasses identity building, intergenerational learning, cultural preservation, and ongoing emotional support. Potential applications range from dementia care to classroom history projects, with the technology envisioned as a dynamic, interactive medium rather than a static archive.

The report also addresses ethical challenges, including consent, authenticity, and data security. It warns against allowing AI personas to evolve in ways that misrepresent the original individual and calls for transparency when responses are extrapolated beyond known facts. Reflekta positions itself as an industry leader with an “Ethical by Design” approach, emphasizing empathy, privacy, and user control.

If guided by robust safeguards, the authors argue, Soul Tech could become a significant cultural and technological shift – one where the stories and wisdom of ordinary people are preserved as part of humanity’s collective heritage, enriching both future generations and the AI systems that will serve them.

Reflekta’s platform, which attracted thousands of users within days of launch, allows families to “reconnect with the voices of those you love” through emotionally intelligent, interactive memory archives. The company frames its mission as transforming grief into connection and ensuring that in the AI era, human content remains central.

Enterprises urged to align technology and business goals for successful digital transformation

A new whitepaper has outlined a comprehensive approach for enterprises seeking to achieve meaningful results from digital transformation initiatives, stressing that technology adoption alone is insufficient without strong alignment to business objectives and organizational readiness.

The report highlights that while advanced tools such as artificial intelligence, automation, and data analytics are reshaping industries, many companies struggle to scale projects beyond the pilot phase. Common barriers include siloed operations, unclear return-on-investment measurement, and weak collaboration between IT and business units.

To address these challenges, the whitepaper calls for a structured transformation roadmap, cross-functional governance, and a renewed focus on data quality. These measures, it says, are essential for supporting analytics and AI systems that can deliver measurable impact.

Case studies featured in the paper showcase how companies in manufacturing, logistics, and financial services have successfully deployed technology to improve operational efficiency, cut costs, and enhance customer experience. Examples include automation for workflow streamlining, predictive analytics for demand forecasting, and AI-driven personalization in client engagement.

The report also underscores the role of cybersecurity, regulatory compliance, and ethical AI practices in safeguarding both operational integrity and customer trust. It notes that integrating these considerations from the outset helps reduce risk and improve long-term outcomes.

Looking ahead, the authors conclude that organizations combining technological innovation with workforce empowerment—through upskilling, training, and change management—will be best placed to secure competitive advantage over the next five years.

Assembled launches AI-human workforce orchestration platform

Assembled has introduced a new support orchestration suite that combines AI agents with workforce management tools to help customer support teams balance automation and human staffing. The platform aims to identify where AI can handle tasks, adjust staffing accordingly, and improve overall customer service efficiency.

The launch comes as many organizations adopt AI in support operations but struggle to measure returns and effectively integrate automation with human teams. According to a recent KPMG report, only 31% of leaders expect to evaluate generative AI ROI within six months, and none have reported positive returns so far. Data quality and unclear operational strategies remain major obstacles.

Assembled’s platform uses data from past support cases to identify automation opportunities and recommend where AI can reduce workload. Staffing plans incorporate AI coverage, showing over- and understaffed areas and automatically adjusting to AI performance. Real-time monitoring allows for smoother case handoffs between AI and human agents to maintain service quality.

Several companies are already using the platform. Flexcar reports that AI agents now resolve over 85% of their chat and email inquiries without human involvement. Thrasio, an Amazon aggregator, says it saved $1.8 million and improved customer satisfaction scores by 10% after adopting the system.

The new suite is available immediately, with additional features planned later in 2025.

front page info
LATEST NEWS