New AI agent boosts game testing

Researchers from Zhejiang University and NetEase Fuxi AI Lab have developed Titan, an AI-powered agent transforming MMORPG testing. Using large-language-model reasoning, Titan navigates MMORPGs, efficiently completing tasks and identifying issues.

In trials across two commercial games, Titan achieved a 95% task completion rate and uncovered four previously undetected bugs. Outperforming human testers in speed and coverage, the AI agent offers a faster, more thorough approach to quality assurance in game development.

Titan mimics expert testers by perceiving game states, selecting actions, and diagnosing problems. Using simplified text and screenshots, it reasons through objectives, streamlining a traditionally costly and time-consuming process that can consume millions in labour.

Already integrated into QA pipelines, Titan signals a shift toward AI-driven game testing. As studios increasingly adopt AI tools, such agents could redefine efficiency across PC and mobile game development.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot

Doctors and nurses outperform AI in patient triage

Human staff are more accurate than AI in assessing patient urgency in emergency departments, according to research presented at the European Emergency Medicine Congress in Barcelona.

The study, led by Dr Renata Jukneviciene of Vilnius University, tested ChatGPT 3.5 against clinicians and nurses using real case studies.

Doctors achieved an overall accuracy of 70.6% and nurses 65.5%, compared with 50.4% for AI. Doctors also outperformed AI in surgical and therapeutic cases, while nurses were more reliable overall.

AI did show strength in recognising the most critical cases, surpassing nurses in both accuracy and specificity. Researchers suggested that AI may help prioritise life-threatening situations and support less experienced staff instead of acting as a replacement.

However, over-triaging by AI could lead to inefficiencies, making human oversight essential.

Future studies will explore newer AI models, ECG interpretation, and integration into nurse training, particularly in mass-casualty scenarios.

Commenting on the findings, Dr Barbra Backus from Amsterdam said AI has value in certain areas, such as interpreting scans, but it cannot yet replace trained staff for triage decisions.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!

Visa unveils stablecoin pilot for faster payments

Visa unveiled a stablecoin prefunding pilot for Visa Direct at SIBOS 2025, enabling faster, more flexible global payments. By integrating stablecoins, the pilot aims to modernise treasury operations, offering a solution tailored for the digital-first economy.

Traditional cross-border payments often rely on slow, costly systems that require businesses to hold large fiat balances in advance. The pilot lets companies pre-fund Visa Direct with stablecoins, cutting friction and boosting liquidity for active, efficient capital.

Financial institutions, banks, and remitters benefit from this approach, as stablecoins provide a consistent settlement layer, minimising exposure to currency volatility. Funds move in minutes, not days, enabling dynamic liquidity and predictable treasury for high-volume payouts.

Set to expand in 2026, the pilot builds on Visa’s global network and blockchain programmability, transforming how businesses handle cross-border transactions. Recipients still receive payments in local currency, ensuring seamless integration with existing systems.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!

Google unveils smarter AI Mode for visual searches

Google’s upgraded AI Mode in Google Search now supports conversational queries and image uploads, delivering highly relevant visual results. Launched in the US in English, the feature allows users to refine searches naturally, perfect for finding inspiration or specific items effortlessly.

AI Mode simplifies shopping; users describe items like ‘barrel jeans, not too baggy,’ to get tailored, shoppable results. Google’s Shopping Graph, boasting over 50 billion product listings, provides details like reviews, deals, and availability, with 2 billion listings refreshed hourly.

The update harnesses Gemini 2.5’s advanced multimodal capabilities and a ‘visual search fan-out’ technique, enabling deeper image analysis. The approach identifies subtle details and secondary objects, ensuring results align closely with the user’s intent and the image’s full context.

On mobile, users can dive deeper by searching within specific images, asking follow-up questions to explore creative ideas or pinpoint exact items. The intuitive experience transforms how users seek inspiration or shop online, making searches more natural and precise.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!

UK users lose access to Imgur amid watchdog probe

Imgur has cut off access for UK users after regulators warned its parent company, MediaLab AI, of a potential fine over child data protection.

Visitors to the platform since 30 September have been met with a notice saying that content is unavailable in their region, with embedded Imgur images on other sites also no longer visible.

The UK’s Information Commissioner’s Office (ICO) began investigating the platform in March, questioning whether it complied with data laws and the Children’s Code.

The regulator said it had issued MediaLab with a notice of intent to fine the company following provisional findings. Officials also emphasised that leaving the UK would not shield Imgur from responsibility for any past breaches.

Some users speculated that the withdrawal was tied to new duties under the Online Safety Act, which requires platforms to check whether visitors are over 18 before allowing access to harmful content.

However, both the ICO and Ofcom stated that Imgur decided on a commercial choice. Other MediaLab services, such as Kik Messenger, continue to operate in the UK with age verification measures in place.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!

OpenAI launches Instant Checkout to enable in-chat purchases

OpenAI has launched Instant Checkout, a feature that lets users make direct purchases within ChatGPT. The initial rollout applies to US Etsy sellers, with Shopify merchants to follow.

The system is powered by the Agentic Commerce Protocol, which OpenAI co-developed with Stripe, and currently supports single-item purchases. Future updates will add multi-item carts and expand to more regions.

According to OpenAI, product results in ChatGPT are organic and ranked for relevance. The e-commerce framework will be open-sourced to accelerate integrations for merchants and developers. Users can pay using cards already on file, and transactions involve explicit confirmation steps, scoped payment tokens, and limited data sharing to build trust.

Michelle Fradin, OpenAI’s product lead for ChatGPT commerce, said the goal is to move beyond information retrieval and support real-world actions. Stripe’s president for technology and business, Will Gaybrick, described the partnership as laying economic infrastructure for AI.

Merchants will pay a small fee on completed purchases, while users are not charged extra and product prices remain unchanged.

Reuters reported that Etsy and Shopify’s stocks rose significantly following the announcement, with Etsy closing up nearly 16 percent and Shopify more than 6 percent. The company plans to extend the system to more merchants and payment types over time.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!

Claude Sonnet 4.5 expands developer options with rollbacks and longer-running agents

Anthropic has released Claude Sonnet 4.5, featuring a suite of new upgrades designed to enhance coding, automation, and creativity. The update enhances Claude Code, extends Computer Use, and introduces experimental tools to boost productivity and facilitate real-world applications.

Claude Code now features checkpoints, allowing developers to roll back projects to earlier versions. The Claude API has also been expanded, supporting longer-running agents to generate files such as slides, spreadsheets, and documents directly within chats.

The model’s Computer Use function has been strengthened, enabling agents to operate applications for up to 30 hours autonomously. Anthropic says Claude Sonnet 4.5 built a Slack-style app with 11,000 lines of code in one session.

A new feature, Imagine with Claude, focuses on generating creative software. The system produced a Shakespeare-themed desktop with customised scripts and performance schedules from a single prompt, highlighting its versatility.

Anthropic has maintained steady pricing for free and premium users, positioning Sonnet 4.5 as its most practical and feature-rich release yet, combining reliability with expanded creative and developer-friendly tools.

Would you like to learn more about AI, tech, and digital diplomacy? If so, ask our Diplo chatbot!

Athens Democracy Forum highlights AI challenge for democracy

The 2025 Athens Democracy Forum opened in Athens with a dedicated session on AI, ethics and democracy, co-organised by Kathimerini in partnership with The New York Times.

Held at the Athens Conservatoire, the event placed AI at the heart of discussions on the future of democratic governance.

Speakers underlined the urgency of addressing systemic challenges created by AI.

Achilleas Tsaltas, president of the Democracy & Culture Foundation, described AI as the central concern of the era. At the same time, Greece’s minister of digital governance, Dimitris Papastergiou, warned that AI should remain a servant instead of becoming a master.

Axel Dauchez, founder of Make.org, pointed to the conflict between democratic and authoritarian governance models and called for stronger civic education.

The opening panel brought together academics such as Oxford’s Stathis Kalyvas and Yale’s Hélène Landemore, who examined how AI affects liberal democracies, global inequalities and political accountability.

Discussions concluded with a debate on Aristotle’s ethics as a framework for evaluating opportunities and risks in AI development, moderated by Stephen Dunbar-Johnson of The New York Times.

The session continues with panels on the AI transformation blueprint of Greece, regulation of AI, and the emerging concept of AI sovereignty as a business model.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!

Gen Z most vulnerable to phishing scams

A global survey commissioned by Yubico suggests that younger workers are more vulnerable to phishing scams than older generations. Gen Z respondents reported the highest level of interaction with phishing messages, with 62 percent admitting they engaged with a scam in the past year.

The study gathered responses from 18,000 employed adults in nine countries, including the UK, US, France, and Japan. In the past twelve months, 44 percent of participants admitted to clicking on or replying to a phishing message.

AI is raising the stakes for cybersecurity. Seventy percent of those surveyed believe phishing has become more effective due to AI, and 78 percent said the attacks seem more sophisticated. More than half could not confidently identify a phishing email when shown one.

Despite growing risks, cyber defences remain patchy. Only 48 percent said their workplace used multi-factor authentication across all services, and 40 percent reported never receiving cybersecurity training from their employer.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot

OpenAI reports $4.3 billion revenue in first half of 2025

OpenAI posted approximately $4.3 billion in revenue in the first half of 2025, according to a report by The Information cited in Cyprus Mail. That figure is roughly 16 percent higher than what the company is said to have earned in 2024.

During the same period, OpenAI reportedly burned around $2.5 billion due to heavy research, development investments, and operational costs tied to ChatGPT. Total R&D spending for H1 2025 is reported to have reached $6.7 billion, and the company held about $17.5 billion in cash and securities at period’s close.

OpenAI is targeting full-year revenue of $13 billion and aims to cap annual cash burn at $8.5 billion. Meanwhile, in August, the company was reportedly in early discussions about a potential stock sale to allow employee access to liquidity and possibly reach a valuation near $500 billion.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!