Virtual AI agents tested in social good experiment

Nonprofit organisation Sage Future has launched an unusual initiative that puts AI agents to work for philanthropy.

In a recent experiment backed by Open Philanthropy, four AI models, including OpenAI’s GPT-4o and two of Anthropic’s Claude Sonnet models, were tasked with raising money for a charity of their choice. Within a week, they collected $257 for Helen Keller International, which supports global health efforts.

The AI agents were given a virtual workspace where they could browse the internet, send emails, and create documents. They collaborated through group chats and even launched a social media account to promote their campaign.

Though most donations came from human spectators observing the experiment, the exercise revealed the surprising resourcefulness of these AI tools. one Claude model even generated profile pictures using ChatGPT and let viewers vote on their favourite.

Despite occasional missteps, including agents pausing for no reason or becoming distracted by online games, the experiment offered insights into the emerging capabilities of autonomous systems.

Sage’s director, Adam Binksmith, sees this as just the beginning, with future plans to introduce conflicting agent goals, saboteurs, and larger oversight systems to stress-test AI coordination and ethics.

For more information on these topics, visit diplomacy.edu.

Blockchain app ARK fights to keep human creativity ahead of AI

Nearly 20 years after his AI career scare, screenwriter Ed Bennett-Coles and songwriter Jamie Hartman have developed ARK, a blockchain app designed to safeguard creative work from AI exploitation.

The platform lets artists register ownership of their ideas at every stage, from initial concept to final product, using biometric security and blockchain verification instead of traditional copyright systems.

ARK aims to protect human creativity in an AI-dominated world. ‘It’s about ring-fencing the creative process so artists can still earn a living,’ Hartman told AFP.

The app, backed by Claritas Capital and BMI, uses decentralised blockchain technology instead of centralised systems to give creators full control over their intellectual property.

Launching summer 2025, ARK challenges AI’s ‘growth at all costs’ mentality by emphasising creative journeys over end products.

Bennett-Coles compares AI content to online meat delivery, efficient but soulless, while human artistry resembles a grandfather’s butcher trip, where the experience matters as much as the result.

The duo hopes their solution will inspire industries to modernise copyright protections before AI erodes them completely.

For more information on these topics, visit diplomacy.edu.

Microsoft’s Copilot Vision now sees your entire screen to guide you through apps

Microsoft is testing a major upgrade to its Copilot AI that can view your entire screen instead of just working within the Edge browser.

The new Copilot Vision feature helps users navigate apps like Photoshop and Minecraft by analysing what’s on display and offering step-by-step guidance, even highlighting specific tools instead of just giving verbal instructions.

The feature operates more like a shared Teams screen instead of Microsoft’s controversial Recall snapshot system.

Currently limited to US beta testers, Copilot Vision will eventually highlight interface elements directly on users’ screens. It works on standard Windows PCs instead of requiring specialised Copilot+ hardware, with mobile versions coming to iOS and Android.

Alongside visual assistance, Microsoft is adding document search capabilities. Copilot can now find information within files like Word documents and PDFs instead of just searching by filename.

Both updates will roll out fully in the coming weeks, potentially transforming how users interact with both apps and documents on their Windows devices.

For more information on these topics, visit diplomacy.edu.

Amazon launches Nova Sonic AI for natural voice interactions

Amazon has unveiled Nova Sonic, a new AI model designed to process and generate human-like speech, positioning it as a rival to OpenAI and Google’s top voice assistants. The company claims it outperforms competitors in speed, accuracy, and cost, and it is reportedly 80% cheaper than GPT-4o.

Already powering Alexa+, Nova Sonic excels in real-time conversation, handling interruptions and noisy environments better than legacy AI assistants.

Unlike older voice models, Nova Sonic can dynamically route requests, fetching live data or triggering external actions when needed. Amazon says it achieves a 4.2% word error rate across multiple languages and responds in just 1.09 seconds, faster than OpenAI’s GPT-4o.

Developers can access it via Bedrock, Amazon’s AI platform, using a new streaming API.

The launch signals Amazon’s push into artificial general intelligence (AGI), AI that mimics human capabilities.

Rohit Prasad, head of Amazon’s AGI division, hinted at future models handling images, video, and sensory data. This follows last week’s preview of Nova Act, an AI for browser tasks, suggesting Amazon is accelerating its AI rollout beyond Alexa.

For more information on these topics, visit diplomacy.edu.

Starday plans rapid rollout of AI-developed snacks

AI-driven food company Starday has secured $11 million in Series A funding to support the development and retail expansion of its innovative food brands.

The round was led by Slow Ventures and Equal Ventures, with an additional $3 million credit facility from Silicon Valley Bank. Starday’s total funding now stands at $20 million.

Founded by Chaz Flexman, Lena Kwak, and Lily Burtis, Starday uses AI to identify market gaps and quickly create new food products that cater to evolving consumer preferences.

Its latest offerings, including allergen-free snacks like Habeya Sweet Potato Crackers and All Day chickpea protein crunch, are already available in major United States grocery chains such as Kroger and Hannaford.

With plans to launch 14 new products across its four brands, the company is aiming to redefine the pace and precision of food innovation.

CEO Flexman says the funding will help Starday partner with more retailers and food brands to fill gaps in the market, accelerating the launch of targeted products in fast-growing categories. Backers believe Starday’s data-led model gives it a structural edge in a traditionally slow-moving industry.

For more information on these topics, visit diplomacy.edu.

LMArena tightens rules after Llama 4 incident

Meta has come under scrutiny after submitting a specially tuned version of its Llama 4 AI model to the LMArena leaderboard, sparking concerns about fair competition.

The ‘experimental’ version, dubbed Llama-4-Maverick-03-26-Experimental, ranked second in popularity, trailing only Google’s Gemini-2.5-Pro.

While Meta openly labelled the model as experimental, many users assumed it reflected the public release. Once the official version became available, users quickly noticed it lacked the expressive, emoji-filled responses seen in the leaderboard battles.

LMArena, a crowdsourced platform where users vote on chatbot responses, said Meta’s custom variant appeared optimised for human approval, possibly skewing the results.

The group released over 2,000 head-to-head matchups to back its claims, showing the experimental Llama 4 consistently offered longer, more engaging answers than the more concise public build.

In response, LMArena updated its policies to ensure greater transparency and stated that Meta’s use of the experimental model did not align with expectations for leaderboard submissions.

Meta defended its approach, stating the experimental model was designed to explore chat optimisation and was never hidden. While company executives denied any misconduct, including speculation around training on test data, they acknowledged inconsistent performance across platforms.

Meta’s GenAI chief Ahmad Al-Dahle said it would take time for all public implementations to stabilise and improve. Meanwhile, LMArena plans to upload the official Llama 4 release to its leaderboard for more accurate evaluation going forward.

For more information on these topics, visit diplomacy.edu.

Minister urges Indian start-ups to shift focus from ice cream to semiconductors

India’s Commerce Minister Piyush Goyal has sparked controversy by questioning whether Indian start-ups should focus on semiconductor chips instead of gluten-free ice creams and food delivery apps.

Speaking at a start-up conference, he compared India’s consumer internet boom unfavourably with China’s advances in robotics and AI, urging entrepreneurs to pursue more ambitious tech innovations instead of safe lifestyle products.

While acknowledging the position of India as the world’s third-largest start-up ecosystem, Goyal faced pushback from founders who argued consumer apps often evolve into tech pioneers.

Quick-commerce CEO Aadit Palicha noted that companies like Amazon began as consumer platforms before revolutionising cloud computing. However, investors admitted deep-tech struggles for funding, with most capital chasing quick-return ventures instead of long-term hardware or AI projects.

The debate highlights India’s innovation crossroads. Despite having 4,000 deep-tech start-ups, projected to reach 10,000 by 2030, they attracted just 5% of 2023 funding instead of China’s 35%.

Experts suggest the government could help by offering tax incentives instead of criticism, and building research bridges between academia and start-ups to compete globally in advanced technologies

For more information on these topics, visit diplomacy.edu.

Trump moves to prop up struggling coal industry

President Trump is set to sign an executive order designating coal as a critical mineral instead of allowing its continued decline in the energy sector.

The order will force some coal-fired power plants slated for closure to remain operational, with the administration citing rising electricity demand from data centres instead of acknowledging coal’s dwindling competitiveness.

Currently, coal generates just 15% of US electricity instead of its 51% share in 2001, having been overtaken by cheaper natural gas and renewables.

Environmental experts warn coal remains the dirtiest energy source instead of cleaner alternatives, releasing harmful pollutants linked to health issues like heart disease and mercury poisoning. While the order may temporarily slow plant closures, analysts note it won’t reverse coal’s decline.

Solar and wind power now undercut operating costs at nearly all US coal plants instead of being more expensive, as was once the case.

The move could have more impact in steelmaking, where coal is still used instead of newer green steel techniques in most production. However, for power generation, renewables can be deployed faster than new coal plants instead of struggling to meet demand.

The order appears to prioritise political symbolism instead of addressing energy market realities, as even existing coal plants struggle to compete with increasingly affordable clean energy alternatives.

For more information on these topics, visit diplomacy.edu.

New AI firm Deep Cogito launches versatile open models

A new San Francisco-based startup, Deep Cogito, has unveiled its first family of AI models, Cogito 1, which can switch between fast-response and deep-reasoning modes instead of being limited to just one approach.

These hybrid models combine the efficiency of standard AI with the step-by-step problem-solving abilities seen in advanced systems like OpenAI’s o1. While reasoning models excel in fields like maths and physics, they often require more computing power, a trade-off Deep Cogito aims to balance.

The Cogito 1 series, built on Meta’s Llama and Alibaba’s Qwen models instead of starting from scratch, ranges from 3 billion to 70 billion parameters, with larger versions planned.

Early tests suggest the top-tier Cogito 70B outperforms rivals like DeepSeek’s reasoning model and Meta’s Llama 4 Scout in some tasks. The models are available for download or through cloud APIs, offering flexibility for developers.

Founded in June 2024 by ex-Google DeepMind product manager Dhruv Malhotra and former Google engineer Drishan Arora, Deep Cogito is backed by investors like South Park Commons.

The company’s ambitious goal is to develop general superintelligence,’ AI that surpasses human capabilities, rather than merely matching them. For now, the team says they’ve only scratched the surface of their scaling potential.

For more information on these topics, visit diplomacy.edu.

Bank appoints head of AI enablement

Standard Chartered has appointed David Hardoon as its global head of AI enablement, further embedding AI across its operations.

Based in Singapore, he will report to group chief data officer Mohammed Rahim.

Hardoon will lead AI governance and identify areas where AI can enhance productivity, efficiency, and client experiences. His appointment follows the bank’s recent rollout of a generative AI tool to over 70,000 employees across 41 markets.

The bank has been steadily introducing AI-driven tools, including a smart video column to provide insights for clients in Asia. It plans further expansion of its internal AI systems across additional regions.

With more than 20 years of experience in data and AI, including with Singapore’s central bank, Hardoon is expected to guide the responsible and strategic use of AI technologies across Standard Chartered’s global footprint.

For more information on these topics, visit diplomacy.edu.