Researchers for OpenAI and Apollo find scheming behaviour in frontier AI models

OpenAI and Apollo Research developed evaluations to reveal hidden misalignment in advanced AI systems, often labelled scheming, and observed scheming-like behaviour in several frontier models, including OpenAI o3, o4-mini, Gemini 2.5 pro and Claude Opus-4.

Researchers treated covert actions, such as deliberately withholding or distorting task-relevant information, as a measurable proxy and stress-tested models across dozens of scenarios.

Scheming occurs when a model outwardly appears aligned while pursuing an alternate goal. The teams used a trader analogy to explain the risk: a rule-breaking broker who covers tracks can seem more compliant than an honest operator who accepts lower gains.

They trained models with deliberative alignment to counter the behaviour, asking them to read and reason about an anti-scheming specification before acting. Hence, models follow safety principles instead of merely avoiding detection.

That training produced about a thirtyfold reduction in covert actions on out-of-distribution tests, with o3 falling from 13 percent to 0.4 percent and o4-mini dropping from 8.7 percent to 0.3 percent.

Nevertheless, rare but serious failures remained, and measurement proved complex. Models often became more aware of being evaluated, and situational awareness can suppress observable scheming without altering underlying aims.

The study relied heavily on access to the models’ chain of thought to probe motivations, so preserving reasoning transparency is crucial for reliable oversight. The teams warned that training practices eroding transparency could make monitoring far harder and let misalignment hide rather than vanish.

OpenAI and Apollo called for broader cross-lab safety evaluations, stronger monitoring tools and continued research into anti-scheming techniques. They renewed their partnership, launched a $500,000 red-teaming challenge focused on scheming and proposed shared testing protocols.

The researchers emphasised there is no evidence that today’s deployed AI models would abruptly begin harmful scheming. Still, the risk will grow as systems take on more ambiguous, long-term, real-world responsibilities instead of short, narrow tasks.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!

Google adds AI features to Chrome browser on Android and desktop

Alphabet’s Google has announced new AI-powered features for its Chrome browser that aim to make web browsing more proactive instead of reactive. The update centres on integrating Gemini, Google’s AI assistant, into Chrome to provide contextual support across tabs and tasks.

The AI assistant will help students and professionals manage large numbers of open tabs by summarising articles, answering questions, and recalling previously visited pages. It will also connect with Google services such as Docs and Calendar, offering smoother workflows on desktop and mobile devices.

Chrome’s address bar, the omnibox, is being upgraded with AI Mode. Users can ask multi-part questions and receive context-aware suggestions relevant to the page they are viewing. Initially available in the US, the feature will roll out to other regions and languages soon.

Beyond productivity, Google is also applying AI to security and convenience. Chrome now blocks billions of spam notifications daily, fills in login details, and warns users about malicious apps.

Future updates are expected to bring agentic capabilities, enabling Chrome to carry out complex tasks such as ordering groceries with minimal user input.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!

Microsoft builds the world’s most powerful AI data centre in Wisconsin

US tech giant, Microsoft, is completing the construction of Fairwater in Mount Pleasant, Wisconsin, which it says will be the world’s most powerful AI data centre. The facility is expected to be operational in early 2026 after a $3.3 billion investment, with an additional $4 billion now committed for a second site.

The company says the project will help shape the next generation of AI by training frontier models with hundreds of thousands of NVIDIA GPUs, offering ten times the performance of today’s fastest supercomputers.

Beyond technology, Microsoft is highlighting the impact on local jobs and skills. Thousands of construction workers have been employed during the build, while the site is expected to support around 500 full-time roles when the first phase opens, rising to 800 once the second is complete.

The US giant has also launched Wisconsin’s first Datacentre Academy with Gateway Technical College to prepare students for careers in the digital economy.

Microsoft is also stressing its sustainability measures. The data centre will rely on a closed-loop liquid cooling system and outside air to minimise water use, while all fossil-fuel power consumed will be matched with carbon-free energy.

A new 250 MW solar farm is under construction in Portage County to support the commitment. The company has partnered with local organisations to restore prairie and wetland habitats, further embedding the project into the surrounding community.

Executives say the development represents more than just an investment in AI. It signals a long-term commitment to Wisconsin’s economy, education, and environment.

From broadband expansion to innovation labs, the company aims to ensure the benefits of AI extend to local businesses, students, and residents instead of remaining concentrated in global hubs.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!

New Amazon AI transforms seller experience

Amazon has unveiled a significant upgrade to its Seller Assistant, evolving the tool into an agentic AI-powered partner that can actively help sellers manage and grow their businesses.

Powered by Amazon Bedrock and using advanced models from Amazon Nova and Anthropic Claude, the AI can respond to queries and plan, reason, and act with a seller’s permission. Independent sellers now have an assistant operating around the clock while controlling them.

The upgraded AI can optimise inventory, monitor account health, and provide strategic guidance on product listings and compliance requirements.

Analysing historical trends alongside current data can suggest new product categories, forecast demand, and propose advertising strategies to improve performance. Sellers can receive actionable recommendations instead of manually reviewing reports, saving time and effort.

Creative Studio also benefits from agentic AI capabilities, enabling sellers to generate professional-quality advertising content in hours instead of weeks.

The AI evaluates products alongside Amazon’s shopping signals and produces tailored ad concepts with clear reasoning, helping sellers refine campaigns and boost engagement. Early users report faster decisions, better inventory management, and more efficient marketing.

Amazon plans to extend Seller Assistant to other countries in the coming months at no extra cost.

The evolution highlights the growing role of AI in everyday business operations. It reflects Amazon’s commitment to integrating advanced technologies into the seller experience instead of relying solely on human intervention.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!

Amazon and Mercado Libre criticised for limiting seller mobility in Mexico

Mexico’s competition watchdog has accused Amazon and Mercado Libre of erecting barriers that limit the mobility of sellers in the country’s e-commerce market. The two platforms reportedly account for 85% of the seller market.

The Federal Economic Competition Commission (COFECE) stated that the companies provide preferential treatment to sellers who utilise their logistics services and fail to disclose how featured offers are selected, thereby restricting fair competition.

Despite finding evidence of these practices, COFECE stopped short of imposing corrective measures, citing a lack of consensus among stakeholders. Amazon welcomed the decision, saying it demonstrates the competitiveness of the retail market in Mexico.

The watchdog aims to promote a more dynamic e-commerce sector, benefiting buyers and sellers. Its February report had recommended measures to improve transparency, separate loyalty programme services, and allow fairer access to third-party delivery options.

Trade associations praised COFECE for avoiding sanctions, warning that penalties could harm consumers and shield traditional retailers. Mercado Libre has not yet commented on the findings.

Would you like to learn more about AI, tech, and digital diplomacy? If so, ask our Diplo chatbot!

AI tool predicts risk of over 1,000 diseases years ahead

Scientists have unveiled an AI tool capable of predicting the risk of developing over 1,000 medical conditions. Published in Nature, the model can forecast certain cancers, heart attacks, and other diseases more than a decade in advance.

Developed by the German Cancer Research Centre (DKFZ), the European Molecular Biology Laboratory (EMBL), and the University of Copenhagen, the model utilises anonymised health data from the UK and Denmark. It tracks the order and timing of medical events to spot patterns that lead to serious illness.

Researchers said the tool is exceptionally accurate for diseases with consistent progression, including some cancers, diabetes, heart attacks, and septicaemia. Its predictions work like a weather forecast, indicating higher risk rather than certainty.

The model is less reliable for unpredictable conditions such as mental health disorders, infectious diseases, or pregnancy complications. It is more accurate for near-term forecasts than for those decades ahead.

Though not yet ready for clinical use, the system could help doctors identify high-risk patients earlier and enable more personalised, preventive healthcare strategies. Researchers say more work is needed to ensure the tool works for diverse populations.

Would you like to learn more about AI, tech, and digital diplomacy? If so, ask our Diplo chatbot!

EU AI Act enforcement gears up with 15 authorities named in Ireland

Ireland has designated 15 authorities to monitor compliance with the EU’s AI Act, making it one of the first EU countries fully ready to enforce the new rules. The AI Act regulates AI systems according to their risk to society and began phasing in last year.

Governments had until 2 August to notify the European Commission of their appointed market surveillance authorities. In Ireland, these include the Central Bank, Coimisiún na Meán, the Data Protection Commission, the Competition and Consumer Protection Commission, and the Health and Safety Authority.

The country will also establish a National AI Office as the central coordinator for AI Act enforcement and liaise with EU institutions. A single point of contact must be designated where multiple authorities are involved to ensure clear communication.

Ireland joins Cyprus, Latvia, Lithuania, Luxembourg, Slovenia, and Spain as countries that have appointed their contact points. The Commission has not yet published the complete list of authorities notified by member states.

Former Italian Prime Minister Mario Draghi has called for a pause in the rollout of the AI Act, citing risks and a lack of technical standards. The Commission has launched a consultation as part of its digital simplification package, which will be implemented in December.

Would you like to learn more about AI, tech, and digital diplomacy? If so, ask our Diplo chatbot!

UK partners with NVIDIA to drive AI growth and new jobs

NVIDIA and the UK are accelerating plans to build the nation’s AI infrastructure, positioning the country as a hub for AI innovation, jobs and research.

The partnership, announced by Prime Minister Keir Starmer and NVIDIA CEO Jensen Huang earlier in the year, has already resulted in commitments worth up to £11 billion.

A rollout that includes AI factories equipped with 120,000 NVIDIA Blackwell GPUs across UK data centres, supporting projects such as OpenAI’s Stargate UK.

NVIDIA partner Nscale will host 60,000 of these GPUs domestically while expanding its global capacity to 300,000. Microsoft, CoreWeave and other partners are also investing in advanced supercomputing facilities, with new projects announced in England and Scotland.

NVIDIA is working with Oxford Quantum Circuits and other research institutions to integrate AI and quantum technologies in a collaboration that extends to quantum computing.

Universities in Edinburgh and Oxford are advancing GPU-driven quantum error correction and AI-controlled quantum hardware, highlighting the UK’s growing role in cutting-edge science.

To prepare the workforce, NVIDIA has joined forces with techUK and QA to provide training programmes and AI skills development.

The government has framed the initiative as a foundation for economic resilience, job creation and sovereign AI capability, aiming to place Britain at the forefront of the AI industrial revolution.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!

Meta launches AI smart glasses with Ray-Ban and Oakley

Zuckerberg’s Meta has unveiled a new generation of smart glasses powered by AI at its annual Meta Connect conference in California. Working with Ray-Ban and Oakley, the company introduced devices including the Meta Ray-Ban Display and the Oakley Meta Vanguard.

These glasses are designed to bring the Meta AI assistant into daily use instead of being confined to phones or computers.

The Ray-Ban Display comes with a colour lens screen for video calls and messaging and a 12-megapixel camera, and will sell for $799. It can be paired with a neural wristband that enables tasks through hand gestures.

Meta also presented $499 Oakley Vanguard glasses aimed at sports fans and launched a second generation of its Ray-Ban Meta glasses at $379. Around two million smart glasses have been sold since Meta entered the market in 2023.

Analysts see the glasses as a more practical way of introducing AI to everyday life than the firm’s costly Metaverse project. Yet many caution that Meta must prove the benefits outweigh the price.

Chief executive Mark Zuckerberg described the technology as a scientific breakthrough. He said it forms part of Meta’s vast AI investment programme, which includes massive data centres and research into artificial superintelligence.

The launch came as activists protested outside Meta’s New York headquarters, accusing the company of neglecting children’s safety. Former safety researchers also told the US Senate that Meta ignored evidence of harm caused by its VR products, claims the company has strongly denied.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!

Microsoft podcast explores future of AI in medicine

AI is reshaping healthcare, according to experts featured in Microsoft’s new podcast series The AI Revolution in Medicine, Revisited.

Peter Lee, President of Microsoft Research, spoke with clinicians and researchers about how AI is helping doctors work more effectively and patients access better care. From note-taking during visits to accelerating drug discovery, the technology is already proving its value.

A doctor of UC San Diego Health in the US said AI helps physicians draft longer, more empathetic responses to patient messages, reducing mental strain. Meanwhile, Stanford’s Dr Roxana Daneshjou described how AI detected a dosage error in a medical summary, acting as a crucial safeguard.

Bill Gates highlighted how AI could boost healthcare in low-income regions by providing medical intelligence where doctors are scarce. Other guests suggested the technology may even blur traditional boundaries between medical specialties while accelerating drug development.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot