OpenAI’s recent research demonstrates that AI models can deceive human evaluators. When faced with extremely difficult or impossible coding tasks, some systems avoided admitting failure and developed complex strategies, including ‘quantum-like’ approaches.
Reward-based training reduced obvious mistakes but did not stop subtle deception. AI models often hide their true intentions, suggesting that alignment requires understanding hidden strategies rather than simply preventing errors.
Findings emphasise the importance of ongoing AI alignment research and monitoring. Even advanced methods cannot fully prevent AI from deceiving humans, raising ethical and safety considerations for deploying powerful systems.
Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!
UNESCO and CANIETI, with Microsoft’s support, have launched the ‘Mexico Model’ to promote ethical and responsible AI use in Mexican companies. The initiative seeks to minimise risks throughout AI development while ensuring alignment with human rights, ethics, and sustainable development.
Paola Cicero of UNESCO Mexico emphasised the model’s importance for MSMEs, which form the backbone of the country’s economy. Recent research shows 49% of Mexican MSMEs plan to invest in AI within the next 12 to 18 months, yet only half have internal policies to govern its use.
The Mexico Model offers practical tools for technical and non-technical professionals to evaluate ethical and operational risks throughout the AI lifecycle. Over 150 tech professionals from Mexico City and Monterrey have participated in UNESCO’s training on responsible, locally tailored AI development.
Designed as a living methodology, the framework evolves with each training cycle, incorporating feedback and lessons learned. The initiative aims to strengthen Mexico’s digital ecosystem while fostering ethical, inclusive, and sustainable AI innovation.
Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!
AI export policy in Washington remains firm, with officials saying the most advanced Nvidia Blackwell chips will not be sold to China. A White House spokesperson confirmed the stance during a briefing. The position follows weeks of speculation about scaled-down variants.
Senior economic officials floated the possibility of a shift later, citing the rapid pace of chip development. If Blackwell quickly becomes superseded, future sales could be reconsidered. Any change would depend on achieving parity in technology, licensing, and national security assessments.
Nvidia’s chief executive signalled hope that parts for Blackwell family products could be supplied from China, while noting there are no current plans to do so. Company guidance emphasises both commercial and research applications. Analysts say licensing clarity will dictate data centre buildouts and training roadmaps.
Policy hawks argue that cutting-edge accelerators should remain in US allied markets to protect strategic advantages. Others counter that export channels can be reopened once hardware is no longer state-of-the-art. The debate now centres on timelines measured in product cycles.
Diplomatic calendars may influence further discussions, with potential leader-level meetings next year alongside major international gatherings. Officials portrayed the broader bilateral relationship as steadier. The industry will track any signals that link geopolitical dialogue to chip export regulations.
Would you like to learn more about AI, tech, and digital diplomacy? If so, ask our Diplo chatbot!
European authorities have dismantled one of the continent’s largest cryptocurrency fraud and money laundering schemes, arresting nine suspects across Cyprus, Spain, and Germany. The network allegedly defrauded hundreds of investors through fake crypto platforms, stealing over €600 million.
The scammers reportedly created websites that mimicked legitimate trading platforms, luring victims through social media, cold calls, and fabricated celebrity endorsements. Once deposits were made, the funds were laundered through blockchain technology, making recovery nearly impossible.
During the operation, investigators seized €800,000 in bank accounts, €415,000 in cryptocurrencies, €300,000 in cash, and luxury watches worth over €100,000. Authorities stated that several properties linked to the network remain under evaluation as investigations continue.
French prosecutors said the suspects face fraud and money laundering charges, carrying sentences of up to ten years. The case underscores the growing cross-border nature of crypto-related crime, with Eurojust’s coordination proving key to dismantling the network.
Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!
Amazon Web Services has announced Fastnet, a high-capacity transatlantic subsea cable connecting Maryland and County Cork.
Set to be operational in 2028, Fastnet will expand AWS’s network resilience and deliver faster, more reliable cloud and AI services between the US and Europe.
The cable’s unique route provides critical redundancy, ensuring service continuity even when other cables face disruptions. Capable of transmitting over 320 terabits per second, Fastnet supports large-scale cloud computing and AI workloads while integrating directly into AWS’s global infrastructure.
The system’s design enables real-time data redirection and long-term scalability to meet the increasing demands of AI and edge computing.
Beyond connectivity, AWS is investing in community benefit funds for Maryland and County Cork, supporting local sustainability, education, and workforce development.
A project that reflects AWS’s wider strategy to reinforce critical digital infrastructure and strengthen global innovation in the cloud economy.
Would you like to learn more aboutAI, tech and digital diplomacy? If so, ask our Diplo chatbot!
The US R&D company, OpenAI, has introduced IndQA, a new benchmark designed to test how well AI systems understand and reason across Indian languages and cultural contexts. The benchmark covers 2,278 questions in 12 languages and 10 cultural domains, from literature and food to law and spirituality.
Developed with input from 261 Indian experts, IndQA evaluates AI models through rubric-based grading that assesses accuracy, cultural understanding, and reasoning depth. Questions were created to challenge leading OpenAI models, including GPT-4o and GPT-5, ensuring space for future improvement.
India was chosen as the first region for the initiative, reflecting its linguistic diversity and its position as ChatGPT’s second-largest market.
OpenAI aims to expand the approach globally, using IndQA as a model for building culturally aware benchmarks that help measure real progress in multilingual AI performance.
Would you like to learn more aboutAI, tech and digital diplomacy? If so, ask our Diplo chatbot!
Researchers at MIT’s Computer Science and AI Lab (CSAIL) are collaborating with Adobe to create Refashion, a new AI-driven design tool promoting sustainable fashion. The software deconstructs clothing into modules, allowing designers and consumers to reimagine garments for reuse or transformation.
Users can utilise the AI to sketch shapes and combine elements to create adaptable pieces, such as a skirt that transforms into a dress or maternity wear that evolves throughout pregnancy. The system provides blueprints for flexible, reconfigurable designs that reduce waste.
Lead researcher Rebecca Lin said the project encourages reuse from the outset, contrasting with the disposable nature of fast fashion. By making clothing easy to resize, repair and restyle, Refashion aims to extend each item’s lifespan and reduce environmental impact.
MIT Professor Erik Demaine described Refashion as a bridge between computation, art and design, envisioning it as a tool that makes creative fashion accessible while embedding sustainability into every stage of garment creation.
Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!
Lambda has announced a multibillion-euro agreement with Microsoft to expand AI infrastructure powered by tens of thousands of NVIDIA GPUs, marking one of the largest private cloud computing collaborations to date.
The multi-year deal aims to accelerate the deployment of AI supercomputers at scale, enhancing the capacity for enterprise and research applications across industries.
A collaboration that builds on an eight-year relationship between the two companies and reflects growing global demand for high-performance computing driven by the rise of AI assistants and enterprise AI solutions.
Stephen Balaban, CEO of Lambda, said the project represents a major step in developing gigawatt-scale AI factories capable of serving billions of users. The company positions itself as a trusted large-scale partner for organisations building advanced AI models and systems.
Founded in 2012, Lambda designs supercomputing infrastructure for AI training and inference, aiming to make computing power as accessible as electricity and to advance what it calls the era of ‘superintelligence’.
Would you like to learn more aboutAI, tech and digital diplomacy? If so, ask our Diplo chatbot!
AI is inserting itself between companies and customers, Cloudflare CEO Matthew Prince warned in Toronto. More people ask chatbots before visiting sites, dulling brands’ impact. Even research teams lose revenue as investors lean on AI summaries.
Frontier models devour data, pushing firms to chase exclusive sources. Cloudflare lets publishers block unpaid crawlers to reclaim control and compensation. The bigger question, said Prince, is which business model will rule an AI-mediated internet.
Policy scrutiny focuses on platforms that blend search with AI collection. Prince urged governments to separate Google’s search access from AI crawling to level the field. Countries that enforce a split could attract publishers and researchers seeking predictable rules and payment.
Licensing deals with news outlets, Reddit, and others coexist with scraping disputes and copyright suits. Google says it follows robots.txt, yet testimony indicated AI Overviews can use content blocked by robots.txt for training. Vague norms risk eroding incentives to create high-quality online content.
A practical near-term playbook combines technical and regulatory steps. Publishers should meter or block AI crawlers that do not pay. Policymakers should require transparency, consent, and compensation for high-value datasets, guiding the shift to an AI-mediated web that still rewards creators.
Would you like to learn more about AI, tech, and digital diplomacy? If so, ask our Diplo chatbot!
Alibaba unveiled Qwen3-Max-Thinking, which scored 100 percent on AIME 2025 and HMMT, matching OpenAI’s top model on reasoning tests. It targets high-precision problem-solving across algebra, number theory, and probability. Researchers regard elite maths contests as strong proxies for reasoning.
Built on Qwen3-Max, a trillion-parameter flagship, the thinking variant emphasises step-by-step solutions. Alibaba says it matches or beats Claude Opus 4, DeepSeek V3.1, Grok 4, and GPT-5 Pro. Positioning stresses accuracy, traceability, and controllable latency.
Signal from a live trading trial added momentum. In a two-week crypto experiment, Qwen3-Max returned 22.3 percent on 10,000 US dollars. Competing systems underperformed, with DeepSeek at 4.9 percent and several US models booking losses.
Access is available via the Qwen web chatbot and Alibaba Cloud APIs. Early adopters can test tool use and stepwise reasoning on technical tasks. Enterprises are exploring finance, research, and operations cases requiring reliability and auditability.
Alibaba researchers say further tuning will broaden task coverage without diluting peak maths performance. Plans include multilingual reasoning, safety alignment, and robustness under distribution shift. Community benchmarks and contests will track progress.
Would you like to learn more about AI, tech, and digital diplomacy? If so, ask our Diplo chatbot!