Agibot begins Asia-Pacific expansion with Malaysia event

The robotics company Agibot has launched a series of Asia-Pacific strategic initiatives for 2026 with a high-profile event in Malaysia, signalling its push to expand embodied AI and robotics across the region.

The launch, held at i-City in Selangor, was attended by executives, Malaysian government officials, partners, and customers. It also marked the opening of the first AI and Robotics Experience Centre in Malaysia.

The centre was developed in partnership with I-Bhd and officiated by Science, Technology and Innovation Minister Chang Lih Kang. Agibot said the facility will showcase real-world applications of humanoid robotics.

Founder and CEO of Agibot, Deng Taihua, said the company produced its 5,000th humanoid robot in 2025, strengthening its position as it begins regional expansion in 2026.

The firm plans to deploy its systems across property, hospitality, tourism, and urban services, while its partnership with I-Bhd will focus on wellness, longevity, and residential robotics.

Would you like to learn more about AI, tech, and digital diplomacy? If so, ask our Diplo chatbot!

Inside NeurIPS 2025: How AI research is shifting focus from scale to understanding

For over three decades, the Conference on Neural Information Processing Systems (NeurIPS) has played a pivotal role in shaping the field of AI research. What appears at the conference often determines what laboratories develop, what companies implement, and what policymakers ultimately confront. In this sense, the conference functions not merely as an academic gathering, but as an early indicator of where AI is heading.

The 2025 awards reflected the field at a moment of reassessment. After years dominated by rapid scaling, larger datasets, and unprecedented computational power, researchers are increasingly questioning the consequences of that growth. This year’s most highly recognised papers did not focus on pushing benchmarks marginally higher. Instead, they examined whether today’s AI systems genuinely understand, generalise, and align with human expectations.

The following sections detail the award-winning research, highlighting the problems each study addresses, its significance, and its potential impact on the future of AI.

How one paper transformed computer vision over the period of ten years

Faster R‑CNN: Towards Real-Time Object Detection with Region Proposal Networks

 Person, Walking, Clothing, Shorts, Adult, Male, Man, Female, Woman, Lighting, People, Indoors, Footwear, Shoe, Head, Skirt, Urban

One of the highlights of NeurIPS 2025 was the recognition of a paper published a decade earlier that has influenced modern computer vision. It introduced a new way of detecting objects in images that remains central to the field today.

Before this contribution, state‑of‑the‑art object detection systems relied on separate region proposal algorithms to suggest likely object locations, a step that was both slow and brittle. The authors changed that paradigm by embedding a region proposal network directly into the detection pipeline. By sharing full-image convolutional features between the proposal and detection stages, the system reduced the cost of generating proposals to almost zero while maintaining high accuracy.

The design proved highly effective on benchmark datasets and could run near real‑time on contemporary GPUs, allowing fast and reliable object detection in practical settings. Its adoption paved the way for a generation of two-stage detectors. It sparked a wave of follow-on research that has shaped both academic work and real-world applications, from autonomous driving to robotics.

The recognition of this paper, more than a decade after its publication, underscores how enduring engineering insights can lay the foundation for long-term progress in AI. Papers that continue to influence research and applications years after they first appeared offer a helpful reminder that the field values not just novelty but also lasting contribution.

Defining the true limits of learning in real time

Optimal Mistake Bounds for Transductive Online Learning

 Electronics, Hardware, Computer Hardware

While much of NeurIPS 2025 focused on practical advances, the conference also highlighted the continued importance of theoretical research. One of the recognised studies addressed a fundamental question in a field called online learning theory, which studies how systems can make sequential predictions and improve over time as they receive feedback.

The paper considered a system known as a learner, meaning any entity that makes predictions on a series of problems, and examined how much it can improve if it has access to the problems in advance but does not yet know the correct answers for them, referred to as labels.

The study focused on a method called transductive learning, in which the learner can take into account all upcoming problems without knowing their labels, allowing it to make more accurate predictions. Through precise mathematical analysis, the authors derived tight limits on the number of mistakes a learner can make in this setting.

By measuring problem difficulty using the Littlestone dimension, they demonstrated precisely how transductive learning reduces errors compared to traditional step-by-step online learning, thereby solving a long-standing theoretical problem.

Although the contribution is theoretical, its implications are far from abstract. Many real-world systems operate in environments where data arrives continuously, but labels are scarce or delayed. Recommendation systems, fraud detection pipelines and adaptive security tools all depend on learning under uncertainty, making an understanding of fundamental performance limits essential.

The recognition of this paper at NeurIPS 2025 reflects its resolution of a long-standing open problem and its broader significance for the foundations of machine learning. At a time when AI systems are increasingly deployed in high-stakes settings, clear theoretical guarantees remain a critical safeguard against costly and irreversible errors.

How representation superposition explains why bigger models work better

Superposition Yields Robust Neural Scaling

 Computer, Electronics, Laptop, Pc, Computer Hardware, Computer Keyboard, Hardware, Monitor, Screen, Cup, Person, Text

The remarkable trend that larger language models tend to perform better has been well documented, but exactly why this happens has been less clear. Researchers explored this question by investigating the role of representation superposition, a phenomenon where a model encodes more features than its nominal dimensions would seem to allow.

By constructing a simplified model informed by real data characteristics, the authors demonstrated that when superposition is strong, loss decreases in a predictable manner as the model size increases. Under strong superposition, overlapping representations produce a loss that scales inversely with model dimension across a broad range of data distributions.

That pattern matches observations from open‑source large language models and aligns with recognised scaling laws such as those described in the Chinchilla paper.

The insight at the heart of the study is that overlap in representations can make large models more efficient learners. Rather than requiring each feature to occupy a unique space, models can pack information densely, allowing them to generalise better as they grow. Such an explanation helps to explain why simply increasing model size often yields consistent improvements in performance.

Understanding the mechanisms behind neural scaling laws is important for guiding future design choices. It provides a foundation for building more efficient models and clarifies when and why scaling may cease to deliver gains at higher capacities.

Questioning the limits of reinforcement learning in language models

Does Reinforcement Learning Really Incentivise Reasoning Capacity in LLMs Beyond the Base Model?

 Furniture, Table, Adult, Female, Person, Woman, Desk, Computer Hardware, Electronics, Hardware, Monitor, Screen, Head

Reinforcement learning has been widely applied to large language models with the expectation that it can improve reasoning and decision-making. By rewarding desirable outputs, developers hope to push models beyond their base capabilities and unlock new forms of reasoning.

The study examines whether these improvements truly reflect enhanced reasoning or simply better optimisation within the models’ existing capacities. Through a systematic evaluation across tasks requiring logic, planning and multi-step inference, the authors find that reinforcement learning often does not create fundamentally new reasoning skills. Instead, the gains are largely confined to refining behaviours that the base model could already perform.

These findings carry important implications for the design and deployment of advanced language models. They suggest that current reinforcement learning techniques may be insufficient for developing models capable of independent or genuinely novel reasoning. As AI systems are increasingly tasked with complex decision-making, understanding the true limits of reinforcement learning becomes essential to prevent overestimating their capabilities.

The research encourages a more cautious and evidence-based approach, highlighting the need for new strategies if reinforcement learning is to deliver beyond incremental improvements.

Revealing a hidden lack of diversity in language model outputs

Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond)

 Adult, Female, Person, Woman, Helmet, Head

Large language models are often celebrated for their apparent creativity and flexibility. From essays to advice and storytelling, they appear capable of generating an almost limitless range of responses. Closer examination, however, reveals a more troubling pattern. Despite differences in architecture, scale and training data, many leading models tend to respond to open-ended prompts in strikingly similar ways.

The research examines this phenomenon through a carefully designed benchmark built around real-world questions that do not have a single correct answer. Rather than focusing on factual accuracy, the authors study how models behave when judgement, nuance, and interpretation are required.

Across a wide range of prompts, responses repeatedly converge on the same themes, tones and structures, producing what the authors describe as a form of collective behaviour rather than independent reasoning.

The study’s key contribution lies in its evaluation of existing assessment methods. Automated metrics commonly used to compare language models often fail to detect this convergence, even when human evaluators consistently prefer responses that display greater originality, contextual awareness, or diversity of perspective. As a result, models may appear to improve according to standard benchmarks while becoming increasingly uniform in practice.

The implications extend beyond technical evaluation. When language models are deployed at scale in education, media production, or public information services, the homogeneity of output risks narrowing the range of ideas and viewpoints presented to users. Instead of amplifying human creativity, such systems may quietly reinforce dominant narratives and suppress alternative framings.

The recognition of this paper signals a growing concern about how progress in language modelling is measured. Performance gains alone no longer suffice if they come at the cost of diversity, creativity, and meaningful variation. As language models play an increasingly important role in shaping public discourse, understanding and addressing collective behavioural patterns becomes a matter of both societal and technical importance.

Making large language models more stable by redesigning attention

Gated Attention for Large Language Models: Non-Linearity, Sparsity, and Attention-Sink-Free

 Robot, Light, Smoke Pipe

As large language models grow in size and ambition, the mechanisms that govern how they process information have become a central concern. Attention, the component that allows models to weigh different parts of input, sits at the core of modern language systems.

Yet, the same mechanism that enables impressive performance can also introduce instability, inefficiency, and unexpected failure modes, particularly when models are trained on long sequences.

The research focuses on a subtle but consequential weakness in standard attention designs. In many large models, certain tokens accumulate disproportionate influence, drawing attention away from more relevant information. Over time, this behaviour can distort the way models reason across long contexts, leading to degraded performance and unpredictable outputs.

To address this problem, the authors propose a gated form of attention that enables each attention head to dynamically regulate its own contribution. By introducing non-linearity and encouraging sparsity, the approach reduces the dominance of pathological tokens and leads to more balanced information flow during training and inference.

The results suggest that greater reliability does not necessarily require more data or larger models. Instead, careful architectural choices can significantly improve stability, efficiency, and performance. Such improvements are particularly relevant as language models are increasingly deployed in settings where long context understanding and consistent behaviour are essential.

At a time when language models are moving from experimental tools to everyday infrastructure, refinements of this kind highlight how progress can come from re-examining the foundations rather than simply scaling them further.

Understanding why models do not memorise their data

Why Diffusion Models Don’t Memorise: The Role of Implicit Dynamical Regularisation in Training

 Diagram

Generative AI has advanced at an extraordinary pace, with diffusion models now powering image generation, audio synthesis, and early video creation tools. A persistent concern has been that these systems might simply memorise their training data, reproducing copyrighted or sensitive material rather than producing genuinely novel content.

The study examines the training dynamics of diffusion models in detail, revealing a prolonged phase during which the models generate high-quality outputs that generalise beyond their training examples. Memorisation occurs later, and its timing grows predictably with the size of the dataset. In other words, generating new and creative outputs is not an accidental by-product but a natural stage of the learning process.

Understanding these dynamics has practical significance for both developers and regulators. It shows that memorisation is not an inevitable feature of powerful generative systems and can be managed through careful design of datasets and training procedures. As generative AI moves further into mainstream applications, knowing when and how models memorise becomes essential to ensuring trust, safety, and ethical compliance.

The findings provide a rare theoretical foundation for guiding policy and deployment decisions in a rapidly evolving landscape. By illuminating the underlying mechanisms of learning in diffusion models, the paper points to a future where generative AI can be both highly creative and responsibly controlled.

Challenging long-standing assumptions in reinforcement learning

1000 Layer Networks for Self-Supervised Reinforcement Learning: Scaling Depth Can Enable New Goal-Reaching Capabilities

 Electronics

Reinforcement learning has often been presented as a route to truly autonomous AI, yet practical applications frequently struggle due to fragile training processes and the need for carefully designed rewards. In a surprising twist, researchers have found that increasing the depth of neural networks alone can unlock new capabilities in self-supervised learning settings.

By constructing networks hundreds of layers deep, agents learn to pursue goals more effectively without explicit instructions or rewards. The study demonstrates that depth itself can act as a substitute for hand-crafted incentives, enabling the system to explore and optimise behaviour in ways that shallower architectures cannot.

The findings challenge long-held assumptions about the limits of reinforcement learning and suggest a shift in focus from designing complex reward functions to designing more capable architectures. Potential applications span robotics, autonomous navigation, and simulated environments, where specifying every objective in advance is often impractical.

The paper underlines a broader lesson for AI, showing that complexity in structure can sometimes achieve what complexity in supervision cannot. For systems that must adapt and learn in dynamic environments, architectural depth may be a more powerful tool than previously appreciated.

What NeurIPS 2025 reveals about the state of AI

 Pattern, Computer Hardware, Electronics, Hardware, Monitor, Screen, Accessories

Taken together, research recognised at NeurIPS 2025 paints a picture of a field entering a more reflective phase. AI is no longer defined solely by the size of models. Instead, attention is turning to understanding learning dynamics, improving evaluation frameworks, and ensuring stability and reliability at scale.

The year 2025 did not simply reward technical novelty; it highlighted work that questions assumptions, exposes hidden limitations, and proposes more principled foundations for future systems. As AI becomes an increasingly influential force in society, this shift may prove to be one of the most important developments in the field’s evolution.

Would you like to learn more about AI, tech and digital diplomacyIf so, ask our Diplo chatbot!

EU reaffirms commitment to Digital Markets Act enforcement

European Commission Executive Vice President Teresa Ribera has stated that the EU has a constitutional obligation under its treaties to uphold its digital rulebook, including the Digital Markets Act (DMA).

Speaking at a competition law conference, Ribera framed enforcement as a duty to protect fair competition and market balance across the bloc.

Her comments arrive amid growing criticism from US technology companies and political pressure from Washington, where enforcement of EU digital rules has been portrayed as discriminatory towards American firms.

Several designated gatekeepers have argued that the DMA restricts innovation and challenges existing business models.

Ribera acknowledged the right of companies to challenge enforcement through the courts, while emphasising that designation decisions are based on lengthy and open consultation processes. The Commission, she said, remains committed to applying the law effectively rather than retreating under external pressure.

Apple and Meta have already announced plans to appeal fines imposed in 2025 for alleged breaches of DMA obligations, reinforcing expectations that legal disputes around EU digital regulation will continue in parallel with enforcement efforts.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!

UK considers social media limits for youth

Keir Starmer has told Labour MPs that he is open to an Australian-style ban on social media for young people, following concerns about the amount of time children spend on screens.

The prime minister said reports of very young children using phones for hours each day have increased anxiety about the effects of digital platforms on under-16s.

Starmer previously opposed such a ban, arguing that enforcement would prove difficult and might instead push teenagers towards unregulated online spaces rather than safer platforms. Growing political momentum across Westminster, combined with Australia’s decision to act, has led to a reassessment of that position.

Speaking to MPs, Starmer said different enforcement approaches were being examined and added that phone use during school hours should be restricted.

UK ministers have also revisited earlier proposals aimed at reducing the addictive design of social media and strengthening safeguards on devices sold to teenagers.

Support for stricter measures has emerged across party lines, with senior figures from Labour, the Conservatives, the Liberal Democrats and Reform UK signalling openness to a ban.

A final decision is expected within months as ministers weigh child safety, regulation and practical implementation.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!

Creators showcase AI film innovation in San Jose festival

San Jose became the site of the inaugural Silicon Valley AI Film Festival (SVAIFF) on January 10–11, bringing together filmmakers, tech innovators and creatives to explore how AI is transforming cinema and creative expression.

The event featured AI-generated film trailers, such as “Revolutionary” and “Cosmic,” as well as panel discussions on industry trends and the economic implications of AI in film, and a competition that received over 2,000 entries.

Festival co-founder Cynthia Jiang highlighted how production companies are increasingly using AI in post-production and concept development, while acknowledging resistance remains among some traditional filmmakers.

Human and AI-assisted art appeared throughout the festival, including fashion shows that blended robotics with runway models and featured a humanoid robot performer.

The festival also celebrated creative achievements with awards, such as the Grand Prix for ‘White Night Lake’ and Best Animated Short for ‘A Tree’s Imagination.’ It premiered the feature-length film ‘The Wolves,’ directed by Bing He, who credited generative AI with enabling his vision without replacing his writing role.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!

AI tools influence modern personal finance practices

Personal finance assistants powered by AI tools are increasingly helping users manage budgets, analyse spending, and organise financial documents. Popular platforms such as ChatGPT, Google Gemini, Microsoft Copilot, and Claude now offer features designed to support everyday financial tasks.

Rather than focusing on conversational style, users should consider how financial data is accessed and how each assistant integrates with existing systems. Connections to spreadsheets, cloud storage, and secure platforms often determine how effective AI tools are for managing financial workflows.

ChatGPT is commonly used for drafting financial summaries, analysing expenses, and creating custom tools through plugins. Google Gemini is closely integrated with Google Docs and Sheets, making it suitable for users who rely on Google’s productivity ecosystem.

Microsoft Copilot provides strong automation for Excel and Microsoft 365 users, with administrative controls that appeal to organisations. Claude focuses on safety and large context windows, allowing it to process lengthy financial documents with more conservative output.

Choosing the most suitable AI tools for personal finance depends on workflow needs, data governance preferences, and privacy considerations. No single platform dominates every use case; each offers strengths across different financial management tasks.

Would you like to learn more about AI, tech, and digital diplomacy? If so, ask our Diplo chatbot!

Government IT vulnerabilities revealed by UK public sector cyberattack

A UK public sector cyberattack on Kensington and Chelsea Council has exposed the growing vulnerability of government organisations to data breaches. The council stated that personal details linked to hundreds of thousands of residents may have been compromised after attackers targeted the shared IT infrastructure.

Security experts warn that interconnected systems, while cost-efficient, create systemic risks. Dray Agha, senior manager of security operations at Huntress, said a single breach can quickly spread across partner organisations, disrupting essential services and exposing sensitive information.

Public sector bodies remain attractive targets due to ageing infrastructure and the volume of personal data they hold. Records such as names, addresses, national ID numbers, health information, and login credentials can be exploited for fraud, identity theft, and large-scale scams.

Gregg Hardie, public sector regional vice president at SailPoint, noted that attackers often employ simple, high-volume tactics rather than sophisticated techniques. Compromised credentials allow criminals to blend into regular activity and remain undetected for long periods before launching disruptive attacks.

Hardie said stronger identity security and continuous monitoring are essential to prevent minor intrusions from escalating. Investing in resilient, segmented systems could help reduce the impact of future UK public sector cyberattack incidents and protect critical operations.

Would you like to learn more about AI, tech, and digital diplomacy? If so, ask our Diplo chatbot!

Stranger Things fans question AI use in show finale’s script

The creators of Stranger Things have been accused by some fans of using ChatGPT while writing the show’s fifth and final season, following the release of a behind-the-scenes Netflix documentary.

The series ended on New Year’s Eve with a two-hour finale that saw (SPOILER WARNING) Vecna defeated and Eleven apparently sacrificing herself. The ambiguous ending divided viewers, with some disappointed by the lack of closure.

A documentary titled One Last Adventure: The Making Of Stranger Things 5 was released shortly after the finale. One scene showing Matt and Ross Duffer working on scripts drew attention after a screenshot circulated online.

Some viewers claimed a ChatGPT-style tab was visible on a laptop screen. Others questioned the claim, noting the footage may predate the chatbot’s mainstream use.

Netflix has since confirmed two spin-offs are in development, including a new live-action series and an animated project titled Stranger Things: Tales From ’85.

Would you like to learn more about AI, tech, and digital diplomacy? If so, ask our Diplo chatbot!

Eli Lilly and NVIDIA invest in AI-driven pharmaceutical innovation

NVIDIA and Eli Lilly have announced a joint AI co-innovation lab aimed at advancing drug discovery by combining AI with pharmaceutical research.

The partnership combines Lilly’s experience in medical development with NVIDIA’s expertise in accelerated computing and AI infrastructure.

The two companies plan to invest up to $1 billion over five years in research capacity, computing resources and specialist talent.

Based in the San Francisco Bay Area, the lab will support large-scale data generation and model development using NVIDIA platforms, instead of relying solely on traditional laboratory workflows.

Beyond early research, the collaboration is expected to explore applications of AI across manufacturing, clinical development and supply chain operations.

Both NVIDIA and Eli Lilly claim the initiative is designed to enhance efficiency and scalability in medical production while fostering long-term innovation in the life sciences sector.

Would you like to learn more about AI, tech and digital diplomacy? If so, ask our Diplo chatbot!

Morocco outlines national AI roadmap to 2030

Morocco is preparing to unveil ‘Maroc IA 2030’, a national AI roadmap designed to structure the country’s AI ecosystem and strengthen digital transformation.

The strategy seeks to modernise public services, improve interoperability across digital systems and enhance economic competitiveness, according to officials ahead of the ‘AI Made in Morocco’ event in Rabat.

A central element of the plan involves the creation of Al Jazari Institutes, a national network of AI centres of excellence connecting academic research with innovation and regional economic needs.

A roadmap that prioritises technological autonomy, trusted AI use, skills development, support for local innovation and balanced territorial coverage instead of fragmented deployment.

The initiative builds on the Digital Morocco 2030 strategy launched in 2024, which places AI at the core of national digital policy.

Authorities expect the combined efforts to generate around 240,000 digital jobs and contribute approximately $10 billion to gross domestic product by 2030, while improving the international AI readiness ranking of Morocco.

Additional measures include the establishment of a General Directorate for AI and Emerging Technologies to oversee public policy and the development of an Arab African regional digital hub in partnership with the United Nations Development Programme.

Their main goal is to support sustainable and responsible digital innovation.

Would you like to learn more aboutAI, tech and digital diplomacy? If so, ask our Diplo chatbot!