The Expanding Universe of Generative Models

16 Jan 2024 15:00h - 15:45h

Knowledge Graph of Debate
Session report
Speakers

Disclaimer: This is not an official record of the WEF session. The DiploAI system automatically generates these resources from the audiovisual recording. Resources are presented in their original format, as provided by the AI (e.g. including any spelling mistakes). The accuracy of these resources cannot be guaranteed. The official record of the session can be found on the WEF YouTube channel.

Knowledge Graph of Debate

Session report

Full session report

Daphne Koller

In the analysis, speakers discuss various aspects of artificial intelligence (AI) and its progress. One speaker highlights the significance of data in enhancing AI models and making them more sophisticated. They argue that the availability of more data, including data from self-driving cars, augmented reality, biology, healthcare, and other sources, will contribute to the advancement of AI. This argument is supported by the assertion that progress in the field of AI is primarily attributed to data. Another speaker takes the stance that data, rather than compute power, is the major driver of AI progress. They argue that while compute power and electricity are important, the exclusion of data in AI advancements should not be overlooked. They state that data is the single biggest enabler of AI progress and highlight the increasing availability of diverse data modalities from the real world.

The issue of causality understanding in AI models is also discussed. One speaker points out that current models lack the capacity to understand causality, as they are solely predictive engines that focus on associations. They argue that the incorporation of causality is essential for AI models to interact with the physical world and enable common-sense reasoning in applications such as manufacturing and biology. However, another speaker takes a neutral stance on causality, emphasizing its criticality when interacting with the physical world but not providing a conclusive argument.

The importance of conducting real-world experiments for AI’s growth and understanding of the world is highlighted by one speaker. They argue that machines need to possess the capability to design and learn from experiments in order to surpass human capabilities. These experiments, which teach computers about the complexity of the world, enable them to go beyond what can be taught by a human.

Although AI designs can teach us about building intelligence, they are not equivalent to human intelligence, according to one speaker. They disagree that current AI research is helping us understand human cognition.

The ability of AI to address challenging societal problems is another point of discussion. It is argued that AI can be leveraged to solve problems related to health, agriculture, the environment, and climate change, which may be difficult for humans to solve alone.

In terms of understanding natural phenomena, the analysis highlights that AI models, such as convolutional nets used as models for the visual cortex, improve our understanding but do not accurately replicate the complexity of the natural world. An analogy is drawn to the difference between understanding bird flight through airplanes and comprehending the intricacies of natural flight.

The application of technological advancements to aid in understanding human biology and improving medicine is seen as a positive development. One speaker disagrees with the term “off-ramp” being used to describe these advancements, highlighting the value of applying technology in these areas.

The importance of open-source models is emphasized, with the argument that models constructed by particular companies may not meet the needs of all applications. Open-source models accommodate new ideas and data modalities, enabling the community to build upon a strong foundation.

The role of education, particularly structured thinking, is discussed as a key determinant in the effective use of technology. It is argued that teaching structured thinking from a young age will result in better utilization of technology. This aligns with the goal of promoting quality education (SDG 4).

In conclusion, the analysis provides diverse perspectives on AI and its progress. It underscores the significance of data in enhancing AI models and the need to incorporate causality understanding to bridge the gap between the digital and physical realms. The ability to conduct and learn from real-world experiments is seen as crucial for machines to surpass human capabilities. While AI designs can teach us how to build intelligence, they do not replicate human intelligence entirely. AI’s potential to solve complex societal problems is acknowledged, and the importance of open-source models and structured thinking education is highlighted. The analysis also touches upon the role of AI in understanding natural phenomena and its application in medicine and biology.

Andrew Ng

The analysis of the provided arguments reveals several key points in the field of artificial intelligence (AI) and innovation. Firstly, there is a considerable pace of innovation in the field of scaling and algorithmic evaluations. Despite the increasing difficulty in scaling, continuous innovations are driving the acceleration of the field. The text highlights that numerous algorithmic evaluations and innovations have contributed to this trend. This suggests that the field is witnessing significant advancements fueled by continuous innovation.

Furthermore, the analysis underscores the anticipation of an image processing revolution. Recent technological advances, such as GPT-4V and Gemini Ultra, are cited as examples. These advancements have the potential to revolutionize image processing, transforming how it is currently understood and applied.

In terms of autonomous agents, there is optimism towards their development. The report mentions ongoing work on language models capable of conducting research. While it is noted that this kind of application is not yet fully functional, it highlights the potential for autonomous agents to contribute to various industries and domains.

The analysis also points out the similarities between large language models and humans. Large language models, akin to humans, have the potential to improve if provided with tools, such as a ‘scratchpad’, to work on. The report further mentions that these models are already using various tools to function. This observation underscores the potential for large language models to evolve and improve, ultimately enhancing their capabilities.

In addition to these technical aspects, the report delves into the perspectives of influential figures in the AI community. Andrew Ng, a prominent figure in the field, sees artificial general intelligence (AGI) as a significant goal. The report suggests that the AI community should embrace diverse goals, including AGI, climate change, and life sciences. However, it also notes the challenges posed by defining AGI in terms of human intelligence.

Another noteworthy observation is the value attributed to open source intelligence and AI. These are considered as valuable digital intelligences that contribute to wealth creation. The report suggests that an increase in intelligence, including artificial intelligence and open source intelligence, can lead to a wealthier and better world.

However, the report highlights the existing limitations in the current tech infrastructures. It mentions that today’s foundational tech architectures, such as semiconductors and cloud-based technologies, tend to be closed systems. In contrast, open source technologies are seen as contributing to collective intelligence and potentially sparking innovation and wealth creation. This observation underlines the need to strike a balance between closed systems and open source technologies in order to foster innovation and progress.

Furthermore, the analysis draws attention to the influence of lobbyists on regulations. It is noted that powerful forces are pushing for regulatory proposals that could impose burdensome requirements on open source technologies. This observation suggests the potential risks associated with over-regulation and the potential limitations imposed on open source innovation.

Lastly, the analysis emphasizes the importance of upskilling the workforce for the adoption and effective use of AI in enterprises. The lack of skilled workforce to implement and utilize AI-based tools is identified as a major bottleneck. The report mentions the need to upskill the workforce not only for building AI applications but also for effectively using AI-based tools. This observation underscores the necessity of prioritizing workforce training and education to fully benefit from AI technologies.

To summarize, the analysis highlights the significant pace of innovation in scaling and algorithmic evaluations in the field of AI. The anticipation of an image processing revolution is driven by recent technological advances. There is optimism surrounding the development of autonomous agents, with ongoing work on language models. The potential for large language models to evolve and improve with tools is emphasized. Influential figures like Andrew Ng see AGI as a significant goal but acknowledge the challenges of defining it. The value of open source intelligence and AI in wealth creation is recognized. The limitations of current tech infrastructures and the influence of lobbyists on regulations are noted. Upskilling the workforce is identified as crucial for AI adoption and utilization. These insights provide a comprehensive overview of the current state and future prospects of AI and innovation.

Nicholas Thompson

The panel discussion, moderated by Nicholas Thompson, delved into the future trajectory of artificial intelligence (AI). Thompson’s first question focused on whether the pace of AI advancements will continue to increase, taper, or plateau in the near future. This question aimed to deepen understanding of the speed of innovations in AI.

The discussion highlighted a potential correlation between improved graphics processing units (GPUs) and increased computing capacity, and their impact on AI models. Concerns were raised about the concentration of power in the AI market, as greater compute power is predominantly accessible to a small number of companies. It was suggested that this trend could lead to a less competitive AI market.

Additionally, the panel explored the vast amount of visual data available for AI systems to learn from. It was noted that while the amount of text data available may have limitations, the potential amount of visual data that could be utilized by AI is enormous.

Nicholas Thompson questioned how machines would comprehend video content, considering that even humans struggle to predict the outcomes of certain video scenarios. This raised the question of whether machines can truly understand complex visual information and make accurate predictions.

The discussion drew comparisons between child development and the ability of neural networks to understand causality. Personal anecdotes about raising children were shared, highlighting how young children struggle to comprehend cause and effect. This led to Thompson’s skepticism regarding whether machines could truly understand cause and effect as they are often attributed with more advanced abilities than they possess.

Concerns were expressed about AI models potentially corrupting or polluting themselves when left to learn and create without proper oversight. The worry was that AI systems may evolve in unexpected and potentially harmful ways if they are left unsupervised or if there is a lack of understanding about what the models are learning.

The goal of building machines that surpass human intelligence was questioned. Thompson expressed doubts about whether striving towards creating machines smarter than humans is the ideal goal for AI research. He suggested that AI researchers should focus on how AI can serve specific human needs and biology, similar to how airplanes were designed for flight rather than to replicate birds.

The controversy surrounding open-source AI was also examined. Thompson highlighted fears within the US government regarding the potential misuse of open-source AI and its implications for enabling individuals with malicious intentions to cause harm. The potential negative consequences of multiple countries contributing to highly ranked language models were also discussed.

Thompson appeared to be neutral towards the idea of open-source AI, but invited advocates to present their views and perspectives on the topic. Additionally, concerns were raised about the potential suppression of open-source models through legislation, which may consolidate power within a smaller number of larger companies.

In conclusion, the panel discussion explored various aspects of the future of AI, including the pace of advancements, power consolidation in the industry, the vast amount of visual data available, challenges in understanding video content, the limitations of AI in comprehending causality, concerns about AI models corrupting themselves, the goal of surpassing human intelligence, the controversy surrounding open-source AI, and alternative approaches to reduce income inequality. The discourse provided valuable insights into the current state and potential future directions of AI.

Yann LeCun

Yann LeCun, a prominent figure in the field of artificial intelligence (AI), has raised concerns about the limitations of current autoregressive language models (LLMs). These models, used to generate human-like text, are reaching saturation due to data limits. LeCun argues that there is not enough text data available to continue expanding and improving these models infinitely.

While LLMs have been trained with an extensive amount of text data (approximately 10 trillion tokens), there is a natural limit to their development. LeCun predicts the lack of sufficient text data will eventually hinder progress in LLMs.

LeCun also highlights the untapped potential of visual data. He suggests current methods for training AI models with video inputs are ineffective. There is a need for new breakthroughs to enable AI models to fully utilize sensory input, such as vision, and to teach them intuitive physics, causality, and abstract representations.

Open research and open-source development are key elements advocated by LeCun. He believes these approaches have been pivotal in the rapid progress of AI. Maintaining an open and collaborative environment is crucial for accelerating the development of AI technologies.

LeCun also emphasizes the importance of AI in advancing our understanding of human intelligence and perception. Building AI systems provides insights into human cognition, akin to how airplanes help us comprehend bird flight. However, he cautions against concentration of AI control in the hands of a few private companies. LeCun advocates for diverse and open-source AI systems that cater to various cultures, languages, values, and interests.

In conclusion, Yann LeCun’s perspective highlights the limitations of current autoregressive language models and the need for new breakthroughs in sensory input utilization, open research, and open-source development. He envisions a future where AI surpasses the capabilities of current models, paving the way for more advanced and diverse AI systems. By addressing these challenges and fostering an open and collaborative environment, LeCun believes AI can significantly shape our understanding of human intelligence and perception.

Aidan Gomez

Aidan Gomez argues that enhancing the efficiency of large language models relies on increasing compute power. He asserts that the next generation of GPUs will enable the implementation of more advanced algorithms and methods, unlocking new scale. Doubling compute capacity is anticipated to result in an equivalent increase in model size, indicating the impact that improved hardware platforms can have on large language models’ potential.

However, Gomez acknowledges that the current training strategy for models is approaching its upper limit. The existing approach does not support continuous learning from real-world interactions and human input. While the current strategy performs impressively, it needs to evolve to accommodate models that can learn continuously. This limitation highlights the challenges in training models to surpass the knowledge of an average person.

Furthermore, Gomez emphasizes the importance of models being capable of learning from online experiences and engaging in debate, which is currently unattainable within the existing training strategy. Enabling models to learn autonomously and continuously is essential for their further development and improvement.

Regarding power dynamics, Gomez supports the devolution of power from large tech companies. However, he acknowledges the validity of both open source and closed-source models. Gomez believes that creators and hackers should have access to technology that can challenge the authority of large tech companies, while businesses should also have the right to keep their models closed to maintain a competitive advantage. He suggests a hybrid approach that balances open source and closed-source models.

Additionally, Gomez recognizes the importance of technology being accessible and inclusive. He highlights the significance of understanding different languages and cultures, advocating for technology that can cater to diverse linguistic and cultural backgrounds. Gomez believes that accessible and inclusive technology is crucial for achieving equality and promoting inclusion.

In conclusion, Aidan Gomez’s views revolve around the role of compute power in enhancing large language models, the limitations of the current training strategy, the need for models to learn continuously, the devolution of power from large tech companies, and the importance of accessible and inclusive technology. His insights provide valuable perspectives on the future of language models and the ethical considerations surrounding their development and use.

Kai-Fu Lee

The analysis reveals several insights about the rate of change in artificial intelligence (AI) and its future trajectory. It suggests that while the rate of change in AI will slow down, it will still maintain an impressive pace. This is attributed to the potential for further improvements by increasing compute power, data availability, training methods, and infrastructure.

One notable observation is the optimism expressed regarding the future of AI. The analysis highlights that more entrepreneurs and large companies are entering the AI field, which suggests a growing interest and investment in this technology. This influx of talent and resources is expected to contribute to further advancements in AI.

Regarding scalability and innovation, the analysis suggests that scaling laws will continue to hold in AI, but the pace of innovation will slow down. Despite this, there is recognition of the diminishing returns in any endeavor, implying that the exponential growth seen in the past may not be sustainable in the long term.

Interestingly, the analysis indicates that we have not yet reached the plateau stage in AI development. This suggests that there is still ample room for further growth and improvement in this field. Kai-Fu Lee, a prominent figure in the analysis, also expresses agreement with Yan’s approach to neural networks, adding credibility to the assertion that this method has value and potential for further development.

The commercial value of text-based Large Language Models (LLMs) is emphasized in the analysis. LLMs are seen as valuable tools for generating content, improving productivity, and being deployed widely. This indicates the potential for these models to contribute significantly to various industries and sectors.

Regarding the concept of a world model, the analysis suggests that it is better suited for academic researchers and large company labs to explore. This implies that the development and understanding of a world model require a significant amount of resources and expertise.

AI systems, particularly those based on LLMs, are seen as powerful tools that can be applied in multiple areas, such as office productivity, content creation, and enhancing search engines. This underscores the transformative potential of AI and its ability to revolutionize various aspects of our lives.

The ongoing transformation through AI and LLMs presents numerous opportunities for value production and economic gains. The analysis recognizes the importance of engineering work to improve current AI algorithms, which suggests that there is still much to be done in terms of refining and enhancing AI capabilities.

While the focus on AGI is emphasized, the analysis also suggests that the focus should be on realizing the value of LLMs rather than solely on surpassing human capabilities. This perspective aligns with the notion that AI can complement and enhance human capabilities rather than replace them entirely.

The analysis also touches on the potential risks and challenges associated with the dominance of one or a few companies in the AI field. It highlights the concern that such dominance can lead to significant inequality. It argues for the importance of open-source platforms and education in fostering innovation, creativity, and knowledge-sharing among different individuals and communities.

Overall, the analysis provides valuable insights into the current state and future trajectory of AI. It explores the potential for further advancements, the role of different technologies, and the need for diversity, competition, and collaboration in the AI landscape. These insights can inform decision-making and spark further discussions on the responsible and inclusive development of AI technology.

Speakers

Aidan Gomez

Speech speed

219 words per minute

Speech length

1287 words

Speech time

353 secs

Arguments

Increasing compute power boosts the efficacy of large language models

Supporting facts:

Next generation GPUs will unlock new scale and more expensive algorithms and methods to run

Doubling compute capacity could likely result in doubling of model size

Topics: Artificial Intelligence, Machine Learning, Computing Power, Large Language Models

The current strategy for training models is reaching its upper limit

Supporting facts:

Models currently do not learn continuously from real-world interactions and human input

Despite the extraordinary performance of the existing strategy, it needs evolution to accommodate continuously learning models

Topics: Artificial Intelligence, Machine Learning, Model Training, Autoregressive Models

Humanity is upper limit for training AI models

Supporting facts:

Model training currently requires human input from increasingly specialized domain experts

The current training strategy struggles to progress as models advance beyond the knowledge of an average person

Topics: Artificial Intelligence, Machine Learning, Model Training

Models should be able to learn by themselves.

Supporting facts:

Current strategy of training doesn’t allow continuous learning for models

Challenges include enabling models to learn from online experiences and debate

Topics: Artificial Intelligence, Machine Learning, Self-learning models

Machines, for effective learning and hypothesis formation, need access to the real world for experiments.

Supporting facts:

AI needs to form a hypothesis, test a hypothesis, and experience failure and success, to discover new things.

Topics: Machine Learning, Artificial Intelligence

Aidan believes AGI development is not discrete but continuous

Supporting facts:

Models can be superhuman in particular aspects like a game of Go

the concept of ‘models being better than humans at everything’ is ill-specified

Topics: AGI, AI development

Aidan’s ultimate goal is to create value for the world regardless of achieving full AGI or not

Supporting facts:

A lot can still be done and value can still be created even if full AGI is not achieved

Aidan hopes to build a powerful tool to maximize value

Topics: value creation, AGI

Aidan Gomez supports the devolution of power from large West Coast tech companies but with some considerations.

Supporting facts:

Acknowledges that there are valid reasons for both open source and closed source models.

Believes that hackers and creators should have access to technology that can challenge the authority of large tech companies.

Supports the right of businesses to keep their models closed if they’re building competitive advantage.

Topics: Devolution of Power, West Coast Tech Companies, Policy, Open Source, Closed Source

Access to technology is crucial for equality

Supporting facts:

Aidan Gomez emphasized the importance of technology being accessible and inclusive, in terms of language and culture understanding

Topics: Technology access, Equality, Inclusion

Report

This limitation highlights the challenges in training models to surpass the knowledge of an average person. Furthermore, Gomez emphasizes the importance of models being capable of learning from online experiences and engaging in debate, which is currently unattainable within the existing training strategy.

Enabling models to learn autonomously and continuously is essential for their further development and improvement. Regarding power dynamics, Gomez supports the devolution of power from large tech companies. However, he acknowledges the validity of both open source and closed-source models.

Gomez believes that creators and hackers should have access to technology that can challenge the authority of large tech companies, while businesses should also have the right to keep their models closed to maintain a competitive advantage. He suggests a hybrid approach that balances open source and closed-source models.

His insights provide valuable perspectives on the future of language models and the ethical considerations surrounding their development and use.

Andrew Ng

Speech speed

236 words per minute

Speech length

1076 words

Speech time

273 secs

Arguments

Scaling is becoming increasingly difficult, yet the field appears to be accelerating due to continuous innovations

Supporting facts:

There are numerous algorithmic evaluations and innovations

The pace of innovation continues to increase as new technologies develop

Topics: Scaling, Innovation, AI Development

An image processing revolution is anticipated

Supporting facts:

The text revolution happened recently

GPT-4V and Gemini Ultra are examples of recent advances

Topics: Image Processing, AI, GPT-4V, Gemini Ultra

Edge AI will become more common due to open source and other factors

Supporting facts:

We are used to using OMs in the cloud

Jan Metz has done significant work on this topic

Topics: Edge AI, Open Source, Jan Metz

Large language models are like humans, and could improve with use of tools.

Supporting facts:

Large language models often fail at complex tasks like large number multiplications

But like humans, these models could also improve if given something like a ‘scratchpad’ to work on

These models are already using various tools to function

Topics: Artificial Intelligence, Machine Learning, Language Models

Andrew Ng sees AGI as a great goal

Topics: Artificial General Intelligence, AI development goals

AGI (Artificial General Intelligence) is defined by its comparison to human intelligence

Supporting facts:

Andrew Ng mentions that AGI is benchmarked against a biological path

Topics: AGI, Artificial Intelligence, Human intelligence

AI is already creating valuable digital intelligences

Supporting facts:

According to Andrew Ng, even without reaching AGI, AI is building some incredibly valuable digital intelligences

Topics: AI, Digital intelligence

Andrew Ng has not abandoned the science of progress towards AI

Topics: Artificial Intelligence, AI Development

The world becomes wealthier and better with more intelligence, including artificial intelligence and open source intelligence

Supporting facts:

Open source intelligence is accessible by more people adding to the pool of collective intelligence and potentially sparking innovation and wealth creation

Topics: Artificial Intelligence, Open Source Intelligence, Wealth Creation

Infrastructure doesn’t necessarily gravitate towards open source; today’s foundational tech architectures including semiconductors and cloud-based technologies are closed.

Supporting facts:

Today’s tech infrastructure like NVIDIA, AMD, and Intel semiconductors are closed systems

Topics: Infrastructure, Open Source, Semiconductors, Cloud Computing

There are currently powerful forces pushing towards regulatory proposals that could impose burdensome requirements on open source

Supporting facts:

Lobbyists are influencing regulations in Washington, DC, to potentially limit and control open source technologies in favor of closed systems

Topics: Regulation, Open Source

Training is a major bottleneck in the adoption of AI in enterprises

Supporting facts:

Many corporations have the ideas and options for AI but lack a skilled workforce to implement and use them

Usage of AI-based tools like chatbots can lead to quick productivity gains

Topics: Artificial Intelligence, Workforce Training, Productivity

Report

The text highlights that numerous algorithmic evaluations and innovations have contributed to this trend. This suggests that the field is witnessing significant advancements fueled by continuous innovation. Furthermore, the analysis underscores the anticipation of an image processing revolution. Recent technological advances, such as GPT-4V and Gemini Ultra, are cited as examples.

These advancements have the potential to revolutionize image processing, transforming how it is currently understood and applied. In terms of autonomous agents, there is optimism towards their development. The report mentions ongoing work on language models capable of conducting research.

While it is noted that this kind of application is not yet fully functional, it highlights the potential for autonomous agents to contribute to various industries and domains. The analysis also points out the similarities between large language models and humans.

Large language models, akin to humans, have the potential to improve if provided with tools, such as a ‘scratchpad’, to work on. The report further mentions that these models are already using various tools to function. This observation underscores the potential for large language models to evolve and improve, ultimately enhancing their capabilities.

However, it also notes the challenges posed by defining AGI in terms of human intelligence. Another noteworthy observation is the value attributed to open source intelligence and AI. These are considered as valuable digital intelligences that contribute to wealth creation.

The report suggests that an increase in intelligence, including artificial intelligence and open source intelligence, can lead to a wealthier and better world. However, the report highlights the existing limitations in the current tech infrastructures. It mentions that today’s foundational tech architectures, such as semiconductors and cloud-based technologies, tend to be closed systems.

In contrast, open source technologies are seen as contributing to collective intelligence and potentially sparking innovation and wealth creation. This observation underlines the need to strike a balance between closed systems and open source technologies in order to foster innovation and progress.

This observation underscores the necessity of prioritizing workforce training and education to fully benefit from AI technologies. To summarize, the analysis highlights the significant pace of innovation in scaling and algorithmic evaluations in the field of AI. The anticipation of an image processing revolution is driven by recent technological advances.

There is optimism surrounding the development of autonomous agents, with ongoing work on language models. The potential for large language models to evolve and improve with tools is emphasized. Influential figures like Andrew Ng see AGI as a significant goal but acknowledge the challenges of defining it.

The value of open source intelligence and AI in wealth creation is recognized. The limitations of current tech infrastructures and the influence of lobbyists on regulations are noted. Upskilling the workforce is identified as crucial for AI adoption and utilization.

These insights provide a comprehensive overview of the current state and future prospects of AI and innovation.

Daphne Koller

Speech speed

220 words per minute

Speech length

1367 words

Speech time

374 secs

Arguments

More data will be available for models, making them more sophisticated.

Supporting facts:

Progress in the AI field is attributed largely to data.

The models will have access to new types of data such as those from self-driving cars, augmented reality, biology, healthcare, etc.

Topics: Artificial Intelligence, Data, Model training

Current models lack the capacity for causality understanding

Supporting facts:

Babies learn the notion of cause and effect by intervening in the world.

LLMs are entirely predictive engines only doing associations.

Topics: Artificial Intelligence, Machine Learning, Causality

We do not have the ability at this point to create an in silico model of the world.

Supporting facts:

The world is really complicated.

Topics: AI Learning, World Simulation

The ability to experiment with the world and learn from that is critical to intelligence.

Topics: Machine learning, Experimentation

Giving machines the ability to design experiments might help them understand the world better.

Supporting facts:

Computers need to generate data from experiments to continue growing and develop.

Such experiments could range from simple ones like what happens when you drop the pen to more complex ones involving chemicals in a cell.

Topics: AI Learning, Real World Experiments

Current AI designs are not teaching about human intelligence but a form of intelligence.

Supporting facts:

Designing AI systems is teaching about how to build an intelligence, but not human intelligence

Topics: AI, Human Intelligence

AI can be leveraged to solve difficult societal problems that are hard for humans to solve.

Supporting facts:

Building computers capable of addressing really hard societal problems in health, agriculture, environment, and climate change.

Topics: AI, Societal Problems

Understanding of natural phenomena is improved by models but they’re not the same

Supporting facts:

Thousands of papers in neuroscience use convolutional nets as a model for the visual cortex

Topics: Neural Networks, Artificial Intelligence, Neuroscience, Human perception

Applying technological advancements to aid in understanding human biology and improving medicine is not an off-ramp

Supporting facts:

Daphne Koller disagrees on the term off-ramp being used to describe technological advancements being used for understanding human biology and improving medicine

Topics: Technology and Medicine, Human Biology

A strong foundation of open-source models are needed for community to build on.

Supporting facts:

Models constructed by particular companies may not meet needs of all applications

Open-source models can accommodate new ideas and data modalities

Topics: Open-source, AI models, Machine Learning

Education, specifically structured thinking, is a key determinant in effectively using technology

Supporting facts:

Structured thinking is not well taught to children

She believes that teaching structured thinking from a young age will lead to better usage of technology

Topics: Education, Structured Thinking, Technology Use

Report

This argument is supported by the assertion that progress in the field of AI is primarily attributed to data. Another speaker takes the stance that data, rather than compute power, is the major driver of AI progress. They argue that while compute power and electricity are important, the exclusion of data in AI advancements should not be overlooked.

They state that data is the single biggest enabler of AI progress and highlight the increasing availability of diverse data modalities from the real world. The issue of causality understanding in AI models is also discussed. One speaker points out that current models lack the capacity to understand causality, as they are solely predictive engines that focus on associations.

They argue that the incorporation of causality is essential for AI models to interact with the physical world and enable common-sense reasoning in applications such as manufacturing and biology. However, another speaker takes a neutral stance on causality, emphasizing its criticality when interacting with the physical world but not providing a conclusive argument.

These experiments, which teach computers about the complexity of the world, enable them to go beyond what can be taught by a human. Although AI designs can teach us about building intelligence, they are not equivalent to human intelligence, according to one speaker.

They disagree that current AI research is helping us understand human cognition. The ability of AI to address challenging societal problems is another point of discussion. It is argued that AI can be leveraged to solve problems related to health, agriculture, the environment, and climate change, which may be difficult for humans to solve alone.

While AI designs can teach us how to build intelligence, they do not replicate human intelligence entirely. AI’s potential to solve complex societal problems is acknowledged, and the importance of open-source models and structured thinking education is highlighted. The analysis also touches upon the role of AI in understanding natural phenomena and its application in medicine and biology.

Kai-Fu Lee

Speech speed

184 words per minute

Speech length

1278 words

Speech time

417 secs

Arguments

The rate of change in AI will slow down but will remain impressive

Supporting facts:

Two years ago the MMLU was in the 40s, 50s. Now it’s 90

Improvement by adding more compute, data, training, infrastructure

Topics: Artificial Intelligence, Rate of Change, AI Improvement

Scaling laws will hold and innovations will occur, but they will slow down

Topics: scaling laws, innovations

We are not at the plateau

Topics: plateau, scaling laws

Kai-Fu Lee agrees with Yan’s approach to neural networks

Supporting facts:

Kai-Fu Lee states, ‘No, Yan is always right.’

Topics: Neural Network, Artificial Intelligence

Lee sees tremendous commercial value in text-based Large Language Models (LLMs)

Supporting facts:

He mentions that they provide a pretense of logical reasoning, can generate content, improve productivity and are being deployed widely.

Topics: Commercial Value, Large Language Models (LLMs), Artificial Intelligence

Lee believes the concept of a world model is something for academic researchers and large company labs to work on

Supporting facts:

He expresses that as a startup company, this is a concept they would prefer academia and large company research labs to explore first.

Topics: World Model, Academic Research, Company Labs, Artificial Intelligence

Even with text-based large language models, the world is changing rapidly.

Supporting facts:

AI systems have the capacity to generate content, emulate people, create experiences, and improve search engines.

These can be applied in areas such as office productivity, creating PowerPoint presentations, producing content.

Topics: Technology, AI, Large Language Models

There is much engineering work to be done to improve current AI algorithms

Supporting facts:

Issues with current AI algorithms can be patched

Companies have tried to glue RAG search engine and Wolfram Alpha to AI

More engineering gluings can cover up a lot of issues with AI

Topics: Artificial Intelligence, Algorithm Development

Kai-Fu Lee sees AGI as a great platform for value creation

Supporting facts:

Kai-Fu Lee had AGI as his dream when he was 18

He compares current state of AI with the invention stage of the automobile by Henry Ford

Topics: AGI, Value Creation, AI Development

Dominance by one or few companies creates tremendous inequality

Topics: Monopolies, Open-source, Innovation

Professors, researchers, students, entrepreneurs, hobbyists depend on open source to learn and innovate

Topics: Open-source, Education, Innovation

Kai-Fu Lee doubts the viability of giving away a perfect AGI machine for free to reduce income inequality.

Supporting facts:

Sam Altman’s suggestion at the Innovators’ dinner of Open AI building a perfect machine exceeding AGI and giving it away for free.

Topics: open AI, Artificial General Intelligence, technology, income inequality

Global competition is necessary to prevent dominance by one company or country and to ensure diverse ideologies, values, and biases

Supporting facts:

This is the first time a technological platform comes with its own ideology, values, and biases

Topics: Global competition, Technological dominance, Cultural diversity

Report

Kai-Fu Lee’s vision of artificial general intelligence (AGI) as a platform for value creation is highlighted. This perspective positions AGI as a significant milestone with the potential to revolutionize industries and create new opportunities. While the focus on AGI is emphasized, the analysis also suggests that the focus should be on realizing the value of LLMs rather than solely on surpassing human capabilities.

This perspective aligns with the notion that AI can complement and enhance human capabilities rather than replace them entirely. The analysis also touches on the potential risks and challenges associated with the dominance of one or a few companies in the AI field.

It highlights the concern that such dominance can lead to significant inequality. It argues for the importance of open-source platforms and education in fostering innovation, creativity, and knowledge-sharing among different individuals and communities. Overall, the analysis provides valuable insights into the current state and future trajectory of AI.

It explores the potential for further advancements, the role of different technologies, and the need for diversity, competition, and collaboration in the AI landscape. These insights can inform decision-making and spark further discussions on the responsible and inclusive development of AI technology.

Nicholas Thompson

Speech speed

240 words per minute

Speech length

2228 words

Speech time

558 secs

Arguments

Nicholas Thompson wants to understand the rate of change in AI in the upcoming years

Supporting facts:

Nicholas Thompson is moderating a panel of industry-leading AI experts

Questions are pertaining to the future trajectory of AI

Topics: Artificial Intelligence, Machine Learning, Technology progress

Nicholas Thompson addresses his first specific question to Kai-Fu Lee

Supporting facts:

Kai-Fu Lee is asked to provide his perspective on the future rate of change in AI

Topics: Artificial Intelligence, Kai-Fu Lee

The correlation between better GPUs and more compute capacity with better models can lead to power consolidation

Supporting facts:

Better GPUs and more compute capacity improve AI models

Greater compute power is mostly accessible to a small number of companies

The AI market could become less competitive if there’s a plateau

Topics: AI market competitiveness, Access to advanced technology

The amount of text data that’s available will grow, but not infinitely. However, the amount of visual data that we could potentially put into these machines is massive

Supporting facts:

16,000 hours of video is approximately 30 minutes of uploads on YouTube

Topics: Artificial Intelligence, Data growth, Visual data

The new AI architecture that will be needed to utilize video inputs is yet unknown

Topics: Artificial Intelligence, Machine Learning, New technologies

Nicholas Thompson is questioning how machines would understand video when even humans cannot predict the outcome

Supporting facts:

Nicholas Thompson was responding to Yann LeCun’s explanation on how machines could learn intuitive physics

Topics: AI, machine learning, video understanding

Nicholas Thompson doesn’t fully understand how to make a neural network, but he comments on what he understands according to his own experiences raising children.

Supporting facts:

He raised three children and recalls incidents of them not fully understanding causality at young age.

Topics: neural network, AI, child development

Concern about AI models corrupting or polluting themselves when left on their own

Supporting facts:

Models are moving toward communicating and learning from each other.

There’s apprehension about losing control or understanding of what the models are learning or creating.

Topics: Artificial Intelligence, AI Models, Machine Learning, Synthetic Data

Nicholas Thompson questions if the goal of building machines that are smarter than humans is the proper goal for AI research.

Supporting facts:

OpenAI and FAIR at Facebook have the goal of building machines smarter than humans.

Topics: AI research, Artificial General Intelligence, machine intelligence, OpenAI, FAIR at Facebook

Nicholas Thompson suggests AI researchers to focus on how AI can serve human biology instead of creating a superior human mind.

Supporting facts:

He stresses that to advance AI, it should focus on serving specific human needs, like how airplanes were specially designed for flight, not to mimic birds.

Topics: AI, Human Biology, Technological Advancement

Nicholas Thompson highlights the controversy around open-source AI

Supporting facts:

Nicholas has noted the fear within the US government around open-source AI, stating that should it be misused, it may enable individuals with bad intentions to do harm

He raises an example of a potential outcome where multiple countries contribute to the creation of a language model that ranks highly within the AI field, which some may view as a negative occurrence

Topics: AI, Open Source, Government Regulation

Nicholas Thompson points out that legislation could inadvertently suppress open-source models

Supporting facts:

The Biden administration passed an executive order requiring legal documentation that might be beyond the reach of smaller, open-source companies, thereby consolidating power within a small number of larger companies.

Topics: Open-Source Models, Legislation, AI Consolidation

One doesn’t necessarily need open source to reduce income inequality

Supporting facts:

Sam Altman’s argument suggesting that Open AI could build a perfect machine that exceeds AGI and give it away for free to reduce income inequality

Topics: Open source, Income inequality

Report

It was suggested that this trend could lead to a less competitive AI market. Additionally, the panel explored the vast amount of visual data available for AI systems to learn from. It was noted that while the amount of text data available may have limitations, the potential amount of visual data that could be utilized by AI is enormous.

Personal anecdotes about raising children were shared, highlighting how young children struggle to comprehend cause and effect. This led to Thompson’s skepticism regarding whether machines could truly understand cause and effect as they are often attributed with more advanced abilities than they possess.

Finally, the discussion questioned whether open-source AI is the only way to address and reduce income inequality. Thompson considered arguments proposing alternative methods to tackle this issue without solely relying on open-source AI. In conclusion, the panel discussion explored various aspects of the future of AI, including the pace of advancements, power consolidation in the industry, the vast amount of visual data available, challenges in understanding video content, the limitations of AI in comprehending causality, concerns about AI models corrupting themselves, the goal of surpassing human intelligence, the controversy surrounding open-source AI, and alternative approaches to reduce income inequality.

The discourse provided valuable insights into the current state and potential future directions of AI.

Yann LeCun

Speech speed

202 words per minute

Speech length

2392 words

Speech time

711 secs

Arguments

Current Autoregressive LLMs are reaching saturation due to data limits

Supporting facts:

LLMs are being trained with around 10 trillion tokens, which equals about 10 to the 13 bytes of training data

It’s predicted that there will not be enough text data to infinitely grow these models

Topics: Artificial Intelligence, Machine Learning, Language Models

A child’s visual perception data intake far surpasses that of current LLMs

Supporting facts:

A four-year-old child has seen 50 times more information than the biggest LLMs

A child accumulates knowledge at an enormous rate from the visual information about how the world works

Topics: Artificial Intelligence, Machine Learning, Neuroscience, Child Development

New scientific and technological breakthroughs are needed for AI to fully utilise sensory input

Supporting facts:

There’s no known method yet for how to make machines learn from video

Topics: Artificial Intelligence, Machine Learning

Large language models are not optimized for video inputs

Supporting facts:

Large language models are trained by corrupting a piece of text and training a neural net to reconstruct the full text

Attempt to perform similar processes on images or videos has not been successful

Efforts towards predicting future actions in a video based on prior sequences have also not been successful

Topics: Neural Networks, Video Processing, AI Models

Machines need to be taught intuitive physics like a baby

Supporting facts:

Babies learn that an unsupported object falls in approximately nine months

Machines can’t currently predict object trajectories as humans can

Topics: Artificial Intelligence, Machine Learning, Intuitive Physics

The best models for image recognition are not generative, they don’t reconstruct or predict in pixel space

Supporting facts:

The best image recognition models predict in a space of abstract representation, not in pixel space

Prediction in pixel space have failed so far due to its complexity

Topics: Artificial Intelligence, Image Recognition, Generative Models

Current AI systems lack causal reasoning

Supporting facts:

Current LLMs are merely predictive engines doing associations

Causality is crucial for interaction with the physical world

Current AI models even those embodied with common sense reasoning lack causality

Topics: Artificial Intelligence, Causal reasoning, AI systems

Embodied system in AI is promising

Supporting facts:

Embodied AIs can generate world models predicting the state of world on specific timed action

They can plan, reason and understand causality

Topics: Artificial Intelligence, Embodied AI system, World Model, Reasoning, Causality

Future AI systems will outperform the current LLMs

Supporting facts:

Future AI systems will understand the physical world, plan, reason and understand causality

They will be goal-oriented able to satisfy given goals

Topics: Artificial Intelligence, AI systems, Future of AI

Yann LeCun believes the objective of AI research is discovering the underlying principles behind intelligence and learning.

Supporting facts:

He uses the analogy of building airplanes, to illustrate his point that it’s not about surpassing humans but understanding how intelligence works.

He emphasizes on learning as one of the fundamental elements of intelligence.

Topics: Artificial Intelligence, Machine Learning

Building AI can contribute in understanding human intelligence

Supporting facts:

We understand how birds fly because we built airplanes

Topics: Human Intelligence, Artificial Intelligence

Neural nets helped to understand human perception

Supporting facts:

There are thousands of papers in neuroscience that use convolutional nets as a model for the visual cortex

Topics: Human perception, Neural nets

Conv nets are a good model of visual perception

Supporting facts:

Conv nets are better than the old method, which used template matching

Topics: Artificial Intelligence, Convolutional Neural Networks

The progression of science and technology allows for practical solutions

Topics: science, technology, innovation

Companies capitalize on technological advancements for short-term or long-term gains

Topics: business, technology, industry

Using machines to understand human biology and improve medicine is a valid goal

Supporting facts:

Yann LeCun discusses the use of machines in the field of medicine and biology for progress

Topics: Artificial Intelligence, Medicine, Biology

Open source approach accelerates AI technology development

Supporting facts:

The reason why we’ve seen such a fast progress in AI over the last decade or so is because people practice open research.

As soon as you get a result, you put it on archive and you open source your code and everybody can reuse it.

Common software platforms like PyTorch are used by everyone

Topics: Open source, AI, Technology advancement

Legislation against open source AI can slow down progress

Supporting facts:

If you legislate open source out of existence because of fears, you slow down progress.

Bad guys are gonna get access to it anyway and they’re just gonna catch up with the rest of the world.

Topics: Open source, AI, Legislation

AI systems should not be under the control of a small number of private companies.

Supporting facts:

All of our interaction with the digital world in the somewhat not too distant future is gonna be mediated by AI systems.

They’re gonna live in our intelligent glasses or smartphones or whatever device, and they’re going to be like human assistants, assisting us in our daily lives.

Topics: AI systems, AI influence, private companies, technology control

Open AI does not have a monopoly on good ideas

Supporting facts:

They are using PyTorch

They are using transformers

They are using infrastructure and technologies that was published by many

Topics: Open AI, Artificial General Intelligence

Open AI profits from an open research landscape

Topics: Open AI, Artificial General Intelligence, Open Research

Meta is not limiting publication in growing fields

Topics: Meta, Publication Limits

Open-source dissemination is a big factor for equality

Supporting facts:

Kai-Fu Lee talked about open-source is essential to make the world more equal

Topics: Open-source, Equality

Report

He suggests current methods for training AI models with video inputs are ineffective. There is a need for new breakthroughs to enable AI models to fully utilize sensory input, such as vision, and to teach them intuitive physics, causality, and abstract representations.

Building AI systems provides insights into human cognition, akin to how airplanes help us comprehend bird flight. However, he cautions against concentration of AI control in the hands of a few private companies. LeCun advocates for diverse and open-source AI systems that cater to various cultures, languages, values, and interests.

By addressing these challenges and fostering an open and collaborative environment, LeCun believes AI can significantly shape our understanding of human intelligence and perception.

The Expanding Universe of Generative Models

Table of contents

Knowledge Graph of Debate

Session report

Full session report

Daphne Koller

Andrew Ng

Nicholas Thompson

Yann LeCun

Aidan Gomez

Kai-Fu Lee

Speakers

Aidan Gomez

Arguments

Increasing compute power boosts the efficacy of large language models

The current strategy for training models is reaching its upper limit

Humanity is upper limit for training AI models

Models should be able to learn by themselves.

Machines, for effective learning and hypothesis formation, need access to the real world for experiments.

Aidan believes AGI development is not discrete but continuous

Aidan’s ultimate goal is to create value for the world regardless of achieving full AGI or not

Aidan Gomez supports the devolution of power from large West Coast tech companies but with some considerations.

Access to technology is crucial for equality

Report

Andrew Ng

Arguments

Scaling is becoming increasingly difficult, yet the field appears to be accelerating due to continuous innovations

An image processing revolution is anticipated

Edge AI will become more common due to open source and other factors

Large language models are like humans, and could improve with use of tools.

Andrew Ng sees AGI as a great goal

AGI (Artificial General Intelligence) is defined by its comparison to human intelligence

AI is already creating valuable digital intelligences

Andrew Ng has not abandoned the science of progress towards AI

The world becomes wealthier and better with more intelligence, including artificial intelligence and open source intelligence

Infrastructure doesn’t necessarily gravitate towards open source; today’s foundational tech architectures including semiconductors and cloud-based technologies are closed.

There are currently powerful forces pushing towards regulatory proposals that could impose burdensome requirements on open source

Training is a major bottleneck in the adoption of AI in enterprises

Report

Daphne Koller

Arguments

More data will be available for models, making them more sophisticated.

Current models lack the capacity for causality understanding

We do not have the ability at this point to create an in silico model of the world.

The ability to experiment with the world and learn from that is critical to intelligence.

Giving machines the ability to design experiments might help them understand the world better.

Current AI designs are not teaching about human intelligence but a form of intelligence.

AI can be leveraged to solve difficult societal problems that are hard for humans to solve.

Understanding of natural phenomena is improved by models but they’re not the same

Applying technological advancements to aid in understanding human biology and improving medicine is not an off-ramp

A strong foundation of open-source models are needed for community to build on.

Education, specifically structured thinking, is a key determinant in effectively using technology

Report

Kai-Fu Lee

Arguments

The rate of change in AI will slow down but will remain impressive

Scaling laws will hold and innovations will occur, but they will slow down

We are not at the plateau

Kai-Fu Lee agrees with Yan’s approach to neural networks

Lee sees tremendous commercial value in text-based Large Language Models (LLMs)

Lee believes the concept of a world model is something for academic researchers and large company labs to work on

Even with text-based large language models, the world is changing rapidly.

There is much engineering work to be done to improve current AI algorithms

Kai-Fu Lee sees AGI as a great platform for value creation

Dominance by one or few companies creates tremendous inequality

Professors, researchers, students, entrepreneurs, hobbyists depend on open source to learn and innovate

Kai-Fu Lee doubts the viability of giving away a perfect AGI machine for free to reduce income inequality.

Global competition is necessary to prevent dominance by one company or country and to ensure diverse ideologies, values, and biases

Report

Nicholas Thompson

Arguments

Nicholas Thompson wants to understand the rate of change in AI in the upcoming years

Nicholas Thompson addresses his first specific question to Kai-Fu Lee

The correlation between better GPUs and more compute capacity with better models can lead to power consolidation

The amount of text data that’s available will grow, but not infinitely. However, the amount of visual data that we could potentially put into these machines is massive

The new AI architecture that will be needed to utilize video inputs is yet unknown

Nicholas Thompson is questioning how machines would understand video when even humans cannot predict the outcome

Nicholas Thompson doesn’t fully understand how to make a neural network, but he comments on what he understands according to his own experiences raising children.

Concern about AI models corrupting or polluting themselves when left on their own

Nicholas Thompson questions if the goal of building machines that are smarter than humans is the proper goal for AI research.