Reasoning and Planning: New Frontiers for AI

6 November 2024

Artificial Intelligence, Department of Computer Science, Faculty, Feature, Research

David Hsu

Provost's Chair Professor

Computer Science

SHARE THIS ARTICLE

If artificial intelligence (AI) were a person, it would be an adolescent who’s just gone through a growth spurt and come of age. AI can now detect tumours with great accuracy, draft emails and essays from scratch, create custom soundtracks and original works of art — a far cry from its early days a decade ago when Siri and Roomba (a type of virtual assistant and robot vacuum, respectively) were gaining popularity.

These advances are the result of AI researchers overcoming, for the most part, an obstacle that has long plagued the field: how to gather and manage large amounts of knowledge. “What has changed is this knowledge acquisition bottleneck,” says Computing’s Professor Lee Wee Sun, whose research focuses on machine learning. “Previously our AI didn’t know much about the world, but now it does. It learns from huge amounts of data, essentially the whole Internet.”

Still, there’s room for improvement and growth. “Now we want AI to do a bit more,” he says. “We want it to reason, we want it to plan.”

The former refers to how intelligent systems can use logic to derive new information from existing data, while the latter denotes how they can develop strategies, or sequence of actions, to achieve specific goals. Both are critical components of AI that empower machines to display cognitive abilities akin to humans, allowing them to interpret complex situations and make informed decisions, so as to execute increasingly sophisticated tasks.

But presently, AIs struggle when asked to reason or plan. “That’s because they are more complex multi-step processes,” says Lee.

Narrowing options and providing common sense

To tackle this problem, Lee teamed up with fellow NUS Computing Professor and AI expert David Hsu, alongside PhD students Kang Liwei and Zhao Zirui. Together, they proposed decomposition as a way to understand and design reasoning and planning methods. “We hypothesised that performance can be substantially improved by decomposing complex problems into smaller, solvable components,” explains Lee. “This helps to lower the complexity of learning to reason and plan.”

“Each of these smaller components can then be reliably solved using techniques such as large language models,” he adds. “We can then compose the solutions together using reasoning and planning algorithms.”

Large language models, or LLMs, are advanced AI systems such as ChatGPT that are capable of understanding and generating text. They are particularly helpful in developing decomposition methods. That’s because planning is similar to word prediction in many ways, says Lee. “Essentially, what almost all language models do is try and predict the next word, given all the words they have seen in the past. But just predicting just one word ahead is often not sufficient.”

“Similarly, when you think about planning, you think about all the possible things that can happen — for example, if A happens, then what should I do? And if B happens after A, what should I do?” he says. “You reason through many steps in the future and list out all the possible outcomes.”

Specifically, LLMs help with decomposition in two ways. One, they provide a ‘commonsense world model’ or a means for AI systems to predict what they will encounter in the world around them. Two, they help reduce the vast array of options available, which in turn “much reduces the effort you need for your planning,” says Lee.

Take, for instance, the example of asking a robot helper to fetch an apple. The robot has to reason: where am I likely to find an apple? “It could go to the kitchen, bedroom, bathroom, and so on. But common sense tells the robot it’s the first option,” he says. “So we use LLMs to reduce the options it has. Thinking further ahead, we then use the LLM to predict what we are likely to find in the kitchen, e.g. a fridge which may contain fruits. In this way, we are using the LLM create a model of the world, allowing it to predict what it will encounter ahead of time.”

In 2023, Lee and his fellow researchers demonstrated how combining LLMs with a search algorithm helps achieve better-reasoned decision-making and improves efficiency when solving complex task-planning problems. These included tasks such as mapping out a flight itinerary between two cities, as well rearranging items in a stimulated home environment.

Creating guidelines

In another paper, published earlier this year, the team examined two decomposition methods — tree of thought versus chain of thought — that can be used to simplify reasoning and planning tasks. They applied the contrasting approaches to six case studies, including grade school mathematics and the Game of 24 and the Blocksworld robotic planning domain. The aim? To compare each method’s performance so as to come up with a set of guidelines to help AI practitioners along.

The chain of thought technique is characterised by a linear sequence of reasoning where each piece of information builds directly on the previous one. In the case studies, the researchers found that this approach was especially useful when applied to relatively simpler tasks, where predicting the next step is easy.

Moreover, explicitly annotating necessary information, while leaving out unimportant variables, further improves chain of thought performance. For example, when instructing a robot to pick something up, Lee says “you need to tell it that its hands must first be empty” before doing so. “If you don’t annotate this, the robot has to figure it out for itself by searching through all the possible things that can happen and determine what hidden causes may lead it to fail in the task, which is much harder for it to do.”

By contrast, for more complex tasks, he and his co-authors recommend using the tree of thought approach — a branched structure of reasoning that allows for the simultaneous exploration of multiple pathways of thought. These are instances where “short-chain solutions are computationally harder to find, for example in tasks like the Game of 24 and Blocksworld,” says Lee.

The team are now working to tackle other challenges that remain in the field of AI reasoning and planning. “One issue we have is that the instructions to do something tend to involve a very long sequence of actions,” explains Lee. Another hurdle is figuring out how to incorporate other modalities, such as vision and sensing, into the reasoning processes: “When robots touch things, you further get tactile sensing in addition to audio-visual sensing,” he says. “So these are the challenges we hope to make progress on in the future.”

Trending Posts

28 April 2022

Explainable AI gets more human-centric — thanks to cognitive psychology

Imagine if Amazon Alexa could recommend a tub of ice cream or Siri could play a cheerful song if they hear sadness in your voice. AI voice recognition can now ...

28 December 2020

Protecting IoT devices from attack

In 2017, a casino in North America reported that their database had been hacked. The news in itself wasn’t surprising — more than 5,000 such breaches took place last year ...

30 November 2023

Policing the Dark Web: Can Targeting Large Vendors Curb Further Drug Sales?

One day in May 2014, law enforcement officials swooped down on a warehouse in the San Francisco Bay Area. There they found a mini laboratory, pill press machines, and barrels ...

27 August 2019

There’s power in hierarchy — but not what you expect

These days, it seems that whenever you’re thirsty and in need of a quick caffeine pick-me-up, there’s always a Starbucks close by — whether you’re running errands locally in the ...

13 November 2020

Quantum Physics Gets a Boost from AI

Stéphane Bressan and Christian Miniatura grew up in rival neighbourhoods of the naval garrison town of Toulon in southern France. They went to the same high school and the same ...

9 May 2019

Thinking Beyond STEM: A Lifelong Quest for Lifelong Learning

With science, technology, engineering and mathematics (STEM) skills in greater demand than ever before, many people see STEM education as a ticket to a successful and rewarding career. University graduates ...

13 December 2024

Exploring DiffPath: A Revolutionary Approach to Detecting Out-of-Distribution Data with AI

In the world of artificial intelligence (AI), one major challenge is teaching models to recognise when they encounter something they’ve never seen before—known as out-of-distribution (OOD) data. Imagine training a ...

18 April 2023

Mining the marvellous richness of the human singing voice

Sound and music have always been a big part of Wang Ye’s life, guiding him through a career that has spanned being a research engineer at Nokia in Finland to ...

1 July 2019

Does “practice makes perfect” apply to businesses too?

Remember when your piano teacher used to insist you practise your scales every single day? Turns out she wasn’t just being a tyrannical tormentor, but a firm believer in the ...

6 December 2019

The holy grail of seamless systems integration

Hospital visits can be complicated things. Sometimes it starts out as a visit to the outpatient clinic, where a doctor draws blood or orders some scans to investigate your niggling ...

2 May 2025

Building the Right Features: Rethinking Innovation in the App Economy

A new study published in Information Systems Research by NUS Computing Assistant Professor Aditya Karanam sheds light on how feature strategy influences app adoption in the competitive app market. ...

13 August 2019

The dilemma of an unknown diameter

They say that in the future, vehicles will be able to talk. Not in the way that those in the Pixar movie “Cars” do, but more in the sense of ...

1 October 2020

Beyond Paywalls and Profits

In March 2011, the New York Times introduced a policy that would later be recognised as a milestone in media history. The newspaper, deemed one of the best in the ...

5 January 2024

Detecting Logic Bugs in a Way That’s Quicker and More Effective

Sometime between 2019 and 2020, a curious phenomenon began surfacing on Signal, FaceTime, and four other mobile messaging applications: someone could ring a person up and listen in to the ...

4 January 2023

Problem-first or product-first?

As any Ph.D. student will tell you, paychecks at that level aren’t especially generous. “I was always trying to find cheaper alternatives for household items,” recalls Lim Shi Ying of ...

6 October 2023

What Do Government Subsidies Say about a Firm’s Value to IPO Investors?

In late 2017, two friends were struggling to make rent in San Francisco, a city known for its notoriously high cost of living. As they brainstormed ideas for how they ...

22 October 2021

Bug-bane begone — enter the era of Automated Program Repair

Consider a programmer sitting at her desk, trying to fix an error in a software system. First, she had to determine what was causing the problem and trace its source ...

30 October 2024

A Tussle Between Man and Machine: Heuristics vs. Analytics?

Relying on intuition — finely-honed instincts based on years of experience, expertise, and good judgement — has long been a superpower in business. It’s often the source of innovation, and ...

20 April 2022

Walk, Watch, Learn: On-the-go video learning

As COVID crept across the world, confining people to their homes and chaining them to their desks — for work, school, and play — Zhao Shengdong was no exception. Involved ...

23 January 2020

Let’s maximise influence, but in a fair way

A few years ago, Yair Zick was attending a conference in Stockholm when he struck up a conversation with two researchers from the University of Southern California (USC). Zick, a ...

1 July 2021

Fighting fake news with FANG

There have been many moments of disbelief throughout the pandemic, but one of the most shocking ones happened last April, when then U.S. President Donald Trump suggested that disinfectants could ...

19 December 2019

Lost? Eyes in the sky can tell you where you are

No matter how many times you’ve flown, sitting at the window seat and watching the world shrink away from view as the plane takes off never seems to grow old. ...

4 March 2025

Breaking the Bottleneck: Making Zero-Knowledge Proofs Practical at Scale

A team led by Asst Prof Zhang Jiaheng has developed a scalable, privacy-preserving way to generate zk-SNARKs—unlocking faster, secure proof generation across multiple machines. ...

8 October 2021

Empty shelves in Nairobi’s pharmacies: There’s more than meets the eye

When you’re ill, seeing the doctor is one thing. Getting your prescription filled is another. If you live in an industrialised country, you probably wouldn’t think twice about the latter ...

3 July 2025

When AI Talks in Groups: How Multi-Agent Systems May Be Shaping Your Opinions

Explore how scalable collaborative zk-SNARKs enable fast, secure zero-knowledge proofs across multiple servers. This breakthrough improves privacy and scalability for AI verification, blockchain, and data markets, making advanced cryptography more ...

6 May 2021

Reasoning and Planning: New Frontiers for AI

SHARE THIS ARTICLE

Trending Posts

Programmes

ADMISSIONS

RESEARCH

DEPARTMENTS

RESOURCES

Programmes

ADMISSIONS

RESEARCH

DEPARTMENTS

RESOURCES