AIPython: Python Code for understanding AI

David Poole and Alan Mackworth

This runnable pseudo-code in Python implements AI algorithms from Artificial Intelligence: foundations of computational agents, third edition. All code is copyright © David Poole and Alan Mackworth 2017-2024, and licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

DOWNLOAD LATEST VERSION

Latest version: 0.9.17 (2025-07-07). This is candidate release for Version 1.0, so comments please!
Recent Updates:

Code simplified and documentation expanded
Includes graphical user interfaces (GUIs) to the search, constraint satisfaction, belief networks, decision-theoretic planning (MDPs) and reinforcement learning. These implements most of solve mode of the search, consistency, and belief AIspace apps, which are no longer supported.
Relational Learning updates, including collaborative filtering and relational probabilistic models with graphical display.
Improved causal and counterfactual reasoning.
Now we have Datalog and logic programming top-down proof procedure (Ch 15) and triple store (ch 16) implemented and integrated together.

We have followed the following principles:

The code should be simple and as close the pseudo-code as possible. We have chosen readability over efficiency: we have tried to keep the asymptotic complexity as good as possible (except in some cases where the more efficient code is an exercise), but have not optimized the constant factors.
The code should work, but it does not include all possible functionality. There are parts missing that could be used as exercises.
We make extensive use of list comprehensions, sets, and dictionaries. The only library we use is matplotlib; students don't need to understand multiple libraries. We tried to write the sort of code that a student could write from the psudo-code without extensive searching through libraries.
It is designed for Python 3.9 and beyond. You may as well use the latest version of Python in the distribution you prefer.
We use a simple method for tracing the code, using a method display, which is like print but includes a integer display level. The user can then set a maximum display level for the corresponding object (or class), and so has control over the amount of detail printed. We try to not litter the code with too many tracing statements. The use of display enables a graphical user interface (GUI) without changing the algorithm code. We have multiple GUIs to let students explore the algorithms.

This is not polished code. It is code to play with, change, and explore different problems. It will probably never be polished (or when it is polished, it is probably time to throw it away and start again), and should no be relied on. We make no warranty of any kind, expressed or implied, with regard to these programs or the documentation.

Main Source Code Document and Distribution

The code document, aipython.pdf contains and explains all of the code. The whole code base can be downloaded from aipython.zip.

Note that the document and the code are derived from the same source, and so should be consistent (even up to the same line numbers).

What is in the distribution?

The above zip file and pdf is probably what you want. For the brave, you can get all of the latex sources, but requires knowledge of both latex and Python, which are integrated together. extract_code.py can be used to extract the Python code from the latex sources.

The following is a list of the files. Please refer to aipython.pdf and aipython.zip for the documentation and source code.

1. Python for Artificial Intelligence

pythonDemo.py some tricky cases in Python for those familiar with other languages.
utilities.py some useful utilities for argmax, flip etc. Also all unit tests.
display.py simple code for displaying intermediate results. This can be overridden for fancier displaying in a GUI.

2. Agent Architectures and Hierarchical Control

agents.py defines a simple infrastructure for agents and environments
agentBuying.py A paper buying agent and environment, that is of the form that might be used in a smart home.
agentEnv.py, agentMiddle.py, agentTop.py define a hierarchical agent. See also the reinforcement learning agents. Also agentFollowTarget.py a GUI where the user can mover the locations during acting.

3. Searching for Solutions

searchProblem.py defines a search problem in terms of the start nodes, a predicate to test if a node is a goal, the neighbors function, and an optional heuristic function. This is the interface that algorithms that search use.
searchExample.py contains some example graphs.
searchGeneric.py the generic search algorithm that implements both depth-first search and A^* search. It does not do multi-path pruning or cycle checking. It is easy to modify to make other search algorithms.
searchMPP.py does A^* and other search algorithms with multiple path pruning.
searchGUI.py contains a GUI for stepping through search algorithms
searchBranchAndBound.py defines depth-first branch-and-bound search search. It does not do multi-path pruning or cycle checking.
searchGrid.py and searchTest.py contain code to enable students to understand particular problems that are explored in exercises.

4. Reasoning with Constraints

variable.py defines variables that are used for constraints satisfaction problems (CSPs), probabilistic models, and decision networks.
cspProblem.py provides the primitives to define constraint satisfaction problems (CSPs). cspExamples.py defines some example CSPs.
cspDFS.py solves CSPs with depth-first search
cspSearch.py converts CSPs into search problems for which any of the search methods will work.
cspConsistency.py uses domain splitting and generalized arc consistency to solve CSPs.
cspConsistencyGUI.py lets users understand arc consistency and domain splitting. Users select arcs to process and select nodes to split. It provides animation of domain pruning.
cspSLS.py uses stochastic local search, in particular a probabilistic mix of the variable with the most conflicts, any-conflict and a random variable, to solve CSPs. It only maintain the data structures needed for the algorithm (e.g., a priority queue when we need the best variable, but not when we do an any-conflict). Each step is at most logarithmic in the number of variables (to maintain the priority queue), but depends on the number of neighbors. It also plots runtime distributions (for number of steps only).
cspSoft.py implements optimization with soft constraints.

5. Propositions and Inference

logicProblem.py defines definite clauses
logicBottomUp.py bottom-up inference for definite clauses
logicTopDown.py top-down inference for definite clauses (including ask-the-user)
logicExplain.py provides an interactive interface to explore proofs.
logicAssumables.py Horn clauses for assumables, including consistency-based diagnosis.
logicNegation.py negation-as-failure.

6. Deterministic Planning

stripsProblem.py defines planning problems in STRIPS.
stripsForwardPlanner.py defines a forward planner that can use any of the search methods, including A* or the branch-and-bound searchers.
stripsHeuristic.py defines heuristics for the running example

stripsRegressionPlanner.py defines a regression planner that can use any of the search methods.

stripsCSPPlanner.py creates a CSP from a description of a planning problem, which can then be solved with any of the CSP solvers.

stripsPOP.py implements a partial order planner that can use any of the search methods.

7. Supervised Machine Learning

learnProblem.py representing learning problems.

learnNoInputs.py making predictions and learning probabilities (learning with no inputs).

learnDT.py decision tree learning.

learnCrossValidation.py cross validation and parameter tuning.

learnLinear.py linear classifier and regression and logistic regression.

learnBoosting.py boosting and functional gradient boosting.

8. Neural Networks and Deep Learning

learnNN.py deep neural network learning.

Including feedforward networks, momentum, RMS-Prop and Adam optimizers. This code is very inefficient compared to the state-of-the-art; if you want to train deep networks for non-trivial cases we suggest using Keras or PyTorch. The naming of parameters in the AIPython code is consistent with Keras.

9. Reasoning with Uncertainty

probFactors.py defines factors and multiple representations for conditional probabilities.

probGraphicalModels.py defines graphical models, and belief (Bayesian) networks as special cases.

probExamples.py provides a suite of example belief networks.

probRC.py implements recursive conditioning for graphical models (and so for belief networks).

probVE.py implements variable elimination for graphical models.

probStochSim.py implementations of multiple stochastic simulation algorithms for belief networks.

probHMM.py implementation of hidden Markov models. It defines both exact inference for filtering and particle filtering.

probLocalization.py provides a GUI for an example of probabilistic localization.

probDBN.py defines dynamic belief networks and uses the variable elimination code for filtering.

10. Learning with Uncertainty

learnBayesian.py demonstrates beta distribution as a belief network and as plots.

learnKMeans.py k-means for unsupervised learning.

learnEM.py EM for unsupervised learning.

11. Causality

probDo.py enables queries with "do" as well as observations

probCounterfactual.py allows users to explore counterfactual reasoning

12. Planning with Uncertainty

decnNetworks.py defines and solves decision networks (influence diagrams).

mdpProblem.py defines agent-environment problem domains that can be used for MDPs and reinforcement learning. It also defined value iteration and asynchronous value iteration.

mdpExamples.py defines some example problem domains.

mdpGUI.py defines a GUI to explore value iteration.

13. Reinforcement Learning

Reinforcement learning environments:

rlProblem.py defines reinforcement learning agents and environments, constructing an environment from an MDP and agent-environment problem domains of the previous chapter. It also implements exploration strategies, simulates learning agents and plots performance

rlExamples.py defines multiple reinforcement leaning domains, including a simple game and a monster game.

Learners:

rlQLearner.py Q-learner

rlQExperienceReplay.py Q-learner with experience replay

rlModelLearner.py Model-based reinforcement learner,

rlFeatures.py Feature-based reinforcement learner. rlGameFeatures.py defines features for some environments, including the monster game.

rlGUI.py defines a graphical user interface (GUI) for exploring how the value function, the Q-function and optimal policy change as an agent acts.

14. Multiagent Systems

masProblem.py defines two-player zero-sum games.

masMiniMax.py minimax with alpha-beta pruning.

masLearn.py reinforcement learning with multiple agents.

15. Individuals and Relations

logicRelation.py Implementations of Datalog and Logic Programs, with a top-down proof procedure. Not nearly as efficient as Prolog.

16. Knowledge Graphs and Ontologies

knowledgeGraph.py implements a triple store with efficient indexing of triple no matter which of the subject, verb or object is specified.

knowledgeReasoning.py Integrates the triple store with logicRelation.py.

17. Relational Learning

relnCollFilt.py collaborative filtering.

relnProbModels.py represents relational probabilistic models (plate models), and grounds them into graphical models that can use any of the inference methods.

Copyright © David Poole and Alan Mackworth, 2024. This web page and the code are released under a Creative Commons Attribution-Noncommercial-Share Alike license.

Artificial Intelligence 3e

AIPython: Python Code for understanding AI

David Poole and Alan Mackworth

DOWNLOAD LATEST VERSION

Main Source Code Document and Distribution

What is in the distribution?

1. Python for Artificial Intelligence

2. Agent Architectures and Hierarchical Control

3. Searching for Solutions

4. Reasoning with Constraints

5. Propositions and Inference

6. Deterministic Planning

7. Supervised Machine Learning

8. Neural Networks and Deep Learning

9. Reasoning with Uncertainty

10. Learning with Uncertainty

11. Causality

12. Planning with Uncertainty

13. Reinforcement Learning

14. Multiagent Systems

15. Individuals and Relations

16. Knowledge Graphs and Ontologies

17. Relational Learning

Artificial
Intelligence 3e