AIPython: Python Code for understanding AI
David Poole and Alan Mackworth
This runnable pseudo-code in Python implements AI algorithms from Artificial Intelligence: foundations of computational agents, third edition. All code is copyright © David Poole and Alan Mackworth 2017-2024, and licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
DOWNLOAD LATEST VERSION
- Latest version: 0.9.15 (2024-12-17). This is candidate release for Version 1.0, so comments please!
- Recent Updates:
- Code simplified and documentation expanded
- Includes graphical user interfaces (GUIs) to the search, constraint satisfaction, belief networks, decision-theoretic planning (MDPs) and reinforcement learning. These implements most of solve mode of the search, consistency, and belief AIspace apps, which are no longer supported.
- Relational Learning updates, including collaborative filtering and relational probabilistic models with graphical display.
- Improved causal and counterfactual reasoning.
- Now we have Datalog and logic programming top-down proof procedure (Ch 15) and triple store (ch 16) implemented and integrated together.
We have followed the following principles:
- The code should be simple and as close the pseudo-code as possible. We have chosen readability over efficiency: we have tried to keep the asymptotic complexity as good as possible (except in some cases where the more efficient code is an exercise), but have not optimized the constant factors.
- The code should work, but it does not include all possible functionality. There are parts missing that could be used as exercises.
- We make extensive use of list comprehensions, sets, and dictionaries. The only library we use is matplotlib; students don't need to understand multiple libraries. We tried to write the sort of code that a student could write from the psudo-code without extensive searching through libraries.
- It is designed for Python 3.9 and beyond. You may as well use the latest version of Python in the distribution you prefer.
- We use a simple method for tracing the code, using a method
display
, which is likeprint
but includes a integer display level. The user can then set a maximum display level for the corresponding object (or class), and so has control over the amount of detail printed. We try to not litter the code with too many tracing statements. The use ofdisplay
enables a graphical user interface (GUI) without changing the algorithm code. We have multiple GUIs to let students explore the algorithms.
This is not polished code. It is code to play with, change, and explore different problems. It will probably never be polished (or when it is polished, it is probably time to throw it away and start again), and should no be relied on. We make no warranty of any kind, expressed or implied, with regard to these programs or the documentation.
Main Source Code Document and Distribution
The code document, aipython.pdf contains and explains all of the code. The whole code base can be downloaded from aipython.zip.
Note that the document and the code are derived from the same source, and so should be consistent (even up to the same line numbers).
What is in the distribution?
The above zip file and pdf is probably what you want. For the brave, you can get all of the latex sources, but requires knowledge of both latex and Python, which are integrated together. extract_code.py can be used to extract the Python code from the latex sources.
The following is a list of the files. Please refer to aipython.pdf and aipython.zip for the documentation and source code.
1. Python for Artificial Intelligence
- pythonDemo.py some tricky cases in Python for those familiar with other languages.
- utilities.py some useful utilities for argmax, flip etc. Also all unit tests.
- display.py simple code for displaying intermediate results. This can be overridden for fancier displaying in a GUI.
2. Agent Architectures and Hierarchical Control
- agents.py defines a simple infrastructure for agents and environments
- agentBuying.py A paper buying agent and environment, that is of the form that might be used in a smart home.
- agentEnv.py, agentMiddle.py, agentTop.py define a hierarchical agent. See also the reinforcement learning agents. Also agentFollowTarget.py a GUI where the user can mover the locations during acting.
3. Searching for Solutions
- searchProblem.py defines a search problem in terms of the start nodes, a predicate to test if a node is a goal, the neighbors function, and an optional heuristic function. This is the interface that algorithms that search use.
- searchExample.py contains some example graphs.
- searchGeneric.py the generic search algorithm that implements both depth-first search and A* search. It does not do multi-path pruning or cycle checking. It is easy to modify to make other search algorithms.
- searchMPP.py does A* and other search algorithms with multiple path pruning.
- searchGUI.py contains a GUI for stepping through search algorithms
- searchBranchAndBound.py defines depth-first branch-and-bound search search. It does not do multi-path pruning or cycle checking.
- searchGrid.py and searchTest.py contain code to enable students to understand particular problems that are explored in exercises.
4. Reasoning with Constraints
- variable.py defines variables that are used for constraints satisfaction problems (CSPs), probabilistic models, and decision networks.
- cspProblem.py provides the primitives to define constraint satisfaction problems (CSPs). cspExamples.py defines some example CSPs.
- cspDFS.py solves CSPs with depth-first search
- cspSearch.py converts CSPs into search problems for which any of the search methods will work.
- cspConsistency.py uses domain splitting and generalized arc consistency to solve CSPs.
- cspConsistencyGUI.py lets users understand arc consistency and domain splitting. Users select arcs to process and select nodes to split. It provides animation of domain pruning.
- cspSLS.py uses stochastic local search, in particular a probabilistic mix of the variable with the most conflicts, any-conflict and a random variable, to solve CSPs. It only maintain the data structures needed for the algorithm (e.g., a priority queue when we need the best variable, but not when we do an any-conflict). Each step is at most logarithmic in the number of variables (to maintain the priority queue), but depends on the number of neighbors. It also plots runtime distributions (for number of steps only).
- cspSoft.py implements optimization with soft constraints.
5. Propositions and Inference
- logicProblem.py defines definite clauses
- logicBottomUp.py bottom-up inference for definite clauses
- logicTopDown.py top-down inference for definite clauses (including ask-the-user)
- logicExplain.py provides an interactive interface to explore proofs.
- logicAssumables.py Horn clauses for assumables, including consistency-based diagnosis.
- logicNegation.py negation-as-failure.
6. Deterministic Planning
- stripsProblem.py defines planning problems in STRIPS.
- stripsForwardPlanner.py defines a forward planner that can use any of the search methods, including A* or the branch-and-bound searchers.
- stripsHeuristic.py defines heuristics for the running example
- stripsRegressionPlanner.py defines a regression planner that can use any of the search methods.
- stripsCSPPlanner.py creates a CSP from a description of a planning problem, which can then be solved with any of the CSP solvers.
- stripsPOP.py implements a partial order planner that can use any of the search methods.
7. Supervised Machine Learning
- learnProblem.py representing learning problems.
- learnNoInputs.py making predictions and learning probabilities (learning with no inputs).
- learnDT.py decision tree learning.
- learnCrossValidation.py cross validation and parameter tuning.
- learnLinear.py linear classifier and regression and logistic regression.
- learnBoosting.py boosting and functional gradient boosting.
8. Neural Networks and Deep Learning
- learnNN.py deep neural network learning.
- Including feedforward networks, momentum, RMS-Prop and Adam optimizers. This code is very inefficient compared to the state-of-the-art; if you want to train deep networks for non-trivial cases we suggest using Keras or PyTorch. The naming of parameters in the AIPython code is consistent with Keras.
9. Reasoning with Uncertainty
- probFactors.py defines factors and multiple representations for conditional probabilities.
- probGraphicalModels.py defines graphical models, and belief (Bayesian) networks as special cases.
- probExamples.py provides a suite of example belief networks.
- probRC.py implements recursive conditioning for graphical models (and so for belief networks).
- probVE.py implements variable elimination for graphical models.
- probStochSim.py implementations of multiple stochastic simulation algorithms for belief networks.
- probHMM.py implementation of hidden Markov models. It defines both exact inference for filtering and particle filtering.
- probLocalization.py provides a GUI for an example of probabilistic localization.
- probDBN.py defines dynamic belief networks and uses the variable elimination code for filtering.
10. Learning with Uncertainty
- learnBayesian.py demonstrates beta distribution as a belief network and as plots.
- learnKMeans.py k-means for unsupervised learning.
- learnEM.py EM for unsupervised learning.
11. Causality
- probDo.py enables queries with "do" as well as observations
- probCounterfactual.py allows users to explore counterfactual reasoning
12. Planning with Uncertainty
- decnNetworks.py defines and solves decision networks (influence diagrams).
- mdpProblem.py defines agent-environment problem domains that can be used for MDPs and reinforcement learning. It also defined value iteration and asynchronous value iteration.
- mdpExamples.py defines some example problem domains.
- mdpGUI.py defines a GUI to explore value iteration.
13. Reinforcement Learning
- Reinforcement learning environments:
- rlProblem.py defines reinforcement learning agents and environments, constructing an environment from an MDP and agent-environment problem domains of the previous chapter. It also implements exploration strategies, simulates learning agents and plots performance
- rlExamples.py defines multiple reinforcement leaning domains, including a simple game and a monster game.
- Learners:
- rlQLearner.py Q-learner
- rlQExperienceReplay.py Q-learner with experience replay
- rlModelLearner.py Model-based reinforcement learner,
- rlFeatures.py Feature-based reinforcement learner. rlGameFeatures.py defines features for some environments, including the monster game.
- rlGUI.py defines a graphical user interface (GUI) for exploring how the value function, the Q-function and optimal policy change as an agent acts.
14. Multiagent Systems
- masProblem.py defines two-player zero-sum games.
- masMiniMax.py minimax with alpha-beta pruning.
- masLearn.py reinforcement learning with multiple agents.
15. Individuals and Relations
- logicRelation.py Implementations of Datalog and Logic Programs, with a top-down proof procedure. Not nearly as efficient as Prolog.
16. Knowledge Graphs and Ontologies
- knowledgeGraph.py implements a triple store with efficient indexing of triple no matter which of the subject, verb or object is specified.
- knowledgeReasoning.py Integrates the triple store with logicRelation.py.
17. Relational Learning
- relnCollFilt.py collaborative filtering.
- relnProbModels.py represents relational probabilistic models (plate models), and grounds them into graphical models that can use any of the inference methods.
Copyright © David Poole and Alan Mackworth, 2024. This web page and the code are released under a Creative Commons Attribution-Noncommercial-Share Alike license.