Glossary of artificial intelligence

This glossary of artificial intelligence is a list of definitions of terms and concepts relevant to the study of artificial intelligence (AI), its subdisciplines, and related fields. Related glossaries include Glossary of computer science, Glossary of robotics, and Glossary of machine vision.

Pronounced "A-star".

A graph traversal and pathfinding algorithm which is used in many fields of computer science due to its completeness, optimality, and optimal efficiency.
abductive logic programming (ALP)
A high-level knowledge-representation framework that can be used to solve problems declaratively based on abductive reasoning. It extends normal logic programming by allowing some predicates to be incompletely defined, declared as abducible predicates.
abductive reasoning

Also abduction.

A form of logical inference which starts with an observation or set of observations then seeks to find the simplest and most likely explanation. This process, unlike deductive reasoning, yields a plausible conclusion but does not positively verify it.[1] abductive inference,[1] or retroduction[2]
ablation
The removal of a component of an AI system. An ablation study aims to determine the contribution of a component to an AI system by removing the component, and then analyzing the resultant performance of the system.[3]
abstract data type
A mathematical model for data types, where a data type is defined by its behavior (semantics) from the point of view of a user of the data, specifically in terms of possible values, possible operations on data of this type, and the behavior of these operations.
abstraction
The process of removing physical, spatial, or temporal details[4] or attributes in the study of objects or systems in order to more closely attend to other details of interest[5]
accelerating change
A perceived increase in the rate of technological change throughout history, which may suggest faster and more profound change in the future and may or may not be accompanied by equally profound social and cultural change.
action language
A language for specifying state transition systems, and is commonly used to create formal models of the effects of actions on the world.[6] Action languages are commonly used in the artificial intelligence and robotics domains, where they describe how actions affect the states of systems over time, and may be used for automated planning.
action model learning
An area of machine learning concerned with creation and modification of software agent's knowledge about effects and preconditions of the actions that can be executed within its environment. This knowledge is usually represented in logic-based action description language and used as the input for automated planners.
action selection
A way of characterizing the most basic problem of intelligent systems: what to do next. In artificial intelligence and computational cognitive science, "the action selection problem" is typically associated with intelligent agents and animats—artificial systems that exhibit complex behaviour in an agent environment.
activation function
In artificial neural networks, the activation function of a node defines the output of that node given an input or set of inputs.
adaptive algorithm
An algorithm that changes its behavior at the time it is run, based on a priori defined reward mechanism or criterion.
adaptive neuro fuzzy inference system (ANFIS)

Also adaptive network-based fuzzy inference system.

A kind of artificial neural network that is based on Takagi–Sugeno fuzzy inference system. The technique was developed in the early 1990s.[7][8] Since it integrates both neural networks and fuzzy logic principles, it has potential to capture the benefits of both in a single framework. Its inference system corresponds to a set of fuzzy IF–THEN rules that have learning capability to approximate nonlinear functions.[9] Hence, ANFIS is considered to be a universal estimator.[10] For using the ANFIS in a more efficient and optimal way, one can use the best parameters obtained by genetic algorithm.[11][12]
admissible heuristic
In computer science, specifically in algorithms related to pathfinding, a heuristic function is said to be admissible if it never overestimates the cost of reaching the goal, i.e. the cost it estimates to reach the goal is not higher than the lowest possible cost from the current point in the path.[13]
affective computing

Also artificial emotional intelligence or emotion AI.

The study and development of systems and devices that can recognize, interpret, process, and simulate human affects. Affective computing is an interdisciplinary field spanning computer science, psychology, and cognitive science.[14][15]
agent architecture
A blueprint for software agents and intelligent control systems, depicting the arrangement of components. The architectures implemented by intelligent agents are referred to as cognitive architectures.[16]
AI accelerator
A class of microprocessor[17] or computer system[18] designed as hardware acceleration for artificial intelligence applications, especially artificial neural networks, machine vision, and machine learning.
AI-complete
In the field of artificial intelligence, the most difficult problems are informally known as AI-complete or AI-hard, implying that the difficulty of these computational problems is equivalent to that of solving the central artificial intelligence problem—making computers as intelligent as people, or strong AI.[19] To call a problem AI-complete reflects an attitude that it would not be solved by a simple specific algorithm.
algorithm
An unambiguous specification of how to solve a class of problems. Algorithms can perform calculation, data processing, and automated reasoning tasks.
algorithmic efficiency
A property of an algorithm which relates to the number of computational resources used by the algorithm. An algorithm must be analyzed to determine its resource usage, and the efficiency of an algorithm can be measured based on usage of different resources. Algorithmic efficiency can be thought of as analogous to engineering productivity for a repeating or continuous process.
algorithmic probability
In algorithmic information theory, algorithmic probability, also known as Solomonoff probability, is a mathematical method of assigning a prior probability to a given observation. It was invented by Ray Solomonoff in the 1960s.[20]
AlphaGo
A computer program that plays the board game Go.[21] It was developed by Alphabet Inc.'s Google DeepMind in London. AlphaGo has several versions including AlphaGo Zero, AlphaGo Master, AlphaGo Lee, etc.[22] In October 2015, AlphaGo became the first computer Go program to beat a human professional Go player without handicaps on a full-sized 19×19 board.[23][24]
ambient intelligence (AmI)
Electronic environments that are sensitive and responsive to the presence of people.
analysis of algorithms
The determination of the computational complexity of algorithms, that is the amount of time, storage and/or other resources necessary to execute them. Usually, this involves determining a function that relates the length of an algorithm's input to the number of steps it takes (its time complexity) or the number of storage locations it uses (its space complexity).
analytics
The discovery, interpretation, and communication of meaningful patterns in data.
answer set programming (ASP)
A form of declarative programming oriented towards difficult (primarily NP-hard) search problems. It is based on the stable model (answer set) semantics of logic programming. In ASP, search problems are reduced to computing stable models, and answer set solvers—programs for generating stable models—are used to perform search.
ant colony optimization (ACO)
A probabilistic technique for solving computational problems that can be reduced to finding good paths through graphs.
anytime algorithm
An algorithm that can return a valid solution to a problem even if it is interrupted before it ends.
application programming interface (API)
A set of subroutine definitions, communication protocols, and tools for building software. In general terms, it is a set of clearly defined methods of communication among various components. A good API makes it easier to develop a computer program by providing all the building blocks, which are then put together by the programmer. An API may be for a web-based system, operating system, database system, computer hardware, or software library.
approximate string matching

Also fuzzy string searching.

The technique of finding strings that match a pattern approximately (rather than exactly). The problem of approximate string matching is typically divided into two sub-problems: finding approximate substring matches inside a given string and finding dictionary strings that match the pattern approximately.
approximation error
The discrepancy between an exact value and some approximation to it.
argumentation framework

Also argumentation system.

A way to deal with contentious information and draw conclusions from it. In an abstract argumentation framework,[25] entry-level information is a set of abstract arguments that, for instance, represent data or a proposition. Conflicts between arguments are represented by a binary relation on the set of arguments. In concrete terms, you represent an argumentation framework with a directed graph such that the nodes are the arguments, and the arrows represent the attack relation. There exist some extensions of the Dung's framework, like the logic-based argumentation frameworks[26] or the value-based argumentation frameworks.[27]
artificial general intelligence (AGI)
A type of AI that matches or surpasses human cognitive capabilities across a wide range of cognitive tasks.
artificial immune system (AIS)
A class of computationally intelligent, rule-based machine learning systems inspired by the principles and processes of the vertebrate immune system. The algorithms are typically modeled after the immune system's characteristics of learning and memory for use in problem-solving.
artificial intelligence (AI)

Also machine intelligence.

Any intelligence demonstrated by machines, in contrast to the natural intelligence displayed by humans and other animals. In computer science, AI research is defined as the study of "intelligent agents": any device that perceives its environment and takes actions that maximize its chance of successfully achieving its goals.[28] Colloquially, the term "artificial intelligence" is applied when a machine mimics "cognitive" functions that humans associate with other human minds, such as "learning" and "problem solving".[29]
Artificial Intelligence Markup Language
An XML dialect for creating natural language software agents.
artificial neural network (ANN)

Also connectionist system.

Artificial neural networks (ANNs), also shortened to neural networks (NNs) or neural nets, are a branch of machine learning models that are built using principles of neuronal organization discovered by connectionism in the biological neural networks constituting animal brains.[30][31]
Association for the Advancement of Artificial Intelligence (AAAI)
An international, nonprofit, scientific society devoted to promote research in, and responsible use of, artificial intelligence. AAAI also aims to increase public understanding of artificial intelligence (AI), improve the teaching and training of AI practitioners, and provide guidance for research planners and funders concerning the importance and potential of current AI developments and future directions.[32]
asymptotic computational complexity
In computational complexity theory, asymptotic computational complexity is the usage of asymptotic analysis for the estimation of computational complexity of algorithms and computational problems, commonly associated with the usage of the big O notation.
attention mechanism
Machine learning-based attention is a mechanism mimicking cognitive attention. It calculates "soft" weights for each word, more precisely for its embedding, in the context window. It can do it either in parallel (such as in transformers) or sequentially (such as recursive neural networks). "Soft" weights can change during each runtime, in contrast to "hard" weights, which are (pre-)trained and fine-tuned and remain frozen afterwards. Multiple attention heads are used in transformer-based large language models.
attributional calculus
A logic and representation system defined by Ryszard S. Michalski. It combines elements of predicate logic, propositional calculus, and multi-valued logic. Attributional calculus provides a formal language for natural induction, an inductive learning process whose results are in forms natural to people.
augmented reality (AR)
An interactive experience of a real-world environment where the objects that reside in the real-world are "augmented" by computer-generated perceptual information, sometimes across multiple sensory modalities, including visual, auditory, haptic, somatosensory, and olfactory.[33]
autoencoder
A type of artificial neural network used to learn efficient codings of unlabeled data (unsupervised learning). A common implementation is the variational autoencoder (VAE).
automata theory
The study of abstract machines and automata, as well as the computational problems that can be solved using them. It is a theory in theoretical computer science and discrete mathematics (a subject of study in both mathematics and computer science).
automated machine learning (AutoML)
A field of machine learning which aims to automatically configure a machine learning system to maximize its performance (e.g, classification accuracy).
automated planning and scheduling

Also simply AI planning.

A branch of artificial intelligence that concerns the realization of strategies or action sequences, typically for execution by intelligent agents, autonomous robots and unmanned vehicles. Unlike classical control and classification problems, the solutions are complex and must be discovered and optimized in multidimensional space. Planning is also related to decision theory.[34]
automated reasoning
An area of computer science and mathematical logic dedicated to understanding different aspects of reasoning. The study of automated reasoning helps produce computer programs that allow computers to reason completely, or nearly completely, automatically. Although automated reasoning is considered a sub-field of artificial intelligence, it also has connections with theoretical computer science, and even philosophy.
autonomic computing (AC)
The self-managing characteristics of distributed computing resources, adapting to unpredictable changes while hiding intrinsic complexity to operators and users. Initiated by IBM in 2001, this initiative ultimately aimed to develop computer systems capable of self-management, to overcome the rapidly growing complexity of computing systems management, and to reduce the barrier that complexity poses to further growth.[35]
autonomous car

Also self-driving car, robot car, and driverless car.

A vehicle that is capable of sensing its environment and moving with little or no human input.[36][37][38]
autonomous robot
A robot that performs behaviors or tasks with a high degree of autonomy. Autonomous robotics is usually considered to be a subfield of artificial intelligence, robotics, and information engineering.[39]
backpropagation
A method used in artificial neural networks to calculate a gradient that is needed in the calculation of the weights to be used in the network.[40] Backpropagation is shorthand for "the backward propagation of errors", since an error is computed at the output and distributed backwards throughout the network's layers. It is commonly used to train deep neural networks,[41] a term referring to neural networks with more than one hidden layer.[42]
backpropagation through time (BPTT)
A gradient-based technique for training certain types of recurrent neural networks. It can be used to train Elman networks. The algorithm was independently derived by numerous researchers.[43][44][45]
backward chaining

Also backward reasoning.

An inference method described colloquially as working backward from the goal. It is used in automated theorem provers, inference engines, proof assistants, and other artificial intelligence applications.[46]
bag-of-words model
A simplifying representation used in natural language processing and information retrieval (IR). In this model, a text (such as a sentence or a document) is represented as the bag (multiset) of its words, disregarding grammar and even word order but keeping multiplicity. The bag-of-words model has also been used for computer vision.[47] The bag-of-words model is commonly used in methods of document classification where the (frequency of) occurrence of each word is used as a feature for training a classifier.[48]
bag-of-words model in computer vision
In computer vision, the bag-of-words model (BoW model) can be applied to image classification, by treating image features as words. In document classification, a bag of words is a sparse vector of occurrence counts of words; that is, a sparse histogram over the vocabulary. In computer vision, a bag of visual words is a vector of occurrence counts of a vocabulary of local image features.
batch normalization
A technique for improving the performance and stability of artificial neural networks. It is a technique to provide any layer in a neural network with inputs that are zero mean/unit variance.[49] Batch normalization was introduced in a 2015 paper.[50][51] It is used to normalize the input layer by adjusting and scaling the activations.
Bayesian programming
A formalism and a methodology for having a technique to specify probabilistic models and solve problems when less than the necessary information is available.
bees algorithm
A population-based search algorithm which was developed by Pham, Ghanbarzadeh and et al. in 2005.[52] It mimics the food foraging behaviour of honey bee colonies. In its basic version the algorithm performs a kind of neighborhood search combined with global search, and can be used for both combinatorial optimization and continuous optimization. The only condition for the application of the bees algorithm is that some measure of distance between the solutions is defined. The effectiveness and specific abilities of the bees algorithm have been proven in a number of studies.[53][54][55][56]
behavior informatics (BI)
The informatics of behaviors so as to obtain behavior intelligence and behavior insights.[57]
behavior tree (BT)
A mathematical model of plan execution used in computer science, robotics, control systems and video games. They describe switchings between a finite set of tasks in a modular fashion. Their strength comes from their ability to create very complex tasks composed of simple tasks, without worrying how the simple tasks are implemented. BTs present some similarities to hierarchical state machines with the key difference that the main building block of a behavior is a task rather than a state. Its ease of human understanding make BTs less error-prone and very popular in the game developer community. BTs have shown to generalize several other control architectures.[58][59]
belief–desire–intention software model (BDI)
A software model developed for programming intelligent agents. Superficially characterized by the implementation of an agent's beliefs, desires and intentions, it actually uses these concepts to solve a particular problem in agent programming. In essence, it provides a mechanism for separating the activity of selecting a plan (from a plan library or an external planner application) from the execution of currently active plans. Consequently, BDI agents are able to balance the time spent on deliberating about plans (choosing what to do) and executing those plans (doing it). A third activity, creating the plans in the first place (planning), is not within the scope of the model, and is left to the system designer and programmer.
bias–variance tradeoff
In statistics and machine learning, the bias–variance tradeoff is the property of a set of predictive models whereby models with a lower bias in parameter estimation have a higher variance of the parameter estimates across samples, and vice versa.
big data
A term used to refer to data sets that are too large or complex for traditional data-processing application software to adequately deal with. Data with many cases (rows) offer greater statistical power, while data with higher complexity (more attributes or columns) may lead to a higher false discovery rate.[60]
Big O notation
A mathematical notation that describes the limiting behavior of a function when the argument tends towards a particular value or infinity. It is a member of a family of notations invented by Paul Bachmann,[61] Edmund Landau,[62] and others, collectively called Bachmann–Landau notation or asymptotic notation.
binary tree
A tree data structure in which each node has at most two children, which are referred to as the left child and the right child. A recursive definition using just set theory notions is that a (non-empty) binary tree is a tuple (L, S, R), where L and R are binary trees or the empty set and S is a singleton set.[63] Some authors allow the binary tree to be the empty set as well.[64]
blackboard system
An artificial intelligence approach based on the blackboard architectural model,[65][66][67][68] where a common knowledge base, the "blackboard", is iteratively updated by a diverse group of specialist knowledge sources, starting with a problem specification and ending with a solution. Each knowledge source updates the blackboard with a partial solution when its internal constraints match the blackboard state. In this way, the specialists work together to solve the problem.
Boltzmann machine

Also stochastic Hopfield network with hidden units.

A type of stochastic recurrent neural network and Markov random field.[69] Boltzmann machines can be seen as the stochastic, generative counterpart of Hopfield networks.
Boolean satisfiability problem

Also propositional satisfiability problem; abbreviated SATISFIABILITY or SAT.

The problem of determining if there exists an interpretation that satisfies a given Boolean formula. In other words, it asks whether the variables of a given Boolean formula can be consistently replaced by the values TRUE or FALSE in such a way that the formula evaluates to TRUE. If this is the case, the formula is called satisfiable. On the other hand, if no such assignment exists, the function expressed by the formula is FALSE for all possible variable assignments and the formula is unsatisfiable. For example, the formula "a AND NOT b" is satisfiable because one can find the values a = TRUE and b = FALSE, which make (a AND NOT b) = TRUE. In contrast, "a AND NOT a" is unsatisfiable.
boosting
A machine learning ensemble metaheuristic for primarily reducing bias (as opposed to variance), by training models sequentially, each one correcting the errors of its predecessor.
bootstrap aggregating

Also bagging or bootstrapping.

A machine learning ensemble metaheuristic for primarily reducing variance (as opposed to bias), by training multiple models independently and averaging their predictions.
brain technology

Also self-learning know-how system.

A technology that employs the latest findings in neuroscience. The term was first introduced by the Artificial Intelligence Laboratory in Zurich, Switzerland, in the context of the ROBOY project.[70] Brain Technology can be employed in robots,[71] know-how management systems[72] and any other application with self-learning capabilities. In particular, Brain Technology applications allow the visualization of the underlying learning architecture often coined as "know-how maps".
branching factor
In computing, tree data structures, and game theory, the number of children at each node, the outdegree. If this value is not uniform, an average branching factor can be calculated.

Also exhaustive search or generate and test.

A very general problem-solving technique and algorithmic paradigm that consists of systematically enumerating all possible candidates for the solution and checking whether each candidate satisfies the problem's statement.
capsule neural network (CapsNet)
A machine learning system that is a type of artificial neural network (ANN) that can be used to better model hierarchical relationships. The approach is an attempt to more closely mimic biological neural organization.[73]
case-based reasoning (CBR)
Broadly construed, the process of solving new problems based on the solutions of similar past problems.
chatbot

Also smartbot, talkbot, chatterbot, bot, IM bot, interactive agent, conversational interface, or artificial conversational entity.

A computer program or an artificial intelligence which conducts a conversation via auditory or textual methods.[74]
cloud robotics
A field of robotics that attempts to invoke cloud technologies such as cloud computing, cloud storage, and other Internet technologies centred on the benefits of converged infrastructure and shared services for robotics. When connected to the cloud, robots can benefit from the powerful computation, storage, and communication resources of modern data center in the cloud, which can process and share information from various robots or agent (other machines, smart objects, humans, etc.). Humans can also delegate tasks to robots remotely through networks. Cloud computing technologies enable robot systems to be endowed with powerful capability whilst reducing costs through cloud technologies. Thus, it is possible to build lightweight, low cost, smarter robots have intelligent "brain" in the cloud. The "brain" consists of data center, knowledge base, task planners, deep learning, information processing, environment models, communication support, etc.[75][76][77][78]
cluster analysis

Also clustering.

The task of grouping a set of objects in such a way that objects in the same group (called a cluster) are more similar (in some sense) to each other than to those in other groups (clusters). It is a main task of exploratory data mining, and a common technique for statistical data analysis, used in many fields, including machine learning, pattern recognition, image analysis, information retrieval, bioinformatics, data compression, and computer graphics.
Cobweb
An incremental system for hierarchical conceptual clustering. COBWEB was invented by Professor Douglas H. Fisher, currently at Vanderbilt University.[79][80] COBWEB incrementally organizes observations into a classification tree. Each node in a classification tree represents a class (concept) and is labeled by a probabilistic concept that summarizes the attribute-value distributions of objects classified under the node. This classification tree can be used to predict missing attributes or the class of a new object.[81]
cognitive architecture
The Institute of Creative Technologies defines cognitive architecture as: "hypothesis about the fixed structures that provide a mind, whether in natural or artificial systems, and how they work together – in conjunction with knowledge and skills embodied within the architecture – to yield intelligent behavior in a diversity of complex environments."[82]
cognitive computing
In general, the term cognitive computing has been used to refer to new hardware and/or software that mimics the functioning of the human brain[83][84][85][86][87][88] and helps to improve human decision-making.[89] In this sense, CC is a new type of computing with the goal of more accurate models of how the human brain/mind senses, reasons, and responds to stimulus.
cognitive science
The interdisciplinary scientific study of the mind and its processes.[90]
combinatorial optimization
In Operations Research, applied mathematics and theoretical computer science, combinatorial optimization is a topic that consists of finding an optimal object from a finite set of objects.[91]
committee machine
A type of artificial neural network using a divide and conquer strategy in which the responses of multiple neural networks (experts) are combined into a single response.[92] The combined response of the committee machine is supposed to be superior to those of its constituent experts. Compare ensembles of classifiers.
commonsense knowledge
In artificial intelligence research, commonsense knowledge consists of facts about the everyday world, such as "Lemons are sour", that all humans are expected to know. The first AI program to address common sense knowledge was Advice Taker in 1959 by John McCarthy.[93]
commonsense reasoning
A branch of artificial intelligence concerned with simulating the human ability to make presumptions about the type and essence of ordinary situations they encounter every day.[94]
computational chemistry
A branch of chemistry that uses computer simulation to assist in solving chemical problems.
computational complexity theory
Focuses on classifying computational problems according to their inherent difficulty, and relating these classes to each other. A computational problem is a task solved by a computer. A computation problem is solvable by mechanical application of mathematical steps, such as an algorithm.
computational creativity

Also artificial creativity, mechanical creativity, creative computing, or creative computation.

A multidisciplinary endeavour that includes the fields of artificial intelligence, cognitive psychology, philosophy, and the arts.
computational cybernetics
The integration of cybernetics and computational intelligence techniques.
computational humor
A branch of computational linguistics and artificial intelligence which uses computers in humor research.[95]
computational intelligence (CI)
Usually refers to the ability of a computer to learn a specific task from data or experimental observation.
computational learning theory
In computer science, computational learning theory (or just learning theory) is a subfield of artificial intelligence devoted to studying the design and analysis of machine learning algorithms.[96]
computational linguistics
An interdisciplinary field concerned with the statistical or rule-based modeling of natural language from a computational perspective, as well as the study of appropriate computational approaches to linguistic questions.
computational mathematics
The mathematical research in areas of science where computing plays an essential role.
computational neuroscience

Also theoretical neuroscience or mathematical neuroscience.

A branch of neuroscience which employs mathematical models, theoretical analysis and abstractions of the brain to understand the principles that govern the development, structure, physiology, and cognitive abilities of the nervous system.[97][98][99][100]
computational number theory

Also algorithmic number theory.

The study of algorithms for performing number theoretic computations.
computational problem
In theoretical computer science, a computational problem is a mathematical object representing a collection of questions that computers might be able to solve.
computational statistics

Also statistical computing.

The interface between statistics and computer science.
computer-automated design (CAutoD)
Design automation usually refers to electronic design automation, or Design Automation which is a Product Configurator. Extending Computer-Aided Design (CAD), automated design and computer-automated design[101][102][103] are concerned with a broader range of applications, such as automotive engineering, civil engineering,[104][105][106][107] composite material design, control engineering,[108] dynamic system identification and optimization,[109] financial systems, industrial equipment, mechatronic systems, steel construction,[110] structural optimisation,[111] and the invention of novel systems. More recently, traditional CAD simulation is seen to be transformed to CAutoD by biologically inspired machine learning,[112] including heuristic search techniques such as evolutionary computation,[113][114] and swarm intelligence algorithms.[115]
computer audition (CA)
See machine listening.
computer science
The theory, experimentation, and engineering that form the basis for the design and use of computers. It involves the study of algorithms that process, store, and communicate digital information. A computer scientist specializes in the theory of computation and the design of computational systems.[116]
computer vision
An interdisciplinary scientific field that deals with how computers can be made to gain high-level understanding from digital images or videos. From the perspective of engineering, it seeks to automate tasks that the human visual system can do.[117][118][119]
concept drift
In predictive analytics and machine learning, the concept drift means that the statistical properties of the target variable, which the model is trying to predict, change over time in unforeseen ways. This causes problems because the predictions become less accurate as time passes.
connectionism
An approach in the fields of cognitive science, that hopes to explain mental phenomena using artificial neural networks.[120]
consistent heuristic
In the study of path-finding problems in artificial intelligence, a heuristic function is said to be consistent, or monotone, if its estimate is always less than or equal to the estimated distance from any neighboring vertex to the goal, plus the cost of reaching that neighbor.
constrained conditional model (CCM)
A machine learning and inference framework that augments the learning of conditional (probabilistic or discriminative) models with declarative constraints.
constraint logic programming
A form of constraint programming, in which logic programming is extended to include concepts from constraint satisfaction. A constraint logic program is a logic program that contains constraints in the body of clauses. An example of a clause including a constraint is A(X,Y) :- X+Y>0, B(X), C(Y). In this clause, X+Y>0 is a constraint; A(X,Y), B(X), and C(Y) are literals as in regular logic programming. This clause states one condition under which the statement A(X,Y) holds: X+Y is greater than zero and both B(X) and C(Y) are true.
constraint programming
A programming paradigm wherein relations between variables are stated in the form of constraints. Constraints differ from the common primitives of imperative programming languages in that they do not specify a step or sequence of steps to execute, but rather the properties of a solution to be found.
constructed language

Also conlang.

A language whose phonology, grammar, and vocabulary are consciously devised, instead of having developed naturally. Constructed languages may also be referred to as artificial, planned, or invented languages.[121]
control theory
In control systems engineering is a subfield of mathematics that deals with the control of continuously operating dynamical systems in engineered processes and machines. The objective is to develop a control model for controlling such systems using a control action in an optimum manner without delay or overshoot and ensuring control stability.
convolutional neural network
In deep learning, a convolutional neural network (CNN, or ConvNet) is a class of deep neural networks, most commonly applied to analyzing visual imagery. CNNs use a variation of multilayer perceptrons designed to require minimal preprocessing.[122] They are also known as shift invariant or space invariant artificial neural networks (SIANN), based on their shared-weights architecture and translation invariance characteristics.[123][124]
crossover

Also recombination.

In genetic algorithms and evolutionary computation, a genetic operator used to combine the genetic information of two parents to generate new offspring. It is one way to stochastically generate new solutions from an existing population, and analogous to the crossover that happens during sexual reproduction in biological organisms. Solutions can also be generated by cloning an existing solution, which is analogous to asexual reproduction. Newly generated solutions are typically mutated before being added to the population.
Darkforest
A computer go program developed by Facebook, based on deep learning techniques using a convolutional neural network. Its updated version Darkfores2 combines the techniques of its predecessor with Monte Carlo tree search.[125][126] The MCTS effectively takes tree search methods commonly seen in computer chess programs and randomizes them.[127] With the update, the system is known as Darkfmcts3.[128]
Dartmouth workshop
The Dartmouth Summer Research Project on Artificial Intelligence was the name of a 1956 summer workshop now considered by many[129][130] (though not all[131]) to be the seminal event for artificial intelligence as a field.
data augmentation
Data augmentation in data analysis are techniques used to increase the amount of data. It helps reduce overfitting when training a learning algorithm.
data fusion
The process of integrating multiple data sources to produce more consistent, accurate, and useful information than that provided by any individual data source.[132]
data integration
The process of combining data residing in different sources and providing users with a unified view of them.[133] This process becomes significant in a variety of situations, which include both commercial (such as when two similar companies need to merge their databases) and scientific (combining research results from different bioinformatics repositories, for example) domains. Data integration appears with increasing frequency as the volume (that is, big data) and the need to share existing data explodes.[134] It has become the focus of extensive theoretical work, and numerous open problems remain unsolved.
data mining
The process of discovering patterns in large data sets involving methods at the intersection of machine learning, statistics, and database systems.
data science
An interdisciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from data in various forms, both structured and unstructured,[135][136] similar to data mining. Data science is a "concept to unify statistics, data analysis, machine learning and their related methods" in order to "understand and analyze actual phenomena" with data.[137] It employs techniques and theories drawn from many fields within the context of mathematics, statistics, information science, and computer science.
data set

Also dataset.

A collection of data. Most commonly a data set corresponds to the contents of a single database table, or a single statistical data matrix, where every column of the table represents a particular variable, and each row corresponds to a given member of the data set in question. The data set lists values for each of the variables, such as height and weight of an object, for each member of the data set. Each value is known as a datum. The data set may comprise data for one or more members, corresponding to the number of rows.
data warehouse (DW or DWH)

Also enterprise data warehouse (EDW).

A system used for reporting and data analysis.[138] DWs are central repositories of integrated data from one or more disparate sources. They store current and historical data in one single place[139]
Datalog
A declarative logic programming language that syntactically is a subset of Prolog. It is often used as a query language for deductive databases. In recent years, Datalog has found new application in data integration, information extraction, networking, program analysis, security, and cloud computing.[140]
decision boundary
In the case of backpropagation-based artificial neural networks or perceptrons, the type of decision boundary that the network can learn is determined by the number of hidden layers the network has. If it has no hidden layers, then it can only learn linear problems. If it has one hidden layer, then it can learn any continuous function on compact subsets of Rn as shown by the Universal approximation theorem, thus it can have an arbitrary decision boundary.
decision support system (DSS)
Aan information system that supports business or organizational decision-making activities. DSSs serve the management, operations and planning levels of an organization (usually mid and higher management) and help people make decisions about problems that may be rapidly changing and not easily specified in advance—i.e. unstructured and semi-structured decision problems. Decision support systems can be either fully computerized or human-powered, or a combination of both.
decision theory

Also theory of choice.

The study of the reasoning underlying an agent's choices.[141] Decision theory can be broken into two branches: normative decision theory, which gives advice on how to make the best decisions given a set of uncertain beliefs and a set of values, and descriptive decision theory which analyzes how existing, possibly irrational agents actually make decisions.
decision tree learning
Uses a decision tree (as a predictive model) to go from observations about an item (represented in the branches) to conclusions about the item's target value (represented in the leaves). It is one of the predictive modeling approaches used in statistics, data mining and machine learning.
declarative programming
A programming paradigm—a style of building the structure and elements of computer programs—that expresses the logic of a computation without describing its control flow.[142]
deductive classifier
A type of artificial intelligence inference engine. It takes as input a set of declarations in a frame language about a domain such as medical research or molecular biology. For example, the names of classes, sub-classes, properties, and restrictions on allowable values.
Deep Blue
was a chess-playing computer developed by IBM. It is known for being the first computer chess-playing system to win both a chess game and a chess match against a reigning world champion under regular time controls.
deep learning

Also deep structured learning or hierarchical learning.

Part of a broader family of machine learning methods based on learning data representations, as opposed to task-specific algorithms. Learning can be supervised, semi-supervised, or unsupervised.[143][144][145]
DeepMind Technologies
A British artificial intelligence company founded in September 2010, currently owned by Alphabet Inc. The company is based in London, with research centres in Canada,[146] France,[147] and the United States. Acquired by Google in 2014, the company has created a neural network that learns how to play video games in a fashion similar to that of humans,[148] as well as a neural Turing machine,[149] or a neural network that may be able to access an external memory like a conventional Turing machine, resulting in a computer that mimics the short-term memory of the human brain.[150][151] The company made headlines in 2016 after its AlphaGo program beat human professional Go player Lee Sedol, the world champion, in a five-game match, which was the subject of a documentary film.[152] A more general program, AlphaZero, beat the most powerful programs playing Go, chess, and shogi (Japanese chess) after a few days of play against itself using reinforcement learning.[153]
default logic
A non-monotonic logic proposed by Raymond Reiter to formalize reasoning with default assumptions.
Density-based spatial clustering of applications with noise (DBSCAN)
A clustering algorithm proposed by Martin Ester, Hans-Peter Kriegel, Jörg Sander, and Xiaowei Xu in 1996.[154]
description logic (DL)
A family of formal knowledge representation languages. Many DLs are more expressive than propositional logic but less expressive than first-order logic. In contrast to the latter, the core reasoning problems for DLs are (usually) decidable, and efficient decision procedures have been designed and implemented for these problems. There are general, spatial, temporal, spatiotemporal, and fuzzy descriptions logics, and each description logic features a different balance between DL expressivity and reasoning complexity by supporting different sets of mathematical constructors.[155]
developmental robotics (DevRob)

Also epigenetic robotics.

A scientific field which aims at studying the developmental mechanisms, architectures, and constraints that allow lifelong and open-ended learning of new skills and new knowledge in embodied machines.
diagnosis
Concerned with the development of algorithms and techniques that are able to determine whether the behaviour of a system is correct. If the system is not functioning correctly, the algorithm should be able to determine, as accurately as possible, which part of the system is failing, and which kind of fault it is facing. The computation is based on observations, which provide information on the current behaviour.
dialogue system

Also conversational agent (CA).

A computer system intended to converse with a human with a coherent structure. Dialogue systems have employed text, speech, graphics, haptics, gestures, and other modes for communication on both the input and output channel.
diffusion model
In machine learning, diffusion models, also known as diffusion probabilistic models or score-based generative models, are a class of latent variable models. They are Markov chains trained using variational inference.[156] The goal of diffusion models is to learn the latent structure of a dataset by modeling the way in which data points diffuse through the latent space. In computer vision, this means that a neural network is trained to denoise images blurred with Gaussian noise by learning to reverse the diffusion process.[157][158] It mainly consists of three major components: the forward process, the reverse process, and the sampling procedure.[159] Three examples of generic diffusion modeling frameworks used in computer vision are denoising diffusion probabilistic models, noise conditioned score networks, and stochastic differential equations.[160]
Dijkstra's algorithm
An algorithm for finding the shortest paths between nodes in a weighted graph, which may represent, for example, road networks.
dimensionality reduction

Also dimension reduction.

The process of reducing the number of random variables under consideration[161] by obtaining a set of principal variables. It can be divided into feature selection and feature extraction.[162]
discrete system
Any system with a countable number of states. Discrete systems may be contrasted with continuous systems, which may also be called analog systems. A final discrete system is often modeled with a directed graph and is analyzed for correctness and complexity according to computational theory. Because discrete systems have a countable number of states, they may be described in precise mathematical models. A computer is a finite state machine that may be viewed as a discrete system. Because computers are often used to model not only other discrete systems but continuous systems as well, methods have been developed to represent real-world continuous systems as discrete systems. One such method involves sampling a continuous signal at discrete time intervals.
distributed artificial intelligence (DAI)

Also decentralized artificial intelligence.

A subfield of artificial intelligence research dedicated to the development of distributed solutions for problems. DAI is closely related to and a predecessor of the field of multi-agent systems.[163]
dropout

Also dilution.

A regularization technique for reducing overfitting in artificial neural networks by preventing complex co-adaptations on training data.
dynamic epistemic logic (DEL)
A logical framework dealing with knowledge and information change. Typically, DEL focuses on situations involving multiple agents and studies how their knowledge changes when events occur.
eager learning
A learning method in which the system tries to construct a general, input-independent target function during training of the system, as opposed to lazy learning, where generalization beyond the training data is delayed until a query is made to the system.[164]
early stopping
A regularization technique often used when training a machine learning model with an iterative method such as gradient descent.
Ebert test
A test which gauges whether a computer-based synthesized voice[165][166] can tell a joke with sufficient skill to cause people to laugh.[167] It was proposed by film critic Roger Ebert at the 2011 TED conference as a challenge to software developers to have a computerized voice master the inflections, delivery, timing, and intonations of a speaking human.[165] The test is similar to the Turing test proposed by Alan Turing in 1950 as a way to gauge a computer's ability to exhibit intelligent behavior by generating performance indistinguishable from a human being.[168]
echo state network (ESN)
A recurrent neural network with a sparsely connected hidden layer (with typically 1% connectivity). The connectivity and weights of hidden neurons are fixed and randomly assigned. The weights of output neurons can be learned so that the network can (re)produce specific temporal patterns. The main interest of this network is that although its behaviour is non-linear, the only weights that are modified during training are for the synapses that connect the hidden neurons to output neurons. Thus, the error function is quadratic with respect to the parameter vector and can be differentiated easily to a linear system.[169][170]
embodied agent

Also interface agent.

An intelligent agent that interacts with the environment through a physical body within that environment. Agents that are represented graphically with a body, for example a human or a cartoon animal, are also called embodied agents, although they have only virtual, not physical, embodiment.[171]
embodied cognitive science
An interdisciplinary field of research, the aim of which is to explain the mechanisms underlying intelligent behavior. It comprises three main methodologies: 1) the modeling of psychological and biological systems in a holistic manner that considers the mind and body as a single entity, 2) the formation of a common set of general principles of intelligent behavior, and 3) the experimental use of robotic agents in controlled environments.
error-driven learning
A sub-area of machine learning concerned with how an agent ought to take actions in an environment so as to minimize some error feedback. It is a type of reinforcement learning.
ensemble learning
The use of multiple machine learning algorithms to obtain better predictive performance than could be obtained from any of the constituent learning algorithms alone.[172][173][174]
epoch
In machine learning, particularly in the creation of artificial neural networks, an epoch is training the model for one cycle through the full training dataset. Small models are typically trained for as many epochs as it takes to reach the best performance on the validation dataset. The largest models may train for only one epoch.
ethics of artificial intelligence
The part of the ethics of technology specific to artificial intelligence.
evolutionary algorithm (EA)
A subset of evolutionary computation,[175] a generic population-based metaheuristic optimization algorithm. An EA uses mechanisms inspired by biological evolution, such as reproduction, mutation, recombination, and selection. Candidate solutions to the optimization problem play the role of individuals in a population, and the fitness function determines the quality of the solutions (see also loss function). Evolution of the population then takes place after the repeated application of the above operators.
evolutionary computation
A family of algorithms for global optimization inspired by biological evolution, and the subfield of artificial intelligence and soft computing studying these algorithms. In technical terms, they are a family of population-based trial and error problem solvers with a metaheuristic or stochastic optimization character.
evolving classification function (ECF)
Evolving classification functions are used for classifying and clustering in the field of machine learning and artificial intelligence, typically employed for data stream mining tasks in dynamic and changing environments.
existential risk
The hypothesis that substantial progress in artificial general intelligence (AGI) could someday result in human extinction or some other unrecoverable global catastrophe.[176][177][178]
expert system
A computer system that emulates the decision-making ability of a human expert.[179] Expert systems are designed to solve complex problems by reasoning through bodies of knowledge, represented mainly as if–then rules rather than through conventional procedural code.[180]
fast-and-frugal trees
A type of classification tree. Fast-and-frugal trees can be used as decision-making tools which operate as lexicographic classifiers, and, if required, associate an action (decision) to each class or category.[181]
feature
An individual measurable property or characteristic of a phenomenon.[182] In computer vision and image processing, a feature is a piece of information about the content of an image; typically about whether a certain region of the image has certain properties. Features may be specific structures in an image (such as points, edges, or objects), or the result of a general neighborhood operation or feature detection applied to the image.
feature extraction
In machine learning, pattern recognition, and image processing, feature extraction starts from an initial set of measured data and builds derived values (features) intended to be informative and non-redundant, facilitating the subsequent learning and generalization steps, and in some cases leading to better human interpretations.
feature learning
In machine learning, feature learning or representation learning[143] is a set of techniques that allows a system to automatically discover the representations needed for feature detection or classification from raw data. This replaces manual feature engineering and allows a machine to both learn the features and use them to perform a specific task.
feature selection
In machine learning and statistics, feature selection, also known as variable selection, attribute selection or variable subset selection, is the process of selecting a subset of relevant features (variables, predictors) for use in model construction.
federated learning
A type of machine learning that allows for training on multiple devices with decentralized data, thus helping preserve the privacy of individual users and their data.
first-order logic

Also first-order predicate calculus or predicate logic.

A collection of formal systems used in mathematics, philosophy, linguistics, and computer science. First-order logic uses quantified variables over non-logical objects and allows the use of sentences that contain variables, so that rather than propositions such as Socrates is a man one can have expressions in the form "there exists X such that X is Socrates and X is a man" and there exists is a quantifier while X is a variable.[183] This distinguishes it from propositional logic, which does not use quantifiers or relations.[184]
fluent
A condition that can change over time. In logical approaches to reasoning about actions, fluents can be represented in first-order logic by predicates having an argument that depends on time.
formal language
A set of words whose letters are taken from an alphabet and are well-formed according to a specific set of rules.
forward chaining

Also forward reasoning.

One of the two main methods of reasoning when using an inference engine and can be described logically as repeated application of modus ponens. Forward chaining is a popular implementation strategy for expert systems, businesses and production rule systems. The opposite of forward chaining is backward chaining. Forward chaining starts with the available data and uses inference rules to extract more data (from an end user, for example) until a goal is reached. An inference engine using forward chaining searches the inference rules until it finds one where the antecedent (If clause) is known to be true. When such a rule is found, the engine can conclude, or infer, the consequent (Then clause), resulting in the addition of new information to its data.[185]
frame
An artificial intelligence data structure used to divide knowledge into substructures by representing "stereotyped situations". Frames are the primary data structure used in artificial intelligence frame language.
frame language
A technology used for knowledge representation in artificial intelligence. Frames are stored as ontologies of sets and subsets of the frame concepts. They are similar to class hierarchies in object-oriented languages although their fundamental design goals are different. Frames are focused on explicit and intuitive representation of knowledge whereas objects focus on encapsulation and information hiding. Frames originated in AI research and objects primarily in software engineering. However, in practice the techniques and capabilities of frame and object-oriented languages overlap significantly.
frame problem
The problem of finding adequate collections of axioms for a viable description of a robot environment.[186]
friendly artificial intelligence

Also friendly AI or FAI.

A hypothetical artificial general intelligence (AGI) that would have a positive effect on humanity. It is a part of the ethics of artificial intelligence and is closely related to machine ethics. While machine ethics is concerned with how an artificially intelligent agent should behave, friendly artificial intelligence research is focused on how to practically bring about this behaviour and ensuring it is adequately constrained.
futures studies
The study of postulating possible, probable, and preferable futures and the worldviews and myths that underlie them.[187]
fuzzy control system
A control system based on fuzzy logic—a mathematical system that analyzes analog input values in terms of logical variables that take on continuous values between 0 and 1, in contrast to classical or digital logic, which operates on discrete values of either 1 or 0 (true or false, respectively).[188][189]
fuzzy logic
A simple form for the many-valued logic, in which the truth values of variables may have any degree of "Truthfulness" that can be represented by any real number in the range between 0 (as in Completely False) and 1 (as in Completely True) inclusive. Consequently, It is employed to handle the concept of partial truth, where the truth value may range between completely true and completely false. In contrast to Boolean logic, where the truth values of variables may have the integer values 0 or 1 only.
fuzzy rule
A rule used within fuzzy logic systems to infer an output based on input variables.
fuzzy set
In classical set theory, the membership of elements in a set is assessed in binary terms according to a bivalent condition — an element either belongs or does not belong to the set. By contrast, fuzzy set theory permits the gradual assessment of the membership of elements in a set; this is described with the aid of a membership function valued in the real unit interval [0, 1]. Fuzzy sets generalize classical sets, since the indicator functions (aka characteristic functions) of classical sets are special cases of the membership functions of fuzzy sets, if the latter only take values 0 or 1.[190] In fuzzy set theory, classical bivalent sets are usually called crisp sets. The fuzzy set theory can be used in a wide range of domains in which information is incomplete or imprecise, such as bioinformatics.[191]
game theory
The study of mathematical models of strategic interaction between rational decision-makers.[192]
general game playing (GGP)
General game playing is the design of artificial intelligence programs to be able to run and play more than one game successfully.[193][194][195]
generative adversarial network (GAN)
A class of machine learning systems. Two neural networks contest with each other in a zero-sum game framework.
generative artificial intelligence
Generative artificial intelligence is artificial intelligence capable of generating text, images, or other media in response to prompts.[196][197] Generative AI models learn the patterns and structure of their input training data and then generate new data that has similar characteristics, typically using Transformer-based deep neural networks.[198][199]
genetic algorithm (GA)
A metaheuristic inspired by the process of natural selection that belongs to the larger class of evolutionary algorithms (EA). Genetic algorithms are commonly used to generate high-quality solutions to optimization and search problems by relying on bio-inspired operators such as mutation, crossover and selection.[200]
genetic operator
An operator used in genetic algorithms to guide the algorithm towards a solution to a given problem. There are three main types of operators (mutation, crossover and selection), which must work in conjunction with one another in order for the algorithm to be successful.
generative pretrained transformer (GPT)
A large language model based on the transformer architecture that generates text. It is first pretrained to predict the next token in texts (a token is typically a word, subword, or punctuation). After their pretraining, GPT models can generate human-like text by repeatedly predicting the token that they would expect to follow. GPT models are usually also fine-tuned, for example with reinforcement learning from human feedback to reduce hallucination or harmful behaviour, or to format the output in a conversationnal format.[201]
glowworm swarm optimization
A swarm intelligence optimization algorithm based on the behaviour of glowworms (also known as fireflies or lightning bugs).
gradient boosting
A machine learning technique based on boosting in a functional space, where the target is pseudo-residuals instead of residuals as in traditional boosting.
graph (abstract data type)
In computer science, a graph is an abstract data type that is meant to implement the undirected graph and directed graph concepts from mathematics; specifically, the field of graph theory.
graph (discrete mathematics)
In mathematics, and more specifically in graph theory, a graph is a structure amounting to a set of objects in which some pairs of the objects are in some sense "related". The objects correspond to mathematical abstractions called vertices (also called nodes or points) and each of the related pairs of vertices is called an edge (also called an arc or line).[202]
graph database (GDB)
A database that uses graph structures for semantic queries with nodes, edges, and properties to represent and store data. A key concept of the system is the graph (or edge or relationship), which directly relates data items in the store a collection of nodes of data and edges representing the relationships between the nodes. The relationships allow data in the store to be linked together directly, and in many cases retrieved with one operation. Graph databases hold the relationships between data as a priority. Querying relationships within a graph database is fast because they are perpetually stored within the database itself. Relationships can be intuitively visualized using graph databases, making it useful for heavily inter-connected data.[203][204]
graph theory
The study of graphs, which are mathematical structures used to model pairwise relations between objects.
graph traversal

Also graph search.

The process of visiting (checking and/or updating) each vertex in a graph. Such traversals are classified by the order in which the vertices are visited. Tree traversal is a special case of graph traversal.
hallucination
A response generated by AI that contains false or misleading information presented as fact.
heuristic
A technique designed for solving a problem more quickly when classic methods are too slow, or for finding an approximate solution when classic methods fail to find any exact solution. This is achieved by trading optimality, completeness, accuracy, or precision for speed. In a way, it can be considered a shortcut. A heuristic function, also called simply a heuristic, is a function that ranks alternatives in search algorithms at each branching step based on available information to decide which branch to follow. For example, it may approximate the exact solution.[205]
hidden layer
A layer of neurons in an artificial neural network that is neither an input layer nor an output layer.
hyper-heuristic
A heuristic search method that seeks to automate the process of selecting, combining, generating, or adapting several simpler heuristics (or components of such heuristics) to efficiently solve computational search problems, often by the incorporation of machine learning techniques. One of the motivations for studying hyper-heuristics is to build systems which can handle classes of problems rather than solving just one problem.[206][207][208]
hyperparameter
A parameter that can be set in order to define any configurable part of a machine learning model's learning process.
hyperparameter optimization
The process of choosing a set of optimal hyperparameters for a learning algorithm.
hyperplane
A decision boundary in machine learning classifiers that partitions the input space into two or more sections, with each section corresponding to a unique class label.
IEEE Computational Intelligence Society
A professional society of the Institute of Electrical and Electronics Engineers (IEEE) focussing on "the theory, design, application, and development of biologically and linguistically motivated computational paradigms emphasizing neural networks, connectionist systems, genetic algorithms, evolutionary programming, fuzzy systems, and hybrid intelligent systems in which these paradigms are contained".[209]
incremental learning
A method of machine learning, in which input data is continuously used to extend the existing model's knowledge i.e. to further train the model. It represents a dynamic technique of supervised learning and unsupervised learning that can be applied when training data becomes available gradually over time or its size is out of system memory limits. Algorithms that can facilitate incremental learning are known as incremental machine learning algorithms.
inference engine
A component of the system that applies logical rules to the knowledge base to deduce new information.
information integration (II)
The merging of information from heterogeneous sources with differing conceptual, contextual and typographical representations. It is used in data mining and consolidation of data from unstructured or semi-structured resources. Typically, information integration refers to textual representations of knowledge but is sometimes applied to rich-media content. Information fusion, which is a related term, involves the combination of information into a new set of information towards reducing redundancy and uncertainty.[132]
Information Processing Language (IPL)
A programming language that includes features intended to help with programs that perform simple problem solving actions such as lists, dynamic memory allocation, data types, recursion, functions as arguments, generators, and cooperative multitasking. IPL invented the concept of list processing, albeit in an assembly-language style.
intelligence amplification (IA)

Also cognitive augmentation, machine augmented intelligence, and enhanced intelligence.

The effective use of information technology in augmenting human intelligence.
intelligence explosion
A possible outcome of humanity building artificial general intelligence (AGI). AGI would be capable of recursive self-improvement leading to rapid emergence of ASI (artificial superintelligence), the limits of which are unknown, at the time of the technological singularity.
intelligent agent (IA)
An autonomous entity which acts, directing its activity towards achieving goals (i.e. it is an agent), upon an environment using observation through sensors and consequent actuators (i.e. it is intelligent). Intelligent agents may also learn or use knowledge to achieve their goals. They may be very simple or very complex.
intelligent control
A class of control techniques that use various artificial intelligence computing approaches like neural networks, Bayesian probability, fuzzy logic, machine learning, reinforcement learning, evolutionary computation and genetic algorithms.[210]
intelligent personal assistant

Also virtual assistant or personal digital assistant.

A software agent that can perform tasks or services for an individual based on verbal commands. Sometimes the term "chatbot" is used to refer to virtual assistants generally or specifically accessed by online chat (or in some cases online chat programs that are exclusively for entertainment purposes). Some virtual assistants are able to interpret human speech and respond via synthesized voices. Users can ask their assistants questions, control home automation devices and media playback via voice, and manage other basic tasks such as email, to-do lists, and calendars with verbal commands.[211]
interpretation
An assignment of meaning to the symbols of a formal language. Many formal languages used in mathematics, logic, and theoretical computer science are defined in solely syntactic terms, and as such do not have any meaning until they are given some interpretation. The general study of interpretations of formal languages is called formal semantics.
intrinsic motivation
An intelligent agent is intrinsically motivated to act if the information content alone, of the experience resulting from the action, is the motivating factor. Information content in this context is measured in the information theory sense as quantifying uncertainty. A typical intrinsic motivation is to search for unusual (surprising) situations, in contrast to a typical extrinsic motivation such as the search for food. Intrinsically motivated artificial agents display behaviours akin to exploration and curiosity.[212]
issue tree

Also logic tree.

A graphical breakdown of a question that dissects it into its different components vertically and that progresses into details as it reads to the right.[213]: 47  Issue trees are useful in problem solving to identify the root causes of a problem as well as to identify its potential solutions. They also provide a reference point to see how each piece fits into the whole picture of a problem.[214]
junction tree algorithm

Also Clique Tree.

A method used in machine learning to extract marginalization in general graphs. In essence, it entails performing belief propagation on a modified graph called a junction tree. The graph is called a tree because it branches into different sections of data; nodes of variables are the branches.[215]
kernel method
In machine learning, kernel methods are a class of algorithms for pattern analysis, whose best known member is the support vector machine (SVM). The general task of pattern analysis is to find and study general types of relations (e.g., cluster analysis, rankings, principal components, correlations, classifications) in datasets.
KL-ONE
A well-known knowledge representation system in the tradition of semantic networks and frames; that is, it is a frame language. The system is an attempt to overcome semantic indistinctness in semantic network representations and to explicitly represent conceptual information as a structured inheritance network.[216][217][218]
k-nearest neighbors
A non-parametric supervised learning method first developed by Evelyn Fix and Joseph Hodges in 1951,[219] and later expanded by Thomas Cover.[215] It is used for classification and regression.
knowledge acquisition
The process used to define the rules and ontologies required for a knowledge-based system. The phrase was first used in conjunction with expert systems to describe the initial tasks associated with developing an expert system, namely finding and interviewing domain experts and capturing their knowledge via rules, objects, and frame-based ontologies.
knowledge-based system (KBS)
A computer program that reasons and uses a knowledge base to solve complex problems. The term is broad and refers to many different kinds of systems. The one common theme that unites all knowledge based systems is an attempt to represent knowledge explicitly and a reasoning system that allows it to derive new knowledge. Thus, a knowledge-based system has two distinguishing features: a knowledge base and an inference engine.
knowledge distillation
The process of transferring knowledge from a large machine learning model to a smaller one.
knowledge engineering (KE)
All technical, scientific, and social aspects involved in building, maintaining, and using knowledge-based systems.
knowledge extraction
The creation of knowledge from structured (relational databases, XML) and unstructured (text, documents, images) sources. The resulting knowledge needs to be in a machine-readable and machine-interpretable format and must represent knowledge in a manner that facilitates inferencing. Although it is methodically similar to information extraction and ETL, the main criterion is that the extraction result goes beyond the creation of structured information or the transformation into a relational schema. It requires either the reuse of existing formal knowledge (reusing identifiers or ontologies) or the generation of a schema based on the source data.
knowledge Interchange Format (KIF)
A computer language designed to enable systems to share and re-use information from knowledge-based systems. KIF is similar to frame languages such as KL-ONE and LOOM but unlike such language its primary role is not intended as a framework for the expression or use of knowledge but rather for the interchange of knowledge between systems. The designers of KIF likened it to PostScript. PostScript was not designed primarily as a language to store and manipulate documents but rather as an interchange format for systems and devices to share documents. In the same way KIF is meant to facilitate sharing of knowledge across different systems that use different languages, formalisms, platforms, etc.
knowledge representation and reasoning (KR² or KR&R)
The field of artificial intelligence dedicated to representing information about the world in a form that a computer system can utilize to solve complex tasks such as diagnosing a medical condition or having a dialog in a natural language. Knowledge representation incorporates findings from psychology[220] about how humans solve problems and represent knowledge in order to design formalisms that will make complex systems easier to design and build. Knowledge representation and reasoning also incorporates findings from logic to automate various kinds of reasoning, such as the application of rules or the relations of sets and subsets.[221] Examples of knowledge representation formalisms include semantic nets, systems architecture, frames, rules, and ontologies. Examples of automated reasoning engines include inference engines, theorem provers, and classifiers.
k-means clustering
A method of vector quantization, originally from signal processing, that aims to partition n observations into k clusters in which each observation belongs to the cluster with the nearest mean (cluster centers or cluster centroid), serving as a prototype of the cluster.
language model
A probabilistic model that manipulates natural language.
large language model (LLM)
A language model with a large number of parameters (typically at least a billion) that are adjusted during training. Due to its size, it requires a lot of data and computing capability to train. Large language models are usually based on the transformer architecture.[222]
lazy learning
In machine learning, lazy learning is a learning method in which generalization of the training data is, in theory, delayed until a query is made to the system, as opposed to in eager learning, where the system tries to generalize the training data before receiving queries.
Lisp (programming language) (LISP)
A family of programming languages with a long history and a distinctive, fully parenthesized prefix notation.[223]
logic programming
A type of programming paradigm which is largely based on formal logic. Any program written in a logic programming language is a set of sentences in logical form, expressing facts and rules about some problem domain. Major logic programming language families include Prolog, answer set programming (ASP), and Datalog.
long short-term memory (LSTM)
An artificial recurrent neural network architecture[224] used in the field of deep learning. Unlike standard feedforward neural networks, LSTM has feedback connections that make it a "general purpose computer" (that is, it can compute anything that a Turing machine can).[225] It can not only process single data points (such as images), but also entire sequences of data (such as speech or video).
machine vision (MV)
The technology and methods used to provide imaging-based automatic inspection and analysis for such applications as automatic inspection, process control, and robot guidance, usually in industry. Machine vision is a term encompassing a large number of technologies, software and hardware products, integrated systems, actions, methods and expertise. Machine vision as a systems engineering discipline can be considered distinct from computer vision, a form of computer science. It attempts to integrate existing technologies in new ways and apply them to solve real world problems. The term is the prevalent one for these functions in industrial automation environments but is also used for these functions in other environments such as security and vehicle guidance.
Markov chain
A stochastic model describing a sequence of possible events in which the probability of each event depends only on the state attained in the previous event.[226][227]
Markov decision process (MDP)
A discrete time stochastic control process. It provides a mathematical framework for modeling decision making in situations where outcomes are partly random and partly under the control of a decision maker. MDPs are useful for studying optimization problems solved via dynamic programming and reinforcement learning.
mathematical optimization

Also mathematical programming.

In mathematics, computer science, and operations research, the selection of a best element (with regard to some criterion) from some set of available alternatives.[228]
machine learning (ML)
The scientific study of algorithms and statistical models that computer systems use in order to perform a specific task effectively without using explicit instructions, relying on patterns and inference instead.
machine listening

Also computer audition (CA).

A general field of study of algorithms and systems for audio understanding by machine.[229][230]
machine perception
The capability of a computer system to interpret data in a manner that is similar to the way humans use their senses to relate to the world around them.[231][232][233]
mechanism design
A field in economics and game theory that takes an engineering approach to designing economic mechanisms or incentives, toward desired objectives, in strategic settings, where players act rationally. Because it starts at the end of the game, then goes backwards, it is also called reverse game theory. It has broad applications, from economics and politics (markets, auctions, voting procedures) to networked-systems (internet interdomain routing, sponsored search auctions).
mechatronics

Also mechatronic engineering.

A multidisciplinary branch of engineering that focuses on the engineering of both electrical and mechanical systems, and also includes a combination of robotics, electronics, computer, telecommunications, systems, control, and product engineering.[234][235]
metabolic network reconstruction and simulation
Allows for an in-depth insight into the molecular mechanisms of a particular organism. In particular, these models correlate the genome with molecular physiology.[236]
metaheuristic
In computer science and mathematical optimization, a metaheuristic is a higher-level procedure or heuristic designed to find, generate, or select a heuristic (partial search algorithm) that may provide a sufficiently good solution to an optimization problem, especially with incomplete or imperfect information or limited computation capacity.[237][238] Metaheuristics sample a set of solutions which is too large to be completely sampled.
model checking
In computer science, model checking or property checking is, for a given model of a system, exhaustively and automatically checking whether this model meets a given specification. Typically, one has hardware or software systems in mind, whereas the specification contains safety requirements such as the absence of deadlocks and similar critical states that can cause the system to crash. Model checking is a technique for automatically verifying correctness properties of finite-state systems.
modus ponens
In propositional logic, modus ponens is a rule of inference.[239] It can be summarized as "P implies Q and P is asserted to be true, therefore Q must be true."
modus tollens
In propositional logic, modus tollens is a valid argument form and a rule of inference. It is an application of the general truth that if a statement is true, then so is its contrapositive. The inference rule modus tollens asserts that the inference from P implies Q to the negation of Q implies the negation of P is valid.
In computer science, Monte Carlo tree search (MCTS) is a heuristic search algorithm for some kinds of decision processes.
multi-agent system (MAS)

Also self-organized system.

A computerized system composed of multiple interacting intelligent agents. Multi-agent systems can solve problems that are difficult or impossible for an individual agent or a monolithic system to solve. Intelligence may include methodic, functional, procedural approaches, algorithmic search or reinforcement learning.
multi-swarm optimization
A variant of particle swarm optimization (PSO) based on the use of multiple sub-swarms instead of one (standard) swarm. The general approach in multi-swarm optimization is that each sub-swarm focuses on a specific region while a specific diversification method decides where and when to launch the sub-swarms. The multi-swarm framework is especially fitted for the optimization on multi-modal problems, where multiple (local) optima exist.
mutation
A genetic operator used to maintain genetic diversity from one generation of a population of genetic algorithm chromosomes to the next. It is analogous to biological mutation. Mutation alters one or more gene values in a chromosome from its initial state. In mutation, the solution may change entirely from the previous solution. Hence GA can come to a better solution by using mutation. Mutation occurs during evolution according to a user-definable mutation probability. This probability should be set low. If it is set too high, the search will turn into a primitive random search.
Mycin
An early backward chaining expert system that used artificial intelligence to identify bacteria causing severe infections, such as bacteremia and meningitis, and to recommend antibiotics, with the dosage adjusted for patient's body weight – the name derived from the antibiotics themselves, as many antibiotics have the suffix "-mycin". The MYCIN system was also used for the diagnosis of blood clotting diseases.
naive Bayes classifier
In machine learning, naive Bayes classifiers are a family of simple probabilistic classifiers based on applying Bayes' theorem with strong (naive) independence assumptions between the features.
naive semantics
An approach used in computer science for representing basic knowledge about a specific domain, and has been used in applications such as the representation of the meaning of natural language sentences in artificial intelligence applications. In a general setting the term has been used to refer to the use of a limited store of generally understood knowledge about a specific domain in the world, and has been applied to fields such as the knowledge based design of data schemas.[240]
name binding
In programming languages, name binding is the association of entities (data and/or code) with identifiers.[241] An identifier bound to an object is said to reference that object. Machine languages have no built-in notion of identifiers, but name-object bindings as a service and notation for the programmer is implemented by programming languages. Binding is intimately connected with scoping, as scope determines which names bind to which objects – at which locations in the program code (lexically) and in which one of the possible execution paths (temporally). Use of an identifier id in a context that establishes a binding for id is called a binding (or defining) occurrence. In all other occurrences (e.g., in expressions, assignments, and subprogram calls), an identifier stands for what it is bound to; such occurrences are called applied occurrences.
named-entity recognition (NER)

Also entity identification, entity chunking, and entity extraction.

A subtask of information extraction that seeks to locate and classify named entity mentions in unstructured text into pre-defined categories such as the person names, organizations, locations, medical codes, time expressions, quantities, monetary values, percentages, etc.
named graph
A key concept of Semantic Web architecture in which a set of Resource Description Framework statements (a graph) are identified using a URI,[242] allowing descriptions to be made of that set of statements such as context, provenance information or other such metadata. Named graphs are a simple extension of the RDF data model[243] through which graphs can be created but the model lacks an effective means of distinguishing between them once published on the Web at large.
natural language generation (NLG)
A software process that transforms structured data into plain-English content. It can be used to produce long-form content for organizations to automate custom reports, as well as produce custom content for a web or mobile application. It can also be used to generate short blurbs of text in interactive conversations (a chatbot) which might even be read out loud by a text-to-speech system.
natural language processing (NLP)
A subfield of computer science, information engineering, and artificial intelligence concerned with the interactions between computers and human (natural) languages, in particular how to program computers to process and analyze large amounts of natural language data.
natural language programming
An ontology-assisted way of programming in terms of natural-language sentences, e.g. English.[244]
network motif
All networks, including biological networks, social networks, technological networks (e.g., computer networks and electrical circuits) and more, can be represented as graphs, which include a wide variety of subgraphs. One important local property of networks are so-called network motifs, which are defined as recurrent and statistically significant sub-graphs or patterns.
neural machine translation (NMT)
An approach to machine translation that uses a large artificial neural network to predict the likelihood of a sequence of words, typically modeling entire sentences in a single integrated model.
neural network

Also artificial neural network.

A neural network can refer to either a neural circuit of biological neurons (sometimes also called a biological neural network), or a network of artificial neurons or nodes in the case of an artificial neural network.[245] Artificial neural networks are used for solving artificial intelligence (AI) problems; they model connections of biological neurons as weights between nodes. A positive weight reflects an excitatory connection, while negative values mean inhibitory connections. All inputs are modified by a weight and summed. This activity is referred to as a linear combination. Finally, an activation function controls the amplitude of the output. For example, an acceptable range of output is usually between 0 and 1, or it could be −1 and 1.
neural Turing machine (NTM)
A recurrent neural network model. NTMs combine the fuzzy pattern matching capabilities of neural networks with the algorithmic power of programmable computers. An NTM has a neural network controller coupled to external memory resources, which it interacts with through attentional mechanisms. The memory interactions are differentiable end-to-end, making it possible to optimize them using gradient descent.[246] An NTM with a long short-term memory (LSTM) network controller can infer simple algorithms such as copying, sorting, and associative recall from examples alone.[247]
neuro-fuzzy
Combinations of artificial neural networks and fuzzy logic.
neurocybernetics

Also brain–computer interface (BCI), neural-control interface (NCI), mind-machine interface (MMI), direct neural interface (DNI), or brain–machine interface (BMI).

A direct communication pathway between an enhanced or wired brain and an external device. BCI differs from neuromodulation in that it allows for bidirectional information flow. BCIs are often directed at researching, mapping, assisting, augmenting, or repairing human cognitive or sensory-motor functions.[248]
neuromorphic engineering

Also neuromorphic computing.

A concept describing the use of very-large-scale integration (VLSI) systems containing electronic analog circuits to mimic neuro-biological architectures present in the nervous system.[249] In recent times, the term neuromorphic has been used to describe analog, digital, mixed-mode analog/digital VLSI, and software systems that implement models of neural systems (for perception, motor control, or multisensory integration). The implementation of neuromorphic computing on the hardware level can be realized by oxide-based memristors,[250] spintronic memories,[251] threshold switches, and transistors.[252][253][254][255]
node
A basic unit of a data structure, such as a linked list or tree data structure. Nodes contain data and also may link to other nodes. Links between nodes are often implemented by pointers.
nondeterministic algorithm
An algorithm that, even for the same input, can exhibit different behaviors on different runs, as opposed to a deterministic algorithm.
nouvelle AI
Nouvelle AI differs from classical AI by aiming to produce robots with intelligence levels similar to insects. Researchers believe that intelligence can emerge organically from simple behaviors as these intelligences interacted with the "real world", instead of using the constructed worlds which symbolic AIs typically needed to have programmed into them.[256]
NP
In computational complexity theory, NP (nondeterministic polynomial time) is a complexity class used to classify decision problems. NP is the set of decision problems for which the problem instances, where the answer is "yes", have proofs verifiable in polynomial time.[257][Note 1]
NP-completeness
In computational complexity theory, a problem is NP-complete when it can be solved by a restricted class of brute force search algorithms and it can be used to simulate any other problem with a similar algorithm. More precisely, each input to the problem should be associated with a set of solutions of polynomial length, whose validity can be tested quickly (in polynomial time[258]), such that the output for any input is "yes" if the solution set is non-empty and "no" if it is empty.
NP-hardness

Also non-deterministic polynomial-time hardness.

In computational complexity theory, the defining property of a class of problems that are, informally, "at least as hard as the hardest problems in NP". A simple example of an NP-hard problem is the subset sum problem.
Occam's razor

Also Ockham's razor or Ocham's razor.

The problem-solving principle that states that when presented with competing hypotheses that make the same predictions, one should select the solution with the fewest assumptions;[259] the principle is not meant to filter out hypotheses that make different predictions. The idea is attributed to the English Franciscan friar William of Ockham (c. 1287–1347), a scholastic philosopher and theologian.
offline learning
A machine learning training approach in which a model is trained on a fixed dataset that is not updated during the learning process.
online machine learning
A method of machine learning in which data becomes available in a sequential order and is used to update the best predictor for future data at each step, as opposed to batch learning techniques which generate the best predictor by learning on the entire training data set at once. Online learning is a common technique used in areas of machine learning where it is computationally infeasible to train over the entire dataset, requiring the need of out-of-core algorithms. It is also used in situations where it is necessary for the algorithm to dynamically adapt to new patterns in the data, or when the data itself is generated as a function of time.
ontology learning

Also ontology extraction, ontology generation, or ontology acquisition.

The automatic or semi-automatic creation of ontologies, including extracting the corresponding domain's terms and the relationships between the concepts that these terms represent from a corpus of natural language text, and encoding them with an ontology language for easy retrieval.
OpenAI
The for-profit corporation OpenAI LP, whose parent organization is the non-profit organization OpenAI Inc[260] that conducts research in the field of artificial intelligence (AI) with the stated aim to promote and develop friendly AI in such a way as to benefit humanity as a whole.
OpenCog
A project that aims to build an open-source artificial intelligence framework. OpenCog Prime is an architecture for robot and virtual embodied cognition that defines a set of interacting components designed to give rise to human-equivalent artificial general intelligence (AGI) as an emergent phenomenon of the whole system.[261]
Open Mind Common Sense
An artificial intelligence project based at the Massachusetts Institute of Technology (MIT) Media Lab whose goal is to build and utilize a large commonsense knowledge base from the contributions of many thousands of people across the Web.
open-source software (OSS)
A type of computer software in which source code is released under an license in which the copyright holder grants users the rights to study, change, and distribute the software to anyone and for any purpose.[262] Open-source software may be developed in an collaborative public manner. Open-source software is a prominent example of open collaboration.[263]
overfitting
"The production of an analysis that corresponds too closely or exactly to a particular set of data, and may therefore fail to fit to additional data or predict future observations reliably".[264] In other words, an overfitted model memorizes training data details but cannot generalize to new data. Conversely, an underfitted model is too simple to capture the complexity of the training data.
partial order reduction
A technique for reducing the size of the state-space to be searched by a model checking or automated planning and scheduling algorithm. It exploits the commutativity of concurrently executed transitions, which result in the same state when executed in different orders.
partially observable Markov decision process (POMDP)
A generalization of a Markov decision process (MDP). A POMDP models an agent decision process in which it is assumed that the system dynamics are determined by an MDP, but the agent cannot directly observe the underlying state. Instead, it must maintain a probability distribution over the set of possible states, based on a set of observations and observation probabilities, and the underlying MDP.
particle swarm optimization (PSO)
A computational method that optimizes a problem by iteratively trying to improve a candidate solution with regard to a given measure of quality. It solves a problem by having a population of candidate solutions, here dubbed particles, and moving these particles around in the search-space according to simple mathematical formulae over the particle's position and velocity. Each particle's movement is influenced by its local best known position, but is also guided toward the best known positions in the search-space, which are updated as better positions are found by other particles. This is expected to move the swarm toward the best solutions.
pathfinding

Also pathing.

The plotting, by a computer application, of the shortest route between two points. It is a more practical variant on solving mazes. This field of research is based heavily on Dijkstra's algorithm for finding a shortest path on a weighted graph.
pattern recognition
Concerned with the automatic discovery of regularities in data through the use of computer algorithms and with the use of these regularities to take actions such as classifying the data into different categories.[265]
perceptron
An algorithm for supervised learning of binary classifiers.
predicate logic

Also first-order logic, predicate logic, and first-order predicate calculus.

A collection of formal systems used in mathematics, philosophy, linguistics, and computer science. First-order logic uses quantified variables over non-logical objects and allows the use of sentences that contain variables, so that rather than propositions such as Socrates is a man one can have expressions in the form "there exists x such that x is Socrates and x is a man" and there exists is a quantifier while x is a variable.[183] This distinguishes it from propositional logic, which does not use quantifiers or relations;[266] in this sense, propositional logic is the foundation of first-order logic.
predictive analytics
A variety of statistical techniques from data mining, predictive modelling, and machine learning, that analyze current and historical facts to make predictions about future or otherwise unknown events.[267][268]
principal component analysis (PCA)
A statistical procedure that uses an orthogonal transformation to convert a set of observations of possibly correlated variables (entities each of which takes on various numerical values) into a set of values of linearly uncorrelated variables called principal components. This transformation is defined in such a way that the first principal component has the largest possible variance (that is, accounts for as much of the variability in the data as possible), and each succeeding component, in turn, has the highest variance possible under the constraint that it is orthogonal to the preceding components. The resulting vectors (each being a linear combination of the variables and containing n observations) are an uncorrelated orthogonal basis set. PCA is sensitive to the relative scaling of the original variables.
principle of rationality

Also rationality principle.

A principle coined by Karl R. Popper in his Harvard Lecture of 1963, and published in his book Myth of Framework.[269] It is related to what he called the 'logic of the situation' in an Economica article of 1944/1945, published later in his book The Poverty of Historicism.[270] According to Popper's rationality principle, agents act in the most adequate way according to the objective situation. It is an idealized conception of human behavior which he used to drive his model of situational logic.
probabilistic programming (PP)
A programming paradigm in which probabilistic models are specified and inference for these models is performed automatically.[271] It represents an attempt to unify probabilistic modeling and traditional general-purpose programming in order to make the former easier and more widely applicable.[272][273] It can be used to create systems that help make decisions in the face of uncertainty. Programming languages used for probabilistic programming are referred to as "Probabilistic programming languages" (PPLs).
production system
A computer program typically used to provide some form of AI, which consists primarily of a set of rules about behavior, but also includes the mechanism necessary to follow those rules as the system responds to states of the world.
programming language
A formal language, which comprises a set of instructions that produce various kinds of output. Programming languages are used in computer programming to implement algorithms.
Prolog
A logic programming language associated with artificial intelligence and computational linguistics.[274][275][276] Prolog has its roots in first-order logic, a formal logic, and unlike many other programming languages, Prolog is intended primarily as a declarative programming language: the program logic is expressed in terms of relations, represented as facts and rules. A computation is initiated by running a query over these relations.[277]
propositional calculus

Also propositional logic, statement logic, sentential calculus, sentential logic, and zeroth-order logic.

A branch of logic which deals with propositions (which can be true or false) and argument flow. Compound propositions are formed by connecting propositions by logical connectives. The propositions without logical connectives are called atomic propositions. Unlike first-order logic, propositional logic does not deal with non-logical objects, predicates about them, or quantifiers. However, all the machinery of propositional logic is included in first-order logic and higher-order logics. In this sense, propositional logic is the foundation of first-order logic and higher-order logic.
proximal policy optimization (PPO)
A reinforcement learning algorithm for training an intelligent agent's decision function to accomplish difficult tasks.
Python
An interpreted, high-level, general-purpose programming language created by Guido van Rossum and first released in 1991. Python's design philosophy emphasizes code readability with its notable use of significant whitespace. Its language constructs and object-oriented approach aim to help programmers write clear, logical code for small and large-scale projects.[278]
PyTorch
A machine learning library based on the Torch library,[279][280][281] used for applications such as computer vision and natural language processing,[282] originally developed by Meta AI and now part of the Linux Foundation umbrella.[283][284][285][286]
Q-learning
A model-free reinforcement learning algorithm for learning the value of an action in a particular state.
qualification problem
In philosophy and artificial intelligence (especially knowledge-based systems), the qualification problem is concerned with the impossibility of listing all of the preconditions required for a real-world action to have its intended effect.[287][288] It might be posed as how to deal with the things that prevent me from achieving my intended result. It is strongly connected to, and opposite the ramification side of, the frame problem.[287]
quantifier
In logic, quantification specifies the quantity of specimens in the domain of discourse that satisfy an open formula. The two most common quantifiers mean "for all" and "there exists". For example, in arithmetic, quantifiers allow one to say that the natural numbers go on forever, by writing that for all n (where n is a natural number), there is another number (say, the successor of n) which is one bigger than n.
quantum computing
The use of quantum-mechanical phenomena such as superposition and entanglement to perform computation. A quantum computer is used to perform such computation, which can be implemented theoretically or physically.[289]: I-5 
query language
Query languages or data query languages (DQLs) are computer languages used to make queries in databases and information systems. Broadly, query languages can be classified according to whether they are database query languages or information retrieval query languages. The difference is that a database query language attempts to give factual answers to factual questions, while an information retrieval query language attempts to find documents containing information that is relevant to an area of inquiry.
R programming language
A programming language and free software environment for statistical computing and graphics supported by the R Foundation for Statistical Computing.[290] The R language is widely used among statisticians and data miners for developing statistical software[291] and data analysis.[292]
radial basis function network
In the field of mathematical modeling, a radial basis function network is an artificial neural network that uses radial basis functions as activation functions. The output of the network is a linear combination of radial basis functions of the inputs and neuron parameters. Radial basis function networks have many uses, including function approximation, time series prediction, classification, and system control. They were first formulated in a 1988 paper by Broomhead and Lowe, both researchers at the Royal Signals and Radar Establishment.[293][294][295]
random forest

Also random decision forest.

An ensemble learning method for classification, regression, and other tasks that operates by constructing a multitude of decision trees at training time and outputting the class that is the mode of the classes (classification) or mean prediction (regression) of the individual trees.[296][297] Random decision forests correct for decision trees' habit of overfitting to their training set.[298]
reasoning system
In information technology a reasoning system is a software system that generates conclusions from available knowledge using logical techniques such as deduction and induction. Reasoning systems play an important role in the implementation of artificial intelligence and knowledge-based systems.
recurrent neural network (RNN)
A class of artificial neural networks where connections between nodes form a directed graph along a temporal sequence. This allows it to exhibit temporal dynamic behavior. Unlike feedforward neural networks, RNNs can use their internal state (memory) to process sequences of inputs. This makes them applicable to tasks such as unsegmented, connected handwriting recognition[299] or speech recognition.[300][301]
regression analysis
A set of statistical processes for estimating the relationships between a dependent variable (often called the outcome or response variable, or label in machine learning) and one or more error-free independent variables (often called regressors, predictors, covariates, explanatory variables, or features). The most common form of regression analysis is linear regression, in which one finds the line (or a more complex linear combination) that most closely fits the data according to a specific mathematical criterion.
regularization
A set of techniques such as dropout, early stopping, and L1 and L2 regularization to reduce overfitting and underfitting when training a learning algorithm.
reinforcement learning (RL)
An area of machine learning concerned with how software agents ought to take actions in an environment so as to maximize some notion of cumulative reward. Reinforcement learning is one of three basic machine learning paradigms, alongside supervised learning and unsupervised learning. It differs from supervised learning in that labelled input/output pairs need not be presented, and sub-optimal actions need not be explicitly corrected. Instead the focus is finding a balance between exploration (of uncharted territory) and exploitation (of current knowledge).[302]
reinforcement learning from human feedback (RLHF)
A technique that involve training a "reward model" to predict how humans rate the quality of generated content, and then training a generative AI model to satisfy this reward model via reinforcement learning. It can be used for example to make the generative AI model more truthful or less harmful.[303]
reservoir computing
A framework for computation that may be viewed as an extension of neural networks.[304] Typically an input signal is fed into a fixed (random) dynamical system called a reservoir and the dynamics of the reservoir map the input to a higher dimension. Then a simple readout mechanism is trained to read the state of the reservoir and map it to the desired output. The main benefit is that training is performed only at the readout stage and the reservoir is fixed. Liquid-state machines[305] and echo state networks[306] are two major types of reservoir computing.[307]
Resource Description Framework (RDF)
A family of World Wide Web Consortium (W3C) specifications[308] originally designed as a metadata data model. It has come to be used as a general method for conceptual description or modeling of information that is implemented in web resources, using a variety of syntax notations and data serialization formats. It is also used in knowledge management applications.
restricted Boltzmann machine (RBM)
A generative stochastic artificial neural network that can learn a probability distribution over its set of inputs.
Rete algorithm
A pattern matching algorithm for implementing rule-based systems. The algorithm was developed to efficiently apply many rules or patterns to many objects, or facts, in a knowledge base. It is used to determine which of the system's rules should fire based on its data store, its facts.
robotics
An interdisciplinary branch of science and engineering that includes mechanical engineering, electronic engineering, information engineering, computer science, and others. Robotics deals with the design, construction, operation, and use of robots, as well as computer systems for their control, sensory feedback, and information processing.
rule-based system
In computer science, a rule-based system is used to store and manipulate knowledge to interpret information in a useful way. It is often used in artificial intelligence applications and research. Normally, the term rule-based system is applied to systems involving human-crafted or curated rule sets. Rule-based systems constructed using automatic rule inference, such as rule-based machine learning, are normally excluded from this system type.
satisfiability
In mathematical logic, satisfiability and validity are elementary concepts of semantics. A formula is satisfiable if it is possible to find an interpretation (model) that makes the formula true.[309] A formula is valid if all interpretations make the formula true. The opposites of these concepts are unsatisfiability and invalidity, that is, a formula is unsatisfiable if none of the interpretations make the formula true, and invalid if some such interpretation makes the formula false. These four concepts are related to each other in a manner exactly analogous to Aristotle's square of opposition.
search algorithm
Any algorithm which solves the search problem, namely, to retrieve information stored within some data structure, or calculated in the search space of a problem domain, either with discrete or continuous values.
selection
The stage of a genetic algorithm in which individual genomes are chosen from a population for later breeding (using the crossover operator).
self-management
The process by which computer systems manage their own operation without human intervention.
semantic network

Also frame network.

A knowledge base that represents semantic relations between concepts in a network. This is often used as a form of knowledge representation. It is a directed or undirected graph consisting of vertices, which represent concepts, and edges, which represent semantic relations between concepts,[310] mapping or connecting semantic fields.
semantic reasoner

Also reasoning engine, rules engine, or simply reasoner.

A piece of software able to infer logical consequences from a set of asserted facts or axioms. The notion of a semantic reasoner generalizes that of an inference engine, by providing a richer set of mechanisms to work with. The inference rules are commonly specified by means of an ontology language, and often a description logic language. Many reasoners use first-order predicate logic to perform reasoning; inference commonly proceeds by forward chaining and backward chaining.
semantic query
Allows for queries and analytics of associative and contextual nature. Semantic queries enable the retrieval of both explicitly and implicitly derived information based on syntactic, semantic and structural information contained in data. They are designed to deliver precise results (possibly the distinctive selection of one single piece of information) or to answer more fuzzy and wide-open questions through pattern matching and digital reasoning.
semantics
In programming language theory, semantics is the field concerned with the rigorous mathematical study of the meaning of programming languages. It does so by evaluating the meaning of syntactically valid strings defined by a specific programming language, showing the computation involved. In such a case that the evaluation would be of syntactically invalid strings, the result would be non-computation. Semantics describes the processes a computer follows when executing a program in that specific language. This can be shown by describing the relationship between the input and output of a program, or an explanation of how the program will be executed on a certain platform, hence creating a model of computation.
sensor fusion
The combining of sensory data or data derived from disparate sources such that the resulting information has less uncertainty than would be possible when these sources were used individually.
separation logic
An extension of Hoare logic, a way of reasoning about programs. The assertion language of separation logic is a special case of the logic of bunched implications (BI).[311]
similarity learning
An area of supervised machine learning in artificial intelligence. It is closely related to classification and regression, but the goal is to learn from a similarity function that measures how similar or related two objects are. It has applications in ranking, in recommendation systems, visual identity tracking, face verification, and speaker verification.
simulated annealing (SA)
A probabilistic technique for approximating the global optimum of a given function. Specifically, it is a metaheuristic to approximate global optimization in a large search space for an optimization problem.
situated approach
In artificial intelligence research, the situated approach builds agents that are designed to behave effectively successfully in their environment. This requires designing AI "from the bottom-up" by focussing on the basic perceptual and motor skills required to survive. The situated approach gives a much lower priority to abstract reasoning or problem-solving skills.
situation calculus
A logic formalism designed for representing and reasoning about dynamical domains.
Selective Linear Definite clause resolution

Also simply SLD resolution.

The basic inference rule used in logic programming. It is a refinement of resolution, which is both sound and refutation complete for Horn clauses.
software
A collection of data or computer instructions that tell the computer how to work. This is in contrast to physical hardware, from which the system is built and actually performs the work. In computer science and software engineering, computer software is all information processed by computer systems, programs and data. Computer software includes computer programs, libraries and related non-executable data, such as online documentation or digital media.
software engineering
The application of engineering to the development of software in a systematic method.[312][313][314]
spatial-temporal reasoning
An area of artificial intelligence which draws from the fields of computer science, cognitive science, and cognitive psychology. The theoretic goal—on the cognitive side—involves representing and reasoning spatial-temporal knowledge in mind. The applied goal—on the computing side—involves developing high-level control systems of automata for navigating and understanding time and space.
SPARQL
An RDF query language—that is, a semantic query language for databases—able to retrieve and manipulate data stored in Resource Description Framework (RDF) format.[315][316]
sparse dictionary learning

Also sparse coding or SDL.

A feature learning method aimed at finding a sparse representation of the input data in the form of a linear combination of basic elements as well as those basic elements themselves.
speech recognition
An interdisciplinary subfield of computational linguistics that develops methodologies and technologies that enables the recognition and translation of spoken language into text by computers. It is also known as automatic speech recognition (ASR), computer speech recognition or speech to text (STT). It incorporates knowledge and research in the linguistics, computer science, and electrical engineering fields.
spiking neural network (SNN)
An artificial neural network that more closely mimics a natural neural network.[317] In addition to neuronal and synaptic state, SNNs incorporate the concept of time into their Operating Model.
state
In information technology and computer science, a program is described as stateful if it is designed to remember preceding events or user interactions;[318] the remembered information is called the state of the system.
statistical classification
In machine learning and statistics, classification is the problem of identifying to which of a set of categories (sub-populations) a new observation belongs, on the basis of a training set of data containing observations (or instances) whose category membership is known. Examples are assigning a given email to the "spam" or "non-spam" class, and assigning a diagnosis to a given patient based on observed characteristics of the patient (sex, blood pressure, presence or absence of certain symptoms, etc.). Classification is an example of pattern recognition.
state–action–reward–state–action (SARSA)
A reinforcement learning algorithm for learning a Markov decision process policy.
statistical relational learning (SRL)
A subdiscipline of artificial intelligence and machine learning that is concerned with domain models that exhibit both uncertainty (which can be dealt with using statistical methods) and complex, relational structure.[319][320] Note that SRL is sometimes called Relational Machine Learning (RML) in the literature. Typically, the knowledge representation formalisms developed in SRL use (a subset of) first-order logic to describe relational properties of a domain in a general manner (universal quantification) and draw upon probabilistic graphical models (such as Bayesian networks or Markov networks) to model the uncertainty; some also build upon the methods of inductive logic programming.
stochastic optimization (SO)
Any optimization method that generates and uses random variables. For stochastic problems, the random variables appear in the formulation of the optimization problem itself, which involves random objective functions or random constraints. Stochastic optimization methods also include methods with random iterates. Some stochastic optimization methods use random iterates to solve stochastic problems, combining both meanings of stochastic optimization.[321] Stochastic optimization methods generalize deterministic methods for deterministic problems.
stochastic semantic analysis
An approach used in computer science as a semantic component of natural language understanding. Stochastic models generally use the definition of segments of words as basic semantic units for the semantic models, and in some cases involve a two layered approach.[322]
Stanford Research Institute Problem Solver (STRIPS)
An automated planner developed by Richard Fikes and Nils Nilsson in 1971 at SRI International.
subject-matter expert (SME)
A person who has accumulated great knowledge in a particular field or topic, demonstrated by the person's degree, licensure, and/or through years of professional experience with the subject.
superintelligence
A hypothetical agent that possesses intelligence far surpassing that of the brightest and most gifted human minds. Superintelligence may also refer to a property of problem-solving systems (e.g., superintelligent language translators or engineering assistants) whether or not these high-level intellectual competencies are embodied in agents that act within the physical world. A superintelligence may or may not be created by an intelligence explosion and be associated with a technological singularity.
supervised learning
The machine learning task of learning a function that maps an input to an output based on example input-output pairs.[323] It infers a function from labeled training data consisting of a set of training examples.[324] In supervised learning, each example is a pair consisting of an input object (typically a vector) and a desired output value (also called the supervisory signal). A supervised learning algorithm analyzes the training data and produces an inferred function, which can be used for mapping new examples. An optimal scenario will allow for the algorithm to correctly determine the class labels for unseen instances. This requires the learning algorithm to generalize from the training data to unseen situations in a "reasonable" way (see inductive bias).
support vector machines
In machine learning, support vector machines (SVMs, also support vector networks[325]) are supervised learning models with associated learning algorithms that analyze data used for classification and regression.
swarm intelligence (SI)
The collective behavior of decentralized, self-organized systems, either natural or artificial. The expression was introduced in the context of cellular robotic systems.[326]
symbolic artificial intelligence
The term for the collection of all methods in artificial intelligence research that are based on high-level "symbolic" (human-readable) representations of problems, logic, and search.
synthetic intelligence (SI)
An alternative term for artificial intelligence which emphasizes that the intelligence of machines need not be an imitation or in any way artificial; it can be a genuine form of intelligence.[327][328]
systems neuroscience
A subdiscipline of neuroscience and systems biology that studies the structure and function of neural circuits and systems. It is an umbrella term, encompassing a number of areas of study concerned with how nerve cells behave when connected together to form neural pathways, neural circuits, and larger brain networks.
technological singularity

Also simply the singularity.

A hypothetical point in the future when technological growth becomes uncontrollable and irreversible, resulting in unfathomable changes to human civilization.[329][330][331]
temporal difference learning
A class of model-free reinforcement learning methods which learn by bootstrapping from the current estimate of the value function. These methods sample from the environment, like Monte Carlo methods, and perform updates based on current estimates, like dynamic programming methods.[332]
tensor network theory
A theory of brain function (particularly that of the cerebellum) that provides a mathematical model of the transformation of sensory space-time coordinates into motor coordinates and vice versa by cerebellar neuronal networks. The theory was developed as a geometrization of brain function (especially of the central nervous system) using tensors.[333][334]
TensorFlow
A free and open-source software library for dataflow and differentiable programming across a range of tasks. It is a symbolic math library, and is also used for machine learning applications such as neural networks.[335]
theoretical computer science (TCS)
A subset of general computer science and mathematics that focuses on more mathematical topics of computing and includes the theory of computation.
theory of computation
In theoretical computer science and mathematics, the theory of computation is the branch that deals with how efficiently problems can be solved on a model of computation, using an algorithm. The field is divided into three major branches: automata theory and languages, computability theory, and computational complexity theory, which are linked by the question: "What are the fundamental capabilities and limitations of computers?".[336]
Thompson sampling
A heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists in choosing the action that maximizes the expected reward with respect to a randomly drawn belief.[337][338]
time complexity
The computational complexity that describes the amount of time it takes to run an algorithm. Time complexity is commonly estimated by counting the number of elementary operations performed by the algorithm, supposing that each elementary operation takes a fixed amount of time to perform. Thus, the amount of time taken and the number of elementary operations performed by the algorithm are taken to differ by at most a constant factor.
transformer
A type of deep learning architecture that exploits a multi-head attention mechanism. Transformers address some of the limitations of long short-term memory, and became widely used in natural language processing, although it can also process other types of data such as images in the case of vision transformers.[339]
transhumanism

Abbreviated H+ or h+.

An international philosophical movement that advocates for the transformation of the human condition by developing and making widely available sophisticated technologies to greatly enhance human intellect and physiology.[340][341]
transition system
In theoretical computer science, a transition system is a concept used in the study of computation. It is used to describe the potential behavior of discrete systems. It consists of states and transitions between states, which may be labeled with labels chosen from a set; the same label may appear on more than one transition. If the label set is a singleton, the system is essentially unlabeled, and a simpler definition that omits the labels is possible.
tree traversal

Also tree search.

A form of graph traversal and refers to the process of visiting (checking and/or updating) each node in a tree data structure, exactly once. Such traversals are classified by the order in which the nodes are visited.
true quantified Boolean formula
In computational complexity theory, the language TQBF is a formal language consisting of the true quantified Boolean formulas. A (fully) quantified Boolean formula is a formula in quantified propositional logic where every variable is quantified (or bound), using either existential or universal quantifiers, at the beginning of the sentence. Such a formula is equivalent to either true or false (since there are no free variables). If such a formula evaluates to true, then that formula is in the language TQBF. It is also known as QSAT (Quantified SAT).
Turing machine
A mathematical model of computation describing an abstract machine[342] that manipulates symbols on a strip of tape according to a table of rules.[343] Despite the model's simplicity, it is capable of implementing any algorithm.[344]
Turing test
A test of a machine's ability to exhibit intelligent behaviour equivalent to, or indistinguishable from, that of a human, developed by Alan Turing in 1950. Turing proposed that a human evaluator would judge natural language conversations between a human and a machine designed to generate human-like responses. The evaluator would be aware that one of the two partners in conversation is a machine, and all participants would be separated from one another. The conversation would be limited to a text-only channel such as a computer keyboard and screen so the result would not depend on the machine's ability to render words as speech.[345] If the evaluator cannot reliably tell the machine from the human, the machine is said to have passed the test. The test results do not depend on the machine's ability to give correct answers to questions, only how closely its answers resemble those a human would give.
type system
In programming languages, a set of rules that assigns a property called type to the various constructs of a computer program, such as variables, expressions, functions, or modules.[346] These types formalize and enforce the otherwise implicit categories the programmer uses for algebraic data types, data structures, or other components (e.g. "string", "array of float", "function returning boolean"). The main purpose of a type system is to reduce possibilities for bugs in computer programs[347] by defining interfaces between different parts of a computer program, and then checking that the parts have been connected in a consistent way. This checking can happen statically (at compile time), dynamically (at run time), or as a combination of static and dynamic checking. Type systems have other purposes as well, such as expressing business rules, enabling certain compiler optimizations, allowing for multiple dispatch, providing a form of documentation, etc.
unsupervised learning
A type of self-organized Hebbian learning that helps find previously unknown patterns in data set without pre-existing labels. It is also known as self-organization and allows modeling probability densities of given inputs.[348] It is one of the main three categories of machine learning, along with supervised and reinforcement learning. Semi-supervised learning has also been described and is a hybridization of supervised and unsupervised techniques.
vision processing unit (VPU)
A type of microprocessor designed to accelerate machine vision tasks.[349][350]
Value-alignment complete
Analogous to an AI-complete problem, a value-alignment complete problem is a problem where the AI control problem needs to be fully solved to solve it.[citation needed]
Watson
A question-answering computer system capable of answering questions posed in natural language,[351] developed in IBM's DeepQA project by a research team led by principal investigator David Ferrucci.[352] Watson was named after IBM's first CEO, industrialist Thomas J. Watson.[353][354]
weak AI

Also narrow AI.

Artificial intelligence that is focused on one narrow task.[355][356][357]
word embedding
A representation of a word in natural language processing. Typically, the representation is a real-valued vector that encodes the meaning of the word in such a way that words that are closer in the vector space are expected to be similar in meaning.[358]
XGBoost
Short for eXtreme Gradient Boosting, XGBoost[359] is an open-source software library which provides a regularizing gradient boosting framework for multiple programming languages.

References

edit
  1. ^ a b For example: Josephson, John R.; Josephson, Susan G., eds. (1994). Abductive Inference: Computation, Philosophy, Technology. Cambridge, UK; New York: Cambridge University Press. doi:10.1017/CBO9780511530128. ISBN 978-0521434614. OCLC 28149683.
  2. ^ "Retroduction". Commens – Digital Companion to C. S. Peirce. Mats Bergman, Sami Paavola & João Queiroz. Archived from the original on 5 July 2022. Retrieved 24 August 2014.
  3. ^ Sheikholeslami, Sina (2019). Ablation Programming for Machine Learning.
  4. ^ Colburn, Timothy; Shute, Gary (5 June 2007). "Abstraction in Computer Science". Minds and Machines. 17 (2): 169–184. doi:10.1007/s11023-007-9061-7. ISSN 0924-6495. S2CID 5927969.
  5. ^ Kramer, Jeff (1 April 2007). "Is abstraction the key to computing?". Communications of the ACM. 50 (4): 36–42. CiteSeerX 10.1.1.120.6776. doi:10.1145/1232743.1232745. ISSN 0001-0782. S2CID 12481509.
  6. ^ Michael Gelfond, Vladimir Lifschitz (1998) "Action Languages", Linköping Electronic Articles in Computer and Information Science, vol 3, nr 16.
  7. ^ Jang, Jyh-Shing R (1991). Fuzzy Modeling Using Generalized Neural Networks and Kalman Filter Algorithm (PDF). Proceedings of the 9th National Conference on Artificial Intelligence, Anaheim, CA, USA, July 14–19. Vol. 2. pp. 762–767.
  8. ^ Jang, J.-S.R. (1993). "ANFIS: adaptive-network-based fuzzy inference system". IEEE Transactions on Systems, Man, and Cybernetics. 23 (3): 665–685. doi:10.1109/21.256541. S2CID 14345934.
  9. ^ Abraham, A. (2005), "Adaptation of Fuzzy Inference System Using Neural Learning", in Nedjah, Nadia; De Macedo Mourelle, Luiza (eds.), Fuzzy Systems Engineering: Theory and Practice, Studies in Fuzziness and Soft Computing, vol. 181, Germany: Springer Verlag, pp. 53–83, CiteSeerX 10.1.1.161.6135, doi:10.1007/11339366_3, ISBN 978-3-540-25322-8
  10. ^ Jang, Sun, Mizutani (1997) – Neuro-Fuzzy and Soft Computing – Prentice Hall, pp 335–368, ISBN 0-13-261066-3
  11. ^ Tahmasebi, P. (2012). "A hybrid neural networks-fuzzy logic-genetic algorithm for grade estimation". Computers & Geosciences. 42: 18–27. Bibcode:2012CG.....42...18T. doi:10.1016/j.cageo.2012.02.004. PMC 4268588. PMID 25540468.
  12. ^ Tahmasebi, P. (2010). "Comparison of optimized neural network with fuzzy logic for ore grade estimation". Australian Journal of Basic and Applied Sciences. 4: 764–772.
  13. ^ Russell, S.J.; Norvig, P. (2002). Artificial Intelligence: A Modern Approach. Prentice Hall. ISBN 978-0-13-790395-5.
  14. ^ Tao, Jianhua; Tieniu Tan (2005). "Affective Computing: A Review". Affective Computing and Intelligent Interaction. Vol. LNCS 3784. Springer. pp. 981–995. doi:10.1007/11573548.
  15. ^ El Kaliouby, Rana (November–December 2017). "We Need Computers with Empathy". Technology Review. Vol. 120, no. 6. p. 8. Archived from the original on 7 July 2018. Retrieved 6 November 2018.
  16. ^ Comparison of Agent Architectures Archived August 27, 2008, at the Wayback Machine
  17. ^ "Intel unveils Movidius Compute Stick USB AI Accelerator". 21 July 2017. Archived from the original on 11 August 2017. Retrieved 28 November 2018.
  18. ^ "Inspurs unveils GX4 AI Accelerator". 21 June 2017.
  19. ^ Shapiro, Stuart C. (1992). Artificial Intelligence In Stuart C. Shapiro (Ed.), Encyclopedia of Artificial Intelligence (Second Edition, pp. 54–57). New York: John Wiley. (Section 4 is on "AI-Complete Tasks".)
  20. ^ Solomonoff, R., "A Preliminary Report on a General Theory of Inductive Inference", Report V-131, Zator Co., Cambridge, Ma. (Nov. 1960 revision of the Feb. 4, 1960 report).
  21. ^ "Artificial intelligence: Google's AlphaGo beats Go master Lee Se-dol". BBC News. 12 March 2016. Retrieved 17 March 2016.
  22. ^ "AlphaGo | DeepMind". DeepMind.
  23. ^ "Research Blog: AlphaGo: Mastering the ancient game of Go with Machine Learning". Google Research Blog. 27 January 2016.
  24. ^ "Google achieves AI 'breakthrough' by beating Go champion". BBC News. 27 January 2016.
  25. ^ See Dung (1995)
  26. ^ See Besnard and Hunter (2001)
  27. ^ see Bench-Capon (2002)
  28. ^ Definition of AI as the study of intelligent agents:
  29. ^ Russell & Norvig 2009, p. 2.
  30. ^ Hardesty, Larry (14 April 2017). "Explained: Neural networks". MIT News Office. Retrieved 2 June 2022.
  31. ^ Yang, Z.R.; Yang, Z. (2014). Comprehensive Biomedical Physics. Karolinska Institute, Stockholm, Sweden: Elsevier. p. 1. ISBN 978-0-444-53633-4. Archived from the original on 28 July 2022. Retrieved 28 July 2022.
  32. ^ "AAAI Corporate Bylaws".
  33. ^ Cipresso, Pietro; Giglioli, Irene Alice Chicchi; Raya, iz; Riva, Giuseppe (7 December 2011). "The Past, Present, and Future of Virtual and Augmented Reality Research: A Network and Cluster Analysis of the Literature". Frontiers in Psychology. 9: 2086. doi:10.3389/fpsyg.2018.02086. PMC 6232426. PMID 30459681.
  34. ^ Ghallab, Malik; Nau, Dana S.; Traverso, Paolo (2004), Automated Planning: Theory and Practice, Morgan Kaufmann, ISBN 978-1-55860-856-6
  35. ^ Kephart, J.O.; Chess, D.M. (2003), "The vision of autonomic computing", Computer, 36: 41–52, CiteSeerX 10.1.1.70.613, doi:10.1109/MC.2003.1160055
  36. ^ Gehrig, Stefan K.; Stein, Fridtjof J. (1999). Dead reckoning and cartography using stereo vision for an automated car. IEEE/RSJ International Conference on Intelligent Robots and Systems. Vol. 3. Kyongju. pp. 1507–1512. doi:10.1109/IROS.1999.811692. ISBN 0-7803-5184-3.
  37. ^ "Self-driving Uber car kills Arizona woman crossing street". Reuters. 20 March 2018.
  38. ^ Thrun, Sebastian (2010). "Toward Robotic Cars". Communications of the ACM. 53 (4): 99–106. doi:10.1145/1721654.1721679. S2CID 207177792.
  39. ^ "Information Engineering Main/Home Page". University of Oxford. Archived from the original on 3 July 2022. Retrieved 3 October 2018.
  40. ^ Goodfellow, Ian; Bengio, Yoshua; Courville, Aaron (2016) Deep Learning. MIT Press. p. 196. ISBN 9780262035613
  41. ^ Nielsen, Michael A. (2015). "Chapter 6". Neural Networks and Deep Learning. Archived from the original on 8 August 2022. Retrieved 5 July 2022.
  42. ^ "Deep Networks: Overview - Ufldl". ufldl.stanford.edu. Archived from the original on 16 March 2022. Retrieved 4 August 2017.
  43. ^ Mozer, M. C. (1995). "A Focused Backpropagation Algorithm for Temporal Pattern Recognition". In Chauvin, Y.; Rumelhart, D. (eds.). Backpropagation: Theory, architectures, and applications. Hillsdale, NJ: Lawrence Erlbaum Associates. pp. 137–169. Retrieved 21 August 2017.
  44. ^ Robinson, A. J. & Fallside, F. (1987). The utility driven dynamic error propagation network (Technical report). Cambridge University, Engineering Department. CUED/F-INFENG/TR.1.
  45. ^ Werbos, Paul J. (1988). "Generalization of backpropagation with application to a recurrent gas market model". Neural Networks. 1 (4): 339–356. doi:10.1016/0893-6080(88)90007-x.
  46. ^ Feigenbaum, Edward (1988). The Rise of the Expert Company. Times Books. p. 317. ISBN 978-0-8129-1731-4.
  47. ^ Sivic, Josef (April 2009). "Efficient visual search of videos cast as text retrieval" (PDF). IEEE Transactions on Pattern Analysis and Machine Intelligence. 31 (4): 591–605. CiteSeerX 10.1.1.174.6841. doi:10.1109/TPAMI.2008.111. PMID 19229077. S2CID 9899337. Archived from the original (PDF) on 31 March 2022. Retrieved 5 July 2022.
  48. ^ McTear et al 2016, p. 167.
  49. ^ "Understanding the backward pass through Batch Normalization Layer". kratzert.github.io. Retrieved 24 April 2018.
  50. ^ Ioffe, Sergey; Szegedy, Christian (2015). "Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift". arXiv:1502.03167 [cs.LG].
  51. ^ "Glossary of Deep Learning: Batch Normalisation". medium.com. 27 June 2017. Retrieved 24 April 2018.
  52. ^ Pham DT, Ghanbarzadeh A, Koc E, Otri S, Rahim S and Zaidi M. The Bees Algorithm. Technical Note, Manufacturing Engineering Centre, Cardiff University, UK, 2005.
  53. ^ Pham, D.T., Castellani, M. (2009), The Bees Algorithm – Modelling Foraging Behaviour to Solve Continuous Optimisation Problems Archived 9 November 2016 at the Wayback Machine. Proc. ImechE, Part C, 223(12), 2919-2938.
  54. ^ Pham, D. T.; Castellani, M. (2014). "Benchmarking and comparison of nature-inspired population-based continuous optimisation algorithms". Soft Computing. 18 (5): 871–903. doi:10.1007/s00500-013-1104-9. S2CID 35138140.
  55. ^ Pham, Duc Truong; Castellani, Marco (2015). "A comparative study of the Bees Algorithm as a tool for function optimisation". Cogent Engineering. 2. doi:10.1080/23311916.2015.1091540.
  56. ^ Nasrinpour, H. R.; Massah Bavani, A.; Teshnehlab, M. (2017). "Grouped Bees Algorithm: A Grouped Version of the Bees Algorithm". Computers. 6 (1): 5. doi:10.3390/computers6010005.
  57. ^ Cao, Longbing (2010). "In-depth Behavior Understanding and Use: the Behavior Informatics Approach". Information Science. 180 (17): 3067–3085. arXiv:2007.15516. doi:10.1016/j.ins.2010.03.025. S2CID 7400761.
  58. ^ Colledanchise Michele, and Ögren Petter 2016. How Behavior Trees Modularize Hybrid Control Systems and Generalize Sequential Behavior Compositions, the Subsumption Architecture, and Decision Trees. In IEEE Transactions on Robotics vol.PP, no.99, pp.1-18 (2016)
  59. ^ Colledanchise, Michele; Ögren, Petter (2018). Behavior Trees in Robotics and AI. arXiv:1709.00084. doi:10.1201/9780429489105. ISBN 9780429950902. S2CID 27470659.
  60. ^ Breur, Tom (July 2016). "Statistical Power Analysis and the contemporary "crisis" in social sciences". Journal of Marketing Analytics. 4 (2–3): 61–65. doi:10.1057/s41270-016-0001-3. ISSN 2050-3318.
  61. ^ Bachmann, Paul (1894). Analytische Zahlentheorie [Analytic Number Theory] (in German). Vol. 2. Leipzig: Teubner.
  62. ^ Landau, Edmund (1909). Handbuch der Lehre von der Verteilung der Primzahlen [Handbook on the theory of the distribution of the primes] (in German). Leipzig: B. G. Teubner. p. 883.
  63. ^ John, Taylor (2009). Garnier, Rowan (ed.). Discrete Mathematics: Proofs, Structures and Applications, Third Edition. CRC Press. p. 620. ISBN 978-1-4398-1280-8.
  64. ^ Skiena, Steven S (2009). The Algorithm Design Manual. Springer Science & Business Media. p. 77. ISBN 978-1-84800-070-4.
  65. ^ Erman, L. D.; Hayes-Roth, F.; Lesser, V. R.; Reddy, D. R. (1980). "The Hearsay-II Speech-Understanding System: Integrating Knowledge to Resolve Uncertainty". ACM Computing Surveys. 12 (2): 213. doi:10.1145/356810.356816. S2CID 118556.
  66. ^ Corkill, Daniel D. (September 1991). "Blackboard Systems" (PDF). AI Expert. 6 (9): 40–47. Archived from the original (PDF) on 16 April 2012. Retrieved 5 July 2022.
  67. ^ * Nii, H. Yenny (1986). Blackboard Systems (PDF) (Technical report). Department of Computer Science, Stanford University. STAN-CS-86-1123. Retrieved 12 April 2013.
  68. ^ Hayes-Roth, B. (1985). "A blackboard architecture for control". Artificial Intelligence. 26 (3): 251–321. doi:10.1016/0004-3702(85)90063-3.
  69. ^ Hinton, Geoffrey E. (24 May 2007). "Boltzmann machine". Scholarpedia. 2 (5): 1668. Bibcode:2007SchpJ...2.1668H. doi:10.4249/scholarpedia.1668. ISSN 1941-6016.
  70. ^ NZZ- Die Zangengeburt eines möglichen Stammvaters. Website Neue Zürcher Zeitung. Seen 16. August 2013.
  71. ^ Official Homepage Roboy Archived 2013-08-03 at the Wayback Machine. Website Roboy. Seen 16. August 2013.
  72. ^ Official Homepage Starmind. Website Starmind. Seen 16. August 2013.
  73. ^ Sabour, Sara; Frosst, Nicholas; Hinton, Geoffrey E. (26 October 2017). "Dynamic Routing Between Capsules". arXiv:1710.09829 [cs.CV].
  74. ^ "What is a chatbot?". techtarget.com. Retrieved 30 January 2017.
  75. ^ Civera, Javier; Ciocarlie, Matei; Aydemir, Alper; Bekris, Kostas; Sarma, Sanjay (2015). "Guest Editorial Special Issue on Cloud Robotics and Automation". IEEE Transactions on Automation Science and Engineering. 12 (2): 396–397. doi:10.1109/TASE.2015.2409511. S2CID 16080778.
  76. ^ "Robo Earth - Tech News". Robo Earth.
  77. ^ Goldberg, Ken. "Cloud Robotics and Automation".
  78. ^ Li, R. "Cloud Robotics-Enable cloud computing for robots". Retrieved 7 December 2014.
  79. ^ Fisher, Douglas (1987). "Knowledge acquisition via incremental conceptual clustering". Machine Learning. 2 (2): 139–172. doi:10.1007/BF00114265.
  80. ^ Fisher, Douglas H. (July 1987). "Improving inference through conceptual clustering". Proceedings of the 1987 AAAI Conferences. AAAI Conference. Seattle Washington. pp. 461–465.
  81. ^ Iba, William; Langley, Pat (27 January 2011). "Cobweb models of categorization and probabilistic concept formation". In Pothos, Emmanuel M.; Wills, Andy J. (eds.). Formal approaches in categorization. Cambridge: Cambridge University Press. pp. 253–273. ISBN 9780521190480.
  82. ^ Refer to the ICT website: https://cogarch.ict.usc.edu/
  83. ^ "Hewlett Packard Labs". Archived from the original on 30 October 2016. Retrieved 5 July 2022.
  84. ^ Terdiman, Daniel (2014) .IBM's TrueNorth processor mimics the human brain.https://cnet.com/news/ibms-truenorth-processor-mimics-the-human-brain/
  85. ^ Knight, Shawn (2011). IBM unveils cognitive computing chips that mimic human brain TechSpot: August 18, 2011, 12:00 PM
  86. ^ Hamill, Jasper (2013). Cognitive computing: IBM unveils software for its brain-like SyNAPSE chips The Register: August 8, 2013
  87. ^ Denning., P.J. (2014). "Surfing Toward the Future". Communications of the ACM. 57 (3): 26–29. doi:10.1145/2566967. S2CID 20681733.
  88. ^ Ludwig, Lars (2013). Extended Artificial Memory: Toward an integral cognitive theory of memory and technology (pdf) (Thesis). Technical University of Kaiserslautern. Retrieved 7 February 2017.
  89. ^ "Research at HP Labs". Archived from the original on 7 March 2022. Retrieved 5 July 2022.
  90. ^ Cognitive science is an interdisciplinary field of researchers from Linguistics, psychology, neuroscience, philosophy, computer science, and anthropology that seek to understand the mind. How We Learn: Ask the Cognitive Scientist
  91. ^ Schrijver, Alexander (February 1, 2006). A Course in Combinatorial Optimization (PDF), page 1.
  92. ^ HAYKIN, S. Neural Networks - A Comprehensive Foundation. Second edition. Pearson Prentice Hall: 1999.
  93. ^ "PROGRAMS WITH COMMON SENSE". www-formal.stanford.edu. Retrieved 11 April 2018.
  94. ^ Davis, Ernest; Marcus, Gary (2015). "Commonsense reasoning". Communications of the ACM. Vol. 58, no. 9. pp. 92–103. doi:10.1145/2701413.
  95. ^ Hulstijn, J, and Nijholt, A. (eds.). Proceedings of the International Workshop on Computational Humor. Number 12 in Twente Workshops on Language Technology, Enschede, Netherlands. University of Twente, 1996.
  96. ^ "ACL - Association for Computational Learning".
  97. ^ Trappenberg, Thomas P. (2002). Fundamentals of Computational Neuroscience. United States: Oxford University Press Inc. p. 1. ISBN 978-0-19-851582-1.
  98. ^ What is computational neuroscience? Patricia S. Churchland, Christof Koch, Terrence J. Sejnowski. in Computational Neuroscience pp.46-55. Edited by Eric L. Schwartz. 1993. MIT Press "Computational Neuroscience Edited by Eric L. Schwartz". Archived from the original on 4 June 2011. Retrieved 11 June 2009.
  99. ^ "Theoretical Neuroscience". The MIT Press. Archived from the original on 31 May 2018. Retrieved 24 May 2018.
  100. ^ Gerstner, W.; Kistler, W.; Naud, R.; Paninski, L. (2014). Neuronal Dynamics. Cambridge, UK: Cambridge University Press. ISBN 9781107447615.
  101. ^ Kamentsky, L.A.; Liu, C.-N. (1963). "Computer-Automated Design of Multifont Print Recognition Logic". IBM Journal of Research and Development. 7 (1): 2. doi:10.1147/rd.71.0002. Archived from the original on 3 March 2016. Retrieved 5 July 2022.
  102. ^ Brncick, M (2000). "Computer automated design and computer automated manufacture". Phys Med Rehabil Clin N Am. 11 (3): 701–13. doi:10.1016/s1047-9651(18)30806-4. PMID 10989487.
  103. ^ Li, Y.; et al. (2004). "CAutoCSD - Evolutionary search and optimisation enabled computer automated control system design". International Journal of Automation and Computing. 1 (1): 76–88. doi:10.1007/s11633-004-0076-8. S2CID 55417415.
  104. ^ Kramer, GJE; Grierson, DE (1989). "Computer automated design of structures under dynamic loads". Computers & Structures. 32 (2): 313–325. doi:10.1016/0045-7949(89)90043-6.
  105. ^ Moharrami, H; Grierson, DE (1993). "Computer-Automated Design of Reinforced Concrete Frameworks". Journal of Structural Engineering. 119 (7): 2036–2058. doi:10.1061/(asce)0733-9445(1993)119:7(2036).
  106. ^ Xu, L; Grierson, DE (1993). "Computer-Automated Design of Semirigid Steel Frameworks". Journal of Structural Engineering. 119 (6): 1740–1760. doi:10.1061/(asce)0733-9445(1993)119:6(1740).
  107. ^ Barsan, GM; Dinsoreanu, M, (1997). Computer-automated design based on structural performance criteria, Mouchel Centenary Conference on Innovation in Civil and Structural Engineering, Aug 19-21, Cambridge England, Innovation in Civil and Structural Engineering, 167-172
  108. ^ Li, Yun (1996). "Genetic algorithm automated approach to the design of sliding mode control systems". International Journal of Control. 63 (4): 721–739. doi:10.1080/00207179608921865.
  109. ^ Li, Yun; Chwee Kim, Ng; Chen Kay, Tan (1995). "Automation of Linear and Nonlinear Control Systems Design by Evolutionary Computation" (PDF). IFAC Proceedings Volumes. 28 (16): 85–90. doi:10.1016/S1474-6670(17)45158-5.
  110. ^ Barsan, GM, (1995) Computer-automated design of semirigid steel frameworks according to EUROCODE-3, Nordic Steel Construction Conference 95, JUN 19-21, 787-794
  111. ^ Gray, Gary J.; Murray-Smith, David J.; Li, Yun; et al. (1998). "Nonlinear model structure identification using genetic programming" (PDF). Control Engineering Practice. 6 (11): 1341–1352. doi:10.1016/s0967-0661(98)00087-2.
  112. ^ Zhang, Jun; Zhan, Zhi-hui; Lin, Ying; Chen, Ni; Gong, Yue-Jiao; Zhong, Jing-hui; Chung, Henry S.H.; Li, Yun; Shi, Yu-hui (2011). "Evolutionary Computation Meets Machine Learning: A Survey". IEEE Computational Intelligence Magazine. 6 (4): 68–75. doi:10.1109/MCI.2011.942584. S2CID 6760276.
  113. ^ Gregory S. Hornby (2003). Generative Representations for Computer-Automated Design Systems, NASA Ames Research Center, Mail Stop 269-3, Moffett Field, CA 94035-1000
  114. ^ J. Clune and H. Lipson (2011). Evolving three-dimensional objects with a generative encoding inspired by developmental biology. Proceedings of the European Conference on Artificial Life. 2011.
  115. ^ Zhan, Z.H.; et al. (2009). "Adaptive Particle Swarm Optimization" (PDF). IEEE Transactions on Systems, Man, and Cybernetics - Part B: Cybernetics. 39 (6): 1362–1381. doi:10.1109/tsmcb.2009.2015956. PMID 19362911. S2CID 11191625.
  116. ^ "WordNet Search—3.1". Wordnetweb.princeton.edu. Archived from the original on 14 January 2013. Retrieved 14 May 2012.
  117. ^ Dana H. Ballard; Christopher M. Brown (1982). Computer Vision. Prentice Hall. ISBN 0-13-165316-4.
  118. ^ Huang, T. (1996-11-19). Vandoni, Carlo, E, ed. Computer Vision : Evolution And Promise (PDF). 19th CERN School of Computing. Geneva: CERN. pp. 21–25. doi:10.5170/CERN-1996-008.21. ISBN 978-9290830955.
  119. ^ Milan Sonka; Vaclav Hlavac; Roger Boyle (2008). Image Processing, Analysis, and Machine Vision. Thomson. ISBN 0-495-08252-X.
  120. ^ Garson, James (27 November 2018). Zalta, Edward N. (ed.). The Stanford Encyclopedia of Philosophy. Metaphysics Research Lab, Stanford University – via Stanford Encyclopedia of Philosophy.
  121. ^ "Ishtar for Belgium to Belgrade". European Broadcasting Union. Retrieved 19 May 2013.
  122. ^ LeCun, Yann. "LeNet-5, convolutional neural networks". Retrieved 16 November 2013.
  123. ^ Zhang, Wei (1988). "Shift-invariant pattern recognition neural network and its optical architecture". Proceedings of annual conference of the Japan Society of Applied Physics.
  124. ^ Zhang, Wei (1990). "Parallel distributed processing model with local space-invariant interconnections and its optical architecture". Applied Optics. 29 (32): 4790–7. Bibcode:1990ApOpt..29.4790Z. doi:10.1364/AO.29.004790. PMID 20577468.
  125. ^ Tian, Yuandong; Zhu, Yan (2015). "Better Computer Go Player with Neural Network and Long-term Prediction". arXiv:1511.06410v1 [cs.LG].
  126. ^ "How Facebook's AI Researchers Built a Game-Changing Go Engine". MIT Technology Review. 4 December 2015. Retrieved 3 February 2016.
  127. ^ "Facebook AI Go Player Gets Smarter With Neural Network And Long-Term Prediction To Master World's Hardest Game". Tech Times. 28 January 2016. Retrieved 24 April 2016.
  128. ^ "Facebook's artificially intelligent Go player is getting smarter". VentureBeat. 27 January 2016. Retrieved 24 April 2016.
  129. ^ Solomonoff, R.J.The Time Scale of Artificial Intelligence; Reflections on Social Effects, Human Systems Management, Vol 5 1985, Pp 149-153
  130. ^ Moor, J., The Dartmouth College Artificial Intelligence Conference: The Next Fifty years, AI Magazine, Vol 27, No., 4, Pp. 87-9, 2006
  131. ^ Kline, Ronald R., Cybernetics, Automata Studies and the Dartmouth Conference on Artificial Intelligence, IEEE Annals of the History of Computing, October–December, 2011, IEEE Computer Society
  132. ^ a b Haghighat, Mohammad; Abdel-Mottaleb, Mohamed; Alhalabi, Wadee (2016). "Discriminant Correlation Analysis: Real-Time Feature Level Fusion for Multimodal Biometric Recognition". IEEE Transactions on Information Forensics and Security. 11 (9): 1984–1996. doi:10.1109/TIFS.2016.2569061. S2CID 15624506.
  133. ^ Lenzerini, Maurizio (2002). "Data Integration: A Theoretical Perspective" (PDF). PODS 2002. pp. 233–246. Archived from the original (PDF) on 27 October 2021. Retrieved 5 July 2022.
  134. ^ Lane, Frederick (2006). "IDC: World Created 161 Billion Gigs of Data in 2006".
  135. ^ Dhar, V. (2013). "Data science and prediction". Communications of the ACM. 56 (12): 64–73. doi:10.1145/2500499. S2CID 6107147.
  136. ^ Leek, Jeff (12 December 2013). "The key word in 'Data Science' is not Data, it is Science". Simply Statistics. Archived from the original on 2 January 2014. Retrieved 11 November 2018.
  137. ^ Hayashi, Chikio (1 January 1998). "What is Data Science ? Fundamental Concepts and a Heuristic Example". In Hayashi, Chikio; Yajima, Keiji; Bock, Hans-Hermann; Ohsumi, Noboru; Tanaka, Yutaka; Baba, Yasumasa (eds.). Data Science, Classification, and Related Methods. Studies in Classification, Data Analysis, and Knowledge Organization. Springer Japan. pp. 40–51. doi:10.1007/978-4-431-65950-1_3. ISBN 9784431702085.
  138. ^ Dedić, Nedim; Stanier, Clare (2016). Hammoudi, Slimane; Maciaszek, Leszek; Missikoff, Michele M. Missikoff; Camp, Olivier; Cordeiro, José (eds.). An Evaluation of the Challenges of Multilingualism in Data Warehouse Development. International Conference on Enterprise Information Systems, 25–28 April 2016, Rome, Italy (PDF). Proceedings of the 18th International Conference on Enterprise Information Systems (ICEIS 2016). Vol. 1. SciTePress. pp. 196–206. doi:10.5220/0005858401960206. ISBN 978-989-758-187-8.
  139. ^ "9 Reasons Data Warehouse Projects Fail". blog.rjmetrics.com. 4 December 2014. Retrieved 30 April 2017.
  140. ^ Huang; Green; Loo, "Datalog and Emerging applications", SIGMOD 2011 (PDF), UC Davis, archived from the original (PDF) on 1 July 2022, retrieved 5 July 2022.
  141. ^ Steele, Katie and Stefánsson, H. Orri, "Decision Theory", The Stanford Encyclopedia of Philosophy (Winter 2015 Edition), Edward N. Zalta (ed.), URL = [1]
  142. ^ Lloyd, J.W., Practical Advantages of Declarative Programming
  143. ^ a b Bengio, Y.; Courville, A.; Vincent, P. (2013). "Representation Learning: A Review and New Perspectives". IEEE Transactions on Pattern Analysis and Machine Intelligence. 35 (8): 1798–1828. arXiv:1206.5538. doi:10.1109/tpami.2013.50. PMID 23787338. S2CID 393948.
  144. ^ Schmidhuber, J (2015). "Deep Learning in Neural Networks: An Overview". Neural Networks. 61: 85–117. arXiv:1404.7828. doi:10.1016/j.neunet.2014.09.003. PMID 25462637. S2CID 11715509.
  145. ^ Bengio, Yoshua; LeCun, Yann; Hinton, Geoffrey (2015). "Deep Learning" (PDF). Nature. 521 (7553): 436–444. Bibcode:2015Natur.521..436L. doi:10.1038/nature14539. PMID 26017442. S2CID 3074096.
  146. ^ "About Us | DeepMind". DeepMind.
  147. ^ "A return to Paris | DeepMind". DeepMind. 29 March 2018.
  148. ^ "The Last AI Breakthrough DeepMind Made Before Google Bought It". The Physics arXiv Blog. 29 January 2014. Retrieved 12 October 2014.
  149. ^ Graves, Alex; Wayne, Greg; Danihelka, Ivo (2014). "Neural Turing Machines". arXiv:1410.5401 [cs.NE].
  150. ^ Best of 2014: Google's Secretive DeepMind Startup Unveils a "Neural Turing Machine" Archived 4 December 2015 at the Wayback Machine, MIT Technology Review
  151. ^ Graves, Alex; Wayne, Greg; Reynolds, Malcolm; Harley, Tim; Danihelka, Ivo; Grabska-Barwińska, Agnieszka; Colmenarejo, Sergio Gómez; Grefenstette, Edward; Ramalho, Tiago (12 October 2016). "Hybrid computing using a neural network with dynamic external memory". Nature. 538 (7626): 471–476. Bibcode:2016Natur.538..471G. doi:10.1038/nature20101. ISSN 1476-4687. PMID 27732574. S2CID 205251479.
  152. ^ Kohs, Greg (29 September 2017), AlphaGo, Ioannis Antonoglou, Lucas Baker, Nick Bostrom, retrieved 9 January 2018
  153. ^ Silver, David; Hubert, Thomas; Schrittwieser, Julian; Antonoglou, Ioannis; Lai, Matthew; Guez, Arthur; Lanctot, Marc; Sifre, Laurent; Kumaran, Dharshan; Graepel, Thore; Lillicrap, Timothy; Simonyan, Karen; Hassabis, Demis (5 December 2017). "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm". arXiv:1712.01815 [cs.AI].
  154. ^ Ester, Martin; Kriegel, Hans-Peter; Sander, Jörg; Xu, Xiaowei (1996). Simoudis, Evangelos; Han, Jiawei; Fayyad, Usama M. (eds.). A density-based algorithm for discovering clusters in large spatial databases with noise (PDF). Proceedings of the Second International Conference on Knowledge Discovery and Data Mining (KDD-96). AAAI Press. pp. 226–231. CiteSeerX 10.1.1.121.9220. ISBN 1-57735-004-9.
  155. ^ Sikos, Leslie F. (2017). Description Logics in Multimedia Reasoning. Cham: Springer International Publishing. doi:10.1007/978-3-319-54066-5. ISBN 978-3-319-54066-5. S2CID 3180114.
  156. ^ Ho, Jonathan; Jain, Ajay; Abbeel, Pieter (19 June 2020). Denoising Diffusion Probabilistic Models. arXiv:2006.11239.
  157. ^ Song, Yang; Sohl-Dickstein, Jascha; Kingma, Diederik P.; Kumar, Abhishek; Ermon, Stefano; Poole, Ben (10 February 2021). "Score-Based Generative Modeling through Stochastic Differential Equations". arXiv:2011.13456 [cs.LG].
  158. ^ Gu, Shuyang; Chen, Dong; Bao, Jianmin; Wen, Fang; Zhang, Bo; Chen, Dongdong; Yuan, Lu; Guo, Baining (2021). "Vector Quantized Diffusion Model for Text-to-Image Synthesis". arXiv:2111.14822 [cs.CV].
  159. ^ Chang, Ziyi; Koulieris, George Alex; Shum, Hubert P. H. (2023). "On the Design Fundamentals of Diffusion Models: A Survey". arXiv:2306.04542 [cs.LG].
  160. ^ Croitoru, Florinel-Alin; Hondru, Vlad; Ionescu, Radu Tudor; Shah, Mubarak (2023). "Diffusion Models in Vision: A Survey". IEEE Transactions on Pattern Analysis and Machine Intelligence. 45 (9): 10850–10869. arXiv:2209.04747. doi:10.1109/TPAMI.2023.3261988. PMID 37030794. S2CID 252199918.
  161. ^ Roweis, S. T.; Saul, L. K. (2000). "Nonlinear Dimensionality Reduction by Locally Linear Embedding". Science. 290 (5500): 2323–2326. Bibcode:2000Sci...290.2323R. CiteSeerX 10.1.1.111.3313. doi:10.1126/science.290.5500.2323. PMID 11125150. S2CID 5987139.
  162. ^ Pudil, P.; Novovičová, J. (1998). "Novel Methods for Feature Subset Selection with Respect to Problem Knowledge". In Liu, Huan; Motoda, Hiroshi (eds.). Feature Extraction, Construction and Selection. pp. 101. doi:10.1007/978-1-4615-5725-8_7. ISBN 978-1-4613-7622-4.
  163. ^ Demazeau, Yves, and J-P. Müller, eds. Decentralized Ai. Vol. 2. Elsevier, 1990.
  164. ^ Hendrickx, Iris; Van den Bosch, Antal (October 2005). "Hybrid algorithms with Instance-Based Classification". Machine Learning: ECML2005. Springer. pp. 158–169. ISBN 9783540292432.
  165. ^ a b Ostrow, Adam (5 March 2011). "Roger Ebert's Inspiring Digital Transformation". Mashable Entertainment. Retrieved 12 September 2011. With the help of his wife, two colleagues and the Alex-equipped MacBook that he uses to generate his computerized voice, famed film critic Roger Ebert delivered the final talk at the TED conference on Friday in Long Beach, California....
  166. ^ Lee, Jennifer (7 March 2011). "Roger Ebert Tests His Vocal Cords, and Comedic Delivery". The New York Times. Retrieved 12 September 2011. Now perhaps, there is the Ebert Test, a way to see if a synthesized voice can deliver humor with the timing to make an audience laugh.... He proposed the Ebert Test as a way to gauge the humanness of a synthesized voice.
  167. ^ "Roger Ebert's Inspiring Digital Transformation". Tech News. 5 March 2011. Archived from the original on 25 March 2011. Retrieved 12 September 2011. Meanwhile, the technology that enables Ebert to "speak" continues to see improvements – for example, adding more realistic inflection for question marks and exclamation points. In a test of that, which Ebert called the "Ebert test" for computerized voices,
  168. ^ Pasternack, Alex (18 April 2011). "A MacBook May Have Given Roger Ebert His Voice, But An iPod Saved His Life (Video)". Motherboard. Archived from the original on 6 September 2011. Retrieved 12 September 2011. He calls it the "Ebert Test," after Turing's AI standard...
  169. ^ Jaeger, Herbert; Haas, Harald (2004). "Harnessing Nonlinearity: Predicting Chaotic Systems and Saving Energy in Wireless Communication" (PDF). Science. 304 (5667): 78–80. Bibcode:2004Sci...304...78J. doi:10.1126/science.1091277. PMID 15064413. S2CID 2184251. Archived from the original (PDF) on 1 September 2022. Retrieved 5 July 2022.
  170. ^ Herbert Jaeger (2007) Echo State Network. Archived 28 June 2022 at the Wayback Machine Scholarpedia.
  171. ^ Serenko, Alexander; Bontis, Nick; Detlor, Brian (2007). "End-user adoption of animated interface agents in everyday work applications" (PDF). Behaviour and Information Technology. 26 (2): 119–132. doi:10.1080/01449290500260538. S2CID 2175427.
  172. ^ Opitz, D.; Maclin, R. (1999). "Popular ensemble methods: An empirical study". Journal of Artificial Intelligence Research. 11: 169–198. arXiv:1106.0257. doi:10.1613/jair.614.
  173. ^ Polikar, R. (2006). "Ensemble based systems in decision making". IEEE Circuits and Systems Magazine. 6 (3): 21–45. doi:10.1109/MCAS.2006.1688199. S2CID 18032543.
  174. ^ Rokach, L. (2010). "Ensemble-based classifiers". Artificial Intelligence Review. 33 (1–2): 1–39. doi:10.1007/s10462-009-9124-7. hdl:11323/1748. S2CID 11149239.
  175. ^ Vikhar, P. A. (2016). "Evolutionary algorithms: A critical review and its future prospects". 2016 International Conference on Global Trends in Signal Processing, Information Computing and Communication (ICGTSPICC). Jalgaon, 2016, pp. 261-265. pp. 261–265. doi:10.1109/ICGTSPICC.2016.7955308. ISBN 978-1-5090-0467-6. S2CID 22100336.
  176. ^ Russell, Stuart; Norvig, Peter (2009). "26.3: The Ethics and Risks of Developing Artificial Intelligence". Artificial Intelligence: A Modern Approach. Prentice Hall. ISBN 978-0-13-604259-4.
  177. ^ Bostrom, Nick (2002). "Existential risks". Journal of Evolution and Technology. 9 (1): 1–31.
  178. ^ "Your Artificial Intelligence Cheat Sheet". Slate. 1 April 2016. Retrieved 16 May 2016.
  179. ^ Jackson, Peter (1998), Introduction To Expert Systems (3 ed.), Addison Wesley, p. 2, ISBN 978-0-201-87686-4
  180. ^ "Conventional programming". PC Magazine. Archived from the original on 14 October 2012. Retrieved 15 September 2013.
  181. ^ Martignon, Laura; Vitouch, Oliver; Takezawa, Masanori; Forster, Malcolm. "Naive and Yet Enlightened: From Natural Frequencies to Fast and Frugal Decision Trees", published in Thinking : Psychological perspectives on reasoning, judgement and decision making (David Hardman and Laura Macchi; editors), Chichester: John Wiley & Sons, 2003.
  182. ^ Bishop, Christopher (2006). Pattern recognition and machine learning. Berlin: Springer. ISBN 0-387-31073-8.
  183. ^ a b Hodgson, Dr. J. P. E., "First Order Logic" Archived 21 September 2019 at the Wayback Machine, Saint Joseph's University, Philadelphia, 1995.
  184. ^ Hughes, G. E., & Cresswell, M. J., A New Introduction to Modal Logic (London: Routledge, 1996), p.161.
  185. ^ Feigenbaum, Edward (1988). The Rise of the Expert Company. Times Books. p. 318. ISBN 978-0-8129-1731-4.
  186. ^ Hayes, Patrick (1981). "The Frame Problem and Related Problems in Artificial Intelligence" (PDF). Readings in Artificial Intelligence. University of Edinburgh: 223–230. doi:10.1016/B978-0-934613-03-3.50020-9. ISBN 9780934613033. S2CID 141711662. Archived from the original (PDF) on 3 December 2013. Retrieved 9 March 2019.
  187. ^ Sardar, Z (2010). "The Namesake: Futures; futures studies; futurology; futuristic; Foresight -- What's in a name?". Futures. 42 (3): 177–184. doi:10.1016/j.futures.2009.11.001.
  188. ^ Pedrycz, Witold (1993). Fuzzy control and fuzzy systems (2 ed.). Research Studies Press Ltd.
  189. ^ Hájek, Petr (1998). Metamathematics of fuzzy logic (4 ed.). Springer Science & Business Media.
  190. ^ D. Dubois and H. Prade (1988) Fuzzy Sets and Systems. Academic Press, New York.
  191. ^ Liang, Lily R.; Lu, Shiyong; Wang, Xuena; Lu, Yi; Mandal, Vinay; Patacsil, Dorrelyn; Kumar, Deepak (2006). "FM-test: A fuzzy-set-theory-based approach to differential gene expression data analysis". BMC Bioinformatics. 7 (Suppl 4): S7. doi:10.1186/1471-2105-7-S4-S7. PMC 1780132. PMID 17217525.
  192. ^ Myerson, Roger B. (1991). Game Theory: Analysis of Conflict, Harvard University Press, p. 1. Chapter-preview links, pp. vii–xi.
  193. ^ Pell, Barney (1992). H. van den Herik; L. Allis (eds.). "Metagame: a new challenge for games and learning" [Heuristic programming in artificial intelligence 3–the third computerolympiad] (PDF). Ellis-Horwood. Archived from the original (PDF) on 17 February 2020. Retrieved 13 June 2020.
  194. ^ Pell, Barney (1996). "A Strategic Metagame Player for General Chess-Like Games". Computational Intelligence. 12 (1): 177–198. doi:10.1111/j.1467-8640.1996.tb00258.x. ISSN 1467-8640. S2CID 996006.
  195. ^ Genesereth, Michael; Love, Nathaniel; Pell, Barney (15 June 2005). "General Game Playing: Overview of the AAAI Competition". AI Magazine. 26 (2): 62. doi:10.1609/aimag.v26i2.1813. ISSN 2371-9621.
  196. ^ Griffith, Erin; Metz, Cade (27 January 2023). "Anthropic Said to Be Closing In on $300 Million in New A.I. Funding". The New York Times. Retrieved 14 March 2023.
  197. ^ Lanxon, Nate; Bass, Dina; Davalos, Jackie (10 March 2023). "A Cheat Sheet to AI Buzzwords and Their Meanings". Bloomberg News. Retrieved 14 March 2023.
  198. ^ Pasick, Adam (27 March 2023). "Artificial Intelligence Glossary: Neural Networks and Other Terms Explained". The New York Times. ISSN 0362-4331. Retrieved 22 April 2023.
  199. ^ Andrej Karpathy; Pieter Abbeel; Greg Brockman; Peter Chen; Vicki Cheung; Yan Duan; Ian Goodfellow; Durk Kingma; Jonathan Ho; Rein Houthooft; Tim Salimans; John Schulman; Ilya Sutskever; Wojciech Zaremba (16 June 2016). "Generative models". OpenAI.
  200. ^ Mitchell 1996, p. 2.
  201. ^ Smith, Craig S. (15 March 2023). "ChatGPT-4 Creator Ilya Sutskever on AI Hallucinations and AI Democracy". Forbes. Retrieved 25 December 2023.
  202. ^ Trudeau, Richard J. (1993). Introduction to Graph Theory (Corrected, enlarged republication. ed.). New York: Dover Pub. p. 19. ISBN 978-0-486-67870-2. Retrieved 8 August 2012. A graph is an object consisting of two sets called its vertex set and its edge set.
  203. ^ Yoon, Byoung-Ha; Kim, Seon-Kyu; Kim, Seon-Young (March 2017). "Use of Graph Database for the Integration of Heterogeneous Biological Data". Genomics & Informatics. 15 (1): 19–27. doi:10.5808/GI.2017.15.1.19. ISSN 1598-866X. PMC 5389944. PMID 28416946.
  204. ^ Bourbakis, Nikolaos G. (1998). Artificial Intelligence and Automation. World Scientific. p. 381. ISBN 9789810226374. Retrieved 20 April 2018.
  205. ^ Pearl, Judea (1984). Heuristics: intelligent search strategies for computer problem solving. United States: Addison-Wesley Pub. Co., Inc., Reading, MA. p. 3. Bibcode:1985hiss.book.....P. OSTI 5127296.
  206. ^ E. K. Burke, E. Hart, G. Kendall, J. Newall, P. Ross, and S. Schulenburg, Hyper-heuristics: An emerging direction in modern search technology, Handbook of Metaheuristics (F. Glover and G. Kochenberger, eds.), Kluwer, 2003, pp. 457–474.
  207. ^ P. Ross, Hyper-heuristics, Search Methodologies: Introductory Tutorials in Optimization and Decision Support Techniques (E. K. Burke and G. Kendall, eds.), Springer, 2005, pp. 529-556.
  208. ^ Ozcan, E.; Bilgin, B.; Korkmaz, E. E. (2008). "A Comprehensive Analysis of Hyper-heuristics". Intelligent Data Analysis. 12 (1): 3–23. doi:10.3233/ida-2008-12102.
  209. ^ "IEEE CIS Scope". Archived from the original on 4 June 2016. Retrieved 18 March 2019.
  210. ^ "Control of Machining Processes - Purdue ME Manufacturing Laboratories". engineering.purdue.edu.
  211. ^ Hoy, Matthew B. (2018). "Alexa, Siri, Cortana, and More: An Introduction to Voice Assistants". Medical Reference Services Quarterly. 37 (1): 81–88. doi:10.1080/02763869.2018.1404391. PMID 29327988. S2CID 30809087.
  212. ^ Oudeyer, Pierre-Yves; Kaplan, Frederic (2008). "How can we define intrinsic motivation?". Proc. of the 8th Conf. on Epigenetic Robotics. Vol. 5. pp. 29–31.
  213. ^ Chevallier, Arnaud (2016). "Strategic thinking in complex problem solving". Oxford Scholarship Online. Oxford; New York: Oxford University Press. doi:10.1093/acprof:oso/9780190463908.001.0001. ISBN 9780190463908. OCLC 940455195. S2CID 157255130.
  214. ^ "Strategy survival guide: Issue trees". London: Government of the United Kingdom. July 2004. Archived from the original on 17 February 2012. Retrieved 6 October 2018. Also available in PDF format.
  215. ^ a b Paskin, Mark. "A Short Course on Graphical Models" (PDF). Stanford.
  216. ^ Woods, W. A.; Schmolze, J. G. (1992). "The KL-ONE family". Computers & Mathematics with Applications. 23 (2–5): 133. doi:10.1016/0898-1221(92)90139-9.
  217. ^ Brachman, R. J.; Schmolze, J. G. (1985). "An Overview of the KL-ONE Knowledge Representation System" (PDF). Cognitive Science. 9 (2): 171. doi:10.1207/s15516709cog0902_1.[permanent dead link]
  218. ^ Duce, D.A.; Ringland, G.A. (1988). Approaches to Knowledge Representation, An Introduction. Research Studies Press, Ltd. ISBN 978-0-86380-064-1.
  219. ^ Fix, Evelyn; Hodges, Joseph L. (1951). Discriminatory Analysis. Nonparametric Discrimination: Consistency Properties (PDF) (Report). USAF School of Aviation Medicine, Randolph Field, Texas. Archived (PDF) from the original on 26 September 2020.
  220. ^ Schank, Roger; Robert Abelson (1977). Scripts, Plans, Goals, and Understanding: An Inquiry Into Human Knowledge Structures. Lawrence Erlbaum Associates, Inc.
  221. ^ "Knowledge Representation in Neural Networks - deepMinds". deepMinds. 16 August 2018. Archived from the original on 17 August 2018. Retrieved 16 August 2018.
  222. ^ Kerner, Sean Michael. "What are Large Language Models?". TechTarget. Retrieved 28 January 2024.
  223. ^ Reilly, Edwin D. (2003). Milestones in computer science and information technology. Greenwood Publishing Group. pp. 156–157. ISBN 978-1-57356-521-9.
  224. ^ Hochreiter, Sepp; Schmidhuber, Jürgen (1997). "Long short-term memory". Neural Computation. 9 (8): 1735–1780. doi:10.1162/neco.1997.9.8.1735. PMID 9377276. S2CID 1915014.
  225. ^ Siegelmann, Hava T.; Sontag, Eduardo D. (1992). "On the computational power of neural nets". Proceedings of the fifth annual workshop on Computational learning theory. Vol. COLT '92. pp. 440–449. doi:10.1145/130385.130432. ISBN 978-0897914970. S2CID 207165680. {{cite book}}: |work= ignored (help)
  226. ^ "Markov chain | Definition of Markov chain in US English by Oxford Dictionaries". Oxford Dictionaries | English. Archived from the original on 15 December 2017. Retrieved 14 December 2017.
  227. ^ Definition at Brilliant.org "Brilliant Math and Science Wiki". Retrieved 12 May 2019
  228. ^ "The Nature of Mathematical Programming Archived 2014-03-05 at the Wayback Machine," Mathematical Programming Glossary, INFORMS Computing Society.
  229. ^ Wang, Wenwu (1 July 2010). Machine Audition: Principles, Algorithms and Systems. IGI Global. ISBN 9781615209194 – via igi-global.com.
  230. ^ "Machine Audition: Principles, Algorithms and Systems" (PDF).
  231. ^ Malcolm Tatum (October 3, 2012). "What is Machine Perception".
  232. ^ Alexander Serov (January 29, 2013). "Subjective Reality and Strong Artificial Intelligence" (PDF).
  233. ^ "Machine Perception & Cognitive Robotics Laboratory". ccs.fau.edu. Retrieved 18 June 2016.
  234. ^ "What is Mechatronics Engineering?". Prospective Student Information. University of Waterloo. Archived from the original on 6 October 2011. Retrieved 30 May 2011.
  235. ^ "Mechatronics (Bc., Ing., PhD.)". Archived from the original on 15 August 2016. Retrieved 15 April 2011.
  236. ^ Franke; Siezen, Teusink (2005). "Reconstructing the metabolic network of a bacterium from its genome". Trends in Microbiology. 13 (11): 550–558. doi:10.1016/j.tim.2005.09.001. PMID 16169729.
  237. ^ Balamurugan, R.; Natarajan, A.M.; Premalatha, K. (2015). "Stellar-Mass Black Hole Optimization for Biclustering Microarray Gene Expression Data". Applied Artificial Intelligence. 29 (4): 353–381. doi:10.1080/08839514.2015.1016391. S2CID 44624424.
  238. ^ Bianchi, Leonora; Dorigo, Marco; Maria Gambardella, Luca; Gutjahr, Walter J. (2009). "A survey on metaheuristics for stochastic combinatorial optimization" (PDF). Natural Computing. 8 (2): 239–287. doi:10.1007/s11047-008-9098-4. S2CID 9141490.
  239. ^ Herbert B. Enderton, 2001, A Mathematical Introduction to Logic Second Edition Enderton:110, Harcourt Academic Press, Burlington MA, ISBN 978-0-12-238452-3.
  240. '^ "Naive Semantics to Support Automated Database Design", IEEE Transactions on Knowledge and Data Engineering, Volume 14, issue 1 (January 2002) by V. C. Storey, R. C. Goldstein and H. Ullrich
  241. ^ Using early binding and late binding in Automation, Microsoft, 11 May 2007, retrieved 11 May 2009
  242. ^ strictly speaking a URIRef
  243. ^ https://w3.org/TR/PR-rdf-syntax/ "Resource Description Framework (RDF) Model and Syntax Specification"
  244. ^ Miller, Lance A. "Natural language programming: Styles, strategies, and contrasts." IBM Systems Journal 20.2 (1981): 184–215.
  245. ^ Hopfield, J. J. (1982). "Neural networks and physical systems with emergent collective computational abilities". Proc. Natl. Acad. Sci. U.S.A. 79 (8): 2554–2558. Bibcode:1982PNAS...79.2554H. doi:10.1073/pnas.79.8.2554. PMC 346238. PMID 6953413.
  246. ^ "Deep Minds: An Interview with Google's Alex Graves & Koray Kavukcuoglu". Retrieved 17 May 2016.
  247. ^ Graves, Alex; Wayne, Greg; Danihelka, Ivo (2014). "Neural Turing Machines". arXiv:1410.5401 [cs.NE].
  248. ^ Krucoff, Max O.; Rahimpour, Shervin; Slutzky, Marc W.; Edgerton, V. Reggie; Turner, Dennis A. (1 January 2016). "Enhancing Nervous System Recovery through Neurobiologics, Neural Interface Training, and Neurorehabilitation". Frontiers in Neuroscience. 10: 584. doi:10.3389/fnins.2016.00584. PMC 5186786. PMID 28082858.
  249. ^ Mead, Carver (1990). "Neuromorphic electronic systems" (PDF). Proceedings of the IEEE. 78 (10): 1629–1636. CiteSeerX 10.1.1.161.9762. doi:10.1109/5.58356. S2CID 1169506.
  250. ^ Maan, A. K.; Jayadevi, D. A.; James, A. P. (1 January 2016). "A Survey of Memristive Threshold Logic Circuits". IEEE Transactions on Neural Networks and Learning Systems. PP (99): 1734–1746. arXiv:1604.07121. doi:10.1109/TNNLS.2016.2547842. ISSN 2162-237X. PMID 27164608. S2CID 1798273.
  251. ^ "A Survey of Spintronic Architectures for Processing-in-Memory and Neural Networks", JSA, 2018
  252. ^ Zhou, You; Ramanathan, S. (1 August 2015). "Mott Memory and Neuromorphic Devices". Proceedings of the IEEE. 103 (8): 1289–1310. doi:10.1109/JPROC.2015.2431914. ISSN 0018-9219. S2CID 11347598.
  253. ^ Monroe, D. (2014). "Neuromorphic computing gets ready for the (really) big time". Communications of the ACM. 57 (6): 13–15. doi:10.1145/2601069. S2CID 20051102.
  254. ^ Zhao, W. S.; Agnus, G.; Derycke, V.; Filoramo, A.; Bourgoin, J. -P.; Gamrat, C. (2010). "Nanotube devices based crossbar architecture: Toward neuromorphic computing". Nanotechnology. 21 (17): 175202. Bibcode:2010Nanot..21q5202Z. doi:10.1088/0957-4484/21/17/175202. PMID 20368686. S2CID 16253700. Archived from the original on 10 April 2021. Retrieved 2 December 2019.
  255. ^ The Human Brain Project SP 9: Neuromorphic Computing Platform on YouTube
  256. ^ Copeland, Jack (May 2000). "What is Artificial Intelligence?". AlanTuring.net. Archived from the original on 9 November 2015. Retrieved 7 November 2015.
  257. ^ Kleinberg, Jon; Tardos, Éva (2006). Algorithm Design (2nd ed.). Addison-Wesley. p. 464. ISBN 0-321-37291-3.
  258. ^ Cobham, Alan (1965). "The intrinsic computational difficulty of functions". Proc. Logic, Methodology, and Philosophy of Science II. North Holland.
  259. ^ "What is Occam's Razor?". math.ucr.edu. Retrieved 1 June 2019.
  260. ^ "OpenAI shifts from nonprofit to 'capped-profit' to attract capital". TechCrunch. Retrieved 2019-05-10.
  261. ^ "OpenCog: Open-Source Artificial General Intelligence for Virtual Worlds". CyberTech News. 6 March 2009. Archived from the original on 6 March 2009. Retrieved 1 October 2016.
  262. ^ St. Laurent, Andrew M. (2008). Understanding Open Source and Free Software Licensing. O'Reilly Media. p. 4. ISBN 9780596553951.
  263. ^ Levine, Sheen S.; Prietula, Michael J. (30 December 2013). "Open Collaboration for Innovation: Principles and Performance". Organization Science. 25 (5): 1414–1433. arXiv:1406.7541. doi:10.1287/orsc.2013.0872. ISSN 1047-7039. S2CID 6583883.
  264. ^ Definition of "overfitting" at OxfordDictionaries.com: this definition is specifically for statistics.
  265. ^ Bishop, Christopher M. (2006). Pattern Recognition and Machine Learning (PDF). Springer. p. vii. Pattern recognition has its origins in engineering, whereas machine learning grew out of computer science. However, these activities can be viewed as two facets of the same field, and together they have undergone substantial development over the past ten years.
  266. ^ Hughes, G. E., & Cresswell, M. J., A New Introduction to Modal Logic (London: Routledge, 1996), p.161.
  267. ^ Nyce, Charles (2007), Predictive Analytics White Paper (PDF), American Institute for Chartered Property Casualty Underwriters/Insurance Institute of America, p. 1
  268. ^ Eckerson, Wayne (10 May 2007), Extending the Value of Your Data Warehousing Investment, The Data Warehouse Institute
  269. ^ Karl R. Popper, The Myth of Framework, London (Routledge) 1994, chap. 8.
  270. ^ Karl R. Popper, The Poverty of Historicism, London (Routledge) 1960, chap. iv, sect. 31.
  271. ^ "Probabilistic programming does in 50 lines of code what used to take thousands". phys.org. 13 April 2015. Retrieved 13 April 2015.
  272. ^ "Probabilistic Programming". probabilistic-programming.org. Archived from the original on 10 January 2016. Retrieved 31 July 2019.
  273. ^ Pfeffer, Avrom (2014), Practical Probabilistic Programming, Manning Publications. p.28. ISBN 978-1 6172-9233-0
  274. ^ Clocksin, William F.; Mellish, Christopher S. (2003). Programming in Prolog. Berlin; New York: Springer-Verlag. ISBN 978-3-540-00678-7.
  275. ^ Bratko, Ivan (2012). Prolog programming for artificial intelligence (4th ed.). Harlow, England; New York: Addison Wesley. ISBN 978-0-321-41746-6.
  276. ^ Covington, Michael A. (1994). Natural language processing for Prolog programmers. Englewood Cliffs, N.J.: Prentice Hall. ISBN 978-0-13-629213-5.
  277. ^ Lloyd, J. W. (1984). Foundations of logic programming. Berlin: Springer-Verlag. ISBN 978-3-540-13299-8.
  278. ^ Kuhlman, Dave. "A Python Book: Beginning Python, Advanced Python, and Python Exercises". Section 1.1. Archived from the original (PDF) on 23 June 2012.
  279. ^ Yegulalp, Serdar (19 January 2017). "Facebook brings GPU-powered machine learning to Python". InfoWorld. Retrieved 11 December 2017.
  280. ^ Lorica, Ben (3 August 2017). "Why AI and machine learning researchers are beginning to embrace PyTorch". O'Reilly Media. Retrieved 11 December 2017.
  281. ^ Ketkar, Nikhil (2017). "Introduction to PyTorch". Deep Learning with Python. Apress, Berkeley, CA. pp. 195–208. doi:10.1007/978-1-4842-2766-4_12. ISBN 9781484227657.
  282. ^ Moez Ali (June 2023). "NLP with PyTorch: A Comprehensive Guide". datacamp.com. Retrieved 1 April 2024.
  283. ^ Patel, Mo (7 December 2017). "When two trends fuse: PyTorch and recommender systems". O'Reilly Media. Retrieved 18 December 2017.
  284. ^ Mannes, John. "Facebook and Microsoft collaborate to simplify conversions from PyTorch to Caffe2". TechCrunch. Retrieved 18 December 2017. FAIR is accustomed to working with PyTorch – a deep learning framework optimized for achieving state of the art results in research, regardless of resource constraints. Unfortunately in the real world, most of us are limited by the computational capabilities of our smartphones and computers.
  285. ^ Arakelyan, Sophia (29 November 2017). "Tech giants are using open source frameworks to dominate the AI community". VentureBeat. Retrieved 18 December 2017.
  286. ^ "PyTorch strengthens its governance by joining the Linux Foundation". pytorch.org. Retrieved 13 September 2022.
  287. ^ a b Reiter, Raymond (2001). Knowledge in Action: Logical Foundations for Specifying and Implementing Dynamical Systems. Cambridge, Massachusetts: The MIT Press. pp. 20–22. ISBN 9780262527002.
  288. ^ Thielscher, Michael (September 2001). "The Qualification Problem: A solution to the problem of anomalous models". Artificial Intelligence. 131 (1–2): 1–37. doi:10.1016/S0004-3702(01)00131-X.
  289. ^ Grumbling, Emily; Horowitz, Mark, eds. (2019). Quantum Computing : Progress and Prospects (2018). Washington, DC: National Academies Press. p. I-5. doi:10.17226/25196. ISBN 978-0-309-47969-1. OCLC 1081001288. S2CID 125635007.
  290. ^ R language and environment Hornik, Kurt (4 October 2017). "R FAQ". The Comprehensive R Archive Network. 2.1 What is R?. Retrieved 6 August 2018. R Foundation Hornik, Kurt (4 October 2017). "R FAQ". The Comprehensive R Archive Network. 2.13 What is the R Foundation?. Retrieved 6 August 2018. The R Core Team asks authors who use R in their data analysis to cite the software using: R Core Team (2016). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https://R-project.org/.
  291. ^ widely used Fox, John & Andersen, Robert (January 2005). "Using the R Statistical Computing Environment to Teach Social Statistics Courses" (PDF). Department of Sociology, McMaster University. Retrieved 6 August 2018. Vance, Ashlee (6 January 2009). "Data Analysts Captivated by R's Power". The New York Times. Retrieved 6 August 2018. R is also the name of a popular programming language used by a growing number of data analysts inside corporations and academia. It is becoming their lingua franca...
  292. ^ Vance, Ashlee (6 January 2009). "Data Analysts Captivated by R's Power". The New York Times. Retrieved 6 August 2018. R is also the name of a popular programming language used by a growing number of data analysts inside corporations and academia. It is becoming their lingua franca...
  293. ^ Broomhead, D. S.; Lowe, David (1988). Radial basis functions, multi-variable functional interpolation and adaptive networks (PDF) (Technical report). RSRE. 4148. Archived from the original on 9 April 2013.
  294. ^ Broomhead, D. S.; Lowe, David (1988). "Multivariable functional interpolation and adaptive networks" (PDF). Complex Systems. 2: 321–355.
  295. ^ Schwenker, Friedhelm; Kestler, Hans A.; Palm, Günther (2001). "Three learning phases for radial-basis-function networks". Neural Networks. 14 (4–5): 439–458. doi:10.1016/s0893-6080(01)00027-2. PMID 11411631.
  296. ^ Ho, Tin Kam (1995). Random Decision Forests (PDF). Proceedings of the 3rd International Conference on Document Analysis and Recognition, Montreal, QC, 14–16 August 1995. pp. 278–282. Archived from the original (PDF) on 17 April 2016. Retrieved 5 June 2016.
  297. ^ Ho, TK (1998). "The Random Subspace Method for Constructing Decision Forests". IEEE Transactions on Pattern Analysis and Machine Intelligence. 20 (8): 832–844. doi:10.1109/34.709601. S2CID 206420153.
  298. ^ Hastie, Trevor; Tibshirani, Robert; Friedman, Jerome(2008). The Elements of Statistical Learning (2nd ed.). Springer. ISBN 0-387-95284-5.
  299. ^ Graves, A.; Liwicki, M.; Fernandez, S.; Bertolami, R.; Bunke, H.; Schmidhuber, J. (2009). "A Novel Connectionist System for Improved Unconstrained Handwriting Recognition" (PDF). IEEE Transactions on Pattern Analysis and Machine Intelligence. 31 (5): 855–868. CiteSeerX 10.1.1.139.4502. doi:10.1109/tpami.2008.137. PMID 19299860. S2CID 14635907.
  300. ^ Sak, Hasim; Senior, Andrew; Beaufays, Francoise (2014). "Long Short-Term Memory recurrent neural network architectures for large scale acoustic modeling" (PDF). Archived from the original (PDF) on 24 April 2018. Retrieved 6 August 2019.
  301. ^ Li, Xiangang; Wu, Xihong (15 October 2014). "Constructing Long Short-Term Memory based Deep Recurrent Neural Networks for Large Vocabulary Speech Recognition". arXiv:1410.4281 [cs.CL].
  302. ^ Kaelbling, Leslie P.; Littman, Michael L.; Moore, Andrew W. (1996). "Reinforcement Learning: A Survey". Journal of Artificial Intelligence Research. 4: 237–285. arXiv:cs/9605103. doi:10.1613/jair.301. S2CID 1708582. Archived from the original on 20 November 2001. Retrieved 5 July 2022.
  303. ^ Patrizio, Andy. "What is reinforcement learning from human feedback (RLHF)?". TechTarget. Retrieved 28 January 2024.
  304. ^ Schrauwen, Benjamin, David Verstraeten, and Jan Van Campenhout. "An overview of reservoir computing: theory, applications, and implementations." Proceedings of the European Symposium on Artificial Neural Networks ESANN 2007, pp. 471-482.
  305. ^ Mass, Wolfgang; Nachtschlaeger, T.; Markram, H. (2002). "Real-time computing without stable states: A new framework for neural computation based on perturbations". Neural Computation. 14 (11): 2531–2560. doi:10.1162/089976602760407955. PMID 12433288. S2CID 1045112.
  306. ^ Jaeger, Herbert, "The echo state approach to analyzing and training recurrent neural networks." Technical Report 154 (2001), German National Research Center for Information Technology.
  307. ^ Jaeger, Herbert (2007). "Echo state network". Scholarpedia. 2 (9): 2330. Bibcode:2007SchpJ...2.2330J. doi:10.4249/scholarpedia.2330.
  308. ^ "XML and Semantic Web W3C Standards Timeline" (PDF). 4 February 2012. Archived from the original (PDF) on 6 July 2022. Retrieved 5 July 2022.
  309. ^ See, for example, Boolos and Jeffrey, 1974, chapter 11.
  310. ^ Sowa, John F. (1987). "Semantic Networks". In Shapiro, Stuart C (ed.). Encyclopedia of Artificial Intelligence. Retrieved 29 April 2008.
  311. ^ O'Hearn, P. W.; Pym, D. J. (June 1999). "The Logic of Bunched Implications". Bulletin of Symbolic Logic. 5 (2): 215–244. CiteSeerX 10.1.1.27.4742. doi:10.2307/421090. JSTOR 421090. S2CID 2948552.
  312. ^ Abran et al. 2004, pp. 1–1
  313. ^ "Computing Degrees & Careers". ACM. 2007. Archived from the original on 17 June 2011. Retrieved 23 November 2010.
  314. ^ Laplante, Phillip (2007). What Every Engineer Should Know about Software Engineering. Boca Raton: CRC. ISBN 978-0-8493-7228-5. Retrieved 21 January 2011.
  315. ^ Rapoza, Jim (2 May 2006). "SPARQL Will Make the Web Shine". eWeek. Retrieved 17 January 2007.
  316. ^ Segaran, Toby; Evans, Colin; Taylor, Jamie (2009). Programming the Semantic Web. O'Reilly Media, Inc., 1005 Gravenstein Highway North, Sebastopol, CA 95472. p. 84. ISBN 978-0-596-15381-6.
  317. ^ Maass, Wolfgang (1997). "Networks of spiking neurons: The third generation of neural network models". Neural Networks. 10 (9): 1659–1671. doi:10.1016/S0893-6080(97)00011-7. ISSN 0893-6080.
  318. ^ "What is stateless? - Definition from WhatIs.com". techtarget.com.
  319. ^ Lise Getoor and Ben Taskar: Introduction to statistical relational learning, MIT Press, 2007
  320. ^ Ryan A. Rossi, Luke K. McDowell, David W. Aha, and Jennifer Neville, "Transforming Graph Data for Statistical Relational Learning. Archived 6 January 2018 at the Wayback Machine" Journal of Artificial Intelligence Research (JAIR), Volume 45 (2012), pp. 363-441.
  321. ^ Spall, J. C. (2003). Introduction to Stochastic Search and Optimization. Wiley. ISBN 978-0-471-33052-3.
  322. ^ Language Understanding Using Two-Level Stochastic Models by F. Pla, et al, 2001, Springer Lecture Notes in Computer Science ISBN 978-3-540-42557-1
  323. ^ Stuart J. Russell, Peter Norvig (2010) Artificial Intelligence: A Modern Approach, Third Edition, Prentice Hall ISBN 9780136042594.
  324. ^ Mehryar Mohri, Afshin Rostamizadeh, Ameet Talwalkar (2012) Foundations of Machine Learning, The MIT Press ISBN 9780262018258.
  325. ^ Cortes, Corinna; Vapnik, Vladimir N (1995). "Support vector networks". Machine Learning. 20 (3): 273–297. doi:10.1007/BF00994018.
  326. ^ Beni, G.; Wang, J. (1993). "Swarm Intelligence in Cellular Robotic Systems". Proceed. NATO Advanced Workshop on Robots and Biological Systems, Tuscany, Italy, June 26–30 (1989). pp. 703–712. doi:10.1007/978-3-642-58069-7_38. ISBN 978-3-642-63461-1.
  327. ^ Haugeland 1985, p. 255.
  328. ^ Poole, Mackworth & Goebel 1998, p. 1.
  329. ^ "Collection of sources defining "singularity"". singularitysymposium.com. Retrieved 17 April 2019.
  330. ^ Eden, Amnon H.; Moor, James H. (2012). Singularity hypotheses: A Scientific and Philosophical Assessment. Dordrecht: Springer. pp. 1–2. ISBN 9783642325601.
  331. ^ Cadwalladr, Carole (2014). "Are the robots about to rise? Google's new director of engineering thinks so..." The Guardian. Guardian News and Media Limited.
  332. ^ Sutton, Richard & Andrew Barto (1998). Reinforcement Learning. MIT Press. ISBN 978-0-585-02445-5. Archived from the original on 30 March 2017.
  333. ^ Pellionisz, A.; Llinás, R. (1980). "Tensorial Approach To The Geometry Of Brain Function: Cerebellar Coordination Via A Metric Tensor" (PDF). Neuroscience. 5 (7): 1125––1136. doi:10.1016/0306-4522(80)90191-8. PMID 6967569. S2CID 17303132.[dead link]
  334. ^ Pellionisz, A.; Llinás, R. (1985). "Tensor Network Theory Of The Metaorganization Of Functional Geometries In The Central Nervous System". Neuroscience. 16 (2): 245–273. doi:10.1016/0306-4522(85)90001-6. PMID 4080158. S2CID 10747593.
  335. ^ "TensorFlow: Open source machine learning" "It is machine learning software being used for various kinds of perceptual and language understanding tasks" — Jeffrey Dean, minute 0:47 / 2:17 from YouTube clip
  336. ^ Sipser, Michael (2013). Introduction to the Theory of Computation 3rd. Cengage Learning. ISBN 978-1-133-18779-0. central areas of the theory of computation: automata, computability, and complexity. (Page 1)
  337. ^ Thompson, William R (1933). "On the likelihood that one unknown probability exceeds another in view of the evidence of two samples". Biometrika. 25 (3–4): 285–294. doi:10.1093/biomet/25.3-4.285.
  338. ^ Russo, Daniel J.; Van Roy, Benjamin; Kazerouni, Abbas; Osband, Ian; Wen, Zheng (2018). "A Tutorial on Thompson Sampling". Foundations and Trends in Machine Learning. 11 (1): 1–96. arXiv:1707.02038. doi:10.1561/2200000070. S2CID 3929917.
  339. ^ Dickson, Ben (2 May 2022). "Machine learning: What is the transformer architecture?". TechTarget. Retrieved 2 May 2022.
  340. ^ Mercer, Calvin. Religion and Transhumanism: The Unknown Future of Human Enhancement. Praeger.
  341. ^ Bostrom, Nick (2005). "A history of transhumanist thought" (PDF). Journal of Evolution and Technology. Retrieved 21 February 2006.
  342. ^ Minsky 1967:107 "In his 1936 paper, A. M. Turing defined the class of abstract machines that now bear his name. A Turing machine is a finite-state machine associated with a special kind of environment -- its tape -- in which it can store (and later recover) sequences of symbols," also Stone 1972:8 where the word "machine" is in quotation marks.
  343. ^ Stone 1972:8 states "This "machine" is an abstract mathematical model", also cf. Sipser 2006:137ff that describes the "Turing machine model". Rogers 1987 (1967):13 refers to "Turing's characterization", Boolos Burgess and Jeffrey 2002:25 refers to a "specific kind of idealized machine".
  344. ^ Sipser 2006:137 "A Turing machine can do everything that a real computer can do".
  345. ^ Turing originally suggested a teleprinter, one of the few text-only communication systems available in 1950. (Turing 1950, p. 433)
  346. ^ Pierce 2002, p. 1: "A type system is a tractable syntactic method for proving the absence of certain program behaviors by classifying phrases according to the kinds of values they compute."
  347. ^ Cardelli 2004, p. 1: "The fundamental purpose of a type system is to prevent the occurrence of execution errors during the running of a program."
  348. ^ Hinton, Jeffrey; Sejnowski, Terrence (1999). Unsupervised Learning: Foundations of Neural Computation. MIT Press. ISBN 978-0262581684.
  349. ^ Colaner, Seth; Humrick, Matthew (3 January 2016). "A third type of processor for AR/VR: Movidius' Myriad 2 VPU". Tom's Hardware.
  350. ^ Banerje, Prasid (28 March 2016). "The rise of VPUs: Giving Eyes to Machines". Digit.in. Archived from the original on 11 December 2018. Retrieved 5 July 2022.
  351. ^ "DeepQA Project: FAQ". IBM. Archived from the original on 5 November 2015. Retrieved 11 February 2011.
  352. ^ Ferrucci, David; Levas, Anthony; Bagchi, Sugato; Gondek, David; Mueller, Erik T. (1 June 2013). "Watson: Beyond Jeopardy!". Artificial Intelligence. 199: 93–105. doi:10.1016/j.artint.2012.06.009.
  353. ^ Hale, Mike (8 February 2011). "Actors and Their Roles for $300, HAL? HAL!". The New York Times. Retrieved 11 February 2011.
  354. ^ "The DeepQA Project". IBM Research. Archived from the original on 21 January 2013. Retrieved 18 February 2011.
  355. ^ io9.com mentions narrow AI. Published 1 April 2013. Retrieved 16 February 2014: https://io9.com/how-much-longer-before-our-first-ai-catastrophe-464043243
  356. ^ AI researcher Ben Goertzel explains why he became interested in AGI instead of narrow AI. Published 18 Oct 2013. Retrieved 16 February 2014. https://intelligence.org/2013/10/18/ben-goertzel/
  357. ^ TechCrunch discusses AI App building regarding Narrow AI. Published 16 Oct 2015. Retrieved 17 Oct 2015. https://techcrunch.com/2015/10/15/machine-learning-its-the-hard-problems-that-are-valuable/
  358. ^ Jurafsky, Daniel; H. James, Martin (2000). Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition. Upper Saddle River, N.J.: Prentice Hall. ISBN 978-0-13-095069-7.
  359. ^ "GitHub project webpage". GitHub. June 2022. Archived from the original on 1 April 2021. Retrieved 5 April 2016.

Works cited

edit

Notes

edit
  1. ^ polynomial time refers to how quickly the number of operations needed by an algorithm, relative to the size of the problem, grows. It is therefore a measure of efficiency of an algorithm.