NMBU REALTEK, DAT121, 2023 (August block): Glossary

Agent

Definition: An agent is a system that interacts with its surroundings. It receives percepts through sensors and can carry out actions through actuators.

See also: Influence diagram, knowledge base, Pareto optimality, rationality.

Argument passing

Definition: Argument passing is the process of handing over data items to a procedure (e.g., a function in Python) when that procedure is called.

See also: Object reference, procedural programming.

Competency question

(under construction)

See also: Ontology, triple.

Concept (class, entity type)

Definition: A concept is a universal that is only instantiated by individuals.

See also: Foundational ontology, individual, object-oriented programming, ontology, relation, relationship, resource.

Dark data

(under construction)

See also: Ontology, regression analysis, reproducibility.

Decorrelation time (autocorrelation time)

(under construction)

See also: Regression analysis, reproducibility.

Dictionary (hash)

Definition and translation left open for discussion on 14th/15th August.

See also: Dynamic array, object reference, static array.

Dynamic array

Definition: A dynamic array is a dynamic data structure; specifically, it is an array with a dynamically adjustable size, usually reserving free memory capacity at its end or beginning (or both) for additional future elements. Therein, an array is a variable referring to a contiguous region in memory that can hold the content of multiple elementary variables or objects.

See also: Dictionary, static array.

Dynamic typing

Definition: In a program or programming language using dynamic typing, variables do not need a declaration (and their type does not need to be explicitly specified) before being used; instead, it is determined at runtime.

See also: Object reference, script language.

Foundational ontology (top-level ontology)

(under construction)

See also: Concept, ontology.

Global variable

Definition: A global variable is a variable that can be accessed through a name with an unrestricted scope. It has a name that resolves everywhere in the code.

See also: Scope, script language.

Hypothesis

Definition: In machine learning, a hypothesis is a function y = f(x0, x1, …) that predicts an outcome variable y on the basis of values of one or multiple independent variables x0, x1, …

See also: Residual quantity, validation and testing.

Individual (entity, object)

Definition: Anything about which it can be meaningfully asked what concepts it instantiates is an individual.

See also: Concept, knowledge graph, object-oriented programming, object reference, persistent identifier, relationship, resource.

Influence diagram

(under construction)

See also: Agent, optimization parameter, p value, rationality, regression analysis, reproducibility, residual quantity.

Knowledge base

Definition: A knowledge base, given by K = (T, A), consists of an ontology T, describing universals, and a set of assertions A describing concrete instances of these universals.

See also: Agent, knowledge graph, ontology, resource.

Knowledge graph (ABox)

Definition left open for discussion on 18th, 21st or 22nd August.

See also: Individual, knowledge base, persistent identifier, relationship, triple.

Object-oriented programming

Definition left open for discussion on 17th August.

See also: Concept, individual, object reference, procedural programming.

Object reference

Definition: A reference is an alias for data stored at a certain memory address. An object reference is a reference to an object; the memory address remains hidden from the programmer, who can use the reference as if it was the object itself.

See also: Argument passing, dictionary, dynamic typing, individual, object-oriented programming, scope.

Ontology (TBox)

Definition: An ontology is a semantic artefact that formulates a conceptual scheme; it specifies, for a certain domain of knowledge and according to a certain paradigm within that domain, what kinds of entities there can be and how they can relate to each other.

See also: Competency question, concept, dark data, foundational ontology, knowledge base.

Optimization objective

Definition: An optimization objective is a quantity that is used to formulate preferences for the outcome of a decision making scenario. In case of a maximization objective, greater values are preferred, and in case of a minimization objective, smaller values are preferred.

See also: Pareto optimality, rationality.

Optimization parameter

Definition: An optimization parameter is a quantity over which the decision maker has direct control; a parameter value (or parameterization) is selected in order to obtain the best possible outcome for the optimization objective(s).

See also: Influence diagram, Pareto optimality.

Pareto optimality

Definition: Within the framework of multicriteria optimization (MCO), a point in objective space is Pareto optimal if it is accessible and no other accessible point in objective space dominates it.

See also: Agent, optimization parameter, optimization objective, rationality.

Persistent identifier

(under construction)

See also: Individual, knowledge graph.

Procedural programming

Definition: Procedural programming is the programming paradigm where procedures are employed as the highest-level device for structuring code and the program control flow.

See also: Argument passing, object-oriented programming, scope.

p value

(under construction)

See also: Influence diagram, regression analysis.

Rationality

Definition: Tendency toward minimizing a cost function or maximizing a performance measure. In particular, rational preferences, or decisions and choices made by a rational agent, must satisfy the following constraints (Russell & Norvig 2021, p. 520):

For a more complete and more mathematically oriented discussion of rational choice, cf. Russell & Norvig (2021, p. 520f.).

See also: Agent, influence diagram, optimization objective, Pareto optimality.

Regression analysis

Regression is a method or process in supervised learning. The learning problem consists in finding out how an outcome variable y (also called the dependent variable) depends on the values of one or multiple independent variables.

Regression analysis can refer to a discussion of regression methodology (e.g., ordinary least squares fits based on the root mean square deviation) or to analysing the outcome of a regression, such as assessing the confidence in the model. Standardized techniques and concepts for analysing the regression outcome are particularly widespread for linear regression.

See also: Dark data, decorrelation time, influence diagram, p value, reproducibility, residual quantity, validation and testing.

Relation (object property, relationship type)

Definition left open for discussion on 17th August.

See also: Concept, relationship, resource.

Relationship

(under construction)

See also: Concept, individual, knowledge graph, relation, triple.

Reproducibility

(Reserved for discussion on 25th August.)

See also: Dark data, decorrelation time, influence diagram, regression analysis.

Residual quantity

(under construction)

See also: Hypothesis, influence diagram, regression analysis.

Resource

Definition: In RDF, there are three kinds of resources, namely, concepts, relations, and individuals.

See also: Concept, individual, knowledge base, relation, triple.

Scope

Definition: The scope of a name (e.g., for the name of an object reference) is the region within the source code within which that name can be resolved.

See also: Global variable, object reference, procedural programming.

Script language (interpreted language)

Definition: A script language is a language that is most typically used for writing scripts, i.e., programs that require a run-time environment, such as an interpreter or shell, going through the code and executing it step by step.

See also: Dynamic typing, global variable.

Static array

Definition: An array is a variable referring to a contiguous region in memory that can hold the content of multiple elementary variables or objects. A static array is just that, without any special additional functionality, as opposed to a dynamic array which provides the additional functionality that it can be resized.

See also: Dictionary, dynamic array.

Triple

Definition: An RDF triple consists of a subject, a predicate, and an object, all of which need to be resources.

See also: Competency question, knowledge graph, relationship, resource.

Validation and testing

In supervised learning, it is often unclear what hypothesis is the best for modelling the phenomena underlying a given data set. In that case, it is common practice to develop multiple candidate models based on different hypotheses (e.g., a linear, quadratic, and cubic model), compare them to each other by validation, and finally assess the accuracy of the selected model by testing.

For this purpose, the overall data set can be split up into three parts:

The split between training and validation data is helpful to prevent overfitting. The split between validation and test data prevents a selection bias: Since the validation data are used to choose the best hypothesis, the performance of the selected hypothesis will usually tend to be overestimated slightly.

See also: Hypothesis, regression analysis.

Referenced literature

Index