Glossary

A B C D E F J M N P R S U

A

Agentic AI

Refers to artificial intelligence that can act independently to achieve goals, rather than just responding to instructions. It makes decisions, takes actions, and can adapt based on its environment, showing a level of “agency.”

API

Stands for Application Programming Interface. It is a set of functions that allow to access to data from an application or an operating system.

Artificial intelligence

Field of science that refers to computer systems that perform tasks typically performed by human intelligence, such as speech recognition, problem-solving, or decision-making.

B

Big Data

Term that refers to large and complex datasets from various sources that can't be analysed by traditional tools.

C

CSV

A file format in which the records are arranged in a table, with each item of information separated by a comma.

D

Darsmouth

A city in New Hampshire, USA, where the Dartmouth Summer Research Project on Artificial Intelligence was held in 1956, laying the foundations for AI

Data Ethics

Data ethics is concerned with ethical questions, considerations, and debates around data, including the collection, management, analysis, interpretation, sharing, and use (and misuse) of data, and the impact of data on individuals and the society

Data Fabric

Architectural approach that integrates and manages access to data from multiple sources within the organisation.

Data Hub

Centralised platform that provides a single point of access to data from multiple sources and simplifies collaboration and information sharing.

Data Integration

The process of transforming data from multiple sources to have a consolidated view for analysis.

Data Lake

Data storage that holds large amounts of unstructured, structured or semi-structured raw data. It is typically used for machine learning processing or big data analysis.

Data Mesh

Decentralised architectural approach that organises data within domains according to its use.

Data Mining

Process of analysing large amounts of data raws from different angles to identify potential connections between databases.

Data Set

Structured collection of data records, generally organised in a tabular form with rows and columns.

Data Warehouse

Data storage that contains structured data records designed to support business decision-making through data analysis with a consolidated view.

Deep Learning

Subset of Machine Learning that incorporates the use of neural networks inspired by the human brain to learn from large amounts of data.

E

ELT

Stands for Extraction Load Transform .It is a data integration process that loads data after extraction before transforming it directly into the data storage

ETL

Stands for Extraction Transform Load. It is a data integration process that transforms data after extraction before loading it into the data storage.

F

Fine-Tuning

Process of programming a pre-trained model for specific tasks.

FTP

Protocol to transfer files between computer systems

J

JSON

A file format in which records are organised as objects. It is inspired by the JavaScript programming language.

M

Machine Learning

The field of artificial intelligence that involves developing computer systems that can learn and adapt without explicit instruction.

Metadata

Data that describes data information, such as relational databases and details.

N

NLP

Stands for Neural Language Processing. It's an AI field that uses machine learning techniques to help computer systems understand natural language. It is particularly used in the context of generative AI.

P

Python

Programming language generally used to perform analysis using algorithms.

R

Raw Data

Data that has not been manipulated and is in its native format.

S

Semi-Structured Data

Data with some organisational levels without a generally defined structure.

SQL

Acronym of Structured Query Language. It is a programming language used to query, manipulate and interact with databases for analysis

Structured Data

Data that is organised and stored in a specific format to ease the readability for data analysis

U

Unstructured Data

Data that have not a predefined format and lacks a fixed schema, making it difficult to store in traditional relational databases