Skip to main content
Department of Information Technology

Data science

Creating value through data


Data Science is about extracting knowledge from digital data. Given the ubiquitous availability of digital data, Data Science has a wide range of applications, from supporting scientific discoveries in the life sciences to understanding the mechanisms through which disinformation spreads in social media.

The volume, variety, and velocity often characterising contemporary digital data requires the application and development of methods rooted in multiple disciplines, such as databases, distributed computing, machine learning, data mining, and visualisation. This makes Data Science a multi-disciplinary field. For example, the processing of very large data requires efficient data management, leading to the frequent application of methods from databases, artificial intelligence, and cloud computing. Contemporary digital data also exist in a variety of forms, such as images, text, and graphs. This requires the application and development of specialised algorithms, for example from image analysis, natural language processing, and social network analysis. In addition, data often varies in time. Therefore, data streams and algorithms for longitudinal and temporal data analysis are often important elements in Data Science applications.

In Data Science, technical topics such as data models and algorithms are often influenced by ethical and legal considerations. An example is our research on decentralised and privacy-preserving data analysis.

Research Topics

  • Databases (DB): big data, keyword search and ranking, (semi) structured data, (attributed) graphs, semantic data, spatial data, online (geo) social networks.
  • Distributed computing infrastructures (DCI): performance, efficiency, and optimisation of large-scale computational resources.
  • Data Mining (DM): clustering algorithms.
  • Federated Machine learning (FML).
  • Network Science (NS) and Social Network Analysis (SNA): multilayer networks, temporal text networks, probabilistic networks.
  • Social Data Science (SDS): analysis of online communication, analysis of register data, online information disorder.
Updated  2022-05-11 11:27:33 by Victor Kuismin.