Data Science is about extracting knowledge from digital data. Given the ubiquitous availability of digital data, Data Science has a wide range of applications, from supporting scientific discoveries in the life sciences to understanding the mechanisms through which disinformation spreads in social media.
The volume, variety, and velocity often characterising contemporary digital data requires the application and development of methods rooted in multiple disciplines, such as databases, distributed computing, machine learning, data mining, and visualisation. This makes Data Science a multi-disciplinary field. For example, the processing of very large data requires efficient data management, leading to the frequent application of methods from databases, artificial intelligence, and cloud computing. Contemporary digital data also exist in a variety of forms, such as images, text, and graphs. This requires the application and development of specialised algorithms, for example from image analysis, natural language processing, and social network analysis. In addition, data often varies in time. Therefore, data streams and algorithms for longitudinal and temporal data analysis are often important elements in Data Science applications.
In Data Science, technical topics such as data models and algorithms are often influenced by ethical and legal considerations. An example is our research on decentralised and privacy-preserving data analysis.
- Databases (DB): big data, keyword search and ranking, (semi) structured data, (attributed) graphs, semantic data, spatial data, online (geo) social networks.
- Distributed computing infrastructures (DCI): performance, efficiency, and optimisation of large-scale computational resources.
- Data Mining (DM): clustering algorithms.
- Federated Machine learning (FML).
- Network Science (NS) and Social Network Analysis (SNA): multilayer networks, temporal text networks, probabilistic networks.
- Social Data Science (SDS): analysis of online communication, analysis of register data, online information disorder.
- Integrative Scalable Computing Laboratory: DCI, FML
- InfoLab – Uppsala University Information Laboratory: DM, NS, SDS, SNA
- UDBL – Uppsala Database Laboratory: DB
- eSSENCE – an e-Science inter-university collaboration: DB, DCI, DM, NS
- NORDIS – Nordic Observatory for Digital Media and Information Disorder: SDS
Most of the data science courses are part of the following programmes:
- Master's programme in Data Science, spec. in Data Engineering.
- Master's programme in Data Science, spec. in Machine Learning and Statistics.
- Master's programme in Image Analysis and Machine Learning.
Other courses in Data Science not part of the programmes above:
- 1DL007: Introduction to Computational Social Science (SDS, SNA)