IT Technical reports

Technical report 2024-001: Supplement - Capital in Computing Education: Investigating Factors Underlying Participation

Thom Kunkeler and Aletta Nylén — 2024-03-01

Abstract: This document provides the supplementing material for the following publica- tion: Thom Kunkeler and Aletta Nylén. Capital in Computing Education: Investigating Factors Underlying Participation. 2024. In Proceedings of the 2024 Conference on Innovation and Technology in Computer Science Education (Milan, Italy, 2024-07-08) (ITiCSE'24). In this publication, we developed a validated survey instrument to measure capital in computing education. Capital refers to the legitimate, valuable and exchangeable resources that individuals use to generate social advantage within specific fields. In computing education, a theoretical model has been developed highlighting the forms of capital which influence participation and success in the field. This study assessed the theoretical model through careful survey design and Confirmatory Factor Analaysis (CFA). The hypothesised survey structure was assessed in terms of model fit to the observed data, and adjusted to achieve a survey with high internal consistency among the items and factors (robust: X2p = 0.119; CFI/TLI = 0.97/0.95; RMSEA = 0.06, SRMR = 0.041). This document contains a detailed presentation of the pre- and post-validated survey instrument, in addition to the factor analysis diagram.

Technical report 2023-003: Relations Between Prediction Error and Maximum Likelihood Methods in an Error-in-Variables Setting. Extended version with full proofs

Torsten Söderström — 2023-10-01

Abstract: Prediction error (PE) and maximum likelihood (ML) methods are often treated as synonyms when identifying linear dynamic systems from Gaussian data. It is shown how these methods differ when specifically dealing with errors-in-variables problems. These problems can modeled using multivariable times series with a specific internal structure. In such situations the ML estimates have lower variances than the PE estimates. Explicit expressions for the covariance matrices of the estimates are given and analyzed. For the special case when the unperturbed input is white noise it is shown that the PE estimate is not identifiable, while the ML estimates still have quite small variances. Another special case concerns non-Gaussian data. In that case a pseudo-ML estimate (using the ML criterion as if the data were Gaussian) will no longer be superior to the PE estimate in terms of error variances.

Technical report 2023-002: Using gender equality indicators to support gender mainstreaming work at the Department of Information Technology

Ginevra Castellano, Gunilla Kreiss, Robin Strand, and Lina von Sydow — 2023-04-01

Abstract: Previous research has shown that gender statistics can be a powerful tool to raise organizational awareness of gender issues. This report presents the results of a project that investigated how Uppsala Universityâs gender equality indicators can be used to monitor the gender distribution of research resources and funding at the Department of Information Technology and how they can be used in a long-term perspective to improve gender mainstreaming work at the department. Results show that gender differences exist and they are sometimes in favour of females and sometimes in favour of males. This analysis raises several questions of relevance to future gender mainstreaming work at the department.

Technical report 2023-001: Preconditioning of Discrete State- and Control-Constrained Optimal Control Convection-Diffusion Problems

Ivo Dravins and Maya Neytcheva — 2023-02-01

Abstract: We consider the iterative solution of algebraic systems, arising in optimal control problems, constrained by a partial differential equation, with additional box constraints on the state and the control variables, and sparsity imposed on the control. A nonsymmetric two-by-two block preconditioner is analysed and tested for a wide range of problem, regularization and discretization parameters. The constraint equation characterizes convection-diffusion processes.

Technical report 2022-009: Analyzing the Parameter Bias when an ARMAX Model is Fitted to Noise-Corrupted Data

Torsten Söderström and Umberto Soverini — 2022-10-01

Abstract: When an ARMAX model is fitted to noise-corrupted data using the prediction error method, biased estimates are obtained. The bias is examined, with emphasis on the situation when the system is almost non-identifiable. In contrast to the case of using an output error model, no general results on the size of the bias seem to apply.

Technical report 2022-008: Analyzing the Parameter Bias when an Instrumental Variable Method is Used with Noise-Corrupted Data

Torsten Söderström and Umberto Soverini — 2022-10-01

Abstract: When an output error model is fitted to data with noise-corrupted inputs using a prediction error method, a bias occurs. It was previously shown that the bias is of order O(1/delta) for a small pole-zero separation delta. These notes examine the same problem when an instrumental variable model is fitted. A similar result is shown to hold for the instrumental variable case.

Technical report 2022-007: Faster Functional Warming with Cache Merging

Gustaf Borgström, Christian Rohner, and David Black-Schaffer — 2022-08-01

Abstract: SMARTS-like sampled hardware simulation techniques achieve good accuracy by simulating many small portions of an application in detail. However, while this reduces the detailed simulation time, it results in extensive cache warming times, as each of the many simulation points requires warming the whole memory hierarchy. Adaptive Cache Warming reduces this time by iteratively increasing warming until achieving sufficient accuracy. Unfortunately, each time the warming increases, the previous warming must be redone, nearly doubling the required warming. We address re-warming by developing a technique to merge the cache states from the previous and additional warming iterations. We address re-warming by developing a technique to merge the cache states from the previous and additional warming iterations. We demonstrate our merging approach on multi-level LRU cache hierarchy and evaluate and address the introduced errors. By removing warming redundancy, we expect an ideal 2x warming speedup when using our Cache Merging solution together with Adaptive Cache Warming. Experiments show that Cache Merging delivers an average speedup of 1.44x, 1.84x, and 1.87x for 128kB, 2MB, and 8MB L2 caches, respectively, with 95-percentile absolute IPC errors of only 0.029, 0.015, and 0.006, respectively. These results demonstrate that Cache Merging yields significantly higher simulation speed with minimal losses.

Technical report 2022-006: A Robust Multi-Goal Exploration Aided Tracking Policy

Ruoqi Zhang, Per Mattsson, and Torbjörn Wigren — 2022-06-01

Abstract: Set-point control aims at finding a policy that can track a set point that varies over time. Such control objectives are central in industry, yet multi-goal Reinforcement Learning methods are typically evaluated on other environments. The paper therefore proposes the use of a combination of feedback based amplitude aided exploration, simulated ensemble model training, together with policy optimization also over integrated errors, to arrive at a trained multi-goal policy that can be directly deployed to real-world nonlinear set-point control systems. The claim is supported by experiments with a real-world nonlinear cascaded tank process and a simulated strongly non-linear pH-control system.

Technical report 2022-005: Consistency Study of a Reconstructed Genotype Probability Distribution via Clustered Bootstrapping in NORB Pooling Blocks

Camille Clouard and Carl Nettelblad — 2022-06-01

Abstract: For applications with biallelic genetic markers, group testing techniques, synonymous to pooling techniques, are usually applied for decreasing the cost of large-scale testing as e.g. when detecting carriers of rare genetic variants. In some configurations, the results of the grouped tests cannot be decoded and the pooled items are missing. Inference of these missing items can be performed with specific statistical methods that are for example related to the Expectation-Maximization algorithm. Pooling has also been applied for determining the genotype of markers in large populations. The particularity of full genotype data for diploid organisms in the context of group testing are the ternary outcomes (two homozygous genotypes and one heterozygous), as well as the distribution of these three outcomes in a population, which is often ruled by the Hardy-Weinberg Equilibrium and depends on the allele frequency in such situation. When using a nonoverlapping repeated block pooling design, the missing items are only observed in particular arrangements. Overall, a data set of pooled genotypes can be described as an inference problem in Missing Not At Random data with nonmonotone missingness patterns. This study presents a preliminary investigation of the consistency of various iterative methods estimating the most likely genotype probabilities of the missing items in pooled data. We use the Kullback-Leibler divergence and the L2 distance between the genotype distribution computed from our estimates and a simulated empirical distribution as a measure of the distributional consistency.

Technical report 2022-004: Stage-Parallel Preconditioners for Implicit Runge-Kutta Methods of Arbitrary High Order. Linear problems

Owe Axelsson, Ivo Dravins, and Maya Neytcheva — 2022-04-01

Abstract: Fully implicit Runge-Kutta methods offer the possibility to use high order accurate time discretization to match space discretization accuracy, an issue of significant importance for many large scale problems of current interest, where we may have fine space resolution with many millions of spatial degrees of freedom and long time intervals. In this work we consider strongly A-stable implicit Runge-Kutta methods of arbitrary order of accuracy, based on Radau quadratures. For the arising large algebraic systems we introduce an efficient preconditioner, that allows for fully stage-parallel solution. We analyse the spectrum of the corresponding preconditioned system and illustrate the performance of the solution method with numerical experiments using MPI. In this work we consider only linear problems.

Technical report 2022-003: Implicit Summation by Parts Operators for Finite Difference Approximations of First and Second Derivatives

Ken Mattsson and Ylva Ljungberg Rydin — 2022-01-01

Abstract: Implicit finite difference approximations are derived for both the first and second derivates. The boundary closures are based on the banded-norm summation-by-parts framework and the boundary conditions are imposed using a weak (penalty) enforcement. Up to 8th order global convergence is achieved. The finite difference approximations lead to implicit ODE systems. Spectral resolution characteristics are achieved by proper tuning of the internal difference stencils. The accuracy and stability properties are demonstrated for linear hyperbolic problems in 1D and the 2D compressible Euler equations.

Technical report 2022-002: MATLAB Software for Nonlinear and Delayed Recursive Identification - Revision 2

Torbjörn Wigren — 2022-01-01

Abstract: This report is the user's manual for a package of MATLAB scripts and functions, developed for recursive prediction error identification of nonlinear state space systems. The identified state space model incorporates delay, which allows a treatment of general nonlinear networked identification, as well as of general nonlinear systems with delay. The core of the package is an implementation of two output error identification algorithms. The algorithms are based on a continuous time, structured black box state space model of a nonlinear system. The present revision adds a new algorithm, where also the output is determined via a parameterized measurement equation in the states and inputs. The software can only be run off-line, i.e. no true real time operation is possible. The algorithms are however implemented so that true on-line operation can be obtained by extraction of the main algorithmic loop. The user must then provide the real time environment. The software package contains scripts and functions that allow the user to either input live measurements or to generate test data by simulation. The scripts and functions for the setup and execution of the identification algorithms are somewhat more general than what is described in the references. The functionality for display of results include scripts for plotting of e.g. data, parameters, prediction errors, eigenvalues and the condition number of the Hessian. The estimated model obtained at the end of a run can be simulated and the model output plotted, alone or together with the data used for identification. Model validation is supported by two methods apart from the display functionality. First, a calculation of the RPEM loss function can be performed, using parameters obtained at the end of an identification run. Secondly, the accuracy as a function of the output signal amplitude can be assessed.

Technical report 2022-001: Sjuksköterskors upplevelse av att jobba med IT-system: sammanfattning

Diane Golay and Åsa Cajander — 2022-01-01

Abstract: Denna rapport sammanfattar resultaten från en kvalitativ studie om sjuksköterskors upplevelse av IT-system på jobbet. De känslor och uppfattningar som sjuksköterskor upplevde i samband med IT-användning på jobbet presenteras och implikationerna för design och implementering av IT-system och IT-stödda processer i sjukhusmiljö diskuteras.

Technical report 2021-008: MATLAB Software for Recursive Identification and Scaling Using a Structured Nonlinear Black-box Model - Revision 7

Torbjörn Wigren — 2021-12-01

Abstract: This reports is intended as a users manual for a package of MATLAB scripts and functions, developed for recursive prediction error identification of nonlinear state space systems and nonlinear static systems. The core of the package is the implementation of three output error identification and scaling algorithms. The first algorithm is based on a continuous time, structured black box state space model of a nonlinear system. An RPEM algorithm for recursive identification of nonlinear static systems, that re-uses the parameterization of the nonlinear ODE model, is also included in the software package. The present revision adds a third algorithm, where also the output is determined via a parameterized measurement equation in the states and inputs. The software can only be run off-line, i.e. no true real time operation is possible. The algorithm is however implemented so that true on-line operation can be obtained by extraction of the main algorithmic loop. The user must then provide the real time environment. The software package contains scripts and functions that allow the user to either input live measurements or to generate test data by simulation. The scripts and functions for the setup and execution of the identification algorithms are somewhat more general than what is described in the references. There is e.g. support for automatic re-initiation of the algorithms using the parameters obtained at the end of a previous identification run. This allows for multiple runs through a set of data, something that is useful for data sets that are too short to allow complete convergence. The re-initiation step also allows the user to modify the degrees of the polynomial model structure and to specify terms that are to be excluded from the model. This makes it possible to iteratively refine the estimated model using multiple runs. The functionality for display of results include scripts for plotting of data, parameters, prediction errors, eigenvalues and the condition number of the Hessian. The estimated model obtained at the end of a run can be simulated and the model output plotted, alone or together with the data used for identification. Model validation is supported by two methods apart from the display functionality. First, calculation of the RPEM loss function can be performed, using parameters obtained at the end of an identification run. Secondly, the accuracy as a function of the output signal amplitude can be assessed.