Research
I'm a member of the UART group, specialized in computer architecture. I'm currently working in the analysis of resource contention in runtime systems, within the Resource Sharing Modeling project.
Publications
-
Sampled Simulation of Task-Based Programs
. In IEEE Transactions on Computers, volume 68, number 2, pp 255-269, IEEE COMPUTER SOC, 2019. (DOI
).
-
Understanding Task Parallelism: Providing insight into scheduling, memory, and performance for CPUs and Graphics
. Ph.D. thesis, Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology nr 1737, Acta Universitatis Upsaliensis, Uppsala, 2018. (fulltext
, preview image
).
-
Tail-PASS: Resource-based Cache Management for Tiled Graphics Rendering Hardware
. In Proc. 16th International Conference on Parallel and Distributed Processing with Applications, pp 55-63, IEEE, 2018. (DOI
).
-
Behind the Scenes: Memory Analysis of Graphical Workloads on Tile-based GPUs
. In Proc. International Symposium on Performance Analysis of Systems and Software: ISPASS 2018, pp 1-11, IEEE Computer Society, 2018. (DOI
, fulltext:preprint
).
-
Analyzing performance variation of task schedulers with TaskInsight
. In Parallel Computing, volume 75, pp 11-27, 2018. (DOI
).
-
Analyzing Graphics Workloads on Tile-based GPUs
. In Proc. 20th International Symposium on Workload Characterization, pp 108-109, IEEE, 2017. (DOI
).
-
Exploring scheduling effects on task performance with TaskInsight
. In Supercomputing frontiers and innovations, volume 4, number 3, pp 91-98, 2017. (DOI
, Fulltext
).
-
Understanding the interplay between task scheduling, memory and performance
. In Proc. Companion 8th ACM International Conference on Systems, Programming, Languages, and Applications: Software for Humanity, pp 21-23, ACM Press, New York, 2017. (DOI
).
-
How to make tasks faster: Revealing the complex interactions of tasks in the memory system
. In Proc. Companion 8th ACM International Conference on Systems, Programming, Languages, and Applications: Software for Humanity, pp 1-3, ACM Press, New York, 2017. (DOI
).
-
Modeling the interactions between tasks and the memory system
. Licentiate thesis, IT licentiate theses / Uppsala University, Department of Information Technology nr 2017-002, Uppsala University, 2017. (fulltext
).
-
TaskInsight: Understanding task schedules effects on memory and performance
. In Proc. 8th International Workshop on Programming Models and Applications for Multicores and Manycores, pp 11-20, ACM Press, New York, 2017. (DOI
, Fulltext
).
-
Characterizing Task Scheduling Performance Based on Data Reuse
. In Proc. 9th Nordic Workshop on Multi-Core Computing, 2016. (fulltext:print
).
-
Formalizing data locality in task parallel applications
. In Algorithms and Architectures for Parallel Processing, volume 10049 of Lecture Notes in Computer Science, pp 43-61, Springer, 2016. (DOI
).
-
Spatial and Temporal Cache Sharing Analysis in Tasks
. In , Timisoara, Romania, 2016. (Proceedings
, fulltext:print
).
-
StatTask: Reuse distance analysis for task-based applications
. In Proc. 7th Workshop on Rapid Simulation and Performance Evaluation: Methods and Tools, pp 1-7, ACM Press, New York, 2015. (DOI
).
-
Shared Resource Sensitivity in Task-Based Runtime Systems
. In Proc. 6th Swedish Workshop on Multi-Core Computing, Halmstad University Press, 2013. (fulltext:postprint
).
- Adaptive Mesh Refinement for Large Scale Physical Simulations
. Germán Ceballos, Oscar Reula
and Carlos Bederián. In Proceedings of EST 2013, 42nd JAIIO. FaMAF, Universidad Nacional de Córdoba, Córdoba, Argentina. Sep 2013. (Conference Website
, fulltext
)
- MSc. Thesis:
- A GPU-powered Parallel Adaptive Mesh Refinement Framework for Large Scale Physical Simulations
(fulltext sp
|en (soon)). Germán Ceballos, Oscar Reula
and Carlos Bederián. FaMAF, Universidad Nacional de Córdoba, Córdoba, Argentina. March 2013.
- A GPU-powered Parallel Adaptive Mesh Refinement Framework for Large Scale Physical Simulations
Teaching
Semester | Course code | Course | What? |
---|---|---|---|
Autumn 2015 | 1DT032 | Advanced Computer Science Studies in Sweden | Lab Instructor |
Autumn 2015 | 1DT086 | Introduction to Studies in Embedded Systems | Lab Instructor |
Spring 2015 | 1DT024 | Advanced Computer Architecture | Teaching Assistant, Labs |
Autumn 2014 | 1DT016 | Computer Architecture I (Distance) | Instructor |
Autumn 2014 | 1DT038 | Computer Architecture I | Teaching Assistant, Labs |
Autumn 2014 | 1DL560 | Parallel Programming for Efficiency![]() |
Teaching Assistant, Labs |
Spring 2014 | 1DT089 | Operating Systems and Multicore Programming | Teaching Assistant, Labs |
Autumn 2013 | 1DT016 | Computer Architecture I (Distance) | Teaching Assistant, Labs |
Autumn 2013 | 1DT038 | Computer Architecture I | Teaching Assistant, Labs |