Edited by Suparerk Angkawattanawit and Raazesh Sainudiin.
Peer-reviewed by project authors according to these instructions.
Introduction
A total of 22 PhD Student Groups did Projects of their choosing in Scalable Data Science and Distributed Machine Learning, a mandatory course of The WASP Graduate School AI-track in 2020-2021. See ScaDaMaLe Course Pathways to appreciate the pre-requisite modules 000_1 through 000_9 for the union of all 23 projects, including a voluntary one from Masters thesis students.
Best Group Project: The Group Project named MixUp and Generalization by by Olof Zetterqvist, Jimmy Aronsson and Fredrik Hellström of Chalmers University won the Best Group-Project Prize on the basis of peer-review. The prize was donated kindly by the Databricks University Alliance under Rob Reed.
Table of Contents
- The Two Cultures by Daniel Ahlsén, Martin Andersson, Niklas Gunnarsson and Jonathan Styrud.
- Exploring the GQA Scene Graph Dataset Structure and Properties by Adam Dahlgren, Pavlo Melnyk and Emanuel Sanchez Aimar.
- Signed Triads in Social Media by Guangyi Zhang.
- Distributed Linear Algebra by Måns Williamson and Jonatan Vallin.
- Wikipedia analysis using Latent Dirichlet Allocation (LDA) by Axel Berg, Johan Grönqvist and Jens Gulin.
- Unsupervised clustering of particle physics data with distributed training by Karl Bengtsson Bernander, Colin Desmarais, Daniel Gedon and Olga Sunneborn Gudnadottir.
- Motif Finding by Adam Lindhe, Petter Restadh and Francesca Tombari.
- Distributed Ensemble by Amanda Olmin, Amirhossein Ahmadian and Jakob Lindqvist.
- Topic Modeling with SARS-Cov-2 Genome by Hugo Werner and Gizem Çaylak.
- Twitter Streaming Using Geolocation and Emoji Based Sentiment Analysis by Georg Bökman and Rasmus Kjær Høier.
- Anomaly Detection with Iterative Quantile Estimation and T-digest by Alexander Karlsson, Alvin Jin and George Osipov.
- Analysis and Prediction of COVID-19 Data by Chi Zhang, Shuangshuang Chen and Magnus Tarle.
- Genomics Analysis with Glow and Spark by Karin Stacke and Milda Pocoviciute.
- Distributed Combinatorial Bandits by Niklas Åkerblom, Jonas Nordlöf and Emilio Jorge.
- Reinforcement Learning for Intraday Trading by Fabian Sinzinger, Karl Bäckström and Rita Laezza.
- Intrusion Detection by MohamedReza Faridghasemnia, Javad Forough, Quantao Yang and Arman Rahbar.
- Density Estimation via Voronoi Diagrams in High Dimensions by Robert Gieselmann and Vladislav Polianskii.
- Recommender System by Ines De Miranda De Matos Lourenço, Yassir Jedra and Filippo Vannella.
- Fundamental Matrix by Linn Öström, Patrik Persson, Johan Oxenstierna and Alexander Dürr.
- MixUp and Generalization by Olof Zetterqvist, Jimmy Aronsson and Fredrik Hellström.
- Graph Spectral Analysis by Ciwan Ceylan and Hanna Hultin.
- SWAP With DDP by Christos Matsoukas, Emir Konuk, Johan Fredin Haslum and Miquel Marti.
- Distributed Deep Learning by William Anzén and Christian von Koch.