2025
von Rohrscheidt, J.C. ; Rieck, B.
In: (42nd International Conference on Machine Learning, ICML 2025, 13-19 July 2025, Vancouver). 2025. 61790-61809 ( ; 267)
The Euler Characteristic Transform (ECT) is an efficiently-computable geometrical-topological invariant that characterizes the global shape of data. In this paper, we introduce the Local Euler Characteristic Transform (ℓ-ECT), a novel extension of the ECT particularly designed to enhance expressivity and interpretability in graph representation learning. Unlike traditional graph neural networks (GNNs), which may lose critical local details through aggregation, the ℓ-ECT provides a lossless representation of local neighborhoods. This approach addresses key limitations in GNNs by preserving local structures while maintaining global interpretability. Moreover, we construct a rotation-invariant metric based on ℓ-ECTs for spatial alignment of data spaces. Our method exhibits superior performance compared to standard GNNs on a variety of benchmark node-classification tasks, while also offering theoretical guarantees that demonstrate its effectiveness.
Coupette, C. ; Wayland, J.D. ; Simons, E. ; Rieck, B.
In: (42nd International Conference on Machine Learning, ICML 2025, 13-19 July 2025, Vancouver). 2025. 11405-11434 ( ; 267)
Benchmark datasets have proved pivotal to the success of graph learning, and good benchmark datasets are crucial to guide the development of the field. Recent research has highlighted prob-lems with graph-learning datasets and benchmark-ing practices revealing, for example, that meth-ods which ignore the graph structure can outper-form graph-based approaches. Such findings raise two questions: (1) What makes a good graph-learning dataset, and (2) how can we evaluate dataset quality in graph learning? Our work ad-dresses these questions. As the classic evalua-tion setup uses datasets to evaluate models, it does not apply to dataset evaluation. Hence, we start from first principles. Observing that graph-learning datasets uniquely combine two modes graph structure and node features, we introduce RINGS, a flexible and extensible mode-perturbation framework to assess the quality of graph-learning datasets based on dataset abla-tions ie., quantifying differences between the original dataset and its perturbed representations. Within this framework, we propose two mea-sures performance separability and mode com-plementarity as evaluation tools, each assess-ing the capacity of a graph dataset to benchmark the power and efficacy of graph-learning meth-ods from a distinct angle. We demonstrate the utility of our framework for dataset evaluation via extensive experiments on graph-level tasks and derive actionable recommendations for im-proving the evaluation of graph-learning methods. Our work opens new research directions in data-centric graph learning, and it constitutes a step toward the systematic evaluation of evaluations.
Ballester, R. ; Röell, E. ; Schmid, D.B. ; Alain, M. ; Casacuberta, C. ; Escalera, S. ; Rieck, B.
In: (13th International Conference on Learning Representations Iclr 2025, 24 - 28 April 2025, Singapur). 2025. 94750-94779
The rising interest in leveraging higher-order interactions present in complex systems has led to a surge in more expressive models exploiting higher-order structures in the data, especially in topological deep learning (TDL), which designs neural networks on higher-order domains such as simplicial complexes. However, progress in this field is hindered by the scarcity of datasets for benchmarking these architectures. To address this gap, we introduce MANTRA, the first large-scale, diverse, and intrinsically higher-order dataset for benchmarking higher-order models, comprising over 43,000 and 250,000 triangulations of surfaces and three-dimensional manifolds, respectively. With MANTRA, we assess several graph- and simplicial complex-based models on three topological classification tasks. We demonstrate that while simplicial complex-based neural networks generally outperform their graph-based counterparts in capturing simple topological invariants, they also struggle, suggesting a rethink of TDL. Thus, MANTRA serves as a benchmark for assessing and advancing topological methods, paving the way towards more effective higher-order models.
Wayland, J.D. ; Funk, R.J. ; Rieck, B.
In: (Pediatric and Lifespan Data Science). 2025. 1-16 (Comm. Comp. Info. Sci. ; 2386 CCIS)
Identifying (a) systemic barriers to quality healthcare access and (b) key indicators of care efficacy in the United States remains a significant challenge. To improve our understanding of regional disparities in care delivery, we introduce a novel application of curvature, a geometrical-topological property of networks, to Physician Referral Networks. Our initial findings reveal that Forman-Ricci and Ollivier-Ricci curvature measures, which are known for their expressive power in characterizing network structure, offer promising indicators for detecting variations in healthcare efficacy while capturing a range of significant regional demographic features. We also present apparent, an open-source tool that leverages Ricci curvature and other network features to examine correlations between regional Physician Referral Networks structure, local census data, healthcare effectiveness, and patient outcomes.
Zhang, Y. ; Mezrag, L. ; Sun, X. ; Xu, C. ; MacDonald, K. ; Bhaskar, D. ; Krishnaswamy, S. ; Wolf, G. ; Rieck, B.
In: (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 6-11 April 2025, Hyderabad). 345 E 47th St, New York, Ny 10017 Usa: Ieee, 2025. DOI: 10.1109/ICASSP49660.2025.10888433
The rapidly growing field of single-cell transcriptomic sequencing (scRNAseq) presents challenges for data analysis due to its massive datasets. A common method in manifold learning consists in hypothesizing that datasets lie on a lower dimensional manifold. This allows to study the geometry of point clouds by extracting meaningful descriptors like curvature. In this work, we will present Adaptive Local PCA (AdaL-PCA), a data-driven method for accurately estimating various notions of intrinsic curvature on data manifolds, in particular principal curvatures for surfaces. The model relies on local PCA to estimate the tangent spaces. The evaluation of AdaL-PCA on sampled surfaces shows state-of-the-art results. Combined with a PHATE embedding, the model applied to single-cell RNA sequencing data allows us to identify key variations in the cellular differentiation.
Hetzel, L. ; Sommer, J. ; Rieck, B. ; Theis, F.J. ; Günnemann, S.
In: (The Thirteenth International Conference on Learning Representations (ICLR 2025 Spotlight)). 2025. accepted
Recent advances in machine learning for molecules exhibit great potential for facilitating drug discovery from in silico predictions. Most models for molecule generation rely on the decomposition of molecules into frequently occurring substructures (motifs), from which they generate novel compounds. While motif representations greatly aid in learning molecular distributions, such methods fail to represent substructures beyond their known motif set, posing a fundamental limitation for discovering novel compounds. To address this limitation and enhance structural expressivity, we propose to separate structure from features by abstracting motifs to scaffolds and, subsequently, allocating atom and bond types. To this end, we introduce a novel factorisation of the molecules' data distribution that considers the entire molecular context and facilitates learning adequate assignments of atoms and bonds to scaffolds. Complementary to this, we propose MAGNet, the first model to freely learn motifs. Importantly, we demonstrate that MAGNet's improved expressivity leads to molecules with more structural diversity and, at the same time, diverse atom and bond assignments.
2024
von Rohrscheidt, J.C. ; Rieck, B. ; Schmon, S.M.
Trans. Machine Learn. Res. 2024, accepted (2024)
Computational topology recently started to emerge as a novel paradigm for characterising the ‘shape’ of high-dimensional data, leading to powerful algorithms in (un)supervised representation learning. While capable of capturing prominent features at multiple scales, topological methods cannot readily be used for Bayesian inference. We develop a novel approach that bridges this gap, making it possible to perform parameter estimation in a Bayesian framework, using topology-based loss functions. Our method affords easy integration into topological machine learning algorithms. We demonstrate its efficacy for parameter estimation in different simulation settings.
Wissenschaftlicher Artikel
Scientific Article
Papamarkou, T. ; Birdal, T. ; Bronstein, M.D. ; Carlsson, G. ; Curry, J. ; Gao, Y. ; Hajij, M. ; Kwitt, R. ; Liò, P. ; Di Lorenzo, P. ; Maroulas, V. ; Miolane, N. ; Nasrin, F. ; Ramamurthy, K.N. ; Rieck, B. ; Scardapane, S. ; Schaub, M.T. ; Veličković, P. ; Wang, B. ; Wang, Y. ; Wei, G.W. ; Zamzmi, G.
In: (41st International Conference on Machine Learning, 21-27 July 2024, Vienna). 2024. 39529-39555 ( ; 235)
Topological deep learning (TDL) is a rapidly evolving field that uses topological features to understand and design deep learning models. This paper posits that TDL is the new frontier for relational learning. TDL may complement graph representation learning and geometric deep learning by incorporating topological concepts, and can thus provide a natural choice for various machine learning settings. To this end, this paper discusses open problems in TDL, ranging from practical benefits to theoretical foundations. For each problem, it outlines potential solutions and future research opportunities. At the same time, this paper serves as an invitation to the scientific community to actively participate in TDL research to unlock the potential of this emerging field.
Adamer, M.F. ; de Brouwer, E. ; O’Bray, L. ; Rieck, B.
J. Appl. Comput. Topol., DOI: 10.1007/s41468-024-00182-9 (2024)
The magnitude of a finite metric space has recently emerged as a novel invariant quantity, allowing to measure the effective size of a metric space. Despite encouraging first results demonstrating the descriptive abilities of the magnitude, such as being able to detect the boundary of a metric space, the potential use cases of magnitude remain under-explored. In this work, we investigate the properties of the magnitude on images, an important data modality in many machine learning applications. By endowing each individual image with its own metric space, we are able to define the concept of magnitude on images and analyse the individual contribution of each pixel with the magnitude vector. In particular, we theoretically show that the previously known properties of boundary detection translate to edge detection abilities in images. Furthermore, we demonstrate practical use cases of magnitude for machine learning applications and propose a novel magnitude model that consists of a computationally efficient magnitude computation and a learnable metric. By doing so, we address one computational hurdle that used to make magnitude impractical for many applications and open the way for the adoption of magnitude in machine learning research.
Wissenschaftlicher Artikel
Scientific Article
Bock, C. ; Walter, J.E. ; Rieck, B. ; Strebel, I. ; Rumora, K. ; Schaefer, I.K. ; Zellweger, M.J. ; Borgwardt, K. ; Müller, C.
Nat. Commun. 15:5034 (2024)
Functionally relevant coronary artery disease (fCAD) can result in premature death or nonfatal acute myocardial infarction. Its early detection is a fundamentally important task in medicine. Classical detection approaches suffer from limited diagnostic accuracy or expose patients to possibly harmful radiation. Here we show how machine learning (ML) can outperform cardiologists in predicting the presence of stress-induced fCAD in terms of area under the receiver operating characteristic (AUROC: 0.71 vs. 0.64, p = 4.0E-13). We present two ML approaches, the first using eight static clinical variables, whereas the second leverages electrocardiogram signals from exercise stress testing. At a target post-test probability for fCAD of <15%, ML facilitates a potential reduction of imaging procedures by 15-17% compared to the cardiologist's judgement. Predictive performance is validated on an internal temporal data split as well as externally. We also show that combining clinical judgement with conventional ML and deep learning using logistic regression results in a mean AUROC of 0.74.
Wissenschaftlicher Artikel
Scientific Article
Levenson, R.M. ; Singh, Y. ; Rieck, B. ; Hathaway, Q.A. ; Farrelly, C. ; Rozenblit, J. ; Prasanna, P. ; Erickson, B. ; Choudhary, A. ; Carlsson, G. ; Deepa, D.
Lab. Invest. 104:102060 (2024)
Precision medicine aims to provide personalized care based on individual patient characteristics, rather than guideline-directed therapies for groups of diseases or patient demographics. Images-both radiology- and pathology-derived-are a major source of information on presence, type, and status of disease. Exploring the mathematical relationship of pixels in medical imaging ("radiomics") and cellular-scale structures in digital pathology slides ("pathomics") offers powerful tools for extracting both qualitative, and increasingly, quantitative data. These analytical approaches, however, may be significantly enhanced by applying additional methods arising from fields of mathematics such as differential geometry and algebraic topology that remain underexplored in this context. Geometry's strength lies in its ability to provide precise local measurements, such as curvature, that can be crucial for identifying abnormalities at multiple spatial levels. These measurements can augment the quantitative features extracted in conventional radiomics, leading to more nuanced diagnostics. By contrast, topology serves as a robust shape descriptor, capturing essential features such as connected components and holes. The field of topological data analysis was initially founded to explore the shape of data, with functional network connectivity in the brain being a prominent example. Increasingly, its tools are now being used to explore organizational patterns of physical structures in medical images and digitized pathology slides. By leveraging tools from both differential geometry and algebraic topology, researchers and clinicians may be able obtain a more comprehensive, multi-layered understanding of medical images and contribute to precision medicine's armamentarium.
Review
Review
Coupette, C. ; Vreeken, J. ; Rieck, B.
Digit. Scholarsh. Humanit. 39, 74-96 (2024)
We introduce HYPERBARD, a dataset of diverse relational data representations derived from Shakespeare's plays. Our representations range from simple graphs capturing character co-occurrence in single scenes to hypergraphs encoding complex communication settings and character contributions as hyperedges with edge-specific node weights. By making multiple intuitive representations readily available for experimentation, we facilitate rigorous representation robustness checks in graph learning, graph mining, and network analysis, highlighting the advantages and drawbacks of specific representations. Leveraging the data released in HYPERBARD, we demonstrate that many solutions to popular graph mining problems are highly dependent on the representation choice, thus calling current graph curation practices into question. As an homage to our data source, and asserting that science can also be art, we present our points in the form of a play.
Wissenschaftlicher Artikel
Scientific Article
2023
Coupette, C. ; Dalleiger, S. ; Rieck, B.
In: (11th International Conference on Learning Representations, ICLR 2023, 1-5 May 2023, Kigali). 2023.
Southern, J. ; Wayland, J.D. ; Bronstein, M.D. ; Rieck, B.
In: (37th Conference on Neural Information Processing Systems (NeurIPS), 10-16 December 2023, New Orleans, LA). 10010 North Torrey Pines Rd, La Jolla, California 92037 Usa: Neural Information Processing Systems (nips), 2023. 26
Graph generative model evaluation necessitates understanding differences between graphs on the distributional level. This entails being able to harness salient attributes of graphs in an efficient manner. Curvature constitutes one such property that has recently proved its utility in characterising graphs. Its expressive properties, stability, and practical utility in model evaluation remain largely unexplored, however. We combine graph curvature descriptors with emerging methods from topological data analysis to obtain robust, expressive descriptors for evaluating graph generative models.
Morris, C. ; Lipman, Y. ; Maron, H. ; Rieck, B. ; Kriege, N.M. ; Grohe, M. ; Fey, M. ; Borgwardt, K.
J. Mach. Learn. Res. 24:333 (2023)
In recent years, algorithms and neural architectures based on the Weisfeiler-Leman algorithm, a well-known heuristic for the graph isomorphism problem, have emerged as a powerful tool for machine learning with graphs and relational data. Here, we give a comprehensive overview of the algorithm's use in a machine-learning setting, focusing on the supervised regime. We discuss the theoretical background, show how to use it for supervised graph and node representation learning, discuss recent extensions, and outline the algorithm's connection to (permutation-)equivariant neural architectures. Moreover, we give an overview of current applications and future directions to stimulate further research.
Review
Review
Huguet, G. ; Tong, A.Z. ; Rieck, B. ; Huang, J. ; Kuchroo, M. ; Hirn, M. ; Wolf, G. ; Krishnaswamy, S.
SIAM J. Math. Data Sci. 5, 346-372 (2023)
Diffusion condensation is a dynamic process that yields a sequence of multiscale data representations that aim to encode meaningful abstractions. It has proven effective for manifold learning, denoising, clustering, and visualization of high-dimensional data. Diffusion condensation is constructed as a time-inhomogeneous process where each step first computes a diffusion operator and then applies it to the data. We theoretically analyze the convergence and evolution of this process from geometric, spectral, and topological perspectives. From a geometric perspective, we obtain convergence bounds based on the smallest transition probability and the radius of the data, whereas from a spectral perspective, our bounds are based on the eigenspectrum of the diffusion kernel. Our spectral results are of particular interest since most of the literature on data diffusion is focused on homogeneous processes. From a topological perspective, we show that diffusion condensation generalizes centroid-based hierarchical clustering. We use this perspective to obtain a bound based on the number of data points, independent of their location. To understand the evolution of the data geometry beyond convergence, we use topological data analysis. We show that the condensation process itself defines an intrinsic condensation homology. We use this intrinsic topology, as well as the ambient persistent homology, of the condensation process to study how the data changes over diffusion time. We demonstrate both types of topological information in well-understood toy examples. Our work gives theoretical insight into the convergence of diffusion condensation and shows that it provides a link between topological and geometric data analysis.
Wissenschaftlicher Artikel
Scientific Article
Doster, T. ; Emerson, T. ; Kvinge, H. ; Miolane, N. ; Papillon, M. ; Rieck, B. ; Sanborn, S.
In: (Proceedings of Machine Learning Research). 2023. 1-2 ( ; 221)
Andreeva, R. ; Limbeck, K. ; Rieck, B. ; Sarkar, R.
In: (Proceedings of Machine Learning Research). 2023. 242-253 ( ; 221)
Deep learning models have seen significant successes in numerous applications, but their inner workings remain elusive. The purpose of this work is to quantify the learning process of deep neural networks through the lens of a novel topological invariant called magnitude. Magnitude is an isometry invariant; its properties are an active area of research as it encodes many known invariants of a metric space. We use magnitude to study the internal representations of neural networks and propose a new method for determining their generalisation capabilities. Moreover, we theoretically connect magnitude dimension and the generalisation error, and demonstrate experimentally that the proposed framework can be a good indicator of the latter.
Giunti, B. ; Rieck, B.
Not. Amer. Math. Soc. 70, 1640-1644 (2023)
Wissenschaftlicher Artikel
Scientific Article
Alhoori, H. ; Fox, E.A. ; Frommholz, I. ; Liu, H. ; Coupette, C. ; Rieck, B. ; Ghosal, T. ; Wu, J.
In: (Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 26-30 June 2023, Santa Fe, USA). 2023. 319-320 ( ; 2023-June)
With millions of research articles published yearly, the peer review process is in danger of collapsing, especially in 'hot' areas with popular conferences. Challenges arise from the large number of manuscripts submitted, skyrocketing use of preprint archives and institutional repositories, problems regarding the identification and availability of experts, conflicts of interest, and bias in reviewing. Such issues can affect the integrity of the reviewing process as well as the timeliness, quality, credibility, and reproducibility of research articles. Several solutions and systems have been suggested, but none work well, and neither authors nor editors are happy with how long it takes to complete reviewing the submitted research. This panel addresses these challenges and potential solutions, including digital libraries that recommend reviewers, as well as broader issues like opportunities for identifying peer reviewers for scholarly journals by engaging doctoral students and postdocs, as well as those who recently completed their Ph.D.
von Rohrscheidt, J.C. ; Rieck, B.
In: (Proceedings of Machine Learning Research). 2023. 35175-35197 ( ; 202)
The manifold hypothesis, which assumes that data lies on or close to an unknown manifold of low intrinsic dimension, is a staple of modern machine learning research. However, recent work has shown that real-world data exhibits distinct non-manifold structures, i.e. singularities, that can lead to erroneous findings. Detecting such singularities is therefore crucial as a precursor to interpolation and inference tasks. We address this issue by developing a topological framework that (i) quantifies the local intrinsic dimension, and (ii) yields a Euclidicity score for assessing the 'manifoldness' of a point along multiple scales. Our approach identifies singularities of complex spaces, while also capturing singular structures and local geometric complexity in image data.
Waibel, D.J.E. ; Röell, E. ; Rieck, B. ; Giryes, R. ; Marr, C.
In: (Proceedings - International Symposium on Biomedical Imaging, 18-21 April 2023, Cartagena, Colombia). 345 E 47th St, New York, Ny 10017 Usa: Ieee, 2023. 5 ( ; 2023-April)
Diffusion models are a special type of generative model, capable of synthesising new data from a learnt distribution. We introduce DISPR, a diffusion-based model for solving the inverse problem of three-dimensional (3D) cell shape prediction from two-dimensional (2D) single cell microscopy images. Using the 2D microscopy image as a prior, DISPR is conditioned to predict realistic 3D shape reconstructions. To showcase the applicability of DISPR as a data augmentation tool in a feature-based single cell classification task, we extract morphological features from the red blood cells grouped into six highly imbalanced classes. Adding features from the DISPR predictions to the three minority classes improved the macro F1 score from F1macro = 55.2 ± 4.6% to F1macro = 72.2 ± 4.9%. We thus demonstrate that diffusion models can be successfully applied to inverse biomedical problems, and that they learn to reconstruct 3D shapes with realistic morphological features from 2D microscopy images.
Nadimpalli, K.V. ; Chattopadhyay, A. ; Rieck, B.
In: (IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops). 2023. 571-579 ( ; 2023-June)
The computer vision task of reconstructing 3D images, i.e., shapes, from their single 2D image slices is extremely challenging, more so in the regime of limited data. Deep learning models typically optimize geometric loss functions, which may lead to poor reconstructions as they ignore the structural properties of the shape. To tackle this, we propose a novel topological loss function based on the Euler Characteristic Transform. This loss can be used as an inductive bias to aid the optimization of any neural network toward better reconstructions in the regime of limited data. We show the effectiveness of the proposed loss function by incorporating it into SHAPR, a state-of-the-art shape reconstruction model, and test it on two benchmark datasets, viz., Red Blood Cells and Nuclei datasets. We also show a favourable property, namely injectivity and discuss the stability of the topological loss function based on the Euler Characteristic Transform.
Moore, J.L. ; Bhaskar, D. ; Gao, F. ; Matte-Martone, C. ; Du, S. ; Lathrop, E. ; Ganesan, S. ; Shao, L. ; Norris, R. ; Campamà Sanz, N. ; Annusver, K. ; Kasper, M. ; Cox, A. ; Hendry, C. ; Rieck, B. ; Krishnaswamy, S. ; Greco, V.
J. Cell Biol. 222:30 (2023)
Skin homeostasis is maintained by stem cells, which must communicate to balance their regenerative behaviors. Yet, how adult stem cells signal across regenerative tissue remains unknown due to challenges in studying signaling dynamics in live mice. We combined live imaging in the mouse basal stem cell layer with machine learning tools to analyze patterns of Ca2+ signaling. We show that basal cells display dynamic intercellular Ca2+ signaling among local neighborhoods. We find that these Ca2+ signals are coordinated across thousands of cells and that this coordination is an emergent property of the stem cell layer. We demonstrate that G2 cells are required to initiate normal levels of Ca2+ signaling, while connexin43 connects basal cells to orchestrate tissue-wide coordination of Ca2+ signaling. Lastly, we find that Ca2+ signaling drives cell cycle progression, revealing a communication feedback loop. This work provides resolution into how stem cells at different cell cycle stages coordinate tissue-wide signaling during epidermal regeneration.
Wissenschaftlicher Artikel
Scientific Article
Yoneoka, D. ; Rieck, B.
Entropy 25:691 (2023)
We study selection bias in meta-analyses by assuming the presence of researchers (meta-analysts) who intentionally or unintentionally cherry-pick a subset of studies by defining arbitrary inclusion and/or exclusion criteria that will lead to their desired results. When the number of studies is sufficiently large, we theoretically show that a meta-analysts might falsely obtain (non)significant overall treatment effects, regardless of the actual effectiveness of a treatment. We analyze all theoretical findings based on extensive simulation experiments and practical clinical examples. Numerical evaluations demonstrate that the standard method for meta-analyses has the potential to be cherry-picked.
Wissenschaftlicher Artikel
Scientific Article
2022
Liu, R. ; Cantürk, S. ; Wenkel, F. ; McGuire, S. ; Wang, X. ; Little, A. ; O'Bray, L. ; Perlmutter, M. ; Rieck, B. ; Hirn, M. ; Wolf, G. ; Rampášek, L.
In: (Proceedings of Machine Learning Research). 2022. accepted ( ; 198)
Graph Neural Networks (GNNs) extend the success of neural networks to graph-structured data by accounting for their intrinsic geometry. While extensive research has been done on developing GNN models with superior performance according to a collection of graph representation learning benchmarks, it is currently not well understood what aspects of a given model are probed by them. For example, to what extent do they test the ability of a model to leverage graph structure vs. node features? Here, we develop a principled approach to taxonomize benchmarking datasets according to a sensitivity profile that is based on how much GNN performance changes due to a collection of graph perturbations. Our data-driven analysis provides a deeper understanding of which benchmarking data characteristics are leveraged by GNNs. Consequently, our taxonomy can aid in selection and development of adequate graph benchmarks, and better informed evaluation of future GNN methods. Finally, our approach and implementation in GTaxoGym package1 are extendable to multiple graph prediction task types and future datasets.
Hacker, C. ; Rieck, B.
In: (Proceedings of Machine Learning Research). 2022. 142-151 ( ; 196)
Graph embedding techniques are a staple of modern graph learning research. When using embeddings for downstream tasks such as classification, information about their stability and robustness, i.e., their susceptibility to sources of noise, stochastic effects, or specific parameter choices, becomes increasingly important. As one of the most prominent graph embedding schemes, we focus on node2vec and analyse its embedding quality from multiple perspectives. Our findings indicate that embedding quality is unstable with respect to parameter choices, and we propose strategies to remedy this in practice.
Cloninger, A. ; Doster, T. ; Emerson, T. ; Kaul, M.G. ; Ktena, I. ; Kvinge, H. ; Miolane, N. ; Rieck, B. ; Tymochko, S. ; Wolf, G.
In: (Proceedings of Machine Learning Research). 2022. 1-5 ( ; 196)
Gräf, F. ; Zeng, S. ; Rieck, B. ; Niethammer, M. ; Kwitt, R.
In: (Advances in Neural Information Processing Systems). 2022. ( ; 35)
We study the excess capacity of deep networks in the context of supervised classification. That is, given a capacity measure of the underlying hypothesis class - in our case, empirical Rademacher complexity - to what extent can we (a priori) constrain this class while retaining an empirical error on a par with the unconstrained regime? To assess excess capacity in modern architectures (such as residual networks), we extend and unify prior Rademacher complexity bounds to accommodate function composition and addition, as well as the structure of convolutions. The capacity-driving terms in our bounds are the Lipschitz constants of the layers and an (2, 1) group norm distance to the initializations of the convolution weights. Experiments on benchmark datasets of varying task difficulty indicate that (1) there is a substantial amount of excess capacity per task, and (2) capacity can be kept at a surprisingly similar level across tasks. Overall, this suggests a notion of compressibility with respect to weight norms, complementary to classic compression via weight pruning. Source code is available at https://github.com/rkwitt/excess_capacity.
Bhaskar, D. ; MacDonald, K. ; Fasina, O. ; Thomas, D. ; Rieck, B. ; Adelstein, I. ; Krishnaswamy, S.
In: (Advances in Neural Information Processing Systems). 2022. ( ; 35)
We introduce a new intrinsic measure of local curvature on point-cloud data called diffusion curvature. Our measure uses the framework of diffusion maps, including the data diffusion operator, to structure point cloud data and define local curvature based on the laziness of a random walk starting at a point or region of the data. We show that this laziness directly relates to volume comparison results from Riemannian geometry. We then extend this scalar curvature notion to an entire quadratic form using neural network estimations based on the diffusion map of point-cloud data. We show applications of both estimations on toy data, single-cell data and on estimating local Hessian matrices of neural network loss landscapes.
Waibel, D.J.E. ; Atwell, S. ; Meier, M. ; Marr, C. ; Rieck, B.
Lect. Notes Comput. Sc. 13434 LNCS, 150-159 (2022)
Reconstructing 3D objects from 2D images is both challenging for our brains and machine learning algorithms. To support this spatial reasoning task, contextual information about the overall shape of an object is critical. However, such information is not captured by established loss terms (e.g. Dice loss). We propose to complement geometrical shape information by including multi-scale topological features, such as connected components, cycles, and voids, in the reconstruction loss. Our method uses cubical complexes to calculate topological features of 3D volume data and employs an optimal transport distance to guide the reconstruction process. This topology-aware loss is fully differentiable, computationally efficient, and can be added to any neural network. We demonstrate the utility of our loss by incorporating it into SHAPR, a model for predicting the 3D cell shape of individual cells based on 2D microscopy images. Using a hybrid loss that leverages both geometrical and topological information of single objects to assess their shape, we find that topological information substantially improves the quality of reconstructions, thus highlighting its ability to extract more relevant features from image datasets.
Wissenschaftlicher Artikel
Scientific Article
Horoi, S. ; Huang, J. ; Rieck, B. ; Lajoie, G. ; Wolf, G. ; Krishnaswamy, S.
Lect. Notes Comput. Sc. 13205 LNCS, 171-184 (2022)
Recent work has established clear links between the generalization performance of trained neural networks and the geometry of their loss landscape near the local minima to which they converge. This suggests that qualitative and quantitative examination of the loss landscape geometry could yield insights about neural network generalization performance during training. To this end, researchers have proposed visualizing the loss landscape through the use of simple dimensionality reduction techniques. However, such visualization methods have been limited by their linear nature and only capture features in one or two dimensions, thus restricting sampling of the loss landscape to lines or planes. Here, we expand and improve upon these in three ways. First, we present a novel “jump and retrain” procedure for sampling relevant portions of the loss landscape. We show that the resulting sampled data holds more meaningful information about the network’s ability to generalize. Next, we show that non-linear dimensionality reduction of the jump and retrain trajectories via PHATE, a trajectory and manifold-preserving method, allows us to visualize differences between networks that are generalizing well vs poorly. Finally, we combine PHATE trajectories with a computational homology characterization to quantify trajectory differences.
Wissenschaftlicher Artikel
Scientific Article
Horn, M. ; de Brouwer, E. ; Moor, M. ; Rieck, B. ; Borgwardt, K.
In: (International Conference on Learning Representations, 25–29 April 2022, Virtual). 2022.
Graph neural networks (GNNs) are a powerful architecture for tackling graph learning tasks, yet have been shown to be oblivious to eminent substructures such as cycles. We present TOGL, a novel layer that incorporates global topological information of a graph using persistent homology. TOGL can be easily integrated into any type of GNN and is strictly more expressive (in terms the Weisfeiler–Lehman graph isomorphism test) than message-passing GNNs. Augmenting GNNs with TOGL leads to improved predictive performance for graph and node classification tasks, both on synthetic data sets, which can be classified by humans using their topology but not by ordinary GNNs, and on real-world data.
O'Bray, L. ; Horn, M. ; Rieck, B. ; Borgwardt, K.
In: (International Conference on Learning Representations, 25–29 April 2022, Virtual). 2022.
Graph generative models are a highly active branch of machine learning. Given the steady development of new models of ever-increasing complexity, it is necessary to provide a principled way to evaluate and compare them. In this paper, we enumerate the desirable criteria for such a comparison metric and provide an overview of the status quo of graph generative model comparison in use today,which predominantly relies on the maximum mean discrepancy (MMD). We perform a systematic evaluation of MMD in the context of graph generative model comparison, highlighting some of the challenges and pitfalls researchers inadver-tently may encounter. After conducting a thorough analysis of the behaviour of MMD on synthetically-generated perturbed graphs as well as on recently-proposed graph generative models, we are able to provide a suitable procedure to mitigate these challenges and pitfalls. We aggregate our findings into a list of practical recommendations for researchers to use when evaluating graph generative models.