ASPCS
 
Back to Volume
Paper: “You Might Also Like These Images”: Unsupervised Affine-Transformation-Independent Representation Learning for the ALMA Science Archive
Volume: 541, ADASS XXXIII
Page: 173
Authors: Felix Stoehr
DOI: 10.26624/NVGP3776
Abstract: With the exponential growth of the amount of astronomical data with time, finding the needles in the haystack is getting increasingly difficult. The next frontier for science archives is to also allow searches on the content of the observations themselves. As a step into this direction, we have implemented a prototype of a recommender system for the ALMA Science Archive (ASA). We use self-supervised affine-transformation-independent representation learning of source morphologies for the similarity estimation through contrastive learning with a deep neural network. Once the neural network is trained, the feature vectors for all images - both for continuum images and for peak-flux images of data cubes - are evaluated. In a next step, we compute the similarity matrix holding for each image the corresponding 1000 most similar images, ordered by their pairwise similarity. A kd-tree is used to speed up that computation from O(N2) to O(Nlog(N)). Our prototype interface then shows the most-similar images of which the archival researcher can select the most interesting ones. When they do select an image on the interface, we instantaneously re-compute the combined similarity of all the selected images and reorder the displayed remaining images accordingly. Each selection thus further refines the similarity display. Finally, we use k-means clustering on the feature vectors to provide selectable `source morphology categories’ for a quick-select option. We conclude from the prototype that an image similarity interface can be a valuable asset to science archives.
Back to Volume