Kereső
Bejelentkezés
Kapcsolat
Using Dimension Reduction Methods on the Latent Space of Molecules |
| Tartalom: | http://hdl.handle.net/10890/16870 |
|---|---|
| Archívum: | Műegyetem Digitális Archívum |
| Gyűjtemény: |
1. Tudományos közlemények, publikációk
Konferenciák gyűjteményei BME MIT PhD Minisymposium BME MIT PhD Minisymposium, 2022, 29th |
| Cím: |
Using Dimension Reduction Methods on the Latent Space of Molecules
|
| Létrehozó: |
Józsa, György
Sárközy, Péter
|
| Dátum: |
2022-03-09T10:07:57Z
2022-03-09T10:07:57Z
2022
|
| Tartalmi leírás: |
De novo molecule design is the process of generating novel chemicals based on a dataset of drug-like molecules. This method has gained popularity in recent decades. Developing drug-like molecules is both costly and time-consuming. To speed the process up, machine learning and deep neural networks have been used in the last three decades. A particularly popular method is using a variational autoencoder to generate a latent space of drug-like molecules suitable for targeted searching. Quantifying the quality of such a latent space is vital for effective usage. This task is not trivial however, as the chemical structure of molecules cannot be easily quantized and such latent spaces tend to be high-dimensional, leading to the need for dimension reducing visualization algorithms to be applied. Many dimension reduction and visualization algorithms have been developed in recent decades. In this paper, we evaluate five recent algorithms – PCA, t-SNE, UMAP, TriMAP and PaCMAP – to see how well they perform on a given dataset. We examine each algorithm on its ability to transform a 64-dimensional latent space such that the resulting two-dimensional space is smooth over chemical structure. We optimize the hyperparameters of each algorithm to see how they transform the resulting embedding and perform a linear interpolation test to see how they map the latent space into two dimensions. We examine the invertibility and extensibility of each algorithm, as this can make targeted searching much easier to execute.
|
| Nyelv: |
angol
|
| Típus: |
könyvfejezet
|
| Formátum: |
application/pdf
|
| Azonosító: |