Differential Consistency Analysis: Which Similarity Measures can be Applied in Drug Discovery?

Miranda‐Quintana, Ramón Alain and Bajusz, Dávid and Rácz, Anita and Héberger, Károly (2021) Differential Consistency Analysis: Which Similarity Measures can be Applied in Drug Discovery? MOLECULAR INFORMATICS, 40 (7). No.-2060017. ISSN 1868-1743

[img] Text
Restricted to Registered users only

Download (1MB) | Request a copy


Similarity measures are widely used in various areas from taxonomy to cheminformatics. To this end, a large number of similarity and distance measures (or, collectively, comparative measures) have been introduced, with only a few studies directed to revealing their inner relationships. We present a thorough analytical study of the conditions leading to two comparative measures providing equivalent results over a given set of molecules. A key part of this work is the introduction of a novel way to study the consistency between comparative measures: the differential consistency analysis (DCA). This tool reveals how the consistency can be established in an analytical way with minimal (or no) assumptions. We found that the consensus between Tanimoto and the Cosine coefficients improved by choosing a reference whose similarity to the rest of the molecules varies less, or by representing the molecules in a way that does not depend strongly on their size (i. e. bit frequency in the chosen fingerprint representation). The presented derivations are just some generic examples; DCA can be applied widely and for all binary similarity coefficients introduced so far, independently from the molecular representations.

Item Type: Article
Subjects: Q Science / természettudomány > QD Chemistry / kémia
R Medicine / orvostudomány > RS Pharmacy and materia medica / gyógyszerészet, gyógyászati eszközök
Depositing User: MTMT SWORD
Date Deposited: 21 Sep 2021 02:01
Last Modified: 21 Sep 2021 02:01

Actions (login required)

Edit Item Edit Item