In order to explore known chemical space, we used the Reaxys database, working with a set of ~25 million substances, representing all molecules in the database with molecular weight <1000. We calculated the MA of ~2.5 million of these, with molecules of high complexity and molecular weight not calculated due to computation limitations from the algorithm. The MA of the molecules was saved within a local postgresql database. Python scripts were used to extract/analyze the data and output figures. Further details can be found in the SI.
Do you have any questions about this protocol?
Post your question to gather feedback from the community. We will also invite the authors of this article to respond.