Proteins and Wave Functions

Fixed. Thanks Graham.

2023-01-12T15:44:21.235+01:00

Fixed. Thanks Graham.

Hi Jan, just a note to say that the APC for PeerJ ...

2023-01-11T13:26:36.523+01:00

Hi Jan, just a note to say that the APC for PeerJ Chemistry journals are $1195

Cheers,

Graham

(Graham Smeddle, PeerJ)

2023-01-11T13:25:19.793+01:00

This comment has been removed by the author.

Newly started journal with a fee waiver until the ...

2022-01-13T12:52:21.274+01:00

Newly started journal with a fee waiver until the end of this year:

https://www.journals.elsevier.com/artificial-intelligence-in-the-life-sciences

Fixed

2022-01-05T21:11:29.022+01:00

Fixed

Hi Jan, the link for Chemical Science does not wo...

2022-01-05T10:13:50.890+01:00

Hi Jan,

the link for Chemical Science does not work. Could you fix that?

My unofficial criterion is <$2000

2022-01-04T12:41:25.534+01:00

My unofficial criterion is <$2000

JCheminf? https://jcheminf.biomedcentral.com/

2022-01-04T12:11:29.442+01:00

JCheminf? https://jcheminf.biomedcentral.com/

This is not surprising. AI models are simply high...

2021-03-28T17:07:17.597+02:00

This is not surprising. AI models are simply highly nonlinear (in the parameters) regression models, nothing more and nothing less (except for their sexy name). I see many papers in the literature that fit empirical (thermodynamic and other) models to data and they call it "prediction". This is not prediction, but "correlation". The only possible "predictions" are essentially "interpolations" within the ranges of experimental conditions under which the training set data was collected. Only if you use a model that incorporates basic scientific principles can you make "predictions". An example of a "prediction methodology" is "Accurately Predicting CO$_2$ Reactive Absorption Properties in Aqueous Alkanolamine Solutions by Molecular Simulation Requiring No Solvent Experimental Data", Ind. Eng. Chem. Res., 59, 18254--18268 (2020).

Dear Prof. Jensen, I have some problems that invo...

2020-12-14T13:54:03.493+01:00

Dear Prof. Jensen,

I have some problems that involve atom reordering as well: I partly solved these in a program: https://github.com/conradhuebler/curcuma

And I would be happy, if there were interested researchers with similar problems, that might have a look at my solution.

Thanks and best regards,
Conrad

Hi Jan, I appreciate your candour - to help move ...

2020-02-19T13:16:41.001+01:00

Hi Jan,

I appreciate your candour - to help move things forward here's a unittest file, these are what I think GEDs should be. My view is a unit of GED change should 1 per addition/removal of each atom and 1 per addition or deletion of a bond.

Ed

import unittest
from GED import *

class MyTestCase(unittest.TestCase):

def run_tests(self, d):
for n, p in d.items():
with self.subTest(p=p):
GED = calc_GED(p[0], p[1])
self.assertEqual(GED,p[2], msg='GED not correct for ' + n)

def test_aromatic_aromatic(self):
mol_dict = {
'benzene->pyridine': ('c1ccccc1', 'n1ccccc1',1),
'pyridine->pyridazine': ('n1ccccc1', 'n1ncccc1',1),
'benzene->pyridazine': ('c1ccccc1', 'n1ncccc1', 2)
}
self.run_tests(mol_dict)

def test_aromatic_substituents(self):
mol_dict = {
'benzene->toluene': ('c1ccccc1', 'c1ccccc1C',2),
'benzene->chlorobenzene': ('c1ccccc1', 'c1ccccc1Cl', 2),
'benzene->anisole':('c1ccccc1', 'c1ccccc1OC', 3)
}
self.run_tests(mol_dict)

def test_aliphatic_aromatic(self):
mol_dict = {
'benzene->cyclohexane': ('c1ccccc1', 'C1CCCCC1',6)
}
self.run_tests(mol_dict)

def test_aliphatic_ring_expansion(self):
mol_dict = {
'cyclopropane->cyclobutane': ('C1CC1', 'C1CCC1',4)
}
self.run_tests(mol_dict)

def test_aliphatic_aliphatic(self):
mol_dict = {
'ethane->propane': ('CC', 'CCC',2)
}
self.run_tests(mol_dict)

if __name__ == '__main__':
unittest.main()

The implementation makes two arbitrary choices: 1...

2020-02-16T12:51:43.117+01:00

The implementation makes two arbitrary choices:

1. The structures are "kekulized" meaning that benzene is C1=CC=CC=C1 rather, meaning that 3 double bonds need to be changed to single bonds. I do this so that the distance between hexatriene (C=CC=CC=C) and benzene is 1. But whether that's a good idea is arguable.

2. The Hs are implicit, so when a double bond is changed to a single bond, then the number of Hs on the Cs adjust, and don't contribute to the edit distance.

I should also note that there is a bug in the implementation. The way I compare atoms (nodes) is by defining a "bond to itself". This means that additions of atoms is double counted (both as an edge and a node". The the code predicts the GED between C and CC to be 3, when in fact it is 2.

Really nice, however, can you (or perhaps Noel and...

2020-02-10T14:52:34.077+01:00

Really nice, however, can you (or perhaps Noel and Roger) explain how the distance from benzene to cyclohexane is 3? I might have expected 6, or 12, but 3 is a mystery - my ignorance I suspect.

Here's my snippet of code:

def calc_GED(s1,n, s2, m):
mol1 = Chem.MolFromSmiles(s1)
mol2 = Chem.MolFromSmiles(s2)
G1 = get_graph(mol1)
G2 = get_graph(mol2)
GDE = nx.graph_edit_distance(G1, G2, edge_match=lambda a, b: a['weight'] == b['weight'])
print(n, m, GDE)

mol_dict = {
'benzene':'c1ccccc1',
'pyridine':'n1ccccc1',
'pyridazine':'n1ncccc1',
'cyclohexane':'C1CCCCC1'
}

for n,s1 in mol_dict.items():
for m,s2 in mol_dict.items():
if n != m:
calc_GED(s1,n, s2, m)

Mmm. I love keeping my journal's policy on arX...

2013-01-05T11:59:16.550+01:00

Mmm. I love keeping my journal's policy on arXiv deposition ambiguous. Offers me an additional way to reject the really lousy papers without review...

And, at the risk of sounding snarky, the percentage of papers deposited openly as a function of a given field (e.g., math > physics > chemistry) is inversely proportional to that field's likelihood of producing anything having any useful/economic consequence.

Very cool video!

2013-01-02T14:20:30.452+01:00

Very cool video!

Did you get the right answer though?

2012-12-28T11:12:02.081+01:00

Did you get the right answer though?

Thank you!

2012-12-28T11:11:36.828+01:00

Thank you!

oh one is a carbon and the other is the nitrogen (...

2012-12-28T07:33:11.029+01:00

oh one is a carbon and the other is the nitrogen (amine/methyl). took me a while to figure out what's the difference between A & B molecules

Very nice example!

2012-12-27T14:58:26.610+01:00

Very nice example!

Many people who publish in JCP consider themselves...

2012-12-27T11:05:51.723+01:00

Many people who publish in JCP consider themselves chemists and most chemists have never heard about arXiv. Most of the ones who have think arXiv is exclusively for physics. It would really help if arXiv made a chemistry section. I wrote to them about this, but no reply.

The other, related, issue is whether chemists feel an arXiv submission establishes priority since they live in perpetual fear of being scooped.

The final issue is that most chemistry journals appear to be arXiv-hostile, though some are not when you actually write to them and ask.

By the way, http://www.councilscienceeditors.org/f...

2012-12-26T15:25:30.026+01:00

By the way, http://www.councilscienceeditors.org/files/presentations/2009/Ingoldsby.pdf has some research on what percentage of physics articles are on arXiv. Not surprisingly, J. Chem. Phys. is doing quite bad at around 7%, whereas e.g. Phys. Rev. D has a submission ratio of 97%. It seems there are also some great differences between the different fields in physics.

Money quote: "Of course, all Physical Review Letters papers are contained in the arXiv" (though looking at the numbers, only 55% of all Phys. Rev. Lett. papers are on arXiv)

It is true that if you use Avnir's method you ...

2012-12-25T12:23:46.363+01:00

It is true that if you use Avnir's method you will get a non-zero entropy contribution like $S = R \ln(3.9)$, but it won't be exactly the same as the conformational entropy. $S=-R\sum_i^4 p_i\ln(p_i)$ where $p_i=e^{-(G^\circ_i-G^\circ)/RT}$ and $G^\circ=-RT\sum_i^4 e^{-G^\circ_i/RT}$. $G^\circ$ is the free energy of the R-A complex, which has four different binding modes. If A has four-fold symmetry then $G^\circ_1=G^\circ_2=G^\circ_3=G^\circ_4$, $p_i=\frac{1}{4}$ and $S=R\ln(4)$. If you include the symmetry number when computing the entropy of A then this contribution to the free energy is taking care of that way.

If A is formally $C_1$ then the four binding free energies will be different and the conformational entropy will be a value between $R\ln(4)$ and $R\ln(1)$. This is also true for Avnir's method but, based on his equations, I don't see why the entropies computed in these two ways would be the same. However, Avnir's method might yield a good approximation to the conformational entropy, I don't know.

Notice also that the binding free energies can in principle be similar completely by accident and not to do with symmetry in any way.

Very nice reply. Just some further points: «Howev...

2012-12-24T16:36:07.682+01:00

Very nice reply. Just some further points:

«However, if molecule A is even *slightly* asymmetric then the effect enters as the conformational entropy of the complex.» Is this a «law» according to Quantum Mechanics? Because, if you apply the continuous symmetry arguments of Avnir, a molecule which is «slightly» assymetric (a flexible molecule) will have a «slight» degree of assymetry. In this case, a flexible A, for example, would have S = R ln(3.9).

«Both are formally correct but I would argue the former is more general as it also works for non-symmetric molecules.» I do not fully agree with the reason here, because if for some reason the complex has an higher entropy because the ligand has more possibilities of binding, it means that the ligand itself has some symmetry operation somewhere and it can still be present on the rotational entropy of the ligand.

Good questions! Let me take the last question fir...

2012-12-24T11:45:47.161+01:00

Good questions! Let me take the last question first since it is a bit easier. If I understood your question correctly your talking about the extra degree of freedom due to rotation around the horizontal axis. In that case it depends on whether you view the model as being 2-dimensional or 3-dimensional. I view it as 2D where this degree of freedom is not allowed. However, if you view it as 3D (i.e. flat molecules in a 3D world) then you are correct.

Now to your first question. Yes you can view it like that if the molecule is perfectly symmetric. The this effect is included in the rotational entropy of free A via the symmetry number (see this post). However, if molecule A is even *slightly* asymmetric then the effect enters as the conformational entropy of the complex. So I think in the symmetric case the effect can also be ascribed to a conformational entropy of the complex.

One could argue that the entropy of A can be measured to settle this issue. However, I don't think that is so straightforward - even conceptually. The entropy at temperature T is measured relative to absolute zero where the entropy is zero by definition. However, this state cannot be reached, and even at very, very low T symmetric molecules will have so-called residual entropy related to their symmetry which will be hard - if not impossible - to measure accurately. But I am not sure.

Bottom line: I think the answer to your question is: ultimately what is measured is an entropy *change* and the higher entropy change for A can be explained *either* as a lower conformational entropy of the complex or a higher rotational entropy of the free ligand. Both are formally correct but I would argue the former is more general as it also works for non-symmetric molecules.

Just want to comment this paragraph: «If you mix e...

2012-12-23T12:17:52.015+01:00

Just want to comment this paragraph: «If you mix equal amounts of A, B, and R you will get more R-A than R-B at equilibrium even though the hydrogen bond strength is the same in the two complexes. This is because molecule A can bind in four different ways while B can only bind one way, i.e. the R-A complex has a degeneracy of four (g=4) and the R-B complex has a degeneracy of one (g=1). Put another way, the R-A complex is more likely because it has a higher entropy (S=Rln(4)) than the R-B complex (S=Rln(1)).»

Isn´t the entropy of complexation of the case A higher, because the rotational entropy of the unbinded molecule A is smaller, rather than being the entropy of the complex itself higher?

Shouldn´t be the degeneracy 8 (A) and 2 (B), instead of 4 (A) and 1 (B)? For molecule A, 4 with the molecule facing up and 4 with the molecule facing down. And for molecule B, one with the molecule facing up and one with the molecule facing down.