Hacker Newsnew | past | comments | ask | show | jobs | submit | aheifets's commentslogin

Atomwise (YC W15) | Infrastructure, DevOps, Machine Learning | San Francisco | Full-time | Onsite | https://www.atomwise.com/careers/

Atomwise Inc. patented the first deep learning technology for structure-based small molecule drug discovery. This AI technology harnesses millions of data points and thousands of protein structures to solve problems that a human chemist would take many lifetimes to solve. Atomwise has partnered with some of the world’s largest pharmaceutical and agrochemical companies, and with more than 50 leading academic institutions and hospitals, to tackle the challenges of discovering and developing better drugs and chemicals. Recently, Atomwise raised $45 million from leading venture capital firms to support the development and application of its AI technology.


Atomwise (YC W15) | Infrastructure, DevOps, Machine Learning | San Francisco | Full-time | Onsite | https://www.atomwise.com/careers/

Atomwise Inc. patented the first deep learning technology for structure-based small molecule drug discovery. This AI technology harnesses millions of data points and thousands of protein structures to solve problems that a human chemist would take many lifetimes to solve. Atomwise has partnered with some of the world’s largest pharmaceutical and agrochemical companies, and with more than 50 leading academic institutions and hospitals, to tackle the challenges of discovering and developing better drugs and chemicals. Recently, Atomwise raised $45 million from leading venture capital firms to support the development and application of its AI technology.


That's a very useful tool! Part of the reason that we launched the Atomwise AIMS program http://www.atomwise.com/aims-awards/ was to address the cost of compounds, in addition to the cost of equipment. We use our deep neural networks to pick 72 compounds (out of millions) that we buy, QC, plate, and ship to the PI for free.


I haven't thought about it closely but, from a cursory read, it sounds like the SquareList data structure: http://www.drdobbs.com/database/the-squarelist-data-structur...


Thanks for the link! I answered some questions about our technique earlier (https://news.ycombinator.com/item?id=9157777 ), which may help elucidate context.


Is there an open source implementation?


AutoDock Smina is referenced in the paper, and its source is available under GPLv2: http://sourceforge.net/projects/smina/

If you're interested in open source for deep learning, there are a number of toolkits here: http://deeplearning.net/software_links/


Atomwise (YC W15) | San Francisco | Full time, ONSITE | Deep Learning, Computational Chemistry

Atomwise uses deep neural networks to help discover new medicines. Our customers are top researchers at institutions such as Merck and the Dana Farber Cancer Institute (see http://www.atomwise.com/clients/). We're backed by science-heavy VCs, including Data Collective, Khosla Ventures, and DFJ. Our work tackles some of the biggest problems of our time: cancer, multiple sclerosis, malaria, ebola, and antibiotics for drug-resistant bugs. We’ve already shown that modern machine learning can set a new bar for predictive accuracy in structure-based drug design, and we want your help in pushing that accuracy even further.

We’re looking both for people with machine learning expertise, and for people with computational biology/chemistry expertise. If you’ve got both, all the better! Please see our full job descriptions here: http://www.atomwise.com/careers/


any remote positions?


Atomwise (YC W15) | San Francisco | Full time, ONSITE | Deep Learning, Computational Chemistry

Atomwise uses deep neural networks to help discover new medicines. Our customers are top researchers at institutions such as Merck and the Dana Farber Cancer Institute (see http://www.atomwise.com/clients/). We're backed by science-heavy VCs, including Data Collective, Khosla Ventures, and DFJ. Our work tackles some of the biggest problems of our time: cancer, multiple sclerosis, malaria, ebola, and antibiotics for drug-resistant bugs. We’ve already shown that modern machine learning can set a new bar for predictive accuracy in structure-based drug design, and we want your help in pushing that accuracy even further.

We’re looking both for people with machine learning expertise, and for people with computational biology/chemistry expertise. If you’ve got both, all the better! Please see our full job descriptions here: http://www.atomwise.com/careers/


Today, those tests are done physically. But, you're right: if you have a good system to tell if a molecule will stick to a given protein, there's no reason to constrain your tests to the protein you want to hit. You can also predict whether the molecule will go around sticking to necessary proteins in the heart (e.g., hERG channel), liver (e.g., cytochrome P450), kidney, brain, etc. Internally, we have a panel of a couple of hundred proteins against which we can predict these kinds of off-target toxicities.


The typical timeline to get an actual cure all the way through the drug discovery pipeline is about 14 years. While we haven't been around long enough for that, we have had our algorithmic predictions validated by follow up physical experiment. This was even for very different diseases, e.g. multiple sclerosis and drug-resistant antibiotics.

Also, as I described in my above answer to et2o, we do large retrospective tests to evaluate our predictive accuracy.


Great questions!

The typical input to the neural network is the 3D structure of the molecule and of the protein. The model works by detecting patterns in the pair of protein and the drug that correlate with binding, e.g. hydrogen bonding, halogen bonding, cation-pi, pi-pi interactions, etc. But these are complicated to encode manually, given all of the factors that affect binding strength: distance, angle, water mediated effects, resonance, (de)stabilizing environmental charges, etc. That's why we need the neural net: you can think of it as the network automatically deriving the best pharmacophoric features to maximally explain which training examples bind and which ones don't, and then the prediction step is looking for the presence or absence of those patterns in new protein-ligand pairs.

We evaluate our models both retrospectively and prospectively. For example, the DUD-E benchmark (http://dude.docking.org/) gives us an assessment of our performance over more than a million individual predictions, comprising many diseases and many biological classes (GPCR, nuclear receptor, enzyme, etc). It begins with 102 disease proteins and, for each one, has a set of molecules that bind to the protein and a set that don't. We shuffle those sets together and ask the neural net to "pick the aces out of the deck". Separately, we perform prospective evaluations, for settings where no one knows the right answer, and run the experiment to confirm the predictions.

I agree with you that the proper selection of targets is critical, as is the mapping between drug target and disease. For us, however, this is easy: we work with smart biologists! If you have any, please send them our way!

Finally, I agree with your point that biology is not designed to be understood by people. That said, molecular binding is fundamental enough that we could think of it as an example of physics rather than biology. And theory works so well for physics that, in many a physics lab, if an experiment disagrees with theory then the first step is to double-check the experiment for errors. The trick is to scale that up to larger systems. Semi-relevant: http://www.smbc-comics.com/?id=2272


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: