50 Million Chemicals and Counting

UPDATE: Sept 8 – Compound 50m in the CS Registry is a novel arylmethylidene heterocycle with analgesic properties called (5Z)-5-[(5-fluoro-2-hydroxyphenyl)methylene]-2-(4-methyl-1-piperazinyl)-4(5H)-thiazolone. (Registry number 1181081-51-5).

According to an email I received from a CAS spokesman, “The number itself represents an important milestone both for researchers and CAS, but even more significant is the pace of scientific discovery around the world.” Roger Schenck, Manager of the Content Planning Department at CAS, adds that, “More scientific literature is being published and we have noticed an explosive growth of patent literature since 1998 that accounts for the rapid growth of substance information available.”

cas-1181081-51-5By contrast, it took 33 years for CAS to register 10 million compounds, a milestone reached in 1990.

It’s intriguing to think that two decades after I wrote a news item (very early in my career) discussing the announcement of that 10 millionth entry for one of the chemistry trade magazines, that CAS should be recording its 50 millionth substance. Indeed, it’s a mere nine months since it announced the 40 millionth.

chemical compounds on CAS

Apparently, the predominant source of this new chemical substance information is the global patent literature. Several years ago, patents accounted for approximately 20 percent of the substance information added to the registry. Today, that number is closer to 70 percent. It was that statement that intrigued me most.

But, I wonder…if they’re scraping patents on such a vast scale, is the addition of a few extra million entries actually representative of technological advance? An alternative explanation is that it simply shows how clever patent attorneys are at working with chemists to couch their claims in such imaginative ways to envelope a whole chemical space in a single sentence.

The increase could be a real indication that researchers increasingly are thinking in terms of monetizing their discoveries, and doing so much earlier in the research process. It could of course be due to increasing research around the world or maybe it’s driven by demand for more advanced electronics and the need for materials for such devices. There are also increasing demands from medical and pharmaceutical research. But, could this have lead to so many million more compounds?

I’m sure it’s not just CAS running a “stamp collecting” business, there has been research demonstrating molecular diversity in the collection.

Schenck confirmed that molecular diversity is something CAS takes seriously. “In regards to molecular diversity in CAS Registry, CAS scientists recently published an article in the Journal of Organic Chemistry on structural diversity among the 24 million organic substances in Registry at the time and may help to answer in-depth diversity questions,” he says.

He also pointed out that CAS monitors the literature as it is published and selects substances in the literature that meet its criteria. To be added the structure must come from a reputable source, including but not limited to patents, journals, chemical catalogues, and selected substance collections on the web. It has to have been described in largely unambiguous terms, characterized by physical methods or described in a patent document example or claim. It also has to be consistent with the laws of atomic covalent organization.

There are also some subtle legislative effects at play too, as Schenck explains:

In the academic community, such activities were greatly enhanced by U.S. legislation passed in 1980, the Bayh-Dole Act, which requires that universities actively seek commercialization for federally-funded research.

The 50-millionth compound will be an interesting milestone. Its identity will not be revealed until tomorrow. It’s probably not going to be a magic bullet for disease or an environmental panacea, but it’s not going to be a trivial compound either. Just how interesting it is will be determined over time, after all there are few compounds without any intrinsic interest.

It would be a happy coincidence if this 50 millionth entry just happened to be something chemically fascinating, to drive innovation from cancer research and nanotechnology to alternative fuel vehicles, cell phones and more. I suspect it will be a little more mundane, but 50 million entries in any collection is a significant milestone regardless.

Research Blogging IconLipkus, A., Yuan, Q., Lucas, K., Funk, S., Bartelt, W., Schenck, R., & Trippe, A. (2008). Structural Diversity of Organic Chemistry. A Scaffold Analysis of the CAS Registry The Journal of Organic Chemistry, 73 (12), 4443-4451 DOI: 10.1021/jo8001276

Pre-eclampsia, NIR, Nano-dots

Pregnant pause for thought – The analytical cousin of magnetic resonance imaging, NMR spectroscopy, has revealed that a chemical compound found in unpasteurised food can be present at unusually high levels in the red blood cells of pregnant women. The compound, the antioxidant ergothioneine, could be used as a biomarker for the potentially fatal condition, pre-eclampsia, which can cause severely raised blood pressure during pregnancy, according to the researchers, although they explain that the compound is probably not the cause of the disorder.

NIR improvements near – Spectroscopy, forensic science and even a future generation of quantum communication devices could benefit from research at the National Institute of Standards and Technology (NIST). There, researchers have developed a new, highly sensitive, and low-cost approach to measuring electromagnetic radiation in the near-infrared range, just beyond visible red light in the spectrum.

Raman joins the dots – Sub-microscopic particles, nanoparticles, can be used to boost the signal in Raman spectroscopy enough to allow researchers to detect several biomarkers for disease simultaneously even deep within tissues of living animals. The discovery could help in disease diagnostics, biomedical research, and potentially cancer treatment.

Biofuels vs Fossil Fuels

Biofuels are not much better than fossil fuels in terms of the impact on atmospheric pollution levels and effects on climate change, according to Mark Jacobson professor of civil and environmental engineering at Stanford University. This is especially true when making claims about the sustainability of biofuels in comparison with hydrogen fuel cells and battery-driven electric vehicles charged up using solar, wind, tidal or other truly renewable energy sources.

To quote from his web page, the main goal of Jacobson’s research is to…

…understand physical, chemical, and dynamical processes in the atmosphere better in order to address atmospheric problems, such as climate change and urban air pollution, with improved scientific insight and more accurate predictive tools. He also evaluates the atmospheric effects of proposed solutions to climate change and air pollution, examines resource availability of renewable energies, and studies optimal methods of combining renewables.

In order to accomplish these important goals Jacobson has developed and applied various models to simulate gas, aerosol, cloud, radiative, and land/ocean-surface processes that could give scientists and engineers a much more overarching perspective on the climate than other simpler models.

Jacobson points out that the use of biofuels, particularly ethanol, has expanded in the last few years, although in South America biofuels have been popular and successful for decades. This more recent and rapid expansion of biofuel use in transport across North America and elsewhere is based on the notion that by replacing fossil fuels with biofuels we may somehow ameliorate global warming and air pollution. After all, he growing plants absorb carbon dioxide from the atmosphere, they are then converted into biofuels, which are burned in modified vehicle internal combustion engines, which releases the carbon dioxide into the atmosphere again, where it is used by the next generation of biofuel crop plants to grow and so on.

This claim is still being hotly debated, especially given the impact on agriculture and the environment of turning over vast tracts of land to biofuel crops rather than growing food. However, Jacobson believes that, “the real comparison should be between biofuels and other emerging technologies.” He reports that corn-E85 (85% ethanol/15% gasoline) and cellulosic-E85 both degrade air quality and climate by up to two orders of magnitude more than electric vehicles ultimately powered by solar photovoltaic cells, wind, geothermal, hydroelectric, wave, or tidal power. “As such, the use of cellulosic or corn ethanol at the expense of the other options will cause certain damage to health, climate, land, and water supply in the future,” he asserts.

Moreover, the land required for cellulosic-E85 may also exceed that of corn-E85 and the land required for both will exceed that required for the footprint on the ground of wind powering battery electric vehicles by a factor of 500,000 to 1 million, adds Jacobson. He suggests that we should be considering very carefully the notion that replacing fossil fuels with biofuels could save us from catastrophic climate change given that this is not only unlikely, but will also have a negative impact on land and water supply relative to genuinely renewable energy sources.

I asked Jacobson for his thoughts on how vested interests might be persuaded that biofuels are no panacea. “The biofuels interests will go out of their way to dismiss or distort results from any study that comes out that makes their product look bad,” he told me. “It is mostly politicians and the public who can be persuaded,” he added. “The best approach is to focus on what is better, such as electric hydrogen fuel-cell vehicles powered by renewables, and to try to push people more in that direction.”

Research Blogging Icon Mark Z. Jacobson (2009). Effects of biofuels vs. other new vehicle technologies on air pollution, global warming, land use and water International Journal of Biotechnology, 11 (1/2), 14-59

Alchemist Checks Oxy Cholesterol Levels

copper-alchemistThe Alchemist this week learns how fluorine chemistry is blooming, how to melt proteins, and how cholesterol is all about the good, the bad, and the oxy. Also this week, a technique borrowed from organic LED fabrication could lead to a new way to manufacture tiny inorganic LEDs for next generation displays, while a conductive flip has been observed with clusters of atoms close to absolute zero. Finally, the American Chemical Society announces this years previously unsung chemical heroes from across the industry.

Previously on ChemWeb, we heard rumors of silicon neurons and the coming cyborg age, he discovers that a compound that leads to ovine Cyclops has now been synthesized for cancer drug research, and how chicken poop down on the shooting range could help solve the problem of lead in the soil. Also, in the news, a new type of fuel cell for truckers that reduces their emissions during rest periods and the increasing cost in water of producing bioethanol. Finally, a major award for a generic pharmacologist.