The RSC’s objective is to advance the chemical sciences, not only at a research level but also to provide tools to train the next generation of chemists. ChemSpider contains a lot of useful information for students learning Chemistry but there is also a lot of information which is not relevant to their studies which might be confusing and distracting. For some time we have been considering the concept of an educational version of ChemSpider, aimed at students (and their teachers or lecturers) in their last years of school, and first years of university (ages 16-19), which restricts the compounds and the properties, spectra and links displayed for each, to those relevant to their studies. As a result, we are pleased to announce the launch of the Learn Chemistry Wiki which not only fulfils this aim, but also takes it further. This project was developed in a collaboration between Dr Martin Walker at the State University of New York at Potsdam, ChemSpider and the Royal Society of Chemistry’s Education team.
The Learn Chemistry Wiki contains over 2000 “substance” pages which correspond to simple compounds that would commonly be encountered during the last years of school and first years of University. Each of these pages corresponds to a ChemSpider compound, from which it dynamically retrieves compound images, a summary of its properties(molecular formula, mass, IUPAC name, appearance, melting and boiling points, solubility, etc.) and links to view safety sheets and spectra. It also contains text from Wikipedia to display in the substance page based on the Wikipedia links in ChemSpider.

The Learn Chemistry Wiki also goes a step further and not only contains compound information in isolation but also contains laboratory experiments (with parallel sections which contain an overview, teachers’ notes and students’ handouts) for each, quizzes, and tutorials which are linked to the compound information to put them into context. The wiki is based on the MediaWiki platform (which allows multiple users to contribute collaboratively since the website is intended to be a community website), but extends it to incorporate functionality similar to that of ChemSpider, invoked via custom-made extensions. For example, it is possible to draw structures using GGA’s Ketcher in order to find structures, or to draw answers to quiz questions (for example to specify the product of a particular reaction). It is also possible to include an interactive spectrum retrieved from ChemSpider in any wiki page, using the ChemDoodle spectrum viewing widget in browsers which support canvases or JSpecView applet in those that don’t.

For an overview and demonstration of the Learn Chemistry Wiki site see the Learn Chemistry Wiki site tour webppage or the Learn Chemistry Wiki overview demo video:

The Learn Chemistry Wiki is part of the new RSC’s new Learn Chemistry platform which provides a central access point and search facility to make it easier to access the various different RSC teaching resources that it provides.

KNIME is an open-source data integration, processing, analysis, and exploration platform which can be used to create workflows to analyse data.

We have experimented with adding a node to a project which would call the ChemSpider webservices to perform a simple search on it and the instructions below outline how to reproduce our experimentation. This was done with KNIME 2.5.0, with the KNIME extension “Generic Webservice Client” installed.

  1. From the Node Repository find the “Generic Webservice Client” under the “Misc” folder and drag it into the Knime project to add a new node
  2. Right-click on this “Generic Webservice Client” and click on the “Configure…” option
  3. The WSDL for each ChemSpider webservice can be found using the link from the page for the appropriate webservice. For example, the WSDL for the Search webservice is at http://www.chemspider.com/Search.asmx. However, if you enter this as the WSDL location you’ll get an error when you click the “Analyze” button (due to a SOAP exception “undefined simple or complext type ‘soapenc:Array’. This is something that we’re looking into addressing in ChemSpider, but for now a workaround is to copy the WSDL, replace the old fashioned soapenc:Array type with tns:ArrayOfString, and save and use this ammended WSDL locally. I have done this with the Search webservice and the resulting WSDL is available for download here. This file should be downloaded, adn extracted somwhere locally. It can then be entered in the “WSDL Location” field of the Generic Webservice client in KNIME (using a location of the form: file:/C:/temp/ChemSpiderSearchWSDL_no_soapencArray.WSDL) which will then be processed correctly on clicking the “Analyze” button
  4. Set the Port, operation, inputs and outputs as required – see screencapture below for settings for my demonstration. Note that you should use your own token as the value for the token input – if you don’t have one already then see the instructions here for instructions.
  5. Add input and output nodes which connect to and from this Generic WebService Client node as required. For example, you could add a FileReader node as the initial node, which reads in the contents of a text file that simply contains a search term as an input (and adapt the Input value accepted as the query input value of the SimpleSearch to map to this column, rather than hardcoding in a value to search for). And the output csid could be written to a csv file using a CSV Writer node.
  6. On executing the workflow, an output csv file is created which contains the ChemSpider ID(s) of any compounds that match the search term. In the case of a search for “benzene” the csid retrieved is 236.

The functionality of electronic lab notebooks (ELNs) and that of ChemSpider overlap to a certain extent – both store chemical information including structures, data, spectra and reactions. However, the focus of most ELNs is to manage, track and audit that data, and that of ChemSpider is to publish and disseminate it to the world. We have been considering how best to use this complementary functionality to integrate an ELN with ChemSpider.

Some ELNs already currently look up information and link to ChemSpider. For example the blog3 Web-logging (“blogging”) engine by Jeremy Frey, Simon Coles and Mark Borkum at Southampton University, which allows links to compounds from the ChemSpider database to be embedded directly into the content of a post. When a link to ChemSpider is detected, blog3 follows the link to retrieve additional information that is relevant to the compound, including: experimental and theoretical data; two- and three- dimensional depictions; and links to papers and journal articles. Another example is the eScience tool that Steven Wan from CSIRO has developed with the University of New South Wales to text mine LabTrove ELN blog posts to identify chemical names and link these to the relevant ChemSpider compounds.

At the meeting “The Smart Laboratory: Towards a national ELN” meeting (organised as part of the Dial-a-Molecule EPSRC Grand Challenge) in August this year, the seeds were sown to take the integration between ELNs and ChemSpider a step further. Cambridge University has the first Chemistry department in the UK to roll out a department-wide Electronic Lab notebook system, and the software that they’re using is IDBS’s E-WorkBook Suite. In collaboration with IDBS and Cambridge’s Chemistry department, we at ChemSpider have made a plug-in which could both dynamically retrieve information from ChemSpider into their ELN, and publish to it the other way. The Chemistry department at Cambridge (Dr Tim Dickens, Dr Brian Brooks, Prof Bobby Glenn and Prof Steven Ley) have been very helpful in granting access to their ELN to write the plug-in, and will be its first users, but the results will be freely available for any existing IDBS E-WorkBook suite user.

About the extension Prof Bobby Glenn has said: “Much of Chemistry is lost, it is simply not published and languishes in forgotten lab notebooks. Capturing novel molecules soon after synthesis on a searchable database like Chemspider is now an effortless process directly from the ELN, which will greatly encourage sharing of compounds, synthetic methods and all the associated data. It’s instant messaging for chemists”. Antony Williams (Vice-President of Strategic Development of ChemSpider) added “The ability to now publish compound data from the IDBS ELN directly to ChemSpider offers a path to direct exposure of novel chemistry as well as the chemist doing the work. This public compound registration capability is the first move towards ultimately exposing synthetic methods and associated experimental data to the community. Our vision is coming to fruition through this collaboration.”

To view the plug-in in action please view the demonstration movie of ChemSpider E-WorkBook Suite Plugin.

Screen capture of launching Publish to ChemSpider plug-in

Compounds can be published to ChemSpider if they have been drawn out in full in an experiment – whether this is as an individual structure or part of a reaction, and whether they are simply uploaded into the experiment as a reaction file, or included in for example a spreadsheet item. Likewise, compound structures can be automatically loaded into a search of ChemSpider if you would like to find out more information about compounds that have been drawn out in full in an experiment, or if you have published a compound to ChemSpider and wish to see the resulting compound pages. The resulting compound pages in ChemSpider will have the data source “IDBS E-WorkBook Suite”. The external ID will show the ID of the experiment from which the structures are from, and the depositor details as defined in the ChemSpider Settings of the ELN.

The ChemSpider IDBS E-WorkBook Suite plug-in is freely available to customers of IDBS E-WorkBook Suite by downloading it from IDBS, and copying it the appropriate place in their IDBS E-WorkBook Suite program files. It is compatible with E-WorkBook Suite versions 9.0 and 9.1.

This plug-in is an initial proof-of-concept to demonstrate that we can pass compound information between ChemSpider and an ELN in both directions. Future versions will allow more of the information within an experiment to be published to ChemSpider – for example to allow reactions along with a description of their methods to be published to ChemSpider SyntheticPages, or to deposit spectra along with compounds to ChemSpider. We would also like to integrate other ELNs with ChemSpider.

Recently I have been programming a java plug-in from which I needed to call the ChemSpider webservices, and I found that this wasn’t as straightforward as I was expecting, so I thought I would post how to do it in case it’s useful for anyone else who wants to do likewise.
The basic method I used was to use Apache Axis2 to generate java code for the WSDL’s of the main ChemSpider webservices. This java code is available here: chemspider_webservices_javasourcecode.zip and I have also made the compiled jar file available here: chemspider_webservices.jar. The ChemSpider webservices can be called from other java code by referencing this jar file (and the other axis library files).
This blog post describes how I generated and used this jar file. I was using the Eclipse IDE, so some of what I describe will be specific to that.
There is a similar jar file of some ChemSpider webservices which is available by downloading MZMine (the file chemspider-api.jar in the lib directory) and an example of its use can be seen by downloading the source code and looking at the file src\net\sf\mzmine\modules\peaklistmethods\identification\dbsearch\databases\ChemSpiderGateway.java). That jar file was generated using the previous version of Axis (just plain Axis, rather than Axis2) compared to this one. The example here may be easier to use as a start point since the full range of ChemSpider webservices are included in the jar file, there is a full description of how it was generated, the code used to generate the jar file is available and there are more examples of its use.

Generating the chemspider_webservices.jar file

To generate the java code from the WSDL of the ChemSpider webservices I used the WSDL2Java functionality of Apache Axis2. This is available in different forms, including an Eclipse plug-in which will directly import the java code generated into a project, but I found various bugs when trying to use the latest version of that, so just used the command line version.
I started off with generating the java code from the WSDL of the ChemSpider MassSpecAPI webservice:

  • I downloaded and unzipped the latest version of the Apache Axis2 binary distribution from their download page. I used version 1.6.1 of Axis2.
  • In the “bin” directory of this download there should be a file called java2wsdl.bat. Running this batch file from a command line saves a lot of time trying to set up the class paths correctly to run Java2WSDL. Before using it you should set up the following two environment variables:
    • AXIS2_HOME: Must point to the top level of the AXIS2 files which you just downloaded
    • JAVA_HOME: Must point at your Java Development Kit installation direcotry (e.g. C:\Program Files\Java\jre6)
  • To see a full list of the options available when running WSDL2Java simply open a command prompt and run the batch file with no options to obtain the Usage options – more information about these can be found in the Apache Axis2 user guide:
    • > axis2-1.6.1\bin\wsdl2java.bat
  • I ran it with options to specify to use the SOAP 1.2 port of the ChemSpider MassSpecAPI webservice (most ChemSpider webservices have the option of SOAP 1.1, SOAP 1.2, HTTP GET or HTTP POST), to generate synchronous code only (not asynchronous), and to use adb databinding (this is the default anyway):
    • > axis2-1.6.1\bin\wsdl2java.bat -uri http://www.chemspider.com/MassSpecAPI.asmx?WSDL -pn MassSpecAPISoap12 -s -d adb
  • This then generated the file MassSpecAPIStub.java which it automatically put in the package com.chemspider.www (so was the appropriate folder structure was created above it accordingly)
  • I repeated this processes with the other 4 main ChemSpider webservices:
    • > axis2-1.6.1\bin\wsdl2java.bat -uri http://www.chemspider.com/Search.asmx?WSDL -pn SearchSoap12 -s -d adb
    • > axis2-1.6.1\bin\wsdl2java.bat -uri http://www.chemspider.com/InChI.asmx?WSDL -pn InChISoap12 -s -d adb
    • > axis2-1.6.1\bin\wsdl2java.bat -uri http://www.chemspider.com/Spectra.asmx?WSDL -pn SpectraSoap12 -s -d adb
    • > axis2-1.6.1\bin\wsdl2java.bat -uri http://www.chemspider.com/OpenBabel.asmx?WSDL -pn OpenBabelWebServiceSoap12 -s -d adb
  • The folders and java class files generated by Java2WSDL (MassSpecAPIStub.java, SearchStub.java, InChIStub.java, SpectraStub.java and OpenBabelWebServiceStub.java) that were generated are available in the zip file chemspider_webservices_javasourcecode.zip for further reference
  • I then started a new Eclipse project, imported this generated File system into it
  • The generated classes rely on the Axis2 library files so these need to be added to the build path – in Eclipse this is done by right-clicking on the project in the Package Explorer, choosing Properties > Java Build Path > Libraries > Add External Jars and selecting all of the lib files in the lib folder of the Axis2 folder.
  • This project was exported as the jar file chemspider_webservices.jar

Using the chemspider_webservices.jar file as an external library jar file

The chemspider_webservices.jar file and all of the Apache Axis2 library jar files need adding to a java project as referenced libraries before it can be called. To do this in Eclipse right-click on the project in the Package Explorer, choose Properties > Java Build Path > Libraries > Add External Jars and select:

  • the chemspider_webservices.jar file (download it from chemspider_webservices.jar and save it locally)
  • all of the lib files in the lib folder of the Axis2 folder.

Once this has been done then the ChemSpider webservices can be called from the project. An example is shown below, and is also downloadable in text format from here. This has been structured into (pretty well self-contained) functions which can be easily called to retrieve the results of a particular operation of a webservice. In the main function these functions are called and the output written out.

Please note that you should put your obtains your own ChemSpider token from ChemSpider to set as the ChemSpiderToken value – to obtain this, register for a ChemSpider account, and look up your token from your user Profile page after logging in. Some tokens require your user account to be associated with the “Service Subscriber” role, which you can request from your user profile page.

package com.chemspider.www.examples;

import java.util.HashMap;
import java.util.Map;

import javax.swing.JOptionPane;

import org.apache.log4j.BasicConfigurator;
import org.apache.log4j.Level;
import org.apache.log4j.Logger;

import com.chemspider.www.*;
import com.chemspider.www.InChIStub.InChIToCSIDResponse;
import com.chemspider.www.SearchStub.GetAsyncSearchResultResponse;
import com.chemspider.www.SearchStub.GetAsyncSearchStatusResponse;
import com.chemspider.www.SearchStub.SimpleSearchResponse;
import com.chemspider.www.MassSpecAPIStub.ArrayOfInt;
import com.chemspider.www.MassSpecAPIStub.ArrayOfString;
import com.chemspider.www.MassSpecAPIStub.ExtendedCompoundInfo;
import com.chemspider.www.MassSpecAPIStub.GetDatabasesResponse;
import com.chemspider.www.MassSpecAPIStub.GetExtendedCompoundInfoArrayResponse;
import com.chemspider.www.MassSpecAPIStub.SearchByMassAsyncResponse;

public class WebServiceExamples {

/**
* @param args
*/

private static final Logger LOG = Logger.getLogger(WebServiceExamples.class.getName());

private static String ChemSpiderToken = "YOU NEED TO INSERT YOUR OWN TOKEN IN HERE";

public static void main(String[] args) {
BasicConfigurator.configure();

JOptionPane.showMessageDialog(null, "The compound with InChI InChI=1S/C6H6/c1-2-4-6-5-3-1/h1-6H has CSID:"+get_InChI_InChIToCSID_Results("InChI=1S/C6H6/c1-2-4-6-5-3-1/h1-6H"));

int[] SimpleSearchResults = get_Search_SimpleSearch_Results("taxol", ChemSpiderToken);
JOptionPane.showMessageDialog(null, "The first of "+SimpleSearchResults.length+" ChemSpider compound(s) returned by a search for Taxol has CSID:"+SimpleSearchResults[0]);

int[] inputCSIDs = new int[2];
inputCSIDs[0] = 236;
inputCSIDs[1] = 238;
Map> GetExtendedCompoundInfoArrayResults = get_MassSpecAPI_GetExtendedCompoundInfoArray_Results(inputCSIDs, ChemSpiderToken);
Map thisCompoundInfo = GetExtendedCompoundInfoArrayResults.get(238);
JOptionPane.showMessageDialog(null, "The Average Mass of the compound with CSID 238 is: "+thisCompoundInfo.get("AverageMass"));

String[] GetDatabaseResults = get_MassSpecAPI_GetDatabases_Results();
JOptionPane.showMessageDialog(null, "The first of "+GetDatabaseResults.length+" datasources in ChemSpider is:"+GetDatabaseResults[0]);

String SearchByMassAsyncResults = get_MassSpecAPI_SearchByMassAsync_Results(1100.0, 0.1,GetDatabaseResults, ChemSpiderToken);
JOptionPane.showMessageDialog(null, "Transaction ID for search on compounds with mass = 1100+/- 0.1 from any data source is" + SearchByMassAsyncResults);
JOptionPane.showMessageDialog(null, "The operation status of the search with this transaction ID is" + get_Search_GetAsyncSearchStatus_Results(SearchByMassAsyncResults, ChemSpiderToken));
int[] GetAsyncSearchResultResults = get_Search_GetAsyncSearchResult_Results(SearchByMassAsyncResults, ChemSpiderToken);
JOptionPane.showMessageDialog(null, "And the first of "+GetAsyncSearchResultResults.length+" ChemSpider compound(s) returned by the search has CSID:"+GetAsyncSearchResultResults[0]);
}

/**
* Function to call the InChIToCSID operation of ChemSpider's InChI SOAP 1.2 webservice (http://www.chemspider.com/InChI.asmx?op=InChIToCSID)
* Convert InChI to ChemSpider ID.
*
* @param inchi: string representing inchi to search ChemSpider for
* @return: string representing CSID returned
*/
public static String get_InChI_InChIToCSID_Results(String inchi) {
String Output = null;
try {

final InChIStub thisInChIstub = new InChIStub();
com.chemspider.www.InChIStub.InChIToCSID InChIToCSIDInput = new com.chemspider.www.InChIStub.InChIToCSID();
InChIToCSIDInput.setInchi(inchi);
final InChIToCSIDResponse thisInChIToCSIDResponse = thisInChIstub.inChIToCSID(InChIToCSIDInput);
Output = thisInChIToCSIDResponse.getInChIToCSIDResult();
} catch (Exception e) {
LOG.log(Level.ERROR, "Problem retrieving ChemSpider webservices", e);
}
return Output;
}

/**
* Function to call the SimpleSearch operation of ChemSpider's Search SOAP 1.2 webservice (http://www.chemspider.com/search.asmx?op=SimpleSearch)
* Search by Name, SMILES, InChI, InChIKey, etc. Returns a list of found CSIDs (first 100 - please use AsyncSimpleSearch instead if you like to get the full list). Security token is required.
*
* @param query: String representing search term (can be Name, SMILES, InChI, InChIKey)
* @param token: string containing your user token (listed at your http://www.chemspider.com/UserProfile.aspx page)
* @return: int[] array containing the ChemSpider IDs. If more than 100 are found then only the first 100 are returned.
*/
public static int[] get_Search_SimpleSearch_Results(String query, String token) {
int[] Output = null;
try {
final SearchStub thisSearchStub = new SearchStub();
com.chemspider.www.SearchStub.SimpleSearch SimpleSearchInput = new com.chemspider.www.SearchStub.SimpleSearch();
SimpleSearchInput.setQuery(query);
SimpleSearchInput.setToken(token);
final SimpleSearchResponse thisSimpleSearchResponse = thisSearchStub.simpleSearch(SimpleSearchInput);
Output = thisSimpleSearchResponse.getSimpleSearchResult().get_int();
} catch (Exception e) {
LOG.log(Level.ERROR, "Problem retrieving ChemSpider webservices", e);
}
return Output;
}

/**
* Function to call the GetDatabases operation of ChemSpider's MassSpecAPI SOAP 1.2 webservice (http://www.chemspider.com/massspecapi.asmx?op=GetDatabases)
* Get the list of datasources in ChemSpider.
*
* @return: the list of datasources in ChemSpider as a String Array
*/
public static String[] get_MassSpecAPI_GetDatabases_Results() {
String[] Output = null;
try {

final MassSpecAPIStub thisMassSpecAPIStub = new MassSpecAPIStub();
com.chemspider.www.MassSpecAPIStub.GetDatabases getDatabaseInput = new com.chemspider.www.MassSpecAPIStub.GetDatabases();
final GetDatabasesResponse thisGetDatabasesResponse = thisMassSpecAPIStub.getDatabases(getDatabaseInput);
Output = thisGetDatabasesResponse.getGetDatabasesResult().getString();
} catch (Exception e) {
LOG.log(Level.ERROR, "Problem retrieving ChemSpider webservices", e);
}
return Output;
}

/**
* Function to call the GetExtendedCompoundInfoArray operation of ChemSpider's MassSpecAPI SOAP 1.2 webservice (http://www.chemspider.com/massspecapi.asmx?op=GetExtendedCompoundInfoArray)
* Get array of extended record details by an array of CSIDs. Security token is required.
*
* @param CSIDs: integer array containing the CSIDs of compounds for which information will be returned
* @param token: string containing your user token (listed at your http://www.chemspider.com/UserProfile.aspx page)
* @return: a Map> containing the results array for each CSID (with Properties CSID, MF, SMILES, InChIKey, AverageMass, MolecularWeight, MonoisotopicMass, NominalMass, ALogP, XLogP, CommonName)
*/
public static Map> get_MassSpecAPI_GetExtendedCompoundInfoArray_Results(int[] CSIDs, String token) {
Map> Output = new HashMap>();
try {
final MassSpecAPIStub thisMassSpecAPIStub = new MassSpecAPIStub();
ArrayOfInt inputCSIDsArrayofInt = new ArrayOfInt();
inputCSIDsArrayofInt.set_int(CSIDs);
com.chemspider.www.MassSpecAPIStub.GetExtendedCompoundInfoArray getGetExtendedCompoundInfoArrayInput = new com.chemspider.www.MassSpecAPIStub.GetExtendedCompoundInfoArray();
getGetExtendedCompoundInfoArrayInput.setCSIDs(inputCSIDsArrayofInt);
getGetExtendedCompoundInfoArrayInput.setToken(token);
final GetExtendedCompoundInfoArrayResponse thisGetExtendedCompoundInfoArrayResponse = thisMassSpecAPIStub.getExtendedCompoundInfoArray(getGetExtendedCompoundInfoArrayInput);
ExtendedCompoundInfo[] thisExtendedCompoundInfo = thisGetExtendedCompoundInfoArrayResponse.getGetExtendedCompoundInfoArrayResult().getExtendedCompoundInfo();
for (int i=0; i Map thisCompoundExtendedCompoundInfoArrayOutput = new HashMap();
thisCompoundExtendedCompoundInfoArrayOutput.put("CSID", Integer.toString(thisExtendedCompoundInfo[i].getCSID()));
thisCompoundExtendedCompoundInfoArrayOutput.put("MF", thisExtendedCompoundInfo[i].getMF());
thisCompoundExtendedCompoundInfoArrayOutput.put("SMILES", thisExtendedCompoundInfo[i].getSMILES());
thisCompoundExtendedCompoundInfoArrayOutput.put("InChI", thisExtendedCompoundInfo[i].getInChI());
thisCompoundExtendedCompoundInfoArrayOutput.put("InChIKey", thisExtendedCompoundInfo[i].getInChIKey());
thisCompoundExtendedCompoundInfoArrayOutput.put("AverageMass", Double.toString(thisExtendedCompoundInfo[i].getAverageMass()));
thisCompoundExtendedCompoundInfoArrayOutput.put("MolecularWeight", Double.toString(thisExtendedCompoundInfo[i].getMolecularWeight()));
thisCompoundExtendedCompoundInfoArrayOutput.put("MonoisotopicMass", Double.toString(thisExtendedCompoundInfo[i].getMonoisotopicMass()));
thisCompoundExtendedCompoundInfoArrayOutput.put("NominalMass", Double.toString(thisExtendedCompoundInfo[i].getNominalMass()));
thisCompoundExtendedCompoundInfoArrayOutput.put("ALogP", Double.toString(thisExtendedCompoundInfo[i].getALogP()));
thisCompoundExtendedCompoundInfoArrayOutput.put("XLogP", Double.toString(thisExtendedCompoundInfo[i].getXLogP()));
thisCompoundExtendedCompoundInfoArrayOutput.put("CommonName", thisExtendedCompoundInfo[i].getCommonName());
Output.put(thisExtendedCompoundInfo[i].getCSID(), thisCompoundExtendedCompoundInfoArrayOutput);
}

} catch (Exception e) {
LOG.log(Level.ERROR, "Problem retrieving ChemSpider webservices", e);
}
return Output;
}

/**
* Function to call the SearchByMass2 operation of ChemSpider's MassSpecAPI SOAP 1.2 webservice (http://www.chemspider.com/massspecapi.asmx?op=SearchByMass2)
* Search ChemSpider by mass +/- range.
*
* @param Mass: The compounds returned have a mass (Double) within the range Mass +/- Range
* @param Range: The compounds returned have a mass (Double) within the range Mass +/- Range
* @return: the ChemSpider IDs of compounds returned (as a String Array)
*/
public static String get_MassSpecAPI_SearchByMassAsync_Results(Double mass, Double range, String[] dbs, String token) {
String Output = null;
try {
final MassSpecAPIStub thisMassSpecAPIStub = new MassSpecAPIStub();
com.chemspider.www.MassSpecAPIStub.SearchByMassAsync getSearchByMassAsyncInput = new com.chemspider.www.MassSpecAPIStub.SearchByMassAsync();
getSearchByMassAsyncInput.setMass(mass);
getSearchByMassAsyncInput.setRange(range);
ArrayOfString inputDBsArrayofString = new ArrayOfString();
inputDBsArrayofString.setString(dbs);
getSearchByMassAsyncInput.setDbs(inputDBsArrayofString);
getSearchByMassAsyncInput.setToken(token);
final SearchByMassAsyncResponse thisSearchByMassAsyncResponse = thisMassSpecAPIStub.searchByMassAsync(getSearchByMassAsyncInput);
Output = thisSearchByMassAsyncResponse.getSearchByMassAsyncResult();
} catch (Exception e) {
LOG.log(Level.ERROR, "Problem retrieving ChemSpider webservices", e);
}
return Output;
}

/**
* Function to call the GetAsyncSearchStatus operation of ChemSpider's Search SOAP 1.2 webservice (http://www.chemspider.com/search.asmx?op=GetAsyncSearchStatus)
* Query asynchronous operation status. Requires transaction ID returned by AsynchSearch operation. Security token is required.
*
* @param rid: String representing transaction ID returned from a previous search
* @param token: string containing your user token (listed at your http://www.chemspider.com/UserProfile.aspx page)
* @return: String describing status of this search - can have values Unknown or Created or Scheduled or Processing or Suspended or PartialResultReady or ResultReady or Failed or TooManyRecords
*/
public static String get_Search_GetAsyncSearchStatus_Results(String rid, String token) {
String Output = null;
try {
final SearchStub thisSearchStub = new SearchStub();
com.chemspider.www.SearchStub.GetAsyncSearchStatus GetAsyncSearchStatusInput = new com.chemspider.www.SearchStub.GetAsyncSearchStatus();
GetAsyncSearchStatusInput.setRid(rid);
GetAsyncSearchStatusInput.setToken(token);
final GetAsyncSearchStatusResponse thisGetAsyncSearchStatusResponse = thisSearchStub.getAsyncSearchStatus(GetAsyncSearchStatusInput);
Output = thisGetAsyncSearchStatusResponse.getGetAsyncSearchStatusResult().toString();
} catch (Exception e) {
LOG.log(Level.ERROR, "Problem retrieving ChemSpider webservices", e);
}
return Output;
}

/**
* Function to call the GetAsyncSearchResult operation of ChemSpider's Search SOAP 1.2 webservice (http://www.chemspider.com/search.asmx?op=GetAsyncSearchResult)
* Returns the list of CSIDs found by AsynchSearch operation. Security token is required.
*
* @param rid: String representing transaction ID returned from a previous search
* @param token: string containing your user token (listed at your http://www.chemspider.com/UserProfile.aspx page)
* @return: int[] array containing the ChemSpider IDs.
*/
public static int[] get_Search_GetAsyncSearchResult_Results(String rid, String token) {
int[] Output = null;
try {
final SearchStub thisSearchStub = new SearchStub();
com.chemspider.www.SearchStub.GetAsyncSearchResult GetAsyncSearchResultInput = new com.chemspider.www.SearchStub.GetAsyncSearchResult();
GetAsyncSearchResultInput.setRid(rid);
GetAsyncSearchResultInput.setToken(token);
final GetAsyncSearchResultResponse thisGetAsyncSearchResultResponse = thisSearchStub.getAsyncSearchResult(GetAsyncSearchResultInput);
Output = thisGetAsyncSearchResultResponse.getGetAsyncSearchResultResult().get_int();
} catch (Exception e) {
LOG.log(Level.ERROR, "Problem retrieving ChemSpider webservices", e);
}
return Output;
}

}

Disclaimer: I’m new to Java programming, so please excuse me if you are a java expert and I’ve said something obvious, offended you with my code or used the wrong terminology anywhere.

In a way this is a taster, as we’re looking at our Search as part of the refresh of ChemSpider, and more detail will follow. Another motivation for posting was a couple of recent requests for ChemSpider functionality which is already available – a great pointer to how we think about offering massive functionality in a clear interface. The two requests? One was that it would be great if a user could do a search from an input image (so to load an image, convert to structure and launch a search). The other wanted a way to just look for compounds with a specific element included. Both of these can be done on ChemSpider – and tragically both came in as amonymous feature requests. So, because I don’t think they’ve even been fully itemised before, let me count the ways by which you can search ChemSpider.

Simple Search on ChemSpider

 

 

 

 

 

 

 

1. Search by name – systematic name, synonym, trade name
2. Search by chemical identifier – InChI, InChIKey, SMILES
3. Search by database identifier – registry number

Structure Search on ChemSpider

 

 

 

 

 

 

 

 

 

 

 

 

4. Search by exact structure drawn, substructure, similarity – exact match, all tautomers, same skeleton (including/excluding H), all isomers
5. Draw an exact structure – in one of several structure drawers

Load a structure file from an image

 

 

 

 

6. Load from mol, sdf, skc, cdx files
7. Load from an image of the structure – gif, png, jpg, tiff – to get an editable/correctable structure for search

Convert an identifier to search ChemSpider

 

 

 

 

8. Convert an identifier or name to a structure, to use or amend in the structure search

Advanced search options in ChemSpider

 

 

 

 

 

 

 

9. Search for compound with/without a particular element or elements
10. Search by properties – molecular formula, mol wt, nominal mass, average mass, monoisotopic mass. Exact match or within a range
11. Search by calculated properties range – ACD/LogP, ACD/LogD (pH 5.5), ACD/LogD (pH 7.4), Rule Of 5, Number of Hydrogen Bond Acceptors, Number of Hydrogen Bond Donors, Number of Freely Rotatable Bonds, Polar Surface Area, Polar Surface Area, Molar Volume, Refractive Index, Boiling Point, Flash Point, Density, Surface Tension
12. Search by data sources – select one or many individual data sources (from the 400 we hold), one or many data source types from Available Chemicals Databases, Biological Properties , Chemical Reactions, Chemical Safety Data , Drugs or Compounds in Development, Imaging Agents , Information Aggregators, Journal Publishers via MeSH , Ligand/binding/crystal Structure Databases, Metabolic Pathways , Molecular Libraries Screening Center Network, Natural Products, NIH Substance Repository, Patents, Personal Collections, Physical Properties (including SAR/QSAR databases), Protein 3D Structures, Publication or Magazine Article, Spectroscopy Databases , Substance Vendors, Theoretical Properties, Toxicology/Environmental Databases, Virtual Library, Web-based Article (blog or commentary)
13. Search by focussed library – Building Blocks, Screening Compounds, Building Stock, D-EXP014, Acetylcholinesterase (AChE), cAMP dependent protein kinase (PKA), Estrogen Receptor (Alpha), Phospholipase A2 (PLA(2)), Test Set for DILI modelling, Test Set for DILI modelling, Training Set for DILI modelling
14. Search by ligand screening – LASSO (Ligand Activity in Surface Similarity Order) similarity
15. Combine search to look for Single- or multi-component structures
16. Combine search to look for, or disregard, isotopically labelled structures
17. Filter results with analytical data

ChemSpider APIs

 

 

18. Use our web services for mass spectrometry to search by molecular mass or elemental composition within ChemSpider or within particular data sources,
19. Use our web services to search by chemical identifier, retrieve information about ChemSpider record, retrieve the chemical structure thumbnail
20. Use our web services for spectra to return all Open Data spectral information from ChemSpider, return spectral information on a compound, return identified spectra
21. You can show all spectra of a particular type on the spectra page

The free ChemSpider mobile app developed in collaboration with Alex Clark (innovator of the Mobile Molecular DataSheet, Reaction101 and Yield101) is now available for download from the iTunes store! The full details of the app, and some associated screenshots, are outlined on the SciMobileApps wiki here. A brief overview is given below…

“ChemSpider Mobile is a free iOS app (iPhone, iPod, iPad) for searching the ChemSpider online chemical database. It provides the ability to search by drawing a chemical structure, or entering a compound name. The app is very straightforward and easy to learn. Search results are shown in a list showing structure and names. Any search result can be examined in more detail by launching the mobile browser and viewing the structure on the ChemSpider web page. Although the ChemSpider web page is designed to work well on mobile browsers, the mobile app is more convenient to use, and is currently the best way to search by structure from a mobile device. The structure drawing capabilities are provided by the embedded version of the Mobile Molecular DataSheet. The app was built by Molecular Materials Informatics, on behalf of the Royal Society of Chemistry.”

We will look at developing an Android app for ChemSpider, taking into account what we learn from the early use of the iOS Mobile app.

A screencast of the functionality of ChemSpider Mobile is available below.

Only two days until the start of this year’s Fall ACS meeting in Denver. The ChemSpider team is busy preparing for the meeting, packing bags, polishing talks and honing workshop skills.

Please drop by and say “Hi!”

We’d like to repeat our invitation to everyone at the conference to drop by the RSC booth (Booth 1100). Where, of course you can chat with the ChemSpider team, get a quick demo (and find out more about our latest features), pick up our hot-off-the-press User Guide or scoop some exclusive ChemSpider goodies!

To celebrate the release of the new iPhone/iPad app* we have a limited number of covers for 3G and 4G iPhones as well as iPads

*The app itself is free to download from the AppStore.

You can also find out about lots of other things that the RSC does: from publishing books and journals to the promotion of chemistry worldwide. We’ll also have lots of information on our new e-membership option, which is making its’ debut at this meeting. Also keep an eye out for members of our Editorial staff from journals including: OBC, MedChemComm, PCCP, Soft Matter and RSC Advances, who will be scouring the conference in search of lots of new and exciting research.

Natural Product & Synthetic Chemists

I’d like to make an extra special invitation to any Synthetic chemists and Natural products chemists – from PhD students to Professors (please pass this on to all your friends and colleagues who will be at the meeting). The ChemSpider team really wants to hear about your research. Tell us about your latest publication or the work that you are most proud of, and we can make sure that your key compounds from these publications are in ChemSpider, on a platform freely accessible to chemists everywhere. If you are more interested in methodology you shouldn’t feel left out – ask us about ChemSpider Synthetic Pages.

 

ChemSpider related talks and workshops

Antony Williams (most-definitely the hardest working man I know) is giving a number of talks and workshops (details below) which are sure to be entertaining as well as thought-provoking and will be well-worth squeezing into your schedule.

We look forward to meeting you.

 

“Aligning scientific expertise and passion through a career path in the chemical sciences”

Colorado Convention Center, Room: 110, Sunday 28th August 2011, 1.40PM – 2PM

 

“Chemistry in the hand: The delivery of structure databases and spectroscopy gaming on mobile devices

Colorado Convention Center, Room: 110, Monday 29th August 2011, 9.05AM – 9.35AM

 

“ChemSpider: Does community engagement work to build a quality online resource for chemists?”

Colorado Convention Center, Room: 110, Tuesday 30th August, 10.10AM – 10.50AM

 

“An Introduction to ChemSpider – A Combination Platform of Free Chemistry Database, Free Prediction Engines and Wiki Environment”

Colorado Convention Center, Room 503, Wednesday 31th August 2011, 08.30AM – 11AM

 

“Structure representations in public chemistry databases: The challenges of validating the chemical structures for 200 top-selling drugs”

Colorado Convention Center, Room: 110, Wednesday 31st August 2011, 10.45AM – 11.05AM

As I mentioned in my blog post a few weeks ago, over the last few months we have been hard at work trying to improve how we organise all of the information and features that can be found when you view a ChemSpider record. And now you can see the fruits of our labour.

We hope that you find the changes we’ve made give you a better and easier user experience. While we think that the changes will be clear and intuitive, I’d like to highlight a few key features in my next few posts.

Inline help

When you look at compound pages and other useful pages, you should now see a lot more Question mark symbols dottedInline Help question symbol throughout the pages. We’ve called this approach inline help: rather than giving you an in-depth help resource on a separate page or as a PDF, it is much more useful to have a little snippet of help right at the point in the page where you need it. Clicking on the question mark symbol should bring up a yellow text box with short guidance (where there is a need to provide more complete help, we’ll provide a link to a page which contains much more detailed information). Of course, do let us know if you have any suggestions for improvements to the help text.

Inline hep text

 

Default infobox ordering*

Many users indicate they most often look for names (or name-structure associations), physical properties and spectral data, so we have put this information at the beginning of the record. Now when you come to a record, by default the Names infobox is the first box listed followed by the Properties, Spectra and the Articles infoboxes.

None of your favourite infoboxes have been removed (in fact we’ve created some new ones – see later). If you don’t like the default order, it is easy to change the ordering of the infoboxes by clicking on the titlebar and dragging them up or down the record. ChemSpider will remember your order and will use this for all future visits to the site from that PC (in the same browser/profile).

*If you have visited the site before ChemSpider will remember your previous settings. If you want to see the new default order you will need to clear your browser history or delete the ChemSpider cookies that are saved in your profile.

 

New infoboxes: Searches and Chemical Vendors

ChemSpider has always had great features, for instance:

The Similar Search – that allows you to find records for compounds that have the same skeleton, but have different stereochemistry or isotopic labels

The ability to load the structure from the current record into a structure search, so that you are able to modify it and construct a new search.

However, this hasn’t always been made very clear, in our redesigned compound page we have aimed to make these powerful search tools easier to discover and understand.

The Searches Infobox

Now you can find these all together in the Searches infobox – along with our Google Scholar custom queries which allow you to perform one search across publications using all of the validated synonyms (saving you from having to perform many separate searches for individual synonyms). We also help you to perform ‘structure searches’ of Google (in the form of an InChIkey search).

The Search infobox

The Chemical Vendors Infobox

We’ve also created an infobox  just to display Chemical vendor information, so that it is much quicker to find if the compound in the record is commercially available.

The record for Sparteine with it's Chemical vendors infobox

 

In my next post I’ll finish off discussing the improvements that we’ve made to the site. But of course, if you have any comments or questions about the features I’ve discussed here, please leave a comment below, or send an email to the ChemSpider inbox.

 

 

There are multiple structure drawing editors on ChemSpider. And we could add more! For example, one we don’t have is JSDraw and we also don’t have the ChemDoodle components in place, yet, though I am VERY impressed with the spectral display components that are integrated into the SpectralGame that ChemSpider supports. Compared to just a few years ago there is now an abundance of structure drawing editors in the form of Applets and JavaScript Editors. So many in fact that it can be confusing to the user. The user in reality should not worry about the technology behind the editor. It should be quite simple, especially when it comes to something as simple as the editor being the interface to querying ChemSpider. It should display perfectly on the browser(s) and platform(s) used by the user, it should be intuitive and easy to use (preferably without having to resort to reading help files), and essentially, it should “do what I want it to do”. Not at all an unreasonable list of demands right? Not so easy to deliver on mind you!

On ChemSpider we have multiple structure drawing editors. If you visit this page and open up the selection window by using “Click to Edit” you will see the editor below and, underneath the editor shown, a series of editors that you can choose from.

Structure Editors on ChemSpider

There has to be an order of listing the editors…the listed order is NOT a preferred order from our point of view. Just a list. We have heard feedback from numerous people about their preferred editor. Some live and breath the Java Molecular Editor (JME). Some prefer Accelrys JDraw because they already use Accelrys Draw. Many think that Elemental is a great Javascript Editor.

We are left with a choice….leave all editors (which has a cost in time to support them, keep them updated, tested etc) or reduce the number of editors to just a couple (or three). So, we welcome your input, on this blog post as a comment, or via the survey on SurveyMonkey here. We’d like your input to help steer our decision. Thanks

Previously there was ChemMobi, then there was our implementation of ChemSpider for a mobile browser and then ChemSpider SyntheticPages for a mobile browser. At next weeks’ ACS meeting in Denver we hope that the ChemSpider mobile app developed in collaboration with Alex Clark (innovator of the Mobile Molecular DataSheet, Reaction101 and Yield101) will be available for download from the iTunes store! The full details of the app, and some associated screenshots, are outlined on the SciMobileApps wiki here. A brief overview is given below…

“ChemSpider Mobile is a free iOS app (iPhone, iPod, iPad) for searching the ChemSpider online chemical database. It provides the ability to search by drawing a chemical structure, or entering a compound name. The app is very straightforward and easy to learn. Search results are shown in a list showing structure and names. Any search result can be examined in more detail by launching the mobile browser and viewing the structure on the ChemSpider web page.

Although the ChemSpider web page is designed to work well on mobile browsers, the mobile app is more convenient to use, and is currently the best way to search by structure from a mobile device. The structure drawing capabilities are provided by the embedded version of the Mobile Molecular DataSheet. The app was built by Molecular Materials Informatics, on behalf of the Royal Society of Chemistry.”

An early view screencast of the functionality of ChemSpider Mobile is now available.  New movies showing the details of the app will follow in the near future but this is an early view for interested parties.

We’ve rejigged our data to make searching more reliable.

What have we done?

We’ve regenerated all of the InChIs in the database with version 1.03 of the InChI code.

What does that mean?

The InChI (international chemical identifier) is a short piece of text that describes the structure of a molecule. Each one is generated by a free and open-source computer program, which guarantees that it should be the same and there shouldn’t be conflicting InChIs for the same molecule. You can’t really write them by hand, because they look like this:

InChI=1S/C10H22ClN2O5PS/c1-3-10(9-18-20(2,15)16)12-19(14)13(7-5-11)6-4-8-17-19/h10,12H,3-9H2,1-2H3

ChemSpider is built on InChIs. If two molecules have the same InChI, then they’re the same record in ChemSpider, and if you can’t InChIfy it, you can’t put it in ChemSpider. That’s why we can’t do, for example, polymers yet.

We’re proud to be founder members of the InChI Trust, which supports this critical element in the sharing of chemical compound information.

InChI Trust Member 2011

What does all this mean for ChemSpider?

Because there is an active community supporting InChI who look out for these things, version 1.03 contained some bug fixes which mean that a very small number of the InChIs themselves, only a few dozen out of the whole database, have changed.

  • P+–O bonds and P+–S are now treated slightly differently. This means that it will be easier to find the exact molecule you’re looking for, regardless of how it’s been drawn. (In principle this will also apply to analogous bonds containing arsenic, selenium, tellurium and antimony, but I can’t see any examples of this in the database.)
  • There was a small bug where the InChI generated for a molecule with an azide group in it sometimes varied according to the input drawing. But that doesn’t happen now.

This regeneration has also allowed us to catch and clean up some errors in the data.

What happens next?

Version 1.04 of the InChI code will be released soon. With our new framework for processing large amounts of data we’ll be able to update our InChIs much quicker. The main changes in 1.04 that affect the InChI are to how it handles radical atoms in aromatic rings, nobelium, lawrencium and rutherfordium, so we anticipate that there shouldn’t be very many changed InChIs!

COPIED FROM THE CHEMCONNECTOR BLOG

Unless you have no interest in sports, or have your head under a stone, you will be aware of the fact that the next Olympics will be held in London in 2012. Peter Scott (one of the editors of ChemSpider SyntheticPages) and I were recently discussing how much of a role chemistry plays now in modern sports. I’m a runner, cyclist, swimmer and overall sporting type of guy and depend on wicking materials to keep me cool, nutritional support to get me through my 100-150 mile bike rides in a day, glide stick to “stop me chafing” (ow!) and graphite grease to silence the rattling chain on my bike. In fact it doesn’t matter what sport I am doing it is easy to notice the influence that chemistry has on my improved performance at my tender age of, ahem, just over 40 (and holding, for a while now).

I was reminiscing with Peter that Sir Graham Richards and I were chatting about pyrenes about a year ago and we lamented on how Benzo[CD]pyrene, shown here, looks just like the Olympic rings. There is another rather well known “Olympic molecule” of course, already captured on Wikipedia and named Olympiadane. It looks rather complex to synthesize and personally I think the benzopyrene looks a lot more like the Olympic rings so I attached the synonym Olympicene to it! In fact, if you search ChemSpider using the name Olympicene you will find it.

In a recent discussion about our online crowdsourced database of syntheses, ChemSpider SyntheticPages,(and not distracted at all by the conversation about the Olympics going to the UK next year!!!)  I mentioned again to Peter the molecule Olympicene and he searched ChemSpider to find it. We agreed that it would be fun to know how easy it would be too synthesize it and if it was done it would be a good synthesis to add to ChemSpider SyntheticPages. That was enough to trigger Peter into action and chat with one of his colleagues to see if he can make it.

And so it starts…the trials and tribulations of how to synthesize the chemical Olympicene will be captured on ChemSpider SyntheticPages step by step. We’re not sure how complex a synthesis it will be..time will tell. It will be great to add the analytical data to ChemSpider too as it gets generated..including all the intermediate reaction steps and associated data. ChemSpider and CSSP were designed to support projects like this so it will be a fun story to watch it work through.

If YOU have any thoughts about good synthetic approaches for what seems like a simple molecule post them on this blog. Actually, why not try synthesizing yourself and add your syntheses to SyntheticPages!? Every contribution is issued a DOI for your publication list!

It might be ideal to get a  number of synthetic approaches posted on ChemSpider SyntheticPages and see which one is the best! Watch this space. Also, I’ve set up a Twitter account to capture the progress at @Olympicene. Enjoy!

We will be hosting a training session for ChemSpider at the ACS meeting in Denver. Please register early.

An Introduction to ChemSpider – A Combination Platform of Free Chemistry Database, Free Prediction Engines and Wiki Environment

Where: Colorado Convention Center
Room: 503
When: Wednesday, August 31, 8:30 AM – 11:00 AM

>> Click here to register for this workshop
ChemSpider has become one of the premier free online chemistry resources used by many thousands of chemists around the world every day. Hosting over 26 million unique chemical entities, sourced from over 400 separate data sources, ChemSpider provides access to experimental and predicted data, links to patents and publications and uniquely offers the ability to deposit and share their own data online. With the intention of integrating and curating public chemistry resources for the community ChemSpider encourages participation from chemists around the world. Integrated to Wikipedia, Google Patents, Google Books, Google Scholar and PubMed, as well as to the RSC Publishing platform, ChemSpider provides access to chemistry contained in millions of articles. This training session will provide an overview of searching ChemSpider and will discuss how to deposit data and participate in curating the existing information. We will also provide an overview of ChemSpider SyntheticPages, our venture into providing a community-based resource of semantically enriched synthetic procedures and allowing community peer review. This will be an interactive session and you are encouraged to bring your laptops to work along and ask questions regarding present and future capabilities. ChemSpider is built for the community and we welcome your comments about how to make it better for your needs.

I’m sure that by now everyone has noticed that the ChemSpider homepage design changed just over a month ago. A few features moved around, the Molecules of Interest section was retired and perhaps most significantly the Search box was given a dose of CSID: 5791, becoming bigger and more prominent.

The reason for this wasn’t just to make the site more attractive (though I think it does look ‘prettier’). Our motivation for the change is to deliver a site that makes it easier for users to interact with and understand. And by doing so, hopefully make it quicker and simpler for you to get your tasks done using ChemSpider. The refresh of the homepage is hopefully illustrative of this: We think that as most users come to ChemSpider to search for information – it should be easy to get straight into a search, hence the greater emphasis on this feature.

In the next few days we will release another upgrade to the interface which is centered on making it easier to understand the data presented in the compound Record View pages. I’ll post a blog entry dealing with some of the key features in the next few days.

The development of ChemSpider is an ongoing process, and we are aware that even after this upgrade there will be aspects of the compound Record View pages that will need more work (and also other parts of the site that still need development). It’s not going to be easy: ChemSpider brings together a rich and varied set of data from a large number of sources – this poses many challenges. We also realise that there are many different tasks that each of you – as users – want to perform, and it is always going to be difficult to reconcile all of the different opinions/needs.

However, we are trying to make the site better for you. And therefore, we’d really like to know your opinions on the changes (please test new features for a few days first). We welcome your feedback on the redesign either in the form of blog comments or email feedback (chemspider-at-rsc.org).

Over the next week – keep your eyes peeled for the upgrade and my accompanying blog post which will endeavor to give you a good introduction to the new features.

We have text mined compound names from all RSC 2008-2010 journal articles and loaded these into ChemSpider – adding about 26,000 new-to-ChemSpider compounds with links back to the published articles. We’ve also simplified the view of compound name and chemical/biochemical term highlighting within the Publishing Platform HTML view, so readers can link out from compound names (direct to ChemSpider for related compound information) and from chemical and biochemical terms (to other linked articles). We’ll be extending this to cover our 2011-and-then-ongoing publications, then looking to go further back into our journal archive. Later this week we should also have the compounds visible from the article home page, also linking through to ChemSpider

We have also worked with the Utopia Documents team (getutopia.com) to apply these enhancements to our PDF – so with the free Utopia Documents PDF viewer (originally developed in conjunction with Portland Press for the Biochemical Journal), readers get any enhancements overlaid on top of the PDF as they’re reading and can link out just as they can from the HTML. As this is powered from an API from our Publishing Platform, any additional links we make in future will be reflected in real time without having to update the PDF. Anyone who’s seen Steve Pettifer’s Utopia demonstrations tends to say “wow” at the potential, so many thanks to the Utopia team in Manchester for adding support for RSC articles. As above, this will work on 2008-2010 articles just being loaded, and as we extend the coverage Utopia will pick up and display the additional links for these papers

The RSC’s free chemical database ChemSpider has added RDF functionality to its interface, in collaboration with the University of Southampton’s School of Chemistry. The availability of RDF allows the database records to be found and understood by semantic web tools, another step in ChemSpider’s mission to create a public chemical information infrastructure.

Richard Kidd, Informatics Manager at the RSC says “we are delighted to work with top academic teams pushing forward what’s possible with semantic chemistry, and we hope others will use the RDF representation of ChemSpider to support their own developments”

ChemSpider as a Linked Data source for oreChem

The machine-processable representation was specifically developed in order to leverage the core competencies of the ChemSpider database: resolvable identifiers; high-quality, curated metadata; and rich linking to the extensive RSC corpus. Furthermore, as part of the Microsoft Research-funded oreChem project, OAI-ORE technology is being used to facilitate the discovery and re-use of the chemical information in the correct context.

Prof Jeremy Frey and Dr Simon Coles commented “it is a pleasure for Southampton to work with the RSC’s ChemSpider as a culmination of our contribution to the Microsoft-funded oreChem project. As a member of the Southampton Chemistry eResearch team, this work forms the core of graduate student Mark Borkum’s PhD thesis. ”

“Enabling open, semantic chemistry in this way is a monumental step forward for the domain,” notes Lee Dirks, director of Education & Scholarly Communication for Microsoft Research, “We’re thrilled to have played a role in facilitating the creation of this resource and extremely pleased to see Southampton and the RSC innovating and leading the field.”

Another oreChem participant, Carl Lagoze, the Associate Professor, Cornell University Information Science, Co-Director Open Archives Initiative added “it’s wonderful to see the results of our work on OAI-ORE in this exciting application. It fulfils our goal of making the results of research easier to disseminate and reuse”

Read the rest of this entry »

ChemSpiderman, Antony Williams, will be giving ficve presentations at the #ACS Denver meeting in the fall. The times and dates will be listed when they become available. The list of titles is below.

Structure representations in public chemistry databases: The challenges of validating the chemical structures for 200 top-selling drugs

Chemistry in the hand: The delivery of structure databases and spectroscopy gaming on mobile devices

Aligning scientific expertise and passion through a career path in the chemical sciences

ChemSpider: Does community engagement work to build a quality online resource for chemists?

Identifying “known unknowns” using mass spectrometry data and ChemSpider

It’ll be a different type of talk than most I have given before on ChemSpider…should be fun!

I am happy to announce that ChemSpider is a part of the Open PHACTS project. It is detailed in the following PDF file but can be summarized with the extract from the Wikipedia article I recently posted.

“Open PHACTS is an Innovative Medicines Initiative project aiming to deploy a highly innovative open standards, open access, semantic web approach to address key bottlenecks in small molecule drug discovery – disparate information sources, lack of standards and information overload. It will become an open, integrated and sustainable chemistry, biology and pharmacology knowledge resource for drug discovery. It will provide tools for a comprehensive platform of chemical, biological and pharmacological information, confederated from vast, distributed and variable data and information sources. This semantically enriched and fully interoperable platform will deliver information on small molecules and their pharmacological profiles, including pharmacokinetics and ADMET data as well as on biological targets and pathways. The proposed platform will be a widely accessible, open innovation platform that will serve EFPIA partners as well as biotech and academic drug-discovery for one of their core business activities: knowledge discovery and verification. Practicing drug-discovery scientists in both the public and the private sector will turn to this service to: – integrate data on target expression, biological pathways and pharmacology to identify the most productive points for therapeutic intervention – investigate the in vitro pharmacology and mode-of-action of novel targets to help develop screening assays for drug discovery programs – compare molecular interaction profiles to assess potential off-target effects and safety pharmacology – analyse chemical motifs against biological effects to deconvolute high content biology assays.

The Open PHACTS consortium comprises 23 European core partners, with leading experts in the fields of data mining, annotation, small molecule data storage and manipulation, target related bioinformatics, pathway annotation, protein structure analysis, massive daily in silico reasoning, chemical biology, and computational chemistry. These are: Pfizer, Universität Wien>, Technical University of Denmark, University of Hamburg, BioSolveIT GmBH, Consorci Mar Parc de Salut de Barcelona, Leiden University Medical Centre, Netherlands Bioinformatics Centre, Royal Society of Chemistry, Vrije Universiteit, Spanish National Cancer Research Centre, University of Manchester, Maastricht University, ACKnowledge, University of Santiago de Compostela, >Rheinische Friedrich-Wilhelms-Universität Bonn, AstraZeneca, GlaxoSmithKline, Laboratorios del Dr. Esteve, Novartis, Merck KGaA, Lundbeck, and Eli Lilly.”

The Open PHACTS website can be found here. There will be a lot more details about Open PHACTS in the next few months. Follow the conversation on Twitter here: http://twitter.com/#!/Open_PHACTS

 

 

Dotmatics Limited is pleased to announce that it will provide its web-based structure drawing tool, Elemental, to the leading chemistry community website ChemSpider. Elemental provides a zero install drawing tool that lets users draw simple chemical structures or complex structure queries directly within a webpage.

Antony Williams, Vice President of Strategic Development for ChemSpider comments “Elemental offers ease of deployment and flexibility in structure drawing to our community of users and we are happy to embrace this web-based structure drawing platform as an entry point to the rich resources of ChemSpider.”

Dr Mike Hartshorn, Director and CSO of Dotmatics, said “We are delighted to be working with such a well-known chemistry resource as ChemSpider. The new tools will allow simple access to the wide range of structures and related information that is maintained by ChemSpider and the RSC”.

About Dotmatics
Dotmatics Limited (www.dotmatics.com) is a leading provider of web-based database integration and visualisation tools for use within the life sciences industry.

About the Royal Society of Chemistry
The Royal Society of Chemistry is the UK Professional Body for chemical scientists and an international Learned Society for the chemical sciences with more than 47,500 members worldwide. It is a major international publisher of chemical information, supports the teaching of chemical sciences at all levels and is a leader in bringing science to the public. www.rsc.org

About ChemSpider
ChemSpider offers a structure-centric community for chemists to resource data.  Offering access to over 25 million unique chemical entities from over 400 data sources and by providing a platform for crowd-sourced deposition, annotation, and curation, it is the richest source of free integrated chemistry information available online.  ChemSpider delivers data and services to enable the semantic web for chemistry.  www.chemspider.com

Contacts:
Mike Hartshorn
Dotmatics Limited, The Old Monastery
Windhill, Bishops Stortford
CM23 2ND, UK
Tel: +44 1279 654123
Email: info@dotmatics.com
www.dotmatics.com

Antony Williams
ChemSpider, Royal Society of Chemistry
904 Tamaras Circle
Wake Forest, NC 27587
Tel: 919-201-1516
Email: info@chemspider.com
www.chemspider.com

 

 

Last week Antony Williams gave three presentations about ChemSpider as a chemistry resource at the 241st ACS National Meeting  & Exposition in Anaheim.

For those of you who were not able to attend here are the presentations:

RSC ChemSpider as an environment for teaching and sharing chemistry 

Hosting a compound centric community resource for chemistry data 

How the web has weaved a web of interlinked chemistry data 

In January of this year we held a meeting in London with a group of interested parties who wanted to discuss with us how ChemSpider can be used to support metabolomics. From my point of view that was a very successful meeting in terms of providing an overview of what ChemSpider is capable of today as well as garnering feedback and input from a community of users applying mass spectrometry to perform metabolomics studies.

As a result we will be holding a round-table discussion here in the United States in Research Triangle Park, North Carolina, in April 2011, again for scientists interested in further refining how ChemSpider can be extended to serve the metabolomics community. An outline of the meeting is provided below. If you are interested in participating please respond to me directly at williams”AT”rsc”DOT”org by the deadline listed below. We will cap the attendance fairly quickly and are specifically looking for people who can be vocal about their needs and how we may be able to help with ChemSpider as a platform.

Metabolomics Round Table – Delivering Value to the Metabolomics Community via ChemSpider, a Public Domain Database

Hosts: John Shockcor, Waters and Antony Williams, Royal Society of Chemistry

When: April 21st, 2011

Venue: To Be Determined, Research Triangle Park, North Carolina

The metabolomics community presently utilizes public domain databases such as KEGG, LipidMap, DrugBank and a myriad of other online resources to assist in the analysis of data. However, rich as these resources are, they are limited in scope, are challenged by known data quality issues, and are not directly focused on serving the needs of the metabolomics community. ChemSpider is an online resource for the chemistry community hosted by the Royal Society of Chemistry with the intention of linking together online chemistry resources, cleaning and curating chemistry related data and collectively serving a number of communities. ChemSpider has been used by members of the mass spectrometry community, including instrument vendors, for the past 3 years. This roundtable meeting is to provide an overview of how ChemSpider is presently used by scientists working in the domain of metabolomics and garner feedback from the existing user base as well as new potential users to help define how ChemSpider can be enhanced to further support the needs of this community.

Antony Williams, VP of Strategic Development and host of ChemSpider at RSC, and John Shockcor, Director of Life Sciences Business Development at Waters Corp, invite you to attend thismeeting to provide input to steer development of ChemSpider to address the needs of the metabolomics community. An agenda will be defined in the near future based on interest.

If you are interested in attending please express your interest by sending an email to williamsa@rsc.org

 

The Royal Society of Chemistry will be heading to California for the Spring ACS Meeting where Antony Williams, the VP of Strategic Development for ChemSpider will be presenting several papers and hosting a ChemSpider Training Session.

The Training Session – “ChemSpider: A Community Resource for Chemical Data”  will be held on  Wednesday, March 30th from  8:30-11:00 AM in the Anaheim Convention Center,  Room 211 A.

This should be a lively and interactive session and is your opportunity to give feedback regarding present functionality and how you would like to see ChemSpider develop in the future. These sessions have proved popular in the past, so make sure to register early for your place here.

The titles and locations of the talks are:

RSC ChemSpider as an environment for teaching and sharing chemistry – Division of Chemical Education. March 28, 2011 from 9:45 am to 10:05 am. Disney’s Grand Californian Hotel , Room: Trillium B

Hosting a compound centric community resource for chemistry data – CINF: Division of Chemical Information. March 28, 2011 from 3:05 pm to 3:30 pm. Anaheim Convention Center , Room: 204 A

How the web has weaved a web of interlinked chemistry data – CINF: Division of Chemical Information. March 29, 2011 from 3:00 pm to 3:40 pm. Anaheim Convention Center , Room: 204 A

For a more pesonalized demonstration of ChemSpider you can also visit the RSC at Booth 903.

We look forward to meeting you in Anaheim.

On Monday 31 January, ChemSpider and Waters partnered with Chemistry World to deliver its first international live webinar and active audience event at Burlington House, London - Connecting Chemistry & Mass Spectrometry on the Internet.

Dr Antony Williams (RSC, ChemSpider) and Dr John Shockcor (Waters)  presented a top class and engaging event.

You can view the event by registering here:

http://chemistryworld.gav.co.uk/webcasts/event-detail/5/identification-of-metabolite-structures-using-mass-spectrometry.html

Earlier in the day we also hosted a round-table discussion for scientists interested in further refining how ChemSpider can be extended to serve the metabolomics community. Our thanks go out to all those who attended for a lively discussion.

We would be interested to hear your views -  for example, are there any additional features that you would like to have available or any other data sources that we should  link to?

GGA Software Services and Royal Society of Chemistry’s ChemSpider Announce Partnership

Cambridge, MA – January 28, 2011 – GGA Software Services, a leading provider of scientific informatics services to the life sciences and related industries, and Royal Society of Chemistry’s ChemSpider, a leading provider of chemical services and data on the internet, announced today a strategic technology partnership that will provide significant benefits to users of the ChemSpider chemical database.  The agreement makes available GGA’s open source Bingo chemistry search engine for use on the ChemSpider website, enhancing the ability of users of the ChemSpider service to efficiently conduct searches of the nearly 25 million chemical structures within the ChemSpider chemical database.

Bingo is a RDBMS data cartridge that provides the next-generation, fast, scalable, and efficient storage and searching solution for chemical information.  Bingo sets the industry standard in structure and reaction registration and retrieval, implementing state-of-the-art indexing algorithms within an underlying database server and making chemical searching fast and reliable.  Its extensible indexing enables scientists to store, index, and search chemical moieties alongside numbers and text within one underlying relational database server.  Users can seamlessly combine chemical substructure, reaction, and exact structure searching with numeric and text SQL terms.  It also has features not present in other cartridges — for example, advanced tautomer search, resonance substructure search, and fast updating of the index when adding new structures.

Richard Golob, President and CEO of GGA Software Services, states “We are very pleased to offer GGA’s Bingo search engine to users of the ChemSpider service.  As a professional services company, GGA would like to partner and share with others in the scientific informatics community to develop best-of-class open-source solutions.  Bingo, our open-source offering that is the result of five years of internal development, provides all the necessary search features required by chemoinformatics applications.  With Bingo, we have succeeded in achieving superior performance in a data cartridge for various types of searches, especially substructure search.  Users of Bingo will enjoy unsurpassed search capability for molecules and reactions.”

Antony Williams, Vice President of Strategic Development for ChemSpider, adds “ChemSpider has become one of the primary internet resources for scientists to search for chemicals and related information. The ability to efficiently search across a database of almost 25 million structures is a significant challenge.  We chose GGA’s Bingo offering as a result of the superior responsiveness of the GGA development team and the fact that the software is open source, thereby allowing us to extend the platform as necessary.  We look forward to a long and productive relationship with GGA Software Services.”

About GGA Software Services LLC

GGA Software Services is a leading provider of outsourced scientific informatics services to the life sciences industry.  Headquartered in Cambridge, Massachusetts, with a best-of-class development center in St. Petersburg, Russia, GGA offers broad capabilities in scientific software engineering, algorithm development, and knowledge, especially in cheminformatics and bioinformatics.  Global pharmaceutical, scientific instrumentation, and medical device corporations as well as early-stage life science companies rely on GGA to provide ongoing scientific informatics services, including support, maintenance, and QA services, to enhance their internal resources.  Since 1994, GGA has successfully served as an extended workbench for its clients, allowing them to both achieve their critical objectives and maximize their scientific informatics budget.  For more information, please visit our website at www.ggasoftware.com.

About the Royal Society of Chemistry

The RSC is the largest organisation in Europe for advancing the chemical sciences.  Supported by a worldwide network of members and an international publishing business, our activities span education, conferences, science policy and the promotion of chemistry to the public.  www.rsc.org

About ChemSpider

ChemSpider offers a structure-centric community for chemists to resource data.  Offering access to almost 25 million unique chemical entities from over 400 data sources and by providing a platform for crowd-sourced deposition, annotation, and curation, it is the richest source of free integrated chemistry information available online.  ChemSpider delivers data and services to enable the semantic web for chemistry.  www.chemspider.com

Contacts:

Richard Golob

GGA Software Services LLC

One Mifflin Place

Cambridge, MA 02138

Tel: 617-491-5100

Fax: 617-491-0932

Email: info@ggasoftware.com

www.ggasoftware.com

Antony Williams

ChemSpider, Royal Society of Chemistry

904 Tamaras Circle

Wake Forest, NC 27587

Tel: 919-201-1516

Email: info@chemspider.com

www.chemspider.com

In response to your recent feedback we have now made it easier to see at a glance the Systematic name or PhysChem Properties for a compound.

Here is an example of the new record layout for glucose:

  glucose1

 

Clicking on any of the hyperlinks in the central column will expand the information available. The Search Google Scholar link will enable you to expand a search into the scientific literature based on the approved names and synonyms in ChemSpider.

Searching Similar will bring a table of compounds which share the same skeleton, but may have variations in the stereochemistry. The results are displayed in a grid format. New visual icons are now available to help you select the relevant record. These icons will tell you if there is information from Wikipedia, or if spectra are available for that compound.

 glucose2

Clicking on the structure image or the ChemSpider ID will take you back to the record view for each individual compound.

Other icons will be indicative of specific stereochemistry or double bond geometry or if the compound is a charged species or if it an isotope.

 glucose5

   = no, of defined stereocenters

  glucose6

                                = double-bond geometry

    glucose7

                                  = charged species

  glucose9

                              = non-regular isotope

glucose8                                   = spectra

    glucose4

                                           = Wikipedia

Scrolling down the record view will still give lots more information about the compound such as commercial vendors, links to biological, toxicity and safety data, as well as links to RSC journals, books and databases.

We have also made it easier for you to keep the information from different info boxes by adding a print button. For example you can now print a spectrum of interest.

 glucose3

 

Don’t forget that you can also Add your comments, compounds, literature references and spectra to ChemSpider by clicking on the right hand hyperlinks at the top of the record view.

For adding anything but comments you need to be a registered and logged on user.

Please do continue to let us know what you think about these enhancements and if there is anything else that you would like to see on the website.