The e-Bioinfra Gateway (e-BioinfraGateway) is a web application that provides facilitated access to the Dutch Grid infrastructure for researchers that need to analyse large data collections with complex or compute-intensive methods. The researcher just needs to follow some steps on a web interface using any web browser. The web link is http://www.ebioscience.amc.nl/ebioinfragateway/
The following steps are needed to use the e-BioinfraGateway:
The e-BioinfraGateway is developed and maintained by the e-BioScience group of the Bioinformatics Laboratory of the AMC, Department of Epidemiology, Biostatistics and Bioinformatics (KEBB). This research is supported by the BiGGrid project (NWO funding) and the AMC ICT Innovation Fund (research).
For more information please contact us.
Registration
In order to use the e-BioinfraGateway, users should register as follows:
1 Provide the needed information on the registration web page, and agree to the terms and conditions.
Our administration will inform you via email as soon as your account is activated.
NOTE:
Your account includes access privileges to the e-BioinfraGateway website and to the data staging machine, to which the data should be uploaded. The passwords to both accounts are the same.
Experiment Setup
In the documentation pages of each experiment type, there is usually a section which describes the supported input data and any additional step to prepare the input data. Please make sure that you read that section carefully and you provide the input data exactly as described in that section.
In addition, please note that we highly recommend to the users of the e-BioinfraGateway to start their data analysis with a small set of data (e.g., 2 subjects) and confirm the results before proceeding with the complete data set analysis.
Data Transport
The research data to be analysed on the Grid needs to be transferred to grid storage outside the AMC network in a secure manner. This is done with the aid of an intermediary storage server at the AMC, called data staging machine. Each researcher has a private space that can hold data for a limited time (1-2 weeks). All data copied into this machine are automatically transported to the Grid storage. Likewise, all results generated by calculations executed on the Grid are copied back to the data staging machine.
Files can be copied to/from this space by the researcher using regular ftp client programs such as WinSCP, FireFTP or FileZilla. For the examples on the user documentation we are using FireFTP. If you don't have administrative rights on your windows / AMC-OpenPC machine you can use this version of FileZilla, which does not require installation (i.e., unzip and use).
Important Notes:
Please avoid using "." (except the final dot before extension), and "spaces" in the file name. So "subject.001.nii" and "subject 001.nii" are not good but "subject-001.nii" and "subject_001.nii" and "subject-001.tar.gz" are good.
Please make sure that your input data is prepared according to each experiment requirements.
You have to select individual files (and not directories) in the 'input selection' page of experiment. (We suggest using a flat directory structure for input data but if you prefer to store your input files in sub-folders then you need to provide 'custom input data directory' on 'upload' page.)
S. Shahand, M. Santcroos, Y. Mohammed, V. Korkhov, A. Luyf, A. van Kampen, and S. Olabarriaga. Front-ends to Biomedical Data Analysis on Grids. In Proceedings of HealthGrid 2011, Bristol, UK, 2011. PDF
S. Shahand, M. Santcroos, A. H. van Kampen, and S. Olabarriaga. A Grid-enabled Gateway for Biomedical Data Analysis. Submitted.