We've recently finished a short video to help describe the services provided by
the Open Science Data Cloud and the need that drives our interest in providing this service.
If you're new to the OSDC ecosystem or just want to learn more about what the OSDC offers
offers, watch the video here.
Interested researchers can apply for an OSDC resource allocation here.
Big data is important to transforming research and the OCC is giving away a limited number of Discovery Awards to encourage scientists
to experiment with developing novel technology for analyzing big data. We also think it’s important to encourage use of big data in the
business community and are giving away a limited number of Innovation Awards.
Both awards will give users free computing resources on the Open Science Data Cloud.
Our Discovery Awards (for scientific research) are for 50,000 OSDC core hours and are available to selected
scientists and researchers. OCC Innovation Awards (for businesses) are for 30,000 OSDC core hours. We especially encourage small businesses to apply.
To learn more or to apply for a Discovery or Innovation Award, first apply for an OSDC resource allocation, then
send an email noting your application and a short paragraph describing what you’d like to do with the core hours awarded to email@example.com.
We’re proud to announce that The Ontario Institute for Cancer Research (OICR) is now a member of the
Open Cloud Consortium!
OICR will be involved with several OCC Working Groups, including the Open Science Data Cloud Working
Group and the Biomedical Commons Cloud Working Group, to build systems for cancer genomics analysis
and biomedical data sharing.
“Cancer genomics data sets are now too large to download over the Internet, and the compute resources needed to mine them for knowledge are out of reach for many researchers. Our collaboration will enable researchers from around the world to get the data, perform sophisticated analyses over it, and to extract knowledge that can be used to improve cancer diagnosis and care,” said Dr. Lincoln Stein, Director of the Informatics and Bio-computing Program at OICR.
“The Biomedical Commons Cloud (BCC) provides a medical research center a quick and easy way to get access to a secure and compliant cloud that contains a critical mass of biomedical data,” said Dr. Robert Grossman, Director of the OCC. “We are very excited that OICR will be one of the founding partners of this effort.”
The full joint press release is available here.
Learn more about how your organization can become a member of the OCC here.
The OSDC has a very active community of BETA users and demand for OSDC services is growing. To better
distribute available resources among interested researchers, the OSDC moved on August 1st to a new
resource allocation paradigm.
In the new paradigm, OSDC resource allocations generally run for 3 months at a time and begin on January 1, April 1, July 1,
October 1. All incoming applications for resources will be reviewed near one of these terms and are due
on the 15th of the month prior (e.g., December 15th for the allocation period starting January 1st). During
the survey process, a resource allocation extension can be requested if your research is not yet complete.
Established partner projects and Labs and OCC members that have contributed hardware will be given first priority.
To apply for a resource allocation during the period beginning on October 1st please use the OSDC Resource Allocation Application.
Special protected resources like the Bionimbus-PDC have their own separate application process.
Recipients of OSDC resource allocations are expected to:
- Make appropriate use of OSDC resources and use good social behavior (ie - terminating VMs when not in use).
- Cite the OSDC in any papers and publications
- Regularly respond to quarterly OSDC allocation surveys
- Submit tickets to the OSDC support ticketing system
when encountering technical issues not covered by the OSDC support documentation
Open Science Data Cloud researchers from all over the world gathered June 16-20
in the Netherlands at the University of Amsterdam (UvA) Science Park for the
annual OSDC Partnerships for International Research and Education (PIRE)
Workshop. At the workshop, this year's selected OSDC PIRE fellows kicked off
their fellowships by meeting their international summer research hosts
and being trained in the basics of data science and cloud computing from experts
in the field.
Over the course of the week, the fellows learned about open data repositories
such as the OSDC Public Data Commons,
the ENVRI project, the Global Biodiversity Information Facility,
data.tt out of Trinidad and Tobago, and Japan's Landsat-8
Real-time Release site. They worked through tutorials
on tools for data intensive research such as the Open Science Data Cloud
and projects like SAGA (Simple API for Grid Applications).
The fellows also learned best practices for data visualization and research
Armed with these new skills, the fellows formed teams to compete in a data
science hack-a-thon challenge with great results. Teams worked on projects
aimed at facilitating cross-disciplinary data analysis, using OSDC public datasets
for educating the public on extreme weather conditions, developing mobile apps
using public geospatial datasets, and making clouds like OSDC easier for scientists
The first place team, Cody Buntain (University of Maryland) and Nelson Auner (University of Chicago),
created a program they call "Mayfly," a toolkit that enables
reproducible research by allowing researchers to easily publish and share their
analysis, data visualizations, and results to Dropbox for others to view.
The team installed their toolkit on an OSDC public virtual machine snapshot
for any OSDC user to adopt and also made the source code and documentation
available on github for other users.
All teams delivered impressive results after only a few short days of work during
the workshop. Imagine what else could be accomplished!
The OSDC and Bionimbus were featured in a June 2014 article in Scientific American
called "Bioinformatics: Big Data Versus the Big C."
Analysing the genomes of 8,200 tumours is just a start. Researchers are “trying to figure out
how we can bring together and analyse, over the next few years, a million genomes”, says Robert
Grossman, who directs the Initiative in Data Intensive Science at the University of Chicago in
Illinois. This is an immense undertaking; the combined cancer genome and normal genome from a
single patient constitutes about 1 terabyte (1012 bytes) of data, so a million genomes would
generate an exabyte (1018 bytes). Storing and analysing this much data could cost US$100
million a year, Grossman says."
University of Chicago Pathologist and OSDC user Megan McNerney's discoveries (M. E. McNerney et al.
Blood 121, 975–983; 2012) are featured as a bioinformatics project that has shown the benefits of mining data.
Members of the OCC and OSDC team were present during the recent The Cancer Genome Atlas (TCGA)
symposium at which OCC Founder and Director Robert Grossman gave a keynote address that considered the
future of genomics and bioinformatics research.
Dr. Grossman framed the future of bioinformatics research and sharing large genomic datasets as an
extension of Garrett Hardin's 1968 publication, The Tragedy of the Commons.
The Bionimbus PDC and the OSDC's Public Data Commons
are excellent examples of Dr. Grossman and the OCC's efforts to provide shared, public resources in an open-source environment to the
both the genomics community and researchers across all disciplines to facilitate discovery.
You can watch the full speech here.
This week OSDC lead Maria Patterson will participate in the 2014 HyspIRI Symposium
in Maryland as part of the OCC’s collaboration with NASA, Project Matsu. Dr.
Patterson will give a talk on the Matsu Wheel for analytics that nightly processes
large volumes of satellite data. Stuart Fry, Dan Mandl, Pat Cappelcare and Vuong
Ly of Project Matsu will also be presenting and organizing.
The symposium will focus on enabling the evolution of land imaging by using new
approaches and products. Participants will discuss ways the HyspIRI mission and
other new technologies can help address sustainable imaging land requirements.
The HyspIRI mission includes two instruments mounted on a satellite. There is an
imaging spectrometer measuring from the visible to short wave infrared and a
multispectral imager measuring the mid and thermal infrared (TIR). You can
learn more about the HyspIRI mission here: http://hyspiri.jpl.nasa.gov/
One of the OCC’s key members, University of Chicago, is hiring for 4 positions in their Center for Data Intensive Science. These positions will work closely with our OSDC and OCC team.
If you’re interested or know someone qualified who might be, applications are being accepted for the following positions:
- Director of Security x1
- Bioinformaticians x4
- Linux System Administrators x4
- Software Engineers x4
To learn more:
Members of the OCC team are in Texas this week at the Open Big Cloud Symposium. The Symposium
aims to bring together the brightest minds in industry, academia, and research to discuss the
future of cloud computing and Big Data.
The conference will explore bringing the Cloud to the Enterprise, models and benefits, Cloud
Operation Model (DevOps), Open Technologies and best practices including software and hardware
disaggregation, Cloud and BigData for Scientific and Engineering workloads.”
To learn more visit: http://www.opencompute.org/community/events/ocp-on-the-road/open-bigcloud-symposium-and-ocp-workshop-2014
Maria Patterson, a research scientist at the Center for Data Intensive Science at the University
of Chicago and a lead for the Open Science Data Cloud will be giving a talk on 4.17 on the working
with large scientific datasets.
This talk will be an overview of the OSDC, one of the world’s largest general purpose science clouds
managed by the Open Cloud Consortium (OCC), and information on how to collaborate with the OSDC on
research projects involving data intensive computing. This talk will also discuss the NSF-funded
Partnership for International Research and Education (PIRE) fellowship opportunities for summer 2014.
Find out more about OSDC here https://www.opensciencedatacloud.org/
and the NSF PIRE fellowship here http://pire.opensciencedatacloud.org/.