The Exchange: Data Clouds, Data Commons and Data Ecosystems: How Data Science is Changing the Way We Analyze Scientific Data, Make Discoveries and Reproduce Research
Time 10/17/18 04:00PM-05:15PM
Room Pacifica Ballroom 7
Internet2 President and CEO Howard Pfeffer will offer opening remarks before introducing keynote speaker Robert L. Grossman whose presentation is titled: Data Clouds, Data Commons and Data Ecosystems: How Data Science is Changing the Way We Analyze Scientific Data, Make Discoveries and Reproduce Research.
Scientific data has grown too large for most research groups to host and analyze the data from large projects themselves. Data commons provide an alternative by co-locating data, storage and computing resources with commonly used software services, applications and tools for managing, analyzing and sharing data to create an interoperable resource for the research community. We give an overview of data commons and describe some lessons learned from the NCI Genomic Data Commons, the OCC Environmental Data Commons, the Bionimbus Data Commons, the BloodPAC Data Commons and the BRAIN Commons. We also describe how second-generation data commons are providing the foundation for data ecosystems and supporting reproducible research. We conclude by giving an overview of how an organization can set up a commons themselves.
For more information on our Keynote Speaker, please see our Featured Speaker page!
Keynote Speaker Robert Grossman
Robert L. Grossman is the Frederick H. Rawson Professor of Medicine, a Professor of Computer Science, and the Jim and Karen Frank Director of the Center for Translational Data Science at the University of Chicago. He is the principal investigator for the National Cancer Institute Genomic Data Commons (GDC), a platform for the cancer research community that manages, analyzes, integrates, and shares large-scale genomic datasets in support of precision medicine. The GDC is accessed by over 100,000 users each year. He is also the Director of the not-for-profit Open Commons Consortium that develops and operates data commons and data clouds to support research in science, medicine, health care, and the environment.
Primary track Nothing But Net(working)