Computational Science Center Seminar
"Computational Science Center Seminar — VisTrails: Using Provenance to Streamline Data Exploration and Visualization"
Presented by Juliana Freire, Department of Computer Science and Engineering, Polytechnic Institute of New York University, Brooklyn, NY
Monday, March 11, 2013, 11 am
Biosciences Seminar Room, Bldg. 463
Hosted by: Dantong Yu
Computing has been an enormous accelerator to science and industry alike and it has led to an information explosion in many different fields. The unprecedented volume of data acquired by sensors, derived by simulations and analysis processes, and shared on the Web opens up new opportunities, but it also creates many challenges when it comes to managing and analyzing these data. In this talk, I discuss the importance of maintaining detailed provenance (also referred to as lineage and pedigree) for digital data. Besides providing important documentation that is key to preserve data, to determine the data's quality, reproduce and validate results, provenance can also be used to streamline the data exploration process. Besides presenting techniques we have developed to efficiently manage and re-use provenance information, I will give an overview of the provenance infrastructure we have built for the open-source VisTrails system (http://www.vistrails.org). I will also describe emerging applications and novel uses of provenance for enabling collaborative data analysis, teaching science, and publishing reproducible results.