Research Data Management Workshop for graduate students from January 28 to February 1

Wednesday, Dec 19, 2018

Research Data Management Workshop
Monday, January 28, 2019 - Friday, February 1, 2019
138 Lewis Science Library

Join us for a five-day workshop on research data management for graduate students, organized and sponsored by Princeton University Library, Princeton Institute for Computational Science and Engineering, and OIT Research Computing. Topics covered include an introduction to research data management, processing and analyzing data, and best practices for sharing and archiving research data.

Apply now by Friday, December 21, 2018.

Application: Open to all current Princeton University graduate students. Seating is limited, so advanced registration is required. Participants are expected to attend all 5 days. All participants must have an active PU NetID. A link to the registration site, along with more registration information can be found here.

If you are unable to attend the entire workshop but would still like to attend a portion of it, please email with your availability and we will add you to the wait list. If seats are available, we will do the best we can to accommodate as many participants as possible.

This event is co-sponsored by the Office of the Dean for Research, the Graduate School, and the Center for Statistics and Machine Learning.

The goals of the workshop are for graduate students to:

  • Gain new skills in research data management
  • Learn best practices and effective tools for data management
  • Understand issues related to compliance with funding agencies' data management requirements
  • Become familiar with on-campus services and resources that help with research data management 


Day 1, January 28: Workshop Introduction and Overview of Research Data Management

1:00-2:00 p.m.  Welcome, Curt Hillegas, PICSciE/OIT Research Computing
                            Opening Remarks, Christine Murphy, Graduate School, and Dan Marlow, Physics
                            Introduction to the RDM Curriculum - Willow Dressel and Yuan Li, Princeton University Library

2:00-2:15 p.m.   Coffee Break

2:15-3:00 p.m.   Overview of Research Data Management
                             Willow Dressel and Yuan Li

3:00-3:30 p.m.   Hands-on activities
                             Create a data management plan for your project
                             Willow Dressel and Yuan Li

3:30-3:45 pm     Q&A, Wrap up, and online evaluation/assessment

Day 2, January 29: Creating, Collecting, Compliance

1:00-1:45 p.m.   Data Management in the Creation and Collecting Phase
                             Willow Dressel and Anne Marie Phillips, Princeton University Library

1:45-2:15 p.m.   Hands-on activities
                             File naming and folder structuring
                             Identifying types and stages of data
                             Willow Dressel and Yuan Li

2:15-2:30 p.m.   Coffee Break

2:30-3:30 p.m.   Legal and ethical considerations and special session on Human Subjects
                             Maureen Thompson-Siegel and John Jenkins, ORPA
                             Paul Hryvniak and Sheera Gaskin, RIA
                             Wesley D. Markham, Associate University Counsel, OGC
                             Yuan Li, Scholarly Communications Office, Princeton University Library

3:30-3:45 pm     Q&A, Wrap up, and online evaluation/assessment

Day 3, January 30: Processing and Analyzing Data

1:00-2:00 p.m.   Overview of Infrastructure for Storing, Moving, and Sharing Data
                            Curt Hillegas, Chris Tengi, OIT Research Computing,  Bill Wichser, Princeton Institute for Computational Science and Engineering (PICSciE), 
                            Martin Harriss, OIT, Natasha Ermolaev, The Center for Digital Humanities (CDH)

2:00-3:00 p.m.   Data Analysis Tools
                             Overview of data analysis tools, Oscar Torres-Reyna, Data & Statistical Services
                             Python and Pandas, Matthew Cahn, OIT Research Computing, 
                             Apache Spark, Ben Hicks, OIT Research Computing/CDH
                             TensorFlow TBC

3:00-3:10 p.m.   Coffee Break

3:10-4:00 p.m.   Describing, Classifying, and Protecting data
                             Ben Hicks, David Sherry, OIT, Gretchen Thiele, OIT Research Computing

4:00-4:10pm       Q&A, Wrap up and online evaluation/assessment

Day 4, January 31: Publishing, Sharing and Re-Using Data

1:00-1:45 p.m.   Data Publishing, Sharing and Reuse
                             Willow Dressel and Yuan Li

1:45-2:00 p.m.   Hands-on activities
                             Willow Dressel and Yuan Li

2:00-2:45 p.m.   Legal and ethical considerations and compliance: Publishing, Sharing, and Re-use
                             Robert Berness, Yuan Li, and Paul Hryvniak
                              Q & A

2:45-3:00 p.m.   Coffee Break

3:00-4:00 p.m.   Different approaches to creating and sharing data
                             April Clyburne-Sherin, Code Ocean,
                             John Wiggins, Princeton Neuroscience Institute (PNI) 
                             Matthew Cahn TBC & Damian Sian, OIT
4:00-4:10pm       Wrap up and online evaluation/assessment

Day 5, February 1: Preserving and Archiving Data 

1:00-1:45 p.m.   Preserving and Archiving Data
                             Willow Dressel and Yuan Li

1:45-2:15 p.m.   Hands-on activities 
                             Willow Dressel and Yuan Li

2:15-2:30 p.m.   Coffee Break

2:30-3:30 p.m.   Best practices on campus: DataSpace, Archives, Backing-up data
                             Mark Ratliff, OIT; Annalise Berdini, Princeton University Library; Gretchen Thiele, OIT
                             Research Computing

3:30-3:45 p.m.   Closing Remarks, Karla Ewalt, Office of the Dean for Research

3:45-4:30 p.m.   Reception, Lewis Science Library Atrium

Facilitators/Moderators: Ma. Florevel Fusin-Wischusen, PICSciE and Curt Hillegas, OIT Research Computing/PICSciE

Questions? Contact