NCI Office of Data Sharing (ODS) Data Jamboree (Abstract Submissions)
5 submissions
# | Starred | Locked | Notes | Created | User | IP address | First Name | Middle Initial | Last Name | Degree(s) | Position/Title/Career Status | Organization | Organization Address | Other (Please Specify) | Abstract Category | Abstract Keywords | Abstract Title | Abstract Summary | Upload Abstract | Operations | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
5 | Star/flag NCI Office of Data Sharing (ODS) Data Jamboree (Abstract Submissions): Submission #5 | Lock NCI Office of Data Sharing (ODS) Data Jamboree (Abstract Submissions): Submission #5 | Add notes to NCI Office of Data Sharing (ODS) Data Jamboree (Abstract Submissions): Submission #5 | Sun, 06/01/2025 - 23:20 | Anonymous | 10.208.28.181 | Zifan | Gu | M.S. | Graduate student | UT Southwestern Medical Center | Dallas, TX | zifan.gu@utsouthwestern.edu | Development or refinement of analysis pipelines or AI/ML algorithms | deep learning, predictive modeling, large language models, translational medicine | Looking for a project team to join! | My expertise is in developing machine learning pipelines for multimodal data, including both structured and imaging (H&E, multiplex) data. One of my key strengths is being able to communicate with both data scientists and clinicians, supported by my formal training in health data science and bioinformatics. I’m looking to join an interdisciplinary team that not only focuses on algorithm development but also considers how these models can be deployed into the real world. I’m particularly interested in projects that incorporate large language models as part of the predictive modeling pipeline, addressing outcomes such as remission, mortality, or disease progression. I’m well-traversed in HPC using PyTorch and Hugging Face, and I’d be excited to join a team that plans to use AWS or GCP as their computing platforms. |
||||
4 | Star/flag NCI Office of Data Sharing (ODS) Data Jamboree (Abstract Submissions): Submission #4 | Lock NCI Office of Data Sharing (ODS) Data Jamboree (Abstract Submissions): Submission #4 | Add notes to NCI Office of Data Sharing (ODS) Data Jamboree (Abstract Submissions): Submission #4 | Thu, 05/22/2025 - 13:19 | Anonymous | 10.208.28.130 | David | Higgins | Ph.D. | Informatics Program Manager | Children's Hospital of Philadelphia | Philadelphia, PA | higginsd@chop.edu | Development of tutorial and educational tools, data storytelling, infographics, and other creative uses of data | variant discovery, jupyter notebook, pyspark sql, genomics, training | Tutorial and Example Notebooks for the Kids First Variant Workbench | The goal of the Gabriella Miller Kids First Pediatric Research Program is to find common underlying genetic causes of pediatric cancer and structural birth defects. The Gabriella Miller Kids First Data Resource Center (Kids First DRC) produces high quality clinical and genomic datasets to support this goal, accessible and analyzable via our interoperable cloud platforms. The Kids First Variant Workbench powered by CAVATICA accelerates breakthroughs in pediatric medicine by combining Kids First participant conditions, genomic variants, and variant annotations. With each of these tools at their disposal, the Variant Workbench provides one workspace for researchers to make discoveries using Kids First datasets they have received access to, enabling them to accelerate variant discovery. On a technical level, the Variant Workbench is a series of tables containing information about Kids First participants, their genomic variants, and annotations such as ClinVar and CADD to put these in context. These tables can be joined together using PySpark SQL to isolate the specific fields of interest in the more than 400 million unique variants in the Kids First cohorts. The latest release contains germline and somatic variants from 9 cohorts, a total of ~3,900 participants. It is possible to query, analyze and display both types of variants at the same time for these studies. The goal of this project is to develop tutorial and educational tools to increase usage of the Variant Workbench. We have a specific interest in the development of written directions for integrating other datasets into the Variant Workbench such as TARGET as well as example Jupyter notebooks that can be executed by users to show the platform’s capabilities. Overall though, these tools could take many formats and we are looking forward to working with members of the research community to hear their feedback and ideas as well. |
||||
3 | Star/flag NCI Office of Data Sharing (ODS) Data Jamboree (Abstract Submissions): Submission #3 | Lock NCI Office of Data Sharing (ODS) Data Jamboree (Abstract Submissions): Submission #3 | Add notes to NCI Office of Data Sharing (ODS) Data Jamboree (Abstract Submissions): Submission #3 | Mon, 05/12/2025 - 15:46 | Anonymous | 10.208.28.143 | Stephanie | Spielman | Ph.D. | Data Scientist | Alex's Lemonade Stand Foundation | Bala Cynwyd, Pennsylvania | stephanie.spielman@ccdatalab.org | Methods to enable data interoperability | N/A | I am looking for a project team to join for this event. My background is in evolutionary computational biology, but I have worked in the pediatric cancer research space (either full-time or collaboratively) for 4-5 years. I primarily work broadly on pediatric cancer transcriptomtics in a purely computational environment. These days I am working primarily with R and UNIX but I have also worked in Python and with workflow tools like Nextflow. I am enthusiastic about open-source and reproducible coding practices (including GitHub), approaches that improve researchers' ability to obtain, manage, clean, and organize their research projects including data, and educating researchers about these frameworks. In addition, I have excellent written and oral communication skills with an extensive background in writing user-friendly documentation to support my software projects. My goals in joining this "data interoperability" group at the jamboree are to learn more about the limitations researchers experience when attempting to integrate different data sources and identify/begin implementing approaches (which might include technical software, documentation, or developing guidelines/recommendations) to reduce the barriers researchers experience when working with data from disparate sources. | |||||
2 | Star/flag NCI Office of Data Sharing (ODS) Data Jamboree (Abstract Submissions): Submission #2 | Lock NCI Office of Data Sharing (ODS) Data Jamboree (Abstract Submissions): Submission #2 | Add notes to NCI Office of Data Sharing (ODS) Data Jamboree (Abstract Submissions): Submission #2 | Fri, 05/09/2025 - 15:21 | Anonymous | 10.208.24.48 | Maggie | Cam | Ph.D. | Staff Scientist | NCI | Bethesda | maggie.cam@nih.gov | Methods to enable data interoperability | CRDC API, Pediatric Cancer, Immune Profiling, Reproducible Workflows, Immuno-Oncology Data Commons (IODC) | Reproducible Pediatric Immune Profiling Using CRDC APIs and Local HPC Analysis | This project will develop a reproducible workflow for immune profiling of pediatric solid tumors by combining CRDC API-based cohort selection with local analysis on NIH’s Biowulf cluster. The workflow will serve as a pilot for improving data reuse and supporting future methods development for the Immuno-Oncology Data Commons (IODC). | ||||
1 | Star/flag NCI Office of Data Sharing (ODS) Data Jamboree (Abstract Submissions): Submission #1 | Lock NCI Office of Data Sharing (ODS) Data Jamboree (Abstract Submissions): Submission #1 | Add notes to NCI Office of Data Sharing (ODS) Data Jamboree (Abstract Submissions): Submission #1 | Fri, 05/09/2025 - 08:50 | Anonymous | 10.208.24.48 | Jaclyn | N | Taroni | Ph.D. | Director of the Childhood Cancer Data Lab | Alex's Lemonade Stand Founda | Bala Cynwyd, PA | j.taroni@alexslemonade.org | Development or refinement of analysis pipelines or AI/ML algorithms | Seeking a project team to join | I am looking for a project team to join. I have a background in computational biology, machine learning, and data visualization. I additionally have experience in product management, short-format training, and tutorial/documentation writing as part of my current position, where I supervise scientists, UX designers, and software engineers. Given this range of experience, I would be happy to be placed on a team in many of the abstract areas. I can potentially contribute to the experimental design of AI/ML projects, programmatic visualization, or developing tutorials. |