Artwork

Innhold levert av Vanessa Sochat. Alt podcastinnhold, inkludert episoder, grafikk og podcastbeskrivelser, lastes opp og leveres direkte av Vanessa Sochat eller deres podcastplattformpartner. Hvis du tror at noen bruker det opphavsrettsbeskyttede verket ditt uten din tillatelse, kan du følge prosessen skissert her https://no.player.fm/legal.
Player FM - Podcast-app
Gå frakoblet med Player FM -appen!

Joys and Challenges with Big Research Data

34:01
 
Del
 

Manage episode 343889236 series 2556771
Innhold levert av Vanessa Sochat. Alt podcastinnhold, inkludert episoder, grafikk og podcastbeskrivelser, lastes opp og leveres direkte av Vanessa Sochat eller deres podcastplattformpartner. Hvis du tror at noen bruker det opphavsrettsbeskyttede verket ditt uten din tillatelse, kan du følge prosessen skissert her https://no.player.fm/legal.

Ana Trisovic is a Research Associate at Harvard School of Public Health and a Sloan Fellow at the Institute for Quantitative Social Science. Effectively, she does data engineering for her research group and works on reproducible data and software dissemination. First, Ana speaks of her background, from her first job at Microsoft Development Center Serbia, to CERN, UChicago, and Harvard. She shares what inspired her to pursue projects relating to open-source software, open data, and open science. Her work focuses on big data workflows and research reproducibility, and she shares her experiences working with particle physics experimental data, geospatial and climate data, and sensitive medical data. As a member of Consortium of Scientific Software Registries and Repositories (SciCodes), she contributes to research data and software sharing and preservation efforts. Her study shows that research software and code scripts are frequently shared with data, and she is working on better supporting those in the Dataverse data repository. We discuss data engineering roles in the broader RSE scope and recognize them as undervalued yet critical for research groups working on secondary data analysis. Ana speaks of the joys and challenges of working with diverse datasets and the value of open-source software, reusable data workflows, and adequate documentation. She shares recommendations for publishing research data with software and emphasizes the role of data repositories. We end the conversion with community engagement topic ideas.

  continue reading

144 episoder

Artwork
iconDel
 
Manage episode 343889236 series 2556771
Innhold levert av Vanessa Sochat. Alt podcastinnhold, inkludert episoder, grafikk og podcastbeskrivelser, lastes opp og leveres direkte av Vanessa Sochat eller deres podcastplattformpartner. Hvis du tror at noen bruker det opphavsrettsbeskyttede verket ditt uten din tillatelse, kan du følge prosessen skissert her https://no.player.fm/legal.

Ana Trisovic is a Research Associate at Harvard School of Public Health and a Sloan Fellow at the Institute for Quantitative Social Science. Effectively, she does data engineering for her research group and works on reproducible data and software dissemination. First, Ana speaks of her background, from her first job at Microsoft Development Center Serbia, to CERN, UChicago, and Harvard. She shares what inspired her to pursue projects relating to open-source software, open data, and open science. Her work focuses on big data workflows and research reproducibility, and she shares her experiences working with particle physics experimental data, geospatial and climate data, and sensitive medical data. As a member of Consortium of Scientific Software Registries and Repositories (SciCodes), she contributes to research data and software sharing and preservation efforts. Her study shows that research software and code scripts are frequently shared with data, and she is working on better supporting those in the Dataverse data repository. We discuss data engineering roles in the broader RSE scope and recognize them as undervalued yet critical for research groups working on secondary data analysis. Ana speaks of the joys and challenges of working with diverse datasets and the value of open-source software, reusable data workflows, and adequate documentation. She shares recommendations for publishing research data with software and emphasizes the role of data repositories. We end the conversion with community engagement topic ideas.

  continue reading

144 episoder

Alle episoder

×
 
Loading …

Velkommen til Player FM!

Player FM scanner netter for høykvalitets podcaster som du kan nyte nå. Det er den beste podcastappen og fungerer på Android, iPhone og internett. Registrer deg for å synkronisere abonnement på flere enheter.

 

Hurtigreferanseguide

Copyright 2024 | Sitemap | Personvern | Vilkår for bruk | | opphavsrett