285 subscribers
Gå frakoblet med Player FM -appen!
Datahub: Open Source Data Lake with Pardhu Gunnam and Mars Lan
Manage episode 287797638 series 1433319
As the volume and scope of data collected by an organization grow, tasks such as data discovery and data management grow in complexity. Simply put, the more data there is, the harder it is for users such as data analysts to find what they’re looking for. A metadata hub helps manage Big Data by providing metadata search and discovery tools, and a centralized hub which presents a holistic view of the data ecosystem. DataHub is Linkedin’s open-sourced metadata search and discovery tool. It is Linkedin’s second generation of metadata hubs after WhereHows.
Pardhu Gunnam and Mars Lan join us today from Metaphor, a company they co-founded to build out the DataHub ecosystem. Pardhu and Mars, and the other co-founders of Metaphor, were part of the team at Linkedin that built the DataHub project. They join the show today to talk about how DataHub democratizes data access for an organization, why the new DataHub architecture was critical to Linkedin’s growth, and what we can expect to see from the DataHub project moving forwards.
Sponsorship inquiries: sponsor@softwareengineeringdaily.com
The post Datahub: Open Source Data Lake with Pardhu Gunnam and Mars Lan appeared first on Software Engineering Daily.
143 episoder
Manage episode 287797638 series 1433319
As the volume and scope of data collected by an organization grow, tasks such as data discovery and data management grow in complexity. Simply put, the more data there is, the harder it is for users such as data analysts to find what they’re looking for. A metadata hub helps manage Big Data by providing metadata search and discovery tools, and a centralized hub which presents a holistic view of the data ecosystem. DataHub is Linkedin’s open-sourced metadata search and discovery tool. It is Linkedin’s second generation of metadata hubs after WhereHows.
Pardhu Gunnam and Mars Lan join us today from Metaphor, a company they co-founded to build out the DataHub ecosystem. Pardhu and Mars, and the other co-founders of Metaphor, were part of the team at Linkedin that built the DataHub project. They join the show today to talk about how DataHub democratizes data access for an organization, why the new DataHub architecture was critical to Linkedin’s growth, and what we can expect to see from the DataHub project moving forwards.
Sponsorship inquiries: sponsor@softwareengineeringdaily.com
The post Datahub: Open Source Data Lake with Pardhu Gunnam and Mars Lan appeared first on Software Engineering Daily.
143 episoder
Alle episoder
×1 Building a Unified Hardware API at Intel with James Reinders 38:42
1 Building a State Machine Backend with Adam Berger 47:41
1 Open Source Contributing with Brian Douglas 48:01
1 Building Pieces.app and the Future of Developer Productivity with Tsavo Knott 37:49
1 Simplifying Documentation with Sébastien Lorber 49:52
1 Cloud-native Authorization with Tim Hinrichs 55:40
1 Open-Source Cloud Asset Management with Yevgeny Pats 40:26
1 Distributed Open Source Databases with Jonathan Ellis and Spencer Kimball 1:00:11
1 Grouparoo Open Source Data Tools with Brian Leonard 50:55
1 Publishing Open Source Code with William Morgan 1:00:33
1 Wasp-Lang: Boilerplate Code with Matija Sosic 57:37
Velkommen til Player FM!
Player FM scanner netter for høykvalitets podcaster som du kan nyte nå. Det er den beste podcastappen og fungerer på Android, iPhone og internett. Registrer deg for å synkronisere abonnement på flere enheter.