You just unlocked $/£/€ 150 off a workshop. Use code BFCM26 at checkout to reserve your spot at the lowest price yet.
Unlock our largest short course discount of the year. Use code BFCM26* during your call with admissions. Start now. *T&Cs apply
You just unlocked 4 new courses. Apply between now and Dec 31 to waive your application fee*. Start now. *T&Cs apply
Disrupting Data Discovery with Amundsen by Daniel Won, software engineer at Lyft:
Data scientists at Lyft spend approximately 30% of their time in the data discovery—answering questions such as Does this data exist? Who owns this data? What previous analysis exist? And can I trust this data? While data discovery is a prerequisite to delivering good analysis, it does not in itself bring value to the company. Reducing the time spent on data discovery enables data scientists to spend more time building models and visualizations.
Amundsen is an open-source tool built at Lyft that aims to solve the data discovery problem. We index and serve metadata about data resources in a simple and intuitive interface. A user can run a search that will return a list of results sorted by relevance and popularity. Currently, we index tables and people, with plans to index dashboards and teams as well. At Lyft, we have reduced the time spent in data discovery by 75% from baseline.