Building Reddit's Custom Time On Site Metrics with Airflow and Google BigQuery

Washington, D.C. Campus

GA D.C.
509 7th Street NW, 3rd Floor
Washington D.C. 20004

Past Locations for this Event

Building Reddit's Custom Time On Site Metrics with Airflow and Google BigQuery | Washington, D.C.

Washington, D.C. Campus

GA D.C.
509 7th Street NW, 3rd Floor
Washington D.C. 20004

Past Locations for this Event

About this event

Building Time On Site at Reddit with Katie Bauer, Data Science Manager at Reddit:

Time on site is a foundational metric in web analytics and building it seems straightforward enough. But modern websites are built on the backs of distributed systems, and distributed systems make it particularly difficult to figure out when something actually happened. In this talk, we'll discuss how we implemented our own time on site metric, building ETLs with Google BigQuery and Apache Airflow, as well as the choices we made to do it, the problems we caused with those choices, and how we fixed them.

Bad Boys, Whatcha Gonna Do: Predicting Crime on the Streets of SF with Ruqaiya Shipchandler, Solutions Engineer at Dataiku

While San Francisco is most famous for being the technological epicenter of the world, the city's infamous past as the home of notorious criminals at Alcatraz makes us wonder: what is SF's current criminal landscape? And can we use data science to proactively fight crime in the city?

We'll share how we used Dataiku DSS to explore over 12 years of SF crime data to understand key trends, and build a predictive model to pinpoint the category of crime that would occur given time and location.

Coming up near you

Let’s Keep You Updated

Enter your email to start following

By providing us with your email, you agree to the terms of our Privacy Policy and Terms of Service.