Exploring GitHub with BigQuery at GitHub

9 403
30.1
Опубликовано 19 января 2017, 19:16
Felipe Hoffa meets Alyson La, Data Scientist at GitHub. They explore how she uses BigQuery and other big data tools to do her job at GitHub.

Featuring 3 open datasets: GitHub Archive (a timeline of all GitHub events, githubarchive.org/), GitHub Data (the contents of GitHub open source files ready to be analyzed cloud.google.com/bigquery/publ... and GHTorrent (similar to GitHub Archive, plus additional tables ghtorrent.org/).

On the tools side we show how Alyson works with the BigQuery web UI, and the connections between BigQuery, Tableau, and Looker.

Sample queries from the GitHub Octoverse report: gist.github.com/alysonla/e14c0...

GitHub Event Types & Payloads docs: developer.github.com/v3/activi...

Blog post: github.com/blog/2298-github-da...
автотехномузыкадетское