The Google Cloud Public Datasets program lately printed the Python Package Index (PyPI) dataset into the marketplace. PyPI is the usual repository for Python packages. In case you’ve written code in Python earlier than, you’ve most likely downloaded packages from PyPI utilizing pip or pipenv. This dataset supplies statistics for all bundle downloads, together with metadata for every distribution. You may be taught extra concerning the underlying knowledge and desk schemas here. Beneath, I’ll stroll by way of just a few examples of how one can leverage this knowledge.
As a Python fanatic who has helped construct out numerous Looker packages, I used to be significantly thinking about leaping into this dataset to be taught extra about how the libraries are getting used. First I started by wanting on the variety of installations every day for the previous 12 months, for packages whose identify accommodates looker.