6

GitHub - pldubouilh/spotify-gdpr-dump-analysis: analysis of complete spotify str...

 1 year ago
source link: https://github.com/pldubouilh/spotify-gdpr-dump-analysis
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
neoserver,ios ssh client

spotify-gdpr-dump-analysis

Local analysis of complete spotify streaming dataset (endsong_*.json). Made in 3 hours alongside with chatGPT, fixing bugs as they appeared.

Ask for your GDPR streaming data dump here. It take a couple days to come.

That's a whole lot of data eyes

# deps
$ pip install geoip2 basemap pandas matplotlib

# get geodb for local ip lookup
$ curl -L -o city.mmdb https://github.com/lysenkobv/maxmind-geolite2-database/raw/master/city.mmdb

# create sqlite3 database from json dump
$ python makedb.py datafolder/

# run analysis !
$ python map-ips-city.py

a
$ python top-cities.py
df                     city         country  count
20                   Berlin         Germany   2629
...
$ python top-songs-per-country.py
DE                                                 La femme d'argent                        Air
DE  Piano Concerto No. 3 in D Minor, Op. 30: I. Allegro ma non tanto        Sergei Rachmaninoff
DE                                La mer, L. 109: II. Jeux de vagues             Claude Debussy
DE                                                   Samba da Bencao             Bebel Gilberto
DE                                      Merry Christmas Mr. Lawrence           Ryuichi Sakamoto
DE                                                        WEIGHT OFF                 KAYTRANADA
...

About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK