Mario Aksiyote

About

Work

Contact

Back to all work

Band on the Run: Connecting neighborhoods through live music

February 26, 2018

Company: Topos

Collaborators: Will Shapiro, Mahir Yavuz

Roles: data science, data visualization

The following article looks at how we can use machine learning to explore the ways that neighborhoods are connected by live music. Using Scikit Learn I was able to set up a collaborative filtering model to explore neigborhood, venue, and artist similarities based on touring patterns across the US. I then used D3 to make all the visualizations for the article. A special thanks to Will Shapiro for the writeup.


Every day’s an endless stream Of cigarettes and magazines. And each town looks the same to me, the movies and the factories…

—Paul Simon

From Right Here to Everywhere

A musical scene can indelibly define a place. The specific culture of a neighborhood can give birth to a new sound. From New Orleans Jazz to DC Hardcore, Greenwich Village Folk to Queensbridge Hip Hop, musical scenes have been intertwined with the identity of geographic areas ranging from specific street intersections (Haight Ashbury) to entire metropolitan areas (Nashville).

Left: The Grateful Dead at the corner of Haight and Ashbury (credit: Herbie Greene) — Right: Medieval troubadours painted by Simone Martini c. 1315 A.D.

At the same time, since antiquity, music has travelled — from medieval troubadours, to the traveling opera companies of the mid-nineteenth century, through to the decades long cross-country meanderings of the Grateful Dead.

With the rise of the internet and streaming services, new music can reach millions of geographically distributed fans at once, allowing highly specific genres like Chicago Footwork to develop a passionate following in London and even find new expression in cities as far away as Hiroshima. Increasingly the internet itself is the metropolis where genres like Vaporwave and Seapunk are born.

Left: Chicago Footwork in Japan (credit: John Calvert), Right: Vaporwave-A genre born on the internet (credit: Reddit)

Yet, despite the simultaneous, everywhere nature of the internet and music streaming — or perhaps because of it — touring remains a vital (if troubled) facet of the music industry. Musicians continue to connect with fans in neighborhoods across the globe through live shows. In this article we explore the links between these geographically distributed fanbases and ask: how do touring musicians connect neighborhoods?

Since forming Topos last February, we’ve been fascinated by a simple question: what does distance mean in the 21st century? While in past articles we’ve explored holistic understandings of distance that leverage a wide range of heterogeneous data and technologies (first in New York City, then more broadly across the US), in this article, we focus narrowly on a single dataset and technological approach. In particular, we take the tour dates of musicians traveling across the US from 2008 to the present as the basis for a machine learning model that allows us to develop a tour-based distance metric relating neighborhoods across the US. We then use this metric to algorithmically generate venue and neighborhood suggestions for touring musicians.

Below: A sampling of touring patterns across the U.S.

The jet set: Lady Gaga, Kendrick Lamar, Katy Perry

Home/away or on the tour bus: The Radioactive Chicken Head, Bob Schneider, Sam Evian

Hyper local: Andy Coe Band, Surprise Party, Just Another Folk Singer

A Model Built on Co-occurrence

Collaborative Filtering (CF) is one of the most widely used machine learning approaches for determining distance between entities. Once calculated, these distances are often used to power recommendations. From Spotify’s Discover Weekly to Amazon’s product recommendations, CF algorithms form an important part of many well known recommendation engines.

Neighborhoods that co-occur frequently on tour schedules become closely connected

The fuel for CF recommendations are datasets where candidate recommendable items co-occur. Amazon’s recommendations are fueled by the co-occurrence of items in users’ shopping carts; Spotify’s Discover Weekly is fueled by the co-occurrence of songs in user generated playlists and listening histories. In our case, the co-occurrence of venues and neighborhoods on the schedules of touring musicians provided the input for our CF-based similarity metric. We were able to construct these schedules by hooking into the setlist.fm API, an incredible resource that has data on concerts in the US dating back to 1850.

Exploration: Neighborhood Similarity

We start exploring our tour-based similarity metric by looking at three very different neighborhoods: Bushwick NYC, Downtown LA, and Maryvale, Phoenix.

Neighborhood similarities visualized in three dimensional space

Neighborhood similarities visualized in three dimensional space

Bushwick, NYC

Left: Acid Mothers Temple; Right: Acid Baby Jesus

Bushwick is most similar to other well known hipster neighborhoods across the US. It is perhaps telling that there is not one but two bands whose names start with the word “Acid” amongst the most popular acts in the list of similar neighborhoods.

Popular Musical Acts: Acid Mothers Temple, John Maus, Ty Segall, Widowspeak, Acid Baby Jesus

Downtown LA

The Trans Siberian Orchestra — a band that has only ever played arenas

Home to the 21,000-seat Staples Center, Downtown LA’s most similar neighborhoods are other centrally located neighborhoods surrounding big arenas like the Boston Garden and Philadelphia’s Wells Fargo Center. Popular acts in these neighborhoods tend to be top-of-the-charts musicians and — across the board — the Trans-Siberian Orchestra, a band famous for going directly to arenas without ever having played smaller clubs or opening for other bands.

Popular Musical Acts: Lady Gaga, Rihanna, Katy Perry,Justin Bieber, Trans-Siberian Orchestra

Maryvale, Phoenix

Brad Paisley, Toby Keith, Mötley Crüe

Maryvale — famously one of first planned communities in the United States — is located on the edge of the city of Phoenix; its most similar neighborhoods tend be outside of major metropolitan city centers (Fair Park Dallas), or to form small municipalities in their own rights (Tinley Park IL, Englewood CO). Mainstream Country and Metal are generally the most popular genres, with a solid showing from 70s Classic Rock Bands (Journey, Boston, Styx).

Popular Musical Acts: Brad Paisley, Toby Keith, Mötley Crüe, Slipknot, Journey

The Closest Connections

By allowing only the strongest links (>top .1 percentile) between neighborhoods to remain, we can observe some interesting neighborhood groupings. One striking aspect of these groups is their diversity: some are tightly connected geographically (Group 2) while others span the breadth of the country (Group 5); some have narrow genre preferences (Group 4) while others exhibit more eclectic tastes (Group 1).

Network diagram of neighborhood clusters

Group 1: Pacific Northwest

Separated by 182 Miles, this small cluster of two neighborhoods spans a wide range of genres. Within this stylistic diversity, the most frequent acts tend to be older, established medium-popularity performers

Top acts: Indigo Girls, The B‐52s, Brandi Carlile, Ziggy Marley, Aimee Mann

Group 2: Insider NYC

Separated by just 4 Miles, this small cluster is the tightest geographically. Group 2 is also the most ‘local’, with 16 of the top 20 (mainly alt/indie) performers based in NYC.

Top Acts: Widowspeak, Moon Hooch, Men and Whales, The Bottom Dollars, Sharon Van Etten

Group 3: Almost Country

Largely comprised of western neighborhoods (with Tinley Park IL as the sole exception), group 3 has a corresponding passion for country music; half of the top ten acts are mainstream country musicians (Brad Paisley, Luke Bryan, Toby Keith, Zac Brown Band, Rascal Flatts). As in the earlier exploration of Maryvale, Mainstream metal and 70’s classic rock are also favorites.

Top Acts: Brad Paisley, Slipknot, Luke Bryan, Mötley Crüe, Journey

Group 4: Central Downtown Areas

Connecting centrally located downtown areas, Group 4 is the most geographically dispersed. In contrast to this geographic diversity, Group 4 is tightly focused on a particular spectrum of sound — the pop-punk/emo/post-punk continuum (with some Comedic Metal — Steel Panther, Gwar — sprinkled in).

Top Acts: Steel Panther, Say Anything, All Time Low, The Used, Mayday Parade

Group 5: Arena Haloes

Centered around huge stadiums (NYC’s Madison Square Garden, The Boston Garden, Chicago’s United Center) the neighborhoods in Group 5 are visited by arena-filling superstars like Bon Jovi, Kanye West, and of course, Billy Joel whose monthly MSG residency (and accompanying helicopter commute) has become legendary.

Top Acts: Trans‐Siberian Orchestra, Bon Jovi, Roger Waters, Rihanna, Katy Perry

Recommendations

Similarity metrics are often constructed in order to power recommendation engines. And where Spotify might recommend an album or Netflix might suggest a movie, here we give examples of using our similarity metric to recommend venues and neighborhoods for touring musicians, focusing on locations where the musician rarely, if ever, performs. For each act, we also produce a list of similar musicians based on their touring schedules.

We then flip directions and focus on location, recommending musicians who have yet to perform but would be most likely to find an audience for a sampling of venues.

From Nodes to Edges

In this article, we’ve constructed a narrow, highly specific view of place, ignoring myriad factors that shape neighborhoods. While there is a small but statistically significant correlation between the similarity metric constructed here and the Topos Similarity Index (which increases for certain cities), these measures are largely orthogonal. The TSI is a holistic measure of similarity, encompassing everything from the form of the built environment to the ratio of big box stores to local retailers, while here we have worked with a single data source pertaining to one facet of culture.

Tour Based Similarity vs the Topos Similarity Index for Boston <> US. Pearson Correlation of .249, p <<.05

Yet even this narrow view reveals much about neighborhoods, from their form (the connected downtown neighborhoods surrounding large arenas) to their milieu (the hipster neighborhoods connected to Bushwick).

We believe this approach starts to demonstrate the potential of understanding location as a set of relationships rather than solely as a set of isolated points or regions to which metrics are ascribed. Many applications of Location Intelligence — from opening a new store to planning a trip, launching a political campaign to arranging a tour — are ultimately about relationships: Brand and customer, traveller and a foreign culture, politician and constituent, touring musician and fan. Understanding the manifold ways one place is similar to another provides rich context for expanding these relationships into new territories.


Did you like that last post?

Check out some more of my work

The Next Wave

Article looking at New York City through the lens of coffee in an attempt to explore a fundamental question of spatial economics – how are the locations of businesses determined?

The COVID-19 Compiler

App meant to provide a multidimensional view of COVID-19’s impact in counties across the US. It encompasses the mapping of vulnerable populations, state and local policies to reduce transmission, and data on medical / health care resources. Featured extensively on the Rachel Maddow Show.

FIT

A platform made to help commercial real-estate brokers fill retail leases with companies that will be both successful and good for the neighborhood.

[x] Everywhere

An art project meant to ask the question — What does distance mean in the 21st century? [x] Everywhere is a visual (and arguably insane) way to explore neighborhood similarities.