About



This ongoing project explores the index as a creative infrastructure for mapping knowledge and associative relationships within information. Using NLP to map textual data spatially, the system generates unique layouts of text on the page, where each index entry's position is determined by its semantic proximity to others in the dataset as determined by an approximate nearest neighbors algorithm, forming a multiply-determined visual topology.

The corpus for this instance of the project is comprised of the indexes of 3 books selected by the artist from the Cybernetics Library’s Uncomputable collection:

▫︎Updating to Remain the Same: Habitual New Media⸻Wendy Hui Kyong Chun
▫︎Code and Clay, Data and Dirt: Five Thousand Years of Urban Media⸻Shannon Christine Mattern
▫︎The Moment of Complexity: Emerging Network Culture⸻Mark C. Taylor

The webpage provides a visual and textual interface to explore a dataset of index entries and their associated metadata. An interactive 3D t-SNE visualization displays the vector space, each point representing an index entry. Visitors can generate a new speculative index by clicking any entry, which highlights a new set of entries, constructing a unique shape each time it is regenerated. The visualization can be navigated by clicking and dragging to move around it, and scrolling up ↑ to zoom in and down ↓ to zoom out.

View the slides to the first rendition of the project


Methodology

········pre-processing
················make each index structure uniform
················parse into entires, reassemble

········processing
················create vectors for each sequence of text (entry) in the corpus
················reduce dimensionality of each entry's vector with t-SNE 
················save the x, y and z values of t-SNE visualization of vectors to visualize the dataset

········generating
················choose 4 random entries from dataset of entries
················the position of each subentry on the page is determined by an approximate nearest neighbors algorithm