David Spivak

Visiting Assistant Professor
University of Oregon
Department of Mathematics

Office: 317 Fenton Hall
Email: dspivak at uoregon


"Information is the new physics."

Using mathematics to model the physical universe has proven quite successful (for both math and physics). In the modern world, it has also become imperative that we find better ways to classify, organize, process, and understand information. For this, we should again turn to mathematics.

I think that category theory and topology have a lot to offer in this regard. In order to find somewhere to start, I've been working on aspects of computer science which are relevant to the goal of understanding information. The idea is that we already do have ways of classifying, organizing, and processing information; by modeling them mathematically, we can understand their conceptual underpinnings, which in turn can help us create new and better systems in the future.

The following technical proposal (which is not too technical) may help make this more clear and explicit:

Databases and Networks. This is a proposal for a grant I received from the ONR (N000140910466). In it, I propose to use category theory to study databases, networks, and learning. It is a more technical version of the proposal Networking academia I had earlier (unsuccessfully) submitted for a grant called "Science and the Human Condition" at the University of Oregon. The idea here is to study networks, as found in computer science, economics, linguistics, sociology, and biology, under a single mathematical framework. The document is written for a layperson.



Simplicial databases. A paper that categorifies databases. In this model, a "schema" for a database is a labeled simplicial set, whose n-simplicies correspond to tables with n+1 columns. We form a simplicial set by taking the union of such simplices, and doing so reflects the links between objects of differing types. The data itself is then a sheaf on the corresponding site. This category of databases is closed under limits and colimits and these functors correspond to common operations on databases, such as joins, selects, and unions. [Submitted to Mathematical Structures in Computer Science, and the arXiv.]
I gave a talk on this subject at the 2008 ATMCS meeting in Paris, and wrote a more brief survey of the ideas here. written out as a .pdf document.
A primitive version of this talk in which we only discuss the category of tables, was given as a colloquium in the University of Oregon computer science department. It can be found here.

Higher-dimensional models of networks. In this paper, I make a case that many networks involve interactions between more than two entities, and as such should not be modeled by graphs but by simplicial sets. I give a variety of other models and suggest ways to determine which model is most applicable for a given situation, and how to transform models from one type into another. The paper is written mainly for a CS audience familiar with the basics of category theory. It can also be found on the arXiv. A more mathematical paper on networks and information processing is in the works.

Metric realization of fuzzy simplicial sets. Like weighted graphs, fuzzy simplicial sets can be used to model networks. The idea is that the connection between a group of people is a number between 0 and 1. The linked paper does not discuss these real-world applications -- just some preliminary thoughts on the mathematics involved.