Skip to Content

Sponsors

No results

Tags

No results

Types

No results

Search Results

Events

No results
Search events using: keywords, sponsors, locations or event type
When / Where
All occurrences of this event have passed.
This listing is displayed for historical purposes.

Flyer Flyer
Flyer
Abstract:
Examples of data that lie in a simplex abound in a variety of fields, such as biology, geology, and sociology. However, it can be challenging to model this type of data, particularly because each observation must sum up to one. If not handled correctly, this might induce what Karl Pearson called "spurious correlation". As a trivial example of this, take an observation that has two values and sums up to one. Then, the values must be negatively correlated with each other. While traditional approaches for this type of data involve analyzing the log ratio transforms, this might be problematic if any of the observations have a zero as one of their values or for interpretation.

Additional challenges arise if this data changes over time. The first two chapters lay the framework for how such data may be modeled. The first chapter proposes doing so using a general affine transformation for the overall change and a sufficiently rich error model for the difference between the overall transformation and the observation at the next time point. Of the three models explored, the Rotational Geodesic Error model is most promising. However, it might not be appropriate to assume that the direction observations moved in is uniformly distributed. Using ideas from directional statistics, we discuss in the second chapter how to model directions that appear to be similar for observations with similar values. In both chapters, we run simulation studies and analyze the income proportions from Los Angeles County. In each case, our analysis is able to discover trends consistent with larger macroeconomic ones and provide further details about these trends.

The last chapter discusses tree based mixtures of probability simplices. In other words, the simplices share vertices in a way that can be represented by a tree with the root node corresponding to a vertex shared by all simplices and the leaf nodes corresponding to vertices present in one simplex. We show when such models have posterior consistency and demonstrate how to efficiently fit them using geometric methods. Indeed, we apply them to analyze a subset of articles from the New York Times, uncovering meaningful topics and interesting semantic relationships between these topics. While we leave it to future work, these methods might also be combined with the ones from previous chapters to model how sub-regions of data that lie in a simplex change over time.
Flyer Flyer
Flyer

Explore Similar Events

  •  Loading Similar Events...

Back to Main Content